Jianpeng is a Master’s student at the department of Computing and
Software, with a focus on mobile crowdsourcing. He also completed his
Bachelor’s Degree in Mechatronics Engineering at McMaster University
in 2019. Driven by his passion in applying the data science
technologies to real life, the proposed MacDATA fellowship project
will focus on utilizing the semi-supervised learning techniques to
develop a WiFi fingerprint based indoor localization model and
deploying the model to the MacQuest McMaster Campus Navigation Mobile
App to further provide a better indoor-navigation service for the
Somayeh is a 2nd-year doctoral student in Health Management in the area of Health Policy and Management. She is interested in Machine Learning, Big Data, Knowledge Discovery, Personalize Medicine, and Medical Decision Making. Coming with a mathematical background, Somayeh became more interested in Mathematical Programming and pursued her master’s degree in industrial engineering focused on healthcare optimization. Before starting her graduate studies at the DeGroote School of Business, she worked in different healthcare R&D centers for more than six years. She has contributed to several data analytics projects in healthcare.
I am a PhD student at the Cognitive Science of Language program, and a research fellow at the ARiEAL (Centre for Advanced Research in Experimental and Applied Linguistics). I did my BA in Teaching Chinese as a Foreign Language at Beijing Language and Culture University, and an MA in Chinese Linguistics and Pedagogy at the University of Alberta. My research interests include mechanisms of visual word processing under natural reading conditions, which I study with the help of concurrent recording of electroencephalography and eye-tracking.
Yujiao Hao is a second-year Ph.D. student in the Computer and Software department of McMaster whose current research field is activity recognition and mobile computing. Yujiao received both MA.Sc. and B.Sc. degrees in a Software Engineering major from Northeastern University, China. For the master’s phase, her research direction was multimedia medical information technology, such as the medical image registration algorithm. After graduation, she served the mobile phone industry for 5 years, accumulating rich experience in software testing and the at-hand skill of Android application development.
Her current research topic is related to the wellness of senior people, analyzing their mobility status with inertial measurement unit (IMU) sensory data.
I am a marketing Ph.D. student at DeGroote School of Business. My MacData fellowship will be co-supervised by Dr. Sourav Ray and Dr. Matheus Grasselli. In this research project, I will explore the impact of the sharing economy rental market, such as Airbnb, on the housing prices. In particular, I want to study how different aspects of the short-term rental markets of the sharing economy impact the utilization and prices of the traditional rental market. I will use a series of statistical and machine learning techniques to achieve my goals.
Jing Wang is a PhD candidate in the School of Computational Science and Engineering. His research topic is data analytics in supply chain optimization. The goal of his Fellowship project is to investigate the application of reinforcement learning to the optimal decision making in supply chain systems.
I am a first year PhD student in the Department of Civil Engineering working under the supervision of Dr. Zoe Li. My research focuses on implementing uncertainty analysis for wastewater simulation models. This MacDATA project is being co-supervised by Dr. Wenbo He and Dr. Zoe Li. It will establish an integrated toolkit for the prediction of influent and effluent emerging contaminants at wastewater treatment plants. In the short-term, the research program will focus on the development of data preprocessing, feature selection, predictive model development, and uncertainty analysis approaches for the real-time influent and effluent emerging contaminant control. In the long-term, the generated knowledge might be extended to various environmental engineering fields and multiple regions internationally.
I am currently a MSc. student in the Computational Science and Engineering program at McMaster University and have completed my bachelor degree in Electrical Engineering at Amirkabir University of Technology, Tehran, Iran. Working under the supervision of Dr. Sue Becker and Dr. John Connolly, my MSc thesis aims to design and implement a Brain-Computer Interface (BCI) for communication with coma patients (speechless communication).
I have focused my academic pathway on machine learning and data science since the third year of my undergraduate studies and due to my profound interest in brain and neuroscience, I am currently applying these techniques in analysing brain signals (esp. EEG) and designing BCIs. Thus, it brought me to MacDATA institute to propose a project at the intersection of several paradigms: neuroscience, psychology, electrical engineering, machine learning and data science.
Mahdis is a PhD student in the Health Research Methodology program, under the supervisor of Dr. Kathy Georgiades. During her time at McMaster, she plans to establish a strong research foundation in the mental health of immigrant and refugee children in Canada by understanding the social and economic factors that contribute to inequities in their mental health service use. Her MacDATA fellowship will utilize machine learning techniques to identify the specific factors that predict levels of mental health service use among youth in Ontario.
I’m a PhD student studying Statistics at McMaster University. My research focuses primarily on machine learning, clustering, and processes. Specifically I’m interested in the applications of functional data analysis methods in the context of environmental, health, and sports data. This fellowship will be co-supervised by Dr. Dawn Bowdish from the Department of Pathology and Molecular Medicine and my PhD supervisor Dr. Paul McNicholas. This fellowship is aimed at developing and implementing statistical methods for problems related to monocyte function and disease.
Fatemeh Armanfard is a Master’s student at the Department of Electrical and Computer Engineering, co-supervised by Profs. Jim Reilly and John Connolly. The focus of her thesis is on developing supervised machine learning techniques for analyzing electroencephalogram (EEG) signals recorded from comatose patients.
With a background in both humanities and cognitive science, my graduate research focuses on the psychology of migratory populations. By combining millions of demographic and psychological data points, I will adopt machine learning techniques to identify the psychological and demographic predictors of human migration. Further, I will analyze the attitudes towards migration phenomena by considering language from online sources (i.e., Reddit) and use computational methods to detect linguistic properties -e.g., emotional valence and arousal – when referring to immigration issues. This work will enable testing of current social-psychological models of personality using novel data sets that comprise many responders from multiple countries.
Bhanu is a PhD student under the supervision of Dr. Brian Timmons. His background is in concussion and more severe forms of brain injury. In working with Dr. Timmons and the Child Health and Exercise Medicine Program, Bhanu hopes to bridge his background in brain injury with the field of pediatric exercise medicine. By doing so, Bhanu hopes to better understand the complex relationship between exercise and concussion recovery, and thus inform the rest vs. activity debate that remains active in the field of brain injury and exercise medicine.
1 am a first year MSc. Candidate in the Chemical Biology Graduate Program at McMaster University, working under the supervision of Dr. Jennifer Stearns at the Farncombe Family Digestive Health Research Institute. My MSc thesis explores the impact of solid food introduction on bacterial abundance and function in the infant gut microbiome. Specifically, my project aims to uncover bacterial carbohydrate metabolism genes and map their abundance in relation to the introduction of solid food in a cohort of exclusively breastfed infants. While metagenomic assembly is a powerful tool to analyze the infant gut microbiome, the accuracy of metagenomic assemblers is heavily impacted by the relative proportions of genera in these samples, strain diversity of the species, and the size of their pangenome. The proposed MacDATA project, co-supervised by Dr. Jennifer Stearns and Dr. Paul McNicholas, is an investigation into metagenomic assembly in order to improve taxonomic and gene prediction. This project specifically aims to identify the reasons for metagenomic assembly errors pertaining to the infant gut microbiome, understand whether these errors are impacted by relative abundance and strain diversity, and improve assembly with novel clustering methods.
I am an MSc student at McMaster University, studying statistics. I also completed my BSc (Hons.) at McMaster. My research focuses on machine learning, specifically outlier detection in Gaussian mixture modelling. This project will use machine learning algorithms to investigate the relationship between brain structure and psychosocial functioning and the link between changes in sleep and circadian rhythms and risk of postpartum depression. This project will be co-supervised by Dr. Benicio Frey from the Department of Psychiatry and Behavioural Neurosciences and Dr. Paul McNicholas from the Department of Mathematics and Statistics.
I am a third year PhD student in Cognitive Science of Language Department. I am also a researcher at the Reading-Lab in The Centre for Advanced Research in Experimental and Applied Linguistics (ARiEAL). My interests include newly emerging words and word learning. My research focuses on the dynamics behind the spread of newly popular words and the question of how these words are learnt by adult speakers. I have been conducting eye-tracking, EEG, and computational linguistics studies to answer these questions.
Project: Twitter data has been a popular way to examine newly popular words. My project aims to answer two questions by looking at the United States and Canada geo-tagged tweets: (1) what makes a word popular? (2) how have been these words spreading?
Data Optimization with Applications to Assemble-to-Order Systems
Ning Huang is a Master’s student of the Department of Computing and Software, supervised by Dr. Antoine Deza. The focus of her research is on Data Optimization with Applications to Assemble-to-order Systems.The fellowship project aims to optimize the base stocks to achieve a higher reward for a period-review ATO system based on an independent base policy and an FCFS allocation rule.
A Weighted Estimation Method for Improved Precision and Efficiency: Applications in Health Service
Rabiul Islam is an international PhD student from Department of Economics at McMaster University. He is currently working on a weighted method to improve the efficiency and precision of estimation in the presence of self-selection bias in survey data. Pursuing his interest in Health Economics, he is attempting to utilize this estimation method for health data under the supervision of two eminent professors, namely, Professor Arthur Sweetman from Department of Economics and Professor Emmanuel Guindon from Department of Health Research Methods, Evidence, and Impact (HEI). Additionally, Islam is interested in research related to Labour Economics, and Applied Econometrics.
I am currently pursuing my PhD in the Marketing Department at the DeGroote School of Business, McMaster under the supervision of Dr Ruhai Wu. Before coming to DeGroote I spent some time working in the marketing research and management consulting industries. My current research involves examining the market of a homogenous good and estimating how much players in the market can gain by exploiting available information.
Identifying Risk Factors for Severe Influenza Infection
I’m a 2nd year MSc. student in the Department of Health Research, Evidence and Impact studying within the Health Research Methodology program. My research for the fellowship is focused on 2 projects; firstly, identifying potential risk factors for severe influenza infection using both conventional statistical techniques (i.e. multi-levelling modelling of independent patient data) and modern machine learning methods. The second half of my research project involves identifying phenotypes or subgroups of patients at higher risk for severe influenza infection using cluster analysis methods. The proposed MacDATA fellowship project will be co-supervised by my MSc. thesis supervisor, Dr. Domnik Mertz from the Department of Medicine, and Dr. Shikrant Bangdiwala from the Department of Health Research Methods, Evidence and Impact.
Data Optimization with Applications to Shared Electric Vehicles
I am a Master’s student in the department of Computing and Software at McMaster University. My Master’s project is entered on general computational method for combinatorial optimization, specifically the optimization formulations and algorithms. This fellowship will be co-supervised by Dr. Kai Huang from DeGroote School of Business and my Master’s supervisor Dr. Antoine Deza. This fellowship project is data optimization with applications to shared electric vehicles, which focuses on analyzing and exploiting the features of traffic and geographic information system location data used to optimize shared electric vehicle charging stations.
Predicting Adverse Health Events Among Older Emergency Department Patients
I am a PhD student in the department of Health Research Methods, Evidence, and Impact at McMaster University. I received both my Bachelor (2014) and Master of Science in Nursing (2017) from the University of Windsor. Clinically, I work as an emergency department nurse at the Detroit Medical Center, with previous experience in critical care and pediatrics. For my doctoral research I am interested in using big data to evaluate and inform geriatric models of care in the emergency department, and creating decisions support systems for emergency clinicians. My MacData fellowship will be co-supervised by Dr. Manaf Zargoush and Dr. Andrew Costa where we will utilize supervised machine learning methods to predict poor patient outcomes for older adults who present to the emergency department for care.
Physical activity, sedentary time, sleep duration and health outcomes in preschoolers:
exploring the relationships with compositional analyses
The objective of the proposed project is to establish the relationships between
movement behaviours (physical activity, sedentary time and sleep duration) and health
outcomes in preschoolers (i.e., less than 5 years old) using compositional analysis.
Compositional analysis uses data that are proportional of a finite whole (e.g., 24 hours) and
can be used when all parts or just some parts of the whole have been measured (1,2). This
method considers the effects of a behaviour as a proportion relative to the other behaviours
instead of an independent behaviour. This is a novel approach to analyzing movement
behaviour data and it aligns with Canada’s newest Physical Activity Guidelines for Children
and Youth (i.e., 5 years and older). In 2016, the Canadian 24-Hour Movement Guidelines for
Children and Youth were published and this new version focuses on the interplay between
sleep, physical activity and sedentary time, rather than focusing on these movement
My name is Kara Tsang, I am a first year Masters of Science candidate in the Department
of Biochemistry and Biomedical Sciences, performing my graduate research with Dr. Andrew
McArthur (Cisco Research Chair in Bioinformatics). I am applying to the MacDATA Graduate
Fellowship Program because my multidisciplinary project aligns to the aims of the MacDATA
Institute, the Biochemistry and Biomedical Sciences Graduate program, as well as my
professional learning goals.
The aim of this project is to develop a robust and computationally ecient framework for
the statistical analyses of genetic variants using quantile regression. The proposed framework
aims to achieve similar performances with respect to power and computational eciency as
OLS models under the correct model specication, while outperforming OLS models under
true complex interactions with unknown/unmeasured variables. The project will involve
empirical data analysis in the discovery of novel genetic variants for Body Mass Index (BMI)
as a complex phenotype with gene-environment components. This project will utilize the
publically data base of genotypes and phenotypes (dbGAP) on the NIH servers, and aims
to publish and disseminate any knowledge as a result of this work.
Sophiya is a PhD candidate in the Health Policy program, specializing in health economics. Sophiya obtained her Master’s degree in Public Health from the University of Guelph and her Honours Bachelor of Science degree from McMaster University. She has previous professional and research experience in epidemiology and health communications. Sophiya is interested in areas of health equity and data analytics, and in working at the intersection of health and social sciences to improve the health of under-served populations in Canada. Her current thesis under the supervision of Emmanuel Guindon, Arthur Sweetman and Kathy Georgiades, focuses on the health and health care experiences of immigrant and refugee populations in Canada.
How Prescription Drug Coverage affects Health Services Utilization among Immigrants in Canada using Linked Data
I am a PhD student in the Health Policy Program, working under the supervision of Dr. Lisa Schwartz. My thesis explores how health insurance coverage impacts health service utilization and outcomes of vulnerable populations, such as refugees. I completed my Honours BSc in Human Biology (Health and Disease) at the University of Toronto and MSc in Global Health at McMaster University. My MacDATA fellowship is being co-supervised by Dr. Emmanuel Guindon and Dr. Arthur Sweetman. This research will examine how supplementary drug insurance coverage affects health care utilization by immigrants in Canada, combining health research methods, economics, political science and global health disciplines. Given the current federal government’s interest to increase the intake of immigrants into the nation, and the steady rise of pharmacare options onto the agenda, provincially and nationally, this research aims to inform these key policy decisions by presenting the effects of health policies pertaining to the health and well-being of future Canadians.
Trained both as a computer scientist (B.Sc., Dalhousie University) and a neuroscientist (M.Sc., McMaster University), my PhD work in biomedical engineering targets the development of automated assessment tools for brain function. Specifically, I utilize machine learning techniques to analyze brain signals, recorded using electroencephalography (EEG), to assess brain injury and its effects on different aspects of cognition.
Machine Learning and Bayesian Inference in Finance
I am a Masters student at the School of Computational Science and Engineering, with a focus on Machine learning, finance, and economics. Currently, my supervisor is Matheus Grasselli; the professor of financial mathematics and my co-supervisor is Dr. Anand for this fellowship. My main research is focused on how to apply machine learning techniques into finance due to the specialty of the capital markets. In other words, how to grasp the complexity of machine learning applications into investments. For this project, my supervisors will guide me using the Bayesian method to calibrate the work for option pricing.
Big Data Genomics Infectious Disease Biosurveillance
I am a PhD Candidate in the Department of Anthropology, and I study a disease which has had a long and turbulent history with humans: The Plague. My thesis explores how ancient pandemics of plague are connected to modern outbreaks, and what interplay of factors leads to its re-emergence and dispersal. To examine this connection between the past and present, I work with a novel form of evidence: ancient DNA. My work in the McMaster Ancient DNA Centre involves extracting DNA from archaeological plague victims to capture, sequence, and reconstruct the genome of this ancient and deadly bacterium. However, to interpret this ancient genetic evidence, I need access to an extensive comparative dataset of genomes to robustly test the relationships between my ancient samples and modern disease outbreaks. With recent advances in sequencing technology, comparative datasets are plentiful but remain largely inaccessible and underutilized due to the challenges inherent in working with Big Data. In response, my project in collaboration with the MacDATA Institute, is to develop software tools to assist in querying online repositories for infectious disease metadata, and to streamline the data transformation steps needed to acquire and integrate raw data into comparative genomic projects. By improving data accessibility in the realm of infectious disease biosurveillance, this project seeks to advance current models of disease emergence and spread by giving researchers the tools needed to acquire crucial comparative data, particularly from currently underrepresented areas of the globe.
Analysis of Social Inequalities in Child and Youth Mental Health
I am a second year PhD student and Vanier Scholar in the Department of Mathematics and Statistics working under the supervision of Dr. Paul McNicholas. I completed my Honours B.Sc. and M.Sc. at McMaster. My research focuses on model-based clustering and classification which generally makes use of finite mixture models. Recently, I have been looking at model-based clustering and classification for matrix variate data examples of which include image data and multivariate longitudinal data. During of the course of this fellowship, I plan to apply clustering and classification techniques to better understand social inequalities in child and youth mental health and associated academic outcomes, specifically in migrant groups. Due to social and political unrest in many countries throughout the world, there has been a recent influx of immigrants and refugees to Canada, and it will therefore be important to determine which factors might affect the mental health of some of these migrants and help them to more easily settle into Canadian life
Analyzing Astrophysics Data using Machine Learning Techniques
Currently, I am a PhD student in the School of Computational Science and Engineering at McMaster University under the supervision of Dr. Paul McNicholas and Dr. Sharon McNicholas. Prior to coming to McMaster University for my degree in MSc Statistics, I obtained my Honours BSc in Mathematics and Statistics from University of Ghana. My PhD is focused on developing evolutionary algorithms, where some elements of the biological theory of evolution are incorporated into computer algorithms. Specifically, since missing data is inevitable when dealing with multivariate data, our interest lies in applying our developed algorithms to combining the problems of clustering and missing data
Clustering Analysis of Infant General Movements for Neurodevelopmental Evaluation
My name is Omar Nassif and I am a Master’s student in the department of electrical engineering. My research interests include high dimensional data analysis, deep learning networks, and computer vision. My research project involves characterization of infant motor movement for assessment of potential neurological deficit. The proposed methodology is to extract movement primitives from video sequences for analysis by machine learning methods. My hobbies include badminton, jogging, and piano/classical music.
Unsupervised Machine Learning: Robust Model-Based Clustering for High-Dimensional Data
I am a first year PhD student at the School of Computational Science and Engineering, with a focus in computational statistics. I completed my HBSc and MSc degrees through the Mathematics and Statistics Department. My research is focused on unsupervised machine learning methods for very high-dimensional or “big” data. Specifically, I have developed methods for clustering high-dimensional data containing outliers, which has shown a drastic improvement in image recovery of the Martian surface. I am now working on extending this approach for cluster analysis of skewed, asymmetrical data and looking to extend this method to areas other than astrophysics. These clustering approaches are very flexible and can be applied to data originating from many different research areas such as health, technology, etc. Given the recent influx of data, machine learning has become very popular and extending the literature in this area of statistics is crucial to understanding and utilizing all the incoming information.
Clustering Glioblastoma Multiforme Subtypes Using Variable Selection
I am a PhD student in the School of Computational Science and Engineering at McMaster University under the supervision of Dr. Paul McNicholas. I completed both my Honours BSc and MSc in statistics at McMaster University. The focus of my PhD is on computational statistics as my research interests lie in clustering and high-performance computing. My current research has been in developing new mixture model-based approaches for clustering discrete valued time series. I hope to apply my research to real world applications in areas such as healthcare, economics, and behavioural psychology.
Bryor Snefjella is PhD student at McMaster University, in the Cognitive Science of Language program, with interests in corpus linguistics, quantitative linguistics, psychology, and psycholinguistics. He uses corpora – including representative academic copora, social media, and other sources, to examine how the sensorimotor and affective connotations of words affect language processing, and reflect how we mentally represent people, events, and places.
Imputing Missing Matrix Variate Data: Applied to Missing Accelerometer Data in Children
I am a first year PhD student at the School of Computational Science and Engineering, working under the supervision of Dr. Paul McNicholas. My research focus is multivariate and computational statistics. I completed my Hons BSc in Statistics at the University of Toronto and after a number of years working in the software industry, completed my MSc in Statistics at McMaster University. My MacData fellowship is being co-supervised by Dr. Joyce Obeid in the Child Health & Exercise Medicine Program (CHEMP) group in the Department of Pediatrics at McMaster University and Dr. Paul McNicholas in the Department of Mathematics and Statistics. The research I will be conducting is directed towards developing an effective and reliable means of imputing missing accelerometer data, a common problem in studies measuring physical activity and something that has never been done before. I plan on implementing our methodology in easy to use software that will be accessible to clinician scientists. This will improve the quality of accelerometer studies across all health science disciplines.
Artificial Intelligence Algorithms for in-Silico Discovery of Novel Therapeutics for Acute Myeloid Leukemia
I am a 1st year MSc. candidate in the Faculty of Health Sciences at McMaster University, working in the Hope Cancer Lab at the Stem Cell and Cancer Research Institute (SCC-RI). My research focuses on the development of artificial intelligence (AI) pipelines for clinical next-generation sequencing (NGS) data analysis in acute myeloid leukemia (AML), as well as the real-world validation of resulting analytics in animal models of human AML. My research efforts have culminated thus far in the development of an AI program that has been able to discover promising weaknesses in the gene expression program of leukemic stem cells (LSCs), the speculated driver cells of the leukemia disease. The AI applies genetic algorithms to perform multifactorial analyses across multiple transcriptome NGS datasets, looking out for genes that best fit factors of uniqueness to the LSC expression program and association to disease aggressiveness and post-therapy relapse, but also safety as clinical targets. It has yielded new potential targets for pharmaceutical and gene therapies, some of which have only very recently been discovered in classical ways and have only just become actively pursued by other international research groups. My work in collaboration with the MacDATA Institute endeavors to expand upon this AI’s abilities to allow it to intelligently sift through the volumes of literature in fields of medicine, biochemistry and pharmacology published online in order to create self-updating data repositories for pharmaceutical repurposing. These datasets would allow it to “recycle” drugs that have already passed approval and completed clinical trials for other diseases, but that nobody ever imagined could be used against AML. The successful completion of this project and its coupling with robotic high-throughput screening facilities could lend possibility to an unthinkable concept: fully autonomous and precise drug discovery for AML, and perhaps many other types of human cancer.
Understanding How Physicians Read Ultrasound Images
I am a MSc candidate in the Department of Psychology, Neuroscience and Behaviour working under the supervision of Dr. Ranil Sonnadara and Dr. Sue Becker. The purpose of my graduate research is to examine how physicians interpret medical images and to provide a tool to help physicians make better diagnostic decisions and improve training for new physicians. This investigation will be broken into four major phases: creating a machine learning classifier, interviewing physicians to understand how they inspect medical images, tracking physician’s eye movements during image interpretation, and using these findings to develop an educational tool for medical trainees. These four tasks involved in each phase will use 2D ultrasound images from patients with prenatal hydronephrosis (PH), a condition where blockages cause urine to accumulate in the kidneys of fetal infants. The current study will use PH as a model in each phase, however, our long-term goal is to generalize the findings and concepts from these studies to other types of medical images and diseases. We hope that this work will lead to the development of data analytic tools which will help reduce diagnostic error associated with medical image interpretation.