Research
We are broadly interested in the research of machine learning, natural language processing, time series analysis, integrative genomics and computational phenotyping, with a focus on medical and clinical applications. Some of our recent works are on multi-modal machine learning (including deep learning) models applied to better understanding complex diseases, informing targeted therapies, improving patient outcomes, and reducing bias and disparity in health care. The common theme of our works aims at building AI/ML models that improve both prediction accuracy and interpretability, by exploring relational information in each data modality.
We have delved into different modalities of the healthcare data (e.g., unstructured clinical notes, structured EHR data, imaging data, genetic data etc.) and build methods to enable these data modalities to be individually and/or jointly mined to derive actionable intelligence. We have been actively working on developing flagship datasets to power high impact research.
Selected projects:
- Multi-modal machine learning for precision medicine
- Improving fairness and reducing bias for AI/machine learning
- Automated computational phenotyping
- Large Language Models to understand biomedical text
- Interpretable and explainable machine learning models for patient risk prediction
- Missing data imputation for health care data
- Software and resource
Overview paper summarizing our efforts in Collaborative AI in Healthcare.
For a list of publications from this lab, see the principal investigator’s Google Scholar page.