Welcome to Dr. Donghan "Mo" Yang Lab!

I am an Assistant Professor of Data Science in the Peter O’Donnell Jr. School of Public Health at UT Southwestern Medical Center (UTSW) and a Texas Health Resources (THR) Clinical Scholar. My research focuses on developing methods, platforms, and infrastructure for the integration and analysis of multimodal healthcare and biomedical data to address important clinical questions. I have extensive experience in working with real-world data including electronic health records (EHRs), claims, medical notes, imaging, and molecular profiling data. Outcomes from my research include new clinical insights and applications, assessments of health and healthcare disparities, and data commons platforms for diverse disease domains.


More About the PI

THR Clinical Scholar

As an awardee of the THR Clinical Scholars Program, I am the PI leading an internally funded research project, conducted at UTSW and THR, that pilots the use of large language models (LLMs) to mine EHR data for clinical and non-clinical insights. I have led and participated in multiple other projects that use LLMs to extract various features from free-text pathology reports, visit summaries, and progress notes. These works demonstrate the potential of LLMs in disease diagnosis, prognosis, assisting human chart review, and identifying healthcare disparities.



Health Informatics Lead, QBRC

As the health informatics lead at UTSW’s Quantitative Biomedical Research Center, I spearhead efforts to develop comprehensive data commons and resources for various diseases, including adult and childhood cancers, cardiovascular diseases, liver diseases, and COVID-19. My data science and health informatics expertise is strengthened by a solid training in biomedical imaging sciences, where I gained extensive hands-on experience from benchtop to in silico and clinical settings.



Director, Biostatistics and Data Science Core

As the Director of Biostatistics and Data Science Core at UTSW, I manage a team of 10 faculty and staff members to offer analytics, technological, and infrastructural support to the clinical research community. In this role, I oversee staffing, budgeting, regulatory affairs, and project timelines, and have developed strong capabilities in project management and resource.


Research Interests

  • Large Language Models for Clinical Research and Care
  • Integration of Real-World Data
  • Analysis of Real-World Data
  • Biomedical Imaging

Latest Publications

A complete publication list can be found here.

MORE PUBLICATIONS
A critical assessment of using ChatGPT for extracting structured data from clinical notes

Huang J, Yang DM, Rong R, Nezafati K, Treager C, Chi Z, Wang S, Cheng X, Guo Y, Klesse LJ, Xiao G, Peterson ED, Zhan X, Xie Y. (2024).
npj Digit Med. DOI: 10.1038/s41746-024-01079-8

Osteosarcoma Explorer: A Data Commons With Clinical, Genomic, Protein, and Tissue Imaging Data for Osteosarcoma Research

Yang DM#, Zhou Q#, Furman-Cline L, ..., Xie Y. (2023).
JCO Clin Cancer Inform. PMC10681418. DOI: 10.1200/CCI.23.00104

Association of Healthcare Access With Intensive Care Unit Utilization and Mortality in Patients of Hispanic Ethnicity Hospitalized With COVID-19

Velasco F#, Yang DM#, Zhang M, Nelson T, Sheffield T, Keller T, Wang Y, Walker C, Katterapalli C, Zimmerman K, Masica A, Lehmann CU, Xie Y, Hollingsworth JW. (2021)
J Hosp Med. PMC8577697. DOI: 10.12788/jhm.3717

Research Interests


  • Large Language Models for Clinical Research and Care

    The emergence of large language models (LLMs) unlocks unprecedented opportunity for extracting valuable insights from previously inaccessible or underutilized free-text medical notes. My latest research centers on developing LLM-powered approaches for extracting structured data elements from these notes, with a focus on practical implementation in real-world clinical settings.

    Publications

    Huang J, Yang DM, Rong R, Nezafati K, Treager C, Chi Z, Wang S, Cheng X, Guo Y, Klesse LJ, Xiao G, Peterson ED, Zhan X, Xie Y. A critical assessment of using ChatGPT for extracting structured data from clinical notes. npj Digit Med (2024). DOI: 10.1038/s41746-024-01079-8
    Wang L, Nezafati K, Rong R, Park AJ, Zhu J, Xiao G, Xie Y, Yang DM*, Chong BE*. Assessing disease severity in cutaneous lupus patients using natural language processing: preliminary data from a cohort study. JAAD (accepted).
  • Integration of Real-World Data

    A key foundation for precision medicine is the effective integration and analysis of real-world data. My core expertise lies in developing health informatics methods, platforms, and infrastructure to harmonize and integrate multimodal healthcare and biomedical research data. In this area, I led the design and development of data models, extract transform load (ETL) pipelines, and data commons featuring user-friendly web interfaces that facilitate exploration and analytics across different data sources and types. Collaborating with a multidisciplinary team, I successfully coordinated the collection, integration, and management of diverse data assets, including EHRs, medical notes, imaging, and molecular profiling data from various healthcare systems (e.g., UTSW, Children’s Health) as well as cross-system organizations (e.g., Children’s Oncology Group, Malignant Germ Cell International Consortium).

    Publications

    Yang DM#, Zhou Q#, Furman-Cline L, Cheng X, Luo D, Lai H, Li Y, Jin KW, Yao B, Leavey PJ, Rakheja D, Lo T, Hall D, Barkauskas DA, Shulman DS, Janeway K, Khanna C, Gorlick R, Menzies C, Zhan X, Xiao G, Skapek SX, Xu L, Klesse LJ, Crompton BD, Xie Y. Osteosarcoma Explorer: A Data Commons With Clinical, Genomic, Protein, and Tissue Imaging Data for Osteosarcoma Research. JCO Clin Cancer Inform (2023). PMC10681418. DOI: 10.1200/CCI.23.00104
    Ci B#, Yang DM#, Krailo M, Xia C, Yao B, Luo D, Zhou Q, Xiao G, Xu L, Skapek SX, Murray MM, Amatruda JF, Klosterkemper L, Shaikh F, Faure-Conter C, Fresneau B, Volchenboum SL, Stoneham S, Lopes LF, Nicholson J, Frazier AL, Xie Y. Development of a Data Model and Data Commons for Germ Cell Tumors. JCO Clinical Cancer Informatics (2020). PMC7328105. DOI: 10.1200/CCI.20.00025
    Zhang M, Sheffield T, Zhan X, Li Q, Yang DM, Wang Y, Wang S, Xie Y, Wang T, Xiao G. Spatial molecular profiling: platforms, applications and analysis tools. Briefings in Bioinformatics (2021). PMC8138878. DOI: 10.1093/bib/bbaa145
  • Analysis of Real-World Data

    Advanced data analytics strategies, particularly deep learning-based approaches, have significant potential for uncovering hidden insights from complex real-world healthcare data. Working with multidisciplinary teams, I have applied deep learning and statistical methods to analyzing EHR, claims, and registry data, with a focus on characterizing disease and care patterns on both individual and group levels. My analytics works have spanned a diverse range of disease settings, including cardiovascular diseases, cancer, and COVID-19. These studies yielded novel findings on identifying risk factors, predicting clinical outcomes, and addressing healthcare disparities.

    Publications

    Chen HW, Liu J, Yang DM, Xie Y, Peterson ED, Navar A, Chong BF. Incidence and prevalence of atherosclerotic cardiovascular disease in cutaneous lupus erythematosus. JAMA Determatol (accepted).
    Rong R#, Yang DM#, Gu Z#, Lai H, Nelson T, Keller T, Walker C, Jin KW, Chen C, Peterson ED, Navar A, Velasco F, Xiao G, Xie Y. A deep learning model for clinical outcome prediction using longitudinal inpatient electronic health records. UT System AI Syposium 2024. Dallas, TX.
    Velasco F#, Yang DM#, Zhang M, Nelson T, Sheffield T, Keller T, Wang Y, Walker C, Katterapalli C, Zimmerman K, Masica A, Lehmann CU, Xie Y, Hollingsworth JW. Association of Healthcare Access With Intensive Care Unit Utilization and Mortality in Patients of Hispanic Ethnicity Hospitalized With COVID-19. J Hosp Med (2021). PMC8577697. DOI: 10.12788/jhm.3717
    He X, Yin S, Liu H, Lu R, Kernstine K, Gerber DE, Xie Y*, Yang DM*. Upfront Brain Treatments Followed by Lung Surgery Improves Survival for Stage IV Non-small Cell Lung Cancer Patients With Brain Metastases: A Large Cohort Analysis. Front Surg (2021). PMC8549861. DOI: 10.3389/fsurg.2021.649531
  • Biomedical Imaging

    Biomedical imaging technologies reveal intricate biological and pathological details spanning from the cellular to the systemic level. I have obtained extensive techniques and experience in magnetic resonance imaging and digital pathology. I have developed both experimental and computational methods for generating and analyzing image data to improve disease diagnosis and prognosis. I have also led the development of a robust experimentation platform for quantifying intracellular water preexchange lifetime in neurons and astrocytes, a fundamental measure that impacts the design of various magnetic resonance imaging techniques for studying the nervous system.

    Publications

    Wang S, Rong R, Yang DM, Fujimoto J, Bishop JA, Yan S, Cai L, Behrens C, Berry LD, Wilhelm C, Aisner D, Sholl L, Johnson BE, Kwiatkowski DJ, Wistuba, II, Bunn PA, Jr., Minna J, Xiao G, Kris MG, Xie Y. Features of tumor-microenvironment images predict targeted therapy survival benefit in patients with EGFR-mutant lung cancer. J Clin Invest (2023). PMC9843059. DOI: 10.1172/JCI160330
    Wang S, Yang DM, Rang R, Zhan X, Xiao G. Pathology Image Analysis Using Segmentation Deep Learning Algorithms. Am J Pathol (2019). DOI: 10.1016/j.ajpath.2019.05.007
    Yang DM, Arai TJ, Campbell JW, Gerberich JL, Zhou H, Mason RP. Oxygen-sensitive MRI assessment of tumor response to hypoxic gas breathing challenge. NMR Biomed (2019). PMC6581571. DOI: 10.1002/nbm.4101
    Yang DM, Huettner JE, Bretthorst GL, Neil JJ, Garbow JR, Ackerman JJH. Intracellular water preexchange lifetime in neurons and astrocytes. Magnet Reson Med (2018). PMC5754269. DOI: 10.1002/mrm.26781

Publications


Assessing disease severity in cutaneous lupus patients using natural language processing: preliminary data from a cohort study

Wang L, Nezafati K, Rong R, Park AJ, Zhu J, Xiao G, Xie Y, Yang DM*, Chong BE*
2024JAAD (accepted)

Incidence and prevalence of atherosclerotic cardiovascular disease in cutaneous lupus erythematosus

Chen HW, Liu J, Yang DM, Xie Y, Peterson ED, Navar A, Chong BF.
2024JAMA Determatol (accepted)

Enhancing Medical Imaging Segmentation with GB-SAM: A Novel Approach to Tissue Segmentation Using Granular Box Prompts

Villanueva-Miranda I, Rong R, Quan P, Wen Z, Zhan X, Yang DM, Chi Z, Xie Y, Xiao G.
2024Cancers (Basel). PMC11240495. DOI: 10.3390/cancers16132391

Deep Learning-Based Automated Measurement of Murine Bone Length in Radiographs

Rong R, Denton K, Jin KW, Quan P, Wen Z, Kozlitina J, Lyon S, Wang A, Wise CA, Beutler B, Yang DM, Li Q, Rios JJ, Xiao G.
2024nBioengineering. DOI: 10.3390/bioengineering11070670

A critical assessment of using ChatGPT for extracting structured data from clinical notes

Huang J, Yang DM, Rong R, Nezafati K, Treager C, Chi Z, Wang S, Cheng X, Guo Y, Klesse LJ, Xiao G, Peterson ED, Zhan X, Xie Y.
2024npj Digit Med. DOI: 10.1038/s41746-024-01079-8

Osteosarcoma Explorer: A Data Commons With Clinical, Genomic, Protein, and Tissue Imaging Data for Osteosarcoma Research

Yang DM#, Zhou Q#, Furman-Cline L, Cheng X, Luo D, Lai H, Li Y, Jin KW, Yao B, Leavey PJ, Rakheja D, Lo T, Hall D, Barkauskas DA, Shulman DS, Janeway K, Khanna C, Gorlick R, Menzies C, Zhan X, Xiao G, Skapek SX, Xu L, Klesse LJ, Crompton BD, Xie Y.
2023JCO Clin Cancer Inform. PMC10681418. DOI: 10.1200/CCI.23.00104

Deep learning in digital pathology for personalized treatment plans of cancer patients

Wen Z, Wang S, Yang DM, Xie Y, Chen M, Bishop J, Xiao G.
2023Semin Diagn Pathol. DOI: 10.1053/j.semdp.2023.02.003

Deep-Learning-Based Hepatic Ploidy Quantification Using H&E Histopathology Images

Wen Z, Lin YH, Wang S, Fujiwara N, Rong R, Jin KW, Yang DM, Yao B, Yang S, Wang T, Xie Y, Hoshida Y, Zhu H, Xiao G.
2023Genes (Basel). PMC10137944. DOI: 10.3390/genes14040921

Features of tumor-microenvironment images predict targeted therapy survival benefit in patients with EGFR-mutant lung cancer

Wang S, Rong R, Yang DM, Fujimoto J, Bishop JA, Yan S, Cai L, Behrens C, Berry LD, Wilhelm C, Aisner D, Sholl L, Johnson BE, Kwiatkowski DJ, Wistuba, II, Bunn PA, Jr., Minna J, Xiao G, Kris MG, Xie Y.
2023J Clin Invest. PMC9843059. DOI: 10.1172/JCI160330

Enhanced Pathology Image Quality with Restore-Generative Adversarial Network

Rong R, Wang S, Zhang X, Wen Z, Cheng X, Jia L, Yang DM, Xie Y, Zhan X, Xiao G.
2023Am J Pathol. PMC10123520. DOI: 10.1016/j.ajpath.2022.12.011

A Deep Learning Approach for Histology-Based Nucleus Segmentation and Tumor Microenvironment Characterization

Rong R, Sheng H, Jin KW, Wu F, Luo D, Wen Z, Tang C, Yang DM, Jia L, Amgad M, Cooper LAD, Xie Y, Zhan X, Wang S, Xiao G.
2023Mod Pathol. DOI: 10.1016/j.modpat.2023.100196

Spatial molecular profiling: platforms, applications and analysis tools

Zhang M, Sheffield T, Zhan X, Li Q, Yang DM, Wang Y, Wang S, Xie Y, Wang T, Xiao G.
2021Briefings in Bioinformatics. PMC8138878. DOI: 10.1093/bib/bbaa145

A deep learning-based model for screening and staging pneumoconiosis

Zhang L, Rong R, Li Q, Yang DM, Yao B, Luo D, Zhang X, Zhu X, Luo J, Liu Y, Yang X, Ji X, Liu Z, Xie Y, Sha Y, Li Z, Xiao G.
2021Sci Rep. PMC7838184. DOI: 10.1038/s41598-020-77924-z

Association of Healthcare Access With Intensive Care Unit Utilization and Mortality in Patients of Hispanic Ethnicity Hospitalized With COVID-19

Velasco F#, Yang DM#, Zhang M, Nelson T, Sheffield T, Keller T, Wang Y, Walker C, Katterapalli C, Zimmerman K, Masica A, Lehmann CU, Xie Y, Hollingsworth JW.
2021J Hosp Med. PMC8577697. DOI: 10.12788/jhm.3717

Upfront Brain Treatments Followed by Lung Surgery Improves Survival for Stage IV Non-small Cell Lung Cancer Patients With Brain Metastases: A Large Cohort Analysis

He X, Yin S, Liu H, Lu R, Kernstine K, Gerber DE, Xie Y*, Yang DM*
2021Front Surg. PMC8549861. DOI: 10.3389/fsurg.2021.649531

Oxygen-Sensitive MRI: A Predictive Imaging Biomarker for Tumor Radiation Response?

Arai TJ, Yang DM, Campbell JW, Chiu T, Cheng X, Stojadinovic S, Peschke P, Mason RP.
2021Int J Radiat Oncol. PMC8286313. DOI: 10.1016/j.ijrobp.2021.03.039

Computational Staining of Pathology Images to Study the Tumor Microenvironment in Lung Cancer

Wang S, Rong R, Yang DM, Fujimoto J, Yan S, Cai L, Yang L, Luo D, Behrens C, Parra ER, Yao B, Xu L, Wang T, Zhan X, Wistuba, II, Minna J, Xie Y, Xiao G.
2020Cancer Res. DOI: 10.1158/0008-5472.CAN-19-1629

Development of a Data Model and Data Commons for Germ Cell Tumors

Ci B#, Yang DM#, Krailo M, Xia C, Yao B, Luo D, Zhou Q, Xiao G, Xu L, Skapek SX, Murray MM, Amatruda JF, Klosterkemper L, Shaikh F, Faure-Conter C, Fresneau B, Volchenboum SL, Stoneham S, Lopes LF, Nicholson J, Frazier AL, Xie Y.
2020JCO Clinical Cancer Informatics. PMC7328105. DOI: 10.1200/CCI.20.00025

Molecular differences across invasive lung adenocarcinoma morphological subgroups

Ci B, Yang DM, Cai L, Yang L, Girard L, Fujimoto J, Wistuba, II, Xie Y, Minna JD, Travis W, Xiao G.
2020Translational Lung Cancer Research. PMC7481608. DOI: 10.21037/tlcr-19-321

Examining correlations of oxygen sensitive MRI (BOLD/TOLD) with [(18)F]FMISO PET in rat prostate tumors

Zhou H, Chiguru S, Hallac RR, Yang DM, Hao G, Peschke P, Mason RP.
2019American Journal of Nuclear Medicine and Molecular Imaging. PMC6526364. DOI: none

Oxygen-sensitive MRI assessment of tumor response to hypoxic gas breathing challenge

Yang DM, Arai TJ, Campbell JW, Gerberich JL, Zhou H, Mason RP.
2019NMR Biomed. PMC6581571. DOI: 10.1002/nbm.4101

Pathology image analysis using segmentation deep learning algorithms

Wang S, Yang DM, Rang R, Zhan X, Xiao G
2019The American journal of pathology 189 (9), 1686-1698

Artificial Intelligence in Lung Cancer Pathology Image Analysis

Wang S, Yang DM, Rong R, Zhan X, Fujimoto J, Liu H, Minna J, Wistuba, II, Xie Y, Xiao G
2019Cancers. PMC6895901. DOI: 10.3390/cancers11111673

ConvPath: A software tool for lung adenocarcinoma digital pathological image analysis aided by a convolutional neural network

Wang S, Wang T, Yang L, Yang DM, Fujimoto J, Yi F, Luo X, Yang Y, Yao B, Lin S, Moran C, Kalhor N, Weissferdt A, Minna J, Xie Y, Wistuba, II, Mao Y, Xiao G
2019EBioMedicine. PMC6921240. DOI: 10.1016/j.ebiom.2019.10.033

Type and case volume of health care facility influences survival and surgery selection in cases with early-stage non-small cell lung cancer

Wang S, Lai S, von Itzstein MS, Yang L, Yang DM, Zhan X, Xiao G, Halm EA, Gerber DE, Xie Y.
2019Cancer. PMC7678405. DOI: 10.1002/cncr.32377

Systematic Analysis of Gene Expression in Lung Adenocarcinoma and Squamous Cell Carcinoma with a Case Study of FAM83A and FAM83B

Cai L, Luo D, Yao B, Yang DM, Lin S, Girard L, DeBerardinis RJ, Minna JD, Xie Y, Xiao G.
2019Cancers. PMC6627508. DOI: 10.3390/cancers11060886

Intracellular water preexchange lifetime in neurons and astrocytes

Yang DM, Huettner JE, Bretthorst GL, Neil JJ, Garbow JR, Ackerman JJH.
2018Magnet Reson Med. PMC5754269. DOI: 10.1002/mrm.26781

Sodium NMR relaxation in mesoporous systems

Kausik R, Fellah K, Yang DM.
2018Microporous and Mesoporous Materials. DOI: 10.1016/j.micromeso.2017.03.009

Na-23 and H-1 NMR Relaxometry of Shale at High Magnetic Field

Yang D, Kausik R.
2016Energ Fuel. DOI: 10.1021/acs.energyfuels.6b00130

Sacrificial-Template-Assisted Syntheses of Aluminate and Titanate Nanonets via Interfacial Reaction Growth

Shang J, Yu JF, Wang Y, Jiang MJ, Huang YN, Yang D, Tang X, Gao C, Li JL, Chen W, Xu GQ, Teo BK, Wu K.
2016Journal of Cluster Science. DOI: 10.1007/s10876-015-0916-4

Nicotinamide mononucleotide adenylyl transferase 1 protects against acute neurodegeneration in developing CNS by inhibiting excitotoxic-necrotic cell death

Verghese PB, Sasaki Y, Yang D, Stewart F, Sabar F, Finn MB, Wroge CM, Mennerick S, Neil JJ, Milbrandt J, Holtzman DM.
2011P Natl Acad Sci USA. PMC3223466. DOI: 10.1073/pnas.1107325108

Aluminothermal Reaction Approach for Micro-/Nanofabrications: Syntheses of In2O3 Micro-/Nanostructures and InN Octahedral Nanoshells

Yu JF, Wang Y, Wen W, Yang D, Huang B, Li JL, Wu K.
2010Advanced Materials. DOI: 10.1002/adma.200903656

Members


Card image cap
Yujia Guo, MS
Biostatistician
Card image cap
Lesi He, MS
Data Scientist
Card image cap
Xian Cheng, PhD
Data Manager
Card image cap
Jacqueline Abundis, BS
Research Assistant
Card image cap
Ivan Gu, MS
Graduate Research Assistant

About PI


Donghan "Mo" Yang, Ph.D.

Assistant Professor
Director, Biostatistics and Data Science Core
Manager, Pediatric Cancer Data Core
O'Donnell School of Public Health
UT Southwestern Medical Center


  Donghan.Yang@UTSouthwestern.edu
  5323 Harry Hines Blvd. Dallas, TX 75390

Biography


I am an Assistant Professor of Data Science in the Peter O’Donnell Jr. School of Public Health at UT Southwestern Medical Center (UTSW) and a Texas Health Resources (THR) Clinical Scholar. My research focuses on developing methods, platforms, and infrastructure for the integration and analysis of multimodal healthcare and biomedical data to address important clinical questions. I have extensive experience in working with real-world data including electronic health records (EHRs), claims, medical notes, imaging, and molecular profiling data. Outcomes from my research include new clinical insights and applications, assessments of health and healthcare disparities, and data commons platforms for diverse disease domains.

Positions and Appointments


  • Present 2023
    Director
    Biostatistics and Data Science Core, UTSW, Dallas, TX
  • Present 2021
    Assistant Professor
    O’Donnell School of Public Health, UTSW, Dallas, TX
  • 2021 2019
    Bioinformatics Project Manager
    Department of Population and Data Sciences, UTSW, Dallas, TX
  • 2019 2018
    Data Scientist
    Department of Population and Data Sciences, UTSW, Dallas, TX

Education


  • 2018 2015
    Postdoctoral Researcher, Radiology (Biomedical Imaging)
    UT Southwestern Medical Center, Dallas, TX
  • 2014 2009
    Ph.D., Chemistry (Biomedical Imaging)
    Washington University, St. Louis, MO
  • 2008 2004
    B.S., Chemistry (Physical Chemistry)
    Peking University, Beijing, China