12/17/2023

Large Language Models with Retrieval-Augmented Generation for Zero-Shot Disease Phenotyping

NeurIPS 2023 PRESENTATION
Authors Will E. Thompson, David M. Vidmar, Jessica K. De Freitas, Gabriel Altay, Kabir Manghnani, Andrew C. Nelsen*, Kellie Morland*, John M. Pfeifer, Brandon K. Fornwalt, Ruijun Chen, Martin C. Stumpe, Riccardo Miotto

Abstract

Identifying disease phenotypes from electronic health records (EHRs) is critical for numerous secondary uses. Manually encoding physician knowledge into rules is particularly challenging for rare diseases due to inadequate EHR coding, necessitating review of clinical notes. Large language models (LLMs) offer promise in text understanding but may not efficiently handle real-world clinical documentation. We propose a zero-shot LLM-based method enriched by retrieval-augmented generation and MapReduce, which pre-identifies disease-related text snippets to be used in parallel as queries for the LLM to establish diagnosis. We show that this method as applied to pulmonary hypertension (PH), a rare disease characterized by elevated arterial pressures in the lungs, significantly outperforms physician logic rules (F1 score of 0.62 vs. 0.75). This method has the potential to enhance rare disease cohort identification, expanding the scope of robust clinical research and care gap identification.

VIEW THE POSTER

Related Content

View more
  • post image
    06/12/2025

    AI & ML in action: Demonstrating real-world impact in trial design & patient care

    Discover how the Tempus platform leverages AI and ML to inform standard of care practices through health equity guidelines and drive insights that help refine clinical trial design. Engage with live demonstrations showcasing how our tools identify patients by modifying inclusion/exclusion criteria and leveraging patient queries. Explore how our tools integrate NCCN guidelines and empower life science teams to access current, actionable patient-journey insights. Learn how these real-world applications can drive progress in your clinical development initiatives.

    Secure your recording now.

    Watch replay
  • post image
    06/09/2025

    Bridging the translational gap: The role of organoids in oncology R&D

    This white paper explores the evolving role of organoids in oncology R&D, highlighting their potential as predictive preclinical models and their ability to reduce translational risk. Download for a comprehensive overview of the scientific landscape, key adoption barriers, emerging innovations, and how pharma companies leverage organoids to accelerate precision medicine.

    Read more
  • post image
    03/25/2025

    The RNA advantage: A multimodal approach to accelerating oncology R&D

    Discover how transcriptomic data and multi-omics data integrated with AI is helping revolutionize oncology drug development and patient care and fueling precision medicine 2.0. Learn about RNA sequencing’s role in companion diagnostics and Tempus Loop’s capabilities to support target identification and validation.

    Secure your recording now.

    Watch replay