Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods

Neves, Bernardo; Moreira, José Maria; Gonçalves, Simão; Cerejo, Jorge; Silva, Nuno A. da; Leite, Francisca; Silva, Mário J.

Publication

Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods

2025-06-01Research article

dc.contributor.author	Neves, Bernardo
dc.contributor.author	Moreira, José Maria
dc.contributor.author	Gonçalves, Simão
dc.contributor.author	Cerejo, Jorge
dc.contributor.author	Silva, Nuno A. da
dc.contributor.author	Leite, Francisca
dc.contributor.author	Silva, Mário J.
dc.date.accessioned	2025-08-13T14:42:21Z
dc.date.available	2025-08-13T14:42:21Z
dc.date.issued	2025-06-01
dc.description.abstract	Background: Phenotyping, the process of systematically identifying and classifying conditions within clinical data, is a crucial first step in any data science work involving Electronic Health Records (EHRs). Traditional approaches require extensive manual annotation efforts and face challenges with scalability. Methods: We investigated the use of Large Language Models (LLMs) for zero-shot phenotyping of 20 prevalent chronic conditions based on synthetic patient summaries generated from real structured EHRs codes. We evaluated the performance of multiple LLMs, including GPT-4o, GPT-3.5, and LLaMA 3 models with 8-billion, 70-billion, and 405-billion parameters, comparing them against traditional rule-based methods. For the analysis we used a dataset of 1,000 patients from Hospital da Luz Lisboa. Results: GPT-4o outperformed both traditional rule-based methods and alternative LLMs, achieving superior recall (0.97) and macro-F1 score (0.92). Rule-based phenotyping, while highly precise (0.92), showed lower recall (0.36). The integration of rule-based methods with LLMs optimized phenotyping accuracy by targeting manual annotation efforts on discordant cases. Conclusion: Zero-shot learning with LLMs, particularly GPT-4o, offers a powerful and efficient approach for phenotyping chronic conditions from EHRs, significantly reducing the need for extensive labeled datasets while maintaining high accuracy and interpretability.	eng
dc.identifier.citation	Neves, B., Moreira, J. M., Gonçalves, S., & Cerejo, J. et al. (2025). Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods. Computers in Biology and Medicine, 192, Article 110181. https://doi.org/10.1016/j.compbiomed.2025.110181
dc.identifier.doi	10.1016/j.compbiomed.2025.110181
dc.identifier.eid	105003218284
dc.identifier.issn	0010-4825
dc.identifier.other	1903ac83-59bd-4d5e-98e0-cd0ae07e7fe4
dc.identifier.pmid	40273817
dc.identifier.uri	http://hdl.handle.net/10400.14/54550
dc.language.iso	eng
dc.peerreviewed	yes
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Large language models
dc.subject	Multimorbidity
dc.subject	Phenotyping
dc.subject	Zero-shot learning
dc.title	Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods	eng
dc.type	research article
dspace.entity.type	Publication
oaire.citation.title	Computers in Biology and Medicine
oaire.citation.volume	192
oaire.version	http://purl.org/coar/version/c_970fb48d4fbd8a85

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 118221386.pdf
Size:: 5.17 MB
Format:: Adobe Portable Document Format

Download

Collections

CIIS - Contribuições em Revistas Científicas / Contribution to Journals