Publication
Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods
dc.contributor.author | Neves, Bernardo | |
dc.contributor.author | Moreira, José Maria | |
dc.contributor.author | Gonçalves, Simão | |
dc.contributor.author | Cerejo, Jorge | |
dc.contributor.author | Silva, Nuno A. da | |
dc.contributor.author | Leite, Francisca | |
dc.contributor.author | Silva, Mário J. | |
dc.date.accessioned | 2025-08-13T14:42:21Z | |
dc.date.available | 2025-08-13T14:42:21Z | |
dc.date.issued | 2025-06-01 | |
dc.description.abstract | Background: Phenotyping, the process of systematically identifying and classifying conditions within clinical data, is a crucial first step in any data science work involving Electronic Health Records (EHRs). Traditional approaches require extensive manual annotation efforts and face challenges with scalability. Methods: We investigated the use of Large Language Models (LLMs) for zero-shot phenotyping of 20 prevalent chronic conditions based on synthetic patient summaries generated from real structured EHRs codes. We evaluated the performance of multiple LLMs, including GPT-4o, GPT-3.5, and LLaMA 3 models with 8-billion, 70-billion, and 405-billion parameters, comparing them against traditional rule-based methods. For the analysis we used a dataset of 1,000 patients from Hospital da Luz Lisboa. Results: GPT-4o outperformed both traditional rule-based methods and alternative LLMs, achieving superior recall (0.97) and macro-F1 score (0.92). Rule-based phenotyping, while highly precise (0.92), showed lower recall (0.36). The integration of rule-based methods with LLMs optimized phenotyping accuracy by targeting manual annotation efforts on discordant cases. Conclusion: Zero-shot learning with LLMs, particularly GPT-4o, offers a powerful and efficient approach for phenotyping chronic conditions from EHRs, significantly reducing the need for extensive labeled datasets while maintaining high accuracy and interpretability. | eng |
dc.identifier.citation | Neves, B., Moreira, J. M., Gonçalves, S., & Cerejo, J. et al. (2025). Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods. Computers in Biology and Medicine, 192, Article 110181. https://doi.org/10.1016/j.compbiomed.2025.110181 | |
dc.identifier.doi | 10.1016/j.compbiomed.2025.110181 | |
dc.identifier.eid | 105003218284 | |
dc.identifier.issn | 0010-4825 | |
dc.identifier.other | 1903ac83-59bd-4d5e-98e0-cd0ae07e7fe4 | |
dc.identifier.pmid | 40273817 | |
dc.identifier.uri | http://hdl.handle.net/10400.14/54550 | |
dc.language.iso | eng | |
dc.peerreviewed | yes | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.subject | Large language models | |
dc.subject | Multimorbidity | |
dc.subject | Phenotyping | |
dc.subject | Zero-shot learning | |
dc.title | Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods | eng |
dc.type | research article | |
dspace.entity.type | Publication | |
oaire.citation.title | Computers in Biology and Medicine | |
oaire.citation.volume | 192 | |
oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 |
Files
Original bundle
1 - 1 of 1