European Journal of Human Genetics, Published online: 24 February 2026; doi:10.1038/s41431-026-02054-5
Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools