Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Neurodegenerative diseases such as Alzheimer's disease (AD) and Parkinson's disease (PD) are clinically heterogeneous, hampering the success of nonselective treatment strategies. Here we apply a transformer-based unsupervised clustering framework to longitudinal electronic health record data from over 100,000 patients across two UK cohorts, Clinical Practice Research Datalink Aurum and UK Biobank, to identify, validate and characterize subtypes of AD and PD. We uncover five reproducible subtypes for each condition, characterized by distinct comorbidity patterns, symptom trajectories, outcomes and genetic profiles. These include a high-mortality AD subtype with motor and cardiovascular features, and a genetically susceptible but clinically resilient PD subtype. We also identify metabolic-inflammatory and vascular-psychiatric phenotypes shared across AD and PD, suggesting cross-disease mechanisms. By integrating routinely collected electronic health record data with genetic analyses, our study provides a scalable framework for early, biologically informed subtyping, laying the groundwork for future targeted interventions in neurodegenerative diseases.

Original publication

DOI

10.1038/s43587-026-01085-3

Type

Journal article

Journal

Nat Aging

Publication Date

26/02/2026