Resources


Database Credentialed Access

PRediction Of Disease PHEnoTypes (PROPHET)

Niels Turley, Marta Fernandes, Shadi Sartipi, Han Wu, Alice Lam, Lydia Petersen, Catherine Clive, Daniel Sumsion, Ruoqi Wei, Bram Overmeer, Jaden Searle, Gregory Hooke, Spencer Boris, Wan-Yee Kong, Arjun Singh, Marjan Sarami, Alihan Yaramis, Imad Akbar, Rebecca Milde, Jet Veltink, Elijah Davis, Aditya Gupta, Manohar Ghanta, Aidan McDonald Wojciechowski, Shibani Mukerji, Haoqi Sun, M Brandon Westover, Sahar Zafar

Multicenter expert-annotated EHR dataset and NLP phenotyping framework for 17 neurological conditions spanning diagnoses, severity scales, and outcomes across six U.S. health systems.

Published: March 31, 2026. Version: 1.0


Database Credentialed Access

Narcolepsy Risk Estimation from Clinical Notes

Niels Turley, Haoqi Sun, M Brandon Westover

Dataset and code for developing and validating machine learning models to phenotype narcolepsy type 1 (NT1) and narcolepsy type 2/idiopathic hypersomnia (NT2/IH) from multi-site electronic health record data, including cross-sectional classification

Published: March 2, 2026. Version: 1.0


Database Restricted Access

Cerebrospinal Fluid Testing for Neuroinvasive West Nile Virus and Measures to Improve Guideline Adherence

Carson Quinn, Karan Singh, Erik Klontz, Isaac Solomon, Shibani Mukerji

De-identified, dataset of 1,304 adult encounters with CSF-fluid West Nile virus testing patterns at two Mass General Brigham hospitals (2016-2023). Includes demographics, immune status, CSF/serum labs, WNV PCR & IgM results, guideline-adherence flags

Published: July 31, 2025. Version: 1.0.0


Database Restricted Access

Sleep Recordings from Wearable Devices in Meditators

Jayme Banks, Haoqi Sun, Robert Thomas, M Brandon Westover, Balachundhar Subramaniam

initial upload

Published: Feb. 28, 2025. Version: 1.0


Database Restricted Access

Targeted Metabolomics in the REGARDS Stroke Case-Cohort Study

Zsuzsanna Ament, Naruchorn Kijpaisalratana, Varun Bhave, Catharine Couch, Ana-Lucia Garcia Guarniz, Amit Patki, Mary Cushman, Suzanne Judd, Ryan Irvin, William Kimberly

This dataset contains metabolomics data from the REasons for Geographic and Racial Differences in Stroke (REGARDS) Study, a large, biracial cohort investigating metabolic biomarkers linked to stroke risk and health disparities across the United State

Published: Feb. 26, 2025. Version: 1.0.0


Software Open Access

Sleep Electroencaphalography-Based Brain Age Index

Haoqi Sun, Jefferson Tales Oliva, Oluwaseun Akeju, Robert Thomas, M Brandon Westover

Sleep EEG based brain age index is included in Luna v0.99 release.

Published: Nov. 13, 2024. Version: 0.99