Participating cohorts include nearly 1,400 sickle cell patients at St. Jude Children’s Research Hospital, Baylor College of Medicine, and other collaborating institutions (learn more). More than 800 whole genome sequences are currently available, with plans to continually expand data offerings. Access raw genomics data »

  • Our major cohort consists of 1,084 participants in the Sickle Cell Clinical Research and Intervention Program (SCCRIP). This longitudinal St. Jude-led observational study is prospectively following individuals with sickle cell disease throughout their lifetimes. We also collect retrospective data to create a record beginning at birth for each participant. View publication. Whole genome sequencing data is available for 503 participants sequenced through SCCRIP (498/503) and other St. Jude studies.
  • A second major cohort consists of 304 individuals with sickle cell disease who have undergone whole genome sequencing through studies at Baylor College of Medicine in Houston, TX. Genomic DNA was collected from pediatric sickle cell patients receiving care at Texas Children’s Sickle Cell Center, with IRB approval. The Center is the second largest pediatric sickle cell center in the United States.

SCCRIP/Baylor Demographics

Clinical Diagnosis

Genetic Composition

Plot of ethnic subpopulations most closely resembled by individuals in the aggregated St. Jude and Baylor College of Medicine cohorts. The majority of participants are African American and have a mixed genetic background of European white (CEU) and African subpopulations (YRI). Plots are generated from genetic data using principal components analysis (PCA; learn more ). PC1 = principal component 1; PC2 = principal component 2. Explore the genetic compositions by changing different PCs and subpopulations. Access raw data »


Blood phenotype values for total hemoglobin (Hb), fetal hemoglobin (HbF), hemoglobin A2 (HbA2) and the mean corpuscular volume (MCV) were plotted against age of individual participants at the time of measurement. These interactive plots include points for individual patients and regression lines for the mean and +/- one and two standard deviations (sd) for the value being plotted.