In Focus

A platform contributing multiple insights into human health

The Recipient Epidemiology and Donor Evaluation Study Brazil Sickle Cell Disease Cohort Study

Brian Custer

Vitalant Research Institute, San Francisco, CA, USA

Read bio >

Shannon Kelly

Benioff Children's Hospital Oakland, Oakland, CA, USA

Read bio >

Carla DiNardo

Universidade de São Paulo and Fundacao Pro Sangue, São Paulo, Brazil

Read bio >

The ISBT Big Data Working Party (WP) brings together a range of expertise and data to advance our understanding of transfusion biology from donor health to recipient outcomes. The success of the Big Data WP will lie in building cross-country approaches to gain insights into which factors are most important in the chain from donating blood or biologics to improved health for the patients who receive those products.

To achieve these insights international collaboration is central. First, pipelines for collection of complex data need to be developed and then approaches for harmonizing data need to be implemented so that analytical advancements can be achieved. Even if data are held in separate databases for ethical or regulatory reasons, common approaches for analysis and interpretation of findings are necessary to optimize transfusion. An example of how this can be achieved is the Recipient Epidemiology and Donor Evaluation Study-III (REDS-III) Brazil Sickle Cell Disease (SCD) cohort study funded by the US National Heart Lung and Blood Institute (NHLBI) of the National Institutes of Health (NIH).

The Brazil SCD cohort study was initiated at six clinical care centers in four states in Brazil in 2013. The objective for establishing the cohort was to create a longitudinal study of paediatric and adult patients that would provide a path for understanding the pathogenesis of SCD and role of transfusion in the care of patients in an upper middle-income country. The expectation was that insights from the Brazil SCD cohort would be applicable in countries with both lower and higher levels of development. A central motivation was to understand the role of transfusion therapy, both chronic and as needed episodic transfusion, on SCD disease progression and adverse events including those associated with transfusion. The approaches REDS-III Brazil developed for data acquisition were then shared with those seeking to develop similar initiatives in Africa.

Figure 1. Areas of completed research leveraging big data from the REDS-III/REDS-IV-P Brazil SCD cohort study. Each oval includes subject matter investigated using SCD cohort data with topics in blue areas of focus in the REDS program, topics in orange areas of focus in the TOPMed program, and joint analyses shown in both colours.

The cohort of nearly 2800 participants was conceived with the recognition both phenotypes and genotypes are relevant in SCD pathogenesis. As SCD is caused by mutations in the HBB gene, which encodes the beta-globin subunit of hemoglobin, genetic considerations are central to understanding and treating SCD. The Brazil SCD cohort study was designed to collect data in three broad domains; patient reported information, medical data including transfusion history abstracted from medical records, and collection and storage of blood specimens for biomarker studies including array typing of participants for single nucleotide and copy number variants. In the REDS-III program we used an ~800,000 feature Transfusion Medicine Array (TMA) which was specifically developed for two studies conducted by REDS-III (RBC-Omics and the Brazil SCD cohort). With imputation, this array can inform close to 11,000,000 features of the human genome. Analyses of the SCD cohort data sources have been informative to a broad range of SCD topics from analyses of chronic transfusion therapy, including the risk of alloimmunization to broad epidemiological studies of the causes of mortality in the SCD population (Figure 1).

In addition to the array typing conducted by the REDS-III program, whole genome sequencing (WGS) was performed through the successful application to include the Brazil SCD cohort in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. The TOPMed program aims to advance precision medicine by integrating WGS and other omics data (metabolomics, epigenomics, transcriptomics, and proteomics) with clinical, behavioral, and environmental data. The objective is to gain unprecedented insights into the fundamental biological processes that underlie heart, lung, blood, and sleep disorders and assess the predictive ability of omics data in health for these conditions. (see https://topmed.nhlbi.nih.gov/)

Currently TOPMed has WGS and associated phenotype data for ~200,000 individuals, including the REDS-III Brazil SCD cohort members as well as over 3000 other persons with SCD from other cohort studies. So far 17 manuscripts have been published using WGS data from the Brazil SCD cohort and other TOPMed cohorts. Several of these manuscripts have been led by the REDS Brazil team and have focused on SCD genetics related to outcomes, including ischaemic stroke, genetic modifiers of vaso-occlusive events, white blood cell counts, and the diversity of variant alleles encoding Kidd, Duffy, and Kell antigens1-5. The SCD cohort also contributes to a wider set of analyses ranging from human telomere length regulatory genes to factors driving clonal hematopoiesis.6-9

During REDS-IV-P, two additional clinical sites were included in the SCD cohort study. The cohort now has participation of patients from five states in Brazil. The REDS-IV-P Brazil SCD cohort includes 3059 members (2053 continuing participants from REDS-III with an additional 1006 new cohort participants). In addition to blood specimen collection during REDS-IV-P, we collected urine samples, expanding the types of analyses we can achieve to include renal function and disease. We also have completed array typing of the REDS-III new cohort members using the TMA, allowing us to continue our investigation of genetic factors associated with adverse events in SCD. Further in REDS-IV-P, members of the SCD cohort participated in the Red Blood Cell – IMProving trAnsfusions for Chronically Transfused Recipients (RBC-IMPACT) study which seeks to identify the blood donor, component manufacturing and recipient factors associated with red cell survival following transfusion among chronically transfused SCD and Thalassemia patients in the U.S. and Brazil10 and https://clinicaltrials.gov/study/NCT05255445). The Brazil SCD cohort study started with the specific objective to centralize detailed clinical, laboratory, transfusion, and medical history data to conduct studies of SCD pathogenesis and transfusion complications. From the inception the scope and complexity of the data collected have increased. The SCD cohort study has become a major contributor to transfusion, hematology, and human biology questions directly through the work of REDS investigators and through our collaboration with other researchers. Opportunities to further characterize SCD from patient quality of life to specific genetic modifiers and metabolomic factors that alter the trajectory of disease have stemmed from the wide and deep characterization of the cohort members and the health care they receive. This approach of capacity building and active collaboration serves as a model for the Big Data WP. The insights we have gained from array and WGS typing of the SCD cohort and array typing of blood donors whose units were transfused to patients coupled with clinical and laboratory data provide a relevant example for how complex data from international sources can be brought together to advance patient care.

References

1. Dinardo CL, Oliveira TGM, Kelly S, Ashley-Koch A, Telen M, Schmidt LC, et al. Diversity of variant alleles encoding Kidd, Duffy, and Kell antigens in individuals with sickle cell disease using whole genome sequencing data from the NHLBI TOPMed Program. Transfusion. 2021;61(2):603-16. 2. Earley EJ, Kelly S, Fang F, Alencar CS, Rodrigues DOW, Soares Cruz DT, et al. Genome-wide association study of early ischaemic stroke risk in Brazilian individuals with sickle cell disease implicates ADAMTS2 and CDK18 and uncovers novel loci. Br J Haematol. 2023;201(2):343-52. 3. Cintho Ozahata M, Guo Y, Gomes I, Malta B, Belisário A, Amorim L, et al. Genetic variants associated with white blood cell count amongst individuals with sickle cell disease. Br J Haematol. 2024;205(5):1974-84. 4. Cintho Ozahata M, Oliveira M, Gomes I, Maximo C, Ferraz A, Belisário AR, et al. UHRF2 as a genetic correlate of hospitalization in sickle cell disease. Br J Haematol. 2025. 5. Malta B, Cintho Ozahata M, Gomes Moura IC, Amorim L, Ferraz A, Belisário AR, et al. Clinics and genetics of hyperhemolysis syndrome in patients with sickle cell disease. Transfusion. 2025;65(5):992-1000. 6. Liggett LA, Cato LD, Weinstock JS, Zhang Y, Nouraie SM, Gladwin MT, et al. Clonal hematopoiesis in sickle cell disease. J Clin Invest. 2022;132(4). 7. Taub MA, Conomos MP, Keener R, Iyer KR, Weinstock JS, Yanek LR, et al. Genetic determinants of telomere length from 109,122 ancestrally diverse whole-genome sequences in TOPMed. Cell Genom. 2022;2(1). 8. Cato LD, Li R, Lu HY, Yu F, Wissman M, Mkumbe BS, et al. Genetic regulation of fetal hemoglobin across global populations. medRxiv. 2023. 9. Weinstock JS, Gopakumar J, Burugula BB, Uddin MM, Jahn N, Belk JA, et al. Aberrant activation of TCL1A promotes stem cell expansion in clonal haematopoiesis. Nature. 2023;616(7958):755-63. 10. Karafin MS, Kelly S, Chapman KM, Kreuziger LB, Manis JP, Dinardo C, et al. The Red Blood Cell-Improving Transfusions for Chronically Transfused Recipients (RBC-IMPACT) study: protocol description of an international multi-site observational clinical study. Blood Transfus. 2025;23(5):418-32.

Contents