U.S. flag

An official website of the United States government

The Sequence Read Archive (SRA): Getting Started


The SRA is NIH's primary archive of high-throughput sequencing data and is part of the International Nucleotide Sequence Database Collaboration (INSDC) that includes at the NCBI Sequence Read Archive (SRA), the European Bioinformatics Institute (EBI), and the DNA Database of Japan (DDBJ). Data submitted to any of the three organizations are shared among them.

SRA mission

  • Archives raw sequencing data and alignment information from high-throughput sequencing platforms, including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD System®, Helicos Heliscope®, Complete Genomics®, and Pacific Biosciences SMRT®.
  • Makes sequence data available to the research community to enhance reproducibility and allow for new discoveries by comparing data sets.

SRA data

SRA accepts data from all kinds of sequencing projects including clinically important studies that involve human subjects or their metagenomes, which may contain human sequences. These data often have a controlled access via dbGaP (the database of Genotypes and Phenotypes) .

Contact SRA

Contact SRA staff for assistance at sra@ncbi.nlm.nih.gov

Submitting to the SRA

Search and Download


Support Center

Last updated: 2019-10-03T15:17:10Z