U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in

Account

Logged in as:
username
  • Dashboard
  • Publications
  • Account settings
  • Log out
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
  • Datasets
  • Taxonomy
  • Genome
  • Gene
  • Command-line tools
  • Documentation
  • Documentation
    • Getting started
    • Download and install
    • How-to guides
      • Genes
        • Get gene metadata
        • Download genes
        • Download gene orthologs
        • Get the longest isoform
      • Genomes
        • Get genome metadata
        • Download genome data
        • Large genome downloads
      • Virus
        • SARS-CoV-2 genomes
        • SARS-CoV-2 proteins
      • Working with JSON
        • Working with data reports
    • Supported programming languages
      • Python
        • Python API
          • ncbi.datasets
            • metadata
              • gene
              • genome
            • openapi
              • api
                • gene_api
                • genome_api
                • prokaryote_api
                • taxonomy_api
                • version_api
                • virus_api
              • api_client
              • apis
              • configuration
              • exceptions
              • model_utils
              • models
                • protobuf_any
                • rpc_status
                • v1_accessions
                • v1_annotated_assemblies
                • v1_annotation
                • v1_annotation_for_assembly
                • v1_annotation_for_assembly_file
                • v1_annotation_for_assembly_type
                • v1_annotation_for_virus_type
                • v1_assembly_dataset_availability
                • v1_assembly_dataset_descriptor
                • v1_assembly_dataset_descriptor_chromosome
                • v1_assembly_dataset_descriptors
                • v1_assembly_dataset_descriptors_filter
                • v1_assembly_dataset_descriptors_filter_assembly_level
                • v1_assembly_dataset_descriptors_filter_assembly_source
                • v1_assembly_dataset_descriptors_filter_assembly_version
                • v1_assembly_dataset_descriptors_request
                • v1_assembly_dataset_descriptors_request_content_type
                • v1_assembly_dataset_request
                • v1_assembly_dataset_request_resolution
                • v1_assembly_match
                • v1_assembly_metadata
                • v1_assembly_metadata_request
                • v1_assembly_metadata_request_bioprojects
                • v1_assembly_metadata_request_content_type
                • v1_bio_project
                • v1_bio_project_lineage
                • v1_busco_stat
                • v1_count_type
                • v1_dataset_request
                • v1_download_summary
                • v1_download_summary_available_files
                • v1_download_summary_dehydrated
                • v1_download_summary_file_summary
                • v1_download_summary_hydrated
                • v1_element_flank_config
                • v1_error
                • v1_error_assembly_error_code
                • v1_error_gene_error_code
                • v1_error_virus_error_code
                • v1_fasta
                • v1_feature_counts
                • v1_gene_counts
                • v1_gene_dataset_request
                • v1_gene_dataset_request_content_type
                • v1_gene_dataset_request_sort
                • v1_gene_dataset_request_sort_field
                • v1_gene_dataset_request_symbols_for_taxon
                • v1_gene_descriptor
              • models
              • rest
            • package
              • dataset
      • R
    • Reference
      • Command line
        • dataformat
          • tsv
            • genome
            • genome-seq
            • gene
            • virus-genome
            • microbigge
            • prok-gene
            • prok-gene-location
          • excel
            • genome
            • genome-seq
            • gene
            • virus-genome
            • microbigge
            • prok-gene
            • prok-gene-location
          • catalog
          • completion
            • bash
            • zsh
            • fish
            • powershell
          • version
        • datasets
          • summary
            • virus
              • genome
                • taxon
                • accession
            • gene
              • gene-id
              • symbol
              • accession
              • taxon
            • genome
              • accession
              • taxon
            • ortholog
              • gene-id
              • symbol
              • accession
          • download
            • gene
              • gene-id
              • symbol
              • accession
              • taxon
            • genome
              • accession
              • taxon
            • virus
              • genome
                • accession
                • taxon
              • protein
            • ortholog
              • gene-id
              • symbol
              • accession
          • rehydrate
          • completion
            • bash
            • zsh
            • fish
            • powershell
          • version
      • File formats
        • GBFF
        • GFF3
      • Report schemas
        • Gene
        • Genome assembly
        • Genome sequence
        • MicroBIGG-E
        • Prok. gene
        • Prok. gene location
        • Virus
      • Data packages
        • Gene package
        • Genome package
        • SARS-CoV-2 data package
      • GCA and GCF genomes
      • jq cheatsheet
      • REST API
        • Authentication
        • Retired Endpoints
    • FAQs and troubleshooting
      • Frequently asked Questions
      • Mac zip bug
Documentation version
Learn more
  1. Documentation

Datasets Documentation

These documentation pages describe how to use NCBI Datasets websites and tools. We love user feedback! Please reach out to us using the yellow Feedback button in the lower right hand corner.

Getting started

See how NCBI Datasets can help you gather genomic data from a variety of NCBI tools and platforms

Download and install

Step-by-step instructions on downloading and installing NCBI Datasets command-line tools

How-to guides

Use NCBI Datasets to gather metadata, download data packages, view reports and more

Programming languages

NCBI Datasets Python and R resources

Reference

Additional information about the NCBI Datasets CLI and API, file formats and data packages

FAQs and troubleshooting

FAQs, known issue resolutions and helpful resources
Generated June 7, 2023
Follow NCBI
TwitterFacebookLinkedInGitHub

Connect with NLM

  • SM-Twitter
  • SM-Facebook
  • SM-Youtube

National Library of Medicine
8600 Rockville Pike
Bethesda, MD 20894

Web Policies
FOIA
HHS Vulnerability Disclosure

Help
Accessibility
Careers

  • NLM
  • NIH
  • HHS
  • USA.gov