Release Notes For GenBank Release 176
GBREL.TXT Genetic Sequence Data Bank
February 15 2010
NCBI-GenBank Flat File Release 176.0
Distribution Release Notes
116461672 loci, 112326229652 bases, from 116461672 reported sequences
This document describes the format and content of the flat files that
comprise releases of the GenBank nucleotide sequence database. If you
have any questions or comments about GenBank or this document, please
contact NCBI via email at info@ncbi.nlm.nih.gov or:
GenBank
National Center for Biotechnology Information
National Library of Medicine, 38A, 8N805
8600 Rockville Pike
Bethesda, MD 20894
USA
Phone: (301) 496-2475
Fax: (301) 480-9241
GenBank releases do not include sequence records that originate from
third-parties (TPA) or from NCBI's Reference Sequence (RefSeq) project.
Rather, GenBank is the archival/primary resource which those other
efforts draw upon. For information about TPA and RefSeq, please visit:
http://www.ncbi.nih.gov/Genbank/TPA.html
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/RefSeq
==========================================================================
TABLE OF CONTENTS
==========================================================================
1. INTRODUCTION
1.1 Release 176.0
1.2 Cutoff Date
1.3 Important Changes in Release 176.0
1.4 Upcoming Changes
1.5 Request for Direct Submission of Sequence Data
1.6 Organization of This Document
2. ORGANIZATION OF DATA FILES
2.1 Overview
2.2 Files
2.2.1 File Descriptions
2.2.5 File Sizes
2.2.6 Per-Division Statistics
2.2.7 Selected Per-Organism Statistics
2.2.8 Growth of GenBank
3. FILE FORMATS
3.1 File Header Information
3.2 Directory Files
3.2.1 Short Directory File
3.3 Index Files
3.3.1 Accession Number Index File
3.3.2 Keyword Phrase Index File
3.3.3 Author Name Index File
3.3.4 Journal Citation Index File
3.3.5 Gene Name Index
3.4 Sequence Entry Files
3.4.1 File Organization
3.4.2 Entry Organization
3.4.3 Sample Sequence Data File
3.4.4 LOCUS Format
3.4.5 DEFINITION Format
3.4.5.1 DEFINITION Format for NLM Entries
3.4.6 ACCESSION Format
3.4.7 VERSION Format
3.4.8 KEYWORDS Format
3.4.9 SEGMENT Format
3.4.10 SOURCE Format
3.4.11 REFERENCE Format
3.4.12 FEATURES Format
3.4.12.1 Feature Key Names
3.4.12.2 Feature Location
3.4.12.3 Feature Qualifiers
3.4.12.4 Cross-Reference Information
3.4.12.5 Feature Table Examples
3.4.13 ORIGIN Format
3.4.14 SEQUENCE Format
3.4.15 CONTIG Format
4. ALTERNATE RELEASES
5. KNOWN PROBLEMS OF THE GENBANK DATABASE
5.1 Incorrect Gene Symbols in Entries and Index
6. GENBANK ADMINISTRATION
6.1 Registered Trademark Notice
6.2 Citing GenBank
6.3 GenBank Distribution Formats and Media
6.4 Other Methods of Accessing GenBank Data
6.5 Request for Corrections and Comments
6.6 Credits and Acknowledgments
6.7 Disclaimer
==========================================================================
1. INTRODUCTION
1.1 Release 176.0
The National Center for Biotechnology Information (NCBI) at the National
Library of Medicine (NLM), National Institutes of Health (NIH) is responsible
for producing and distributing the GenBank Sequence Database. NCBI handles
all GenBank direct submissions and authors are advised to use the address
below. Submitters are encouraged to use the free Sequin software package
for sending sequence data, or the newly developed World Wide Web submission
form. See Section 1.5 below for details.
*****************************************************************************
The address for direct submissions to GenBank is:
GenBank Submissions
National Center for Biotechnology Information
Bldg 38A, Rm. 8N-803
8600 Rockville Pike
Bethesda, MD 20894
E-MAIL: gb-sub@ncbi.nlm.nih.gov
Updates and changes to existing GenBank records:
E-MAIL: update@ncbi.nlm.nih.gov
URL for GenBank's web-based submission tool (BankIt) :
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/BankIt
(see Section 1.5 for additional details about submitting data to GenBank.)
*****************************************************************************
GenBank Release 176.0 is a release of sequence data by NCBI in the GenBank
Flatfile format. GenBank is a component of a tri-partite collaboration of
sequence databases in the U.S., Europe, and Japan, known as the International
Nucleotide Sequence Database Collaboration (INSDC). The collaborating databases
are the European Molecular Biology Laboratory (EMBL) at Hinxton Hall, UK, and
the DNA Database of Japan (DDBJ) in Mishima. Japan. Patent sequences are
incorporated through arrangements with the U.S. Patent and Trademark Office,
and via the collaborating international databases from other international
patent offices. The database is converted to various output formats, including
the GenBank Flatfile and Abstract Syntax Notation 1 (ASN.1) versions. The ASN.1
and Flatfile forms of the data are available at NCBI's anonymous FTP server :
ftp://ftp.ncbi.nih.gov/ncbi-asn1
ftp://ftp.ncbi.nih.gov/genbank
A mirror of the GenBank FTP site at the NCBI is available at the University
of Indiana, courtesy of the Bio-Mirror project:
ftp://bio-mirror.net/biomirror/genbank/
Some users who experience slow FTP transfers of large files might realize
an improvement in transfer rates from this alternate site when the volume
of traffic at the NCBI is high. For a list of other Bio-Mirror nodes, visit:
http://www.bio-mirror.net/
1.2 Cutoff Date
This full release, 176.0, incorporates data available to the collaborating
databases as of February 19, 2010 at approximately 1:30am EST. For more recent
data, users are advised to:
o Download GenBank Incremental Update (GIU) files by anonymous FTP from NCBI:
ftp://ftp.ncbi.nih.gov/ncbi-asn1/daily-nc (ASN.1 format)
ftp://ftp.ncbi.nih.gov/genbank/daily-nc (flatfile format)
o Use the interactive Network-Entrez or Web-Entrez applications to query
the 'Entrez: Nucleotides' database (see Section 6.4 of this document).
1.3 Important Changes in Release 176.0
1.3.1 Organizational changes
The total number of sequence data files decreased by 619 with this release:
- the BCT division is now composed of 52 files (+2)
- the CON division is now composed of 134 files (+4)
- the EST division is now composed of 410 files (-494)
- the GSS division is now composed of 208 files (-148)
- the MAM division is now composed of 6 files (+1)
- the PAT division is now composed of 100 files (+12)
- the TSA division is now composed of 4 files (+2)
- the VRL division is now composed of 14 files (+1)
- the VRT division is now composed of 21 files (+1)
This is a planned decrease, see Section 1.3.2 for more information.
The total number of 'index' files increased by 3 with this release:
- the AUT (author) index is now composed of 73 files (+2)
- the JOU (journal) index is now composed of 8 files (+1)
1.3.2 Increase in the size of EST and GSS sequence files
The average size of EST and GSS sequence files, uncompressed, was
approximately 222 MB prior to GenBank Release 176.0. But as of this
February 2010 GenBank release, we have increased the average
uncompressed file size to about 500 MB. This has reduced the total
number of EST/GSS files by roughly 50% (from 1260 in Release 175.0 to
618 in Release 176.0).
This change should make the FTP transfer of an entire release easier
to manage, because the number of files is reduced, and their compressed
sizes remain fairly small (with a maximum of about 80MB). Although the
uncompressed files are larger than previously, 500MB should still be
tractable for most users, especially given the overall disk space needs
for an entire release.
1.3.3 Changes in the content of index files
As described in the GB 153 release notes, the 'index' files which accompany
GenBank releases (see Section 3.3) are considered to be a legacy data product by
NCBI, generated mostly for historical reasons. FTP statistics of January 2005
seem to support this: the index files were transferred only half as frequently as
the files of sequence records. The inherent inefficiencies of the index file
format also lead us to suspect that they have little serious use by the user
community, particularly for EST and GSS records.
The software that generated the index file products received little
attention over the years, and finally reached its limitations in
February 2006 (Release 152.0). The required multi-server queries which
obtained and sorted many millions of rows of terms from several different
databases simply outgrew the capacity of the hardware used for GenBank
Release generation.
Our short-term solution is to cease generating some index-file content
for all EST sequence records, and for GSS sequence records that originate
via direct submission to NCBI.
The three gbacc*.idx index files continue to reflect the entirety of the
release, including all EST and GSS records, however the file contents are
unsorted.
These 'solutions' are really just stop-gaps, and we will likely pursue
one of two options:
a) Cease support of the 'index' file products altogether.
b) Provide new products that present some of the most useful data from
the legacy 'index' files, and cease support for other types of index data.
If you are a user of the 'index' files associated with GenBank releases, we
encourage you to make your wishes known, either via the GenBank newsgroup,
or via email to NCBI's Service Desk:
info@ncbi.nlm.nih.gov
Our apologies for any inconvenience that these changes may cause.
1.3.4 GSS File Header Problem
GSS sequences at GenBank are maintained in two different systems, depending
on their origin, and the dumps from those systems occur in parallel. Because
the second dump (for example) has no prior knowledge of exactly how many GSS
files will be dumped by the first, it does not know how to number its own
output files.
There is thus a discrepancy between the filenames and file headers for
eighty-seven of the GSS flatfiles in Release 176.0. Consider gbgss122.seq :
GBGSS1.SEQ Genetic Sequence Data Bank
February 15 2010
NCBI-GenBank Flat File Release 176.0
GSS Sequences (Part 1)
87169 loci, 64174921 bases, from 87169 reported sequences
Here, the filename and part number in the header is "1", though the file
has been renamed as "122" based on the number of files dumped from the other
system. We hope to resolve this discrepancy at some point, but the priority
is certainly much lower than many other tasks.
1.4 Upcoming Changes
1.4.1 New /pseudogene and /non_functional qualifiers
The GenBank 173.0 release notes described an anticipated conversion
of the /pseudo qualifer to /non_functional, based on the results of
the May 2009 INSDC annual meeting:
"Because the term "pseudo" is often assumed to mean 'pseudogene',
the /pseudo qualifier will be renamed as /non_functional, to
better reflect its actual usage in the sequence databases."
During follow-up discussions, the INSDC members decided that existing
uses of /pseudo can include both of the possible meanings of the term,
and that a more conservative course would be to introduce two new
qualifiers:
/pseudogene
/non_functional
Sequence submission tools will be updated to utilize these, and
the ambiguous /pseudo qualifier will be deprecated. If it is
possible, existing instances of /pseudo would then be converted
to one of the two new qualifiers.
/pseudogene and /non_functional will become legal for the Feature
Table as of the April 2010 GenBank Release.
1.5 Request for Direct Submission of Sequence Data
A successful GenBank requires that sequence data enter the database as
soon as possible after publication, that the annotations be as complete as
possible, and that the sequence and annotation data be accurate. All
three of these requirements are best met if authors of sequence data
submit their data directly to GenBank in a usable form. It is especially
important that these submissions be in computer-readable form.
GenBank must rely on direct author submission of data to ensure that
it achieves its goals of completeness, accuracy, and timeliness. To
assist researchers in entering their own sequence data, GenBank
provides a WWW submission tool called BankIt, as well as a stand-alone
software package called Sequin. BankIt and Sequin are both easy-to-use
programs that enable authors to enter a sequence, annotate it, and
submit it to GenBank. Through the international collaboration of DNA
sequence databases, GenBank submissions are forwarded daily for inclusion
in the EMBL and DDBJ databases.
SEQUIN. Sequin is an interactive, graphically-oriented program based
on screen forms and controlled vocabularies that guides you through the
process of entering your sequence and providing biological and
bibliographic annotation. Sequin is designed to simplify the sequence submission
process, and to provide increased data handling capabilities to accomodate
very long sequences, complex annotations, and robust error checking. E-mail
the completed submission file to : gb-sub@ncbi.nlm.nih.gov
Sequin is provided for Macintosh, PC/Windows, UNIX and VMS computers.
It is available by annonymous ftp from ftp.ncbi.nih.gov; login as
anonymous and use your e-mail address as the password. It is located in
the sequin directory. Or direct your web browser to this URL:
ftp://ftp.ncbi.nih.gov/sequin
BANKIT. BankIt provides a simple forms-based approach for submitting your
sequence and descriptive information to GenBank. Your submission will
be submitted directly to GenBank via the World Wide Web, and
immediately forwarded for inclusion in the EMBL and DDBJ databases.
BankIt may be used with Netscape, Internet Explorer, and other common
WWW clients. You can access BankIt from GenBank's home page:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/
AUTHORIN. Authorin sequence submissions are no longer accepted by
GenBank, and the Authorin application is no longer distributed by NCBI.
If you have questions about GenBank submissions or any of the data
submission tools, contact NCBI at: info@ncbi.nlm.nih.gov or 301-496-2475.
1.6 Organization of This Document
The second section describes the contents of GenBank releases. The third
section illustrates the formats of the flat files. The fourth section
describes other versions of the data, the fifth section identifies known prob-
lems, and the sixth contains administrative details.
2. ORGANIZATION OF DATA FILES
2.1 Overview
GenBank releases consist of a set of ASCII text files, most of which
contain sequence data. A few supplemental "index" files are also supplied,
containing comprehensive lists of author names, journal citations,
gene names, and keywords, along with the accession numbers of the records
in which they can be found (see Section 3.3). The line-lengths of
these files is variable.
2.2 Files
This GenBank flat file release consists of 1369 files. The lists
that follow describe each of the files included in the distribution.
Their sizes and base pair content are also summarized.
2.2.1 File Descriptions
Files included in this release are:
1. gbacc1.idx - Index of the entries according to accession number, part 1.
2. gbacc2.idx - Index of the entries according to accession number, part 2.
3. gbacc3.idx - Index of the entries according to accession number, part 3.
4. gbaut1.idx - Index of the entries according to author name, part 1.
5. gbaut10.idx - Index of the entries according to author name, part 10.
6. gbaut11.idx - Index of the entries according to author name, part 11.
7. gbaut12.idx - Index of the entries according to author name, part 12.
8. gbaut13.idx - Index of the entries according to author name, part 13.
9. gbaut14.idx - Index of the entries according to author name, part 14.
10. gbaut15.idx - Index of the entries according to author name, part 15.
11. gbaut16.idx - Index of the entries according to author name, part 16.
12. gbaut17.idx - Index of the entries according to author name, part 17.
13. gbaut18.idx - Index of the entries according to author name, part 18.
14. gbaut19.idx - Index of the entries according to author name, part 19.
15. gbaut2.idx - Index of the entries according to author name, part 2.
16. gbaut20.idx - Index of the entries according to author name, part 20.
17. gbaut21.idx - Index of the entries according to author name, part 21.
18. gbaut22.idx - Index of the entries according to author name, part 22.
19. gbaut23.idx - Index of the entries according to author name, part 23.
20. gbaut24.idx - Index of the entries according to author name, part 24.
21. gbaut25.idx - Index of the entries according to author name, part 25.
22. gbaut26.idx - Index of the entries according to author name, part 26.
23. gbaut27.idx - Index of the entries according to author name, part 27.
24. gbaut28.idx - Index of the entries according to author name, part 28.
25. gbaut29.idx - Index of the entries according to author name, part 29.
26. gbaut3.idx - Index of the entries according to author name, part 3.
27. gbaut30.idx - Index of the entries according to author name, part 30.
28. gbaut31.idx - Index of the entries according to author name, part 31.
29. gbaut32.idx - Index of the entries according to author name, part 32.
30. gbaut33.idx - Index of the entries according to author name, part 33.
31. gbaut34.idx - Index of the entries according to author name, part 34.
32. gbaut35.idx - Index of the entries according to author name, part 35.
33. gbaut36.idx - Index of the entries according to author name, part 36.
34. gbaut37.idx - Index of the entries according to author name, part 37.
35. gbaut38.idx - Index of the entries according to author name, part 38.
36. gbaut39.idx - Index of the entries according to author name, part 39.
37. gbaut4.idx - Index of the entries according to author name, part 4.
38. gbaut40.idx - Index of the entries according to author name, part 40.
39. gbaut41.idx - Index of the entries according to author name, part 41.
40. gbaut42.idx - Index of the entries according to author name, part 42.
41. gbaut43.idx - Index of the entries according to author name, part 43.
42. gbaut44.idx - Index of the entries according to author name, part 44.
43. gbaut45.idx - Index of the entries according to author name, part 45.
44. gbaut46.idx - Index of the entries according to author name, part 46.
45. gbaut47.idx - Index of the entries according to author name, part 47.
46. gbaut48.idx - Index of the entries according to author name, part 48.
47. gbaut49.idx - Index of the entries according to author name, part 49.
48. gbaut5.idx - Index of the entries according to author name, part 5.
49. gbaut50.idx - Index of the entries according to author name, part 50.
50. gbaut51.idx - Index of the entries according to author name, part 51.
51. gbaut52.idx - Index of the entries according to author name, part 52.
52. gbaut53.idx - Index of the entries according to author name, part 53.
53. gbaut54.idx - Index of the entries according to author name, part 54.
54. gbaut55.idx - Index of the entries according to author name, part 55.
55. gbaut56.idx - Index of the entries according to author name, part 56.
56. gbaut57.idx - Index of the entries according to author name, part 57.
57. gbaut58.idx - Index of the entries according to author name, part 58.
58. gbaut59.idx - Index of the entries according to author name, part 59.
59. gbaut6.idx - Index of the entries according to author name, part 6.
60. gbaut60.idx - Index of the entries according to author name, part 60.
61. gbaut61.idx - Index of the entries according to author name, part 61.
62. gbaut62.idx - Index of the entries according to author name, part 62.
63. gbaut63.idx - Index of the entries according to author name, part 63.
64. gbaut64.idx - Index of the entries according to author name, part 64.
65. gbaut65.idx - Index of the entries according to author name, part 65.
66. gbaut66.idx - Index of the entries according to author name, part 66.
67. gbaut67.idx - Index of the entries according to author name, part 67.
68. gbaut68.idx - Index of the entries according to author name, part 68.
69. gbaut69.idx - Index of the entries according to author name, part 69.
70. gbaut7.idx - Index of the entries according to author name, part 7.
71. gbaut70.idx - Index of the entries according to author name, part 70.
72. gbaut71.idx - Index of the entries according to author name, part 71.
73. gbaut72.idx - Index of the entries according to author name, part 72.
74. gbaut73.idx - Index of the entries according to author name, part 73.
75. gbaut8.idx - Index of the entries according to author name, part 8.
76. gbaut9.idx - Index of the entries according to author name, part 9.
77. gbbct1.seq - Bacterial sequence entries, part 1.
78. gbbct10.seq - Bacterial sequence entries, part 10.
79. gbbct11.seq - Bacterial sequence entries, part 11.
80. gbbct12.seq - Bacterial sequence entries, part 12.
81. gbbct13.seq - Bacterial sequence entries, part 13.
82. gbbct14.seq - Bacterial sequence entries, part 14.
83. gbbct15.seq - Bacterial sequence entries, part 15.
84. gbbct16.seq - Bacterial sequence entries, part 16.
85. gbbct17.seq - Bacterial sequence entries, part 17.
86. gbbct18.seq - Bacterial sequence entries, part 18.
87. gbbct19.seq - Bacterial sequence entries, part 19.
88. gbbct2.seq - Bacterial sequence entries, part 2.
89. gbbct20.seq - Bacterial sequence entries, part 20.
90. gbbct21.seq - Bacterial sequence entries, part 21.
91. gbbct22.seq - Bacterial sequence entries, part 22.
92. gbbct23.seq - Bacterial sequence entries, part 23.
93. gbbct24.seq - Bacterial sequence entries, part 24.
94. gbbct25.seq - Bacterial sequence entries, part 25.
95. gbbct26.seq - Bacterial sequence entries, part 26.
96. gbbct27.seq - Bacterial sequence entries, part 27.
97. gbbct28.seq - Bacterial sequence entries, part 28.
98. gbbct29.seq - Bacterial sequence entries, part 29.
99. gbbct3.seq - Bacterial sequence entries, part 3.
100. gbbct30.seq - Bacterial sequence entries, part 30.
101. gbbct31.seq - Bacterial sequence entries, part 31.
102. gbbct32.seq - Bacterial sequence entries, part 32.
103. gbbct33.seq - Bacterial sequence entries, part 33.
104. gbbct34.seq - Bacterial sequence entries, part 34.
105. gbbct35.seq - Bacterial sequence entries, part 35.
106. gbbct36.seq - Bacterial sequence entries, part 36.
107. gbbct37.seq - Bacterial sequence entries, part 37.
108. gbbct38.seq - Bacterial sequence entries, part 38.
109. gbbct39.seq - Bacterial sequence entries, part 39.
110. gbbct4.seq - Bacterial sequence entries, part 4.
111. gbbct40.seq - Bacterial sequence entries, part 40.
112. gbbct41.seq - Bacterial sequence entries, part 41.
113. gbbct42.seq - Bacterial sequence entries, part 42.
114. gbbct43.seq - Bacterial sequence entries, part 43.
115. gbbct44.seq - Bacterial sequence entries, part 44.
116. gbbct45.seq - Bacterial sequence entries, part 45.
117. gbbct46.seq - Bacterial sequence entries, part 46.
118. gbbct47.seq - Bacterial sequence entries, part 47.
119. gbbct48.seq - Bacterial sequence entries, part 48.
120. gbbct49.seq - Bacterial sequence entries, part 49.
121. gbbct5.seq - Bacterial sequence entries, part 5.
122. gbbct50.seq - Bacterial sequence entries, part 50.
123. gbbct51.seq - Bacterial sequence entries, part 51.
124. gbbct52.seq - Bacterial sequence entries, part 52.
125. gbbct6.seq - Bacterial sequence entries, part 6.
126. gbbct7.seq - Bacterial sequence entries, part 7.
127. gbbct8.seq - Bacterial sequence entries, part 8.
128. gbbct9.seq - Bacterial sequence entries, part 9.
129. gbchg.txt - Accession numbers of entries updated since the previous release.
130. gbcon1.seq - Constructed sequence entries, part 1.
131. gbcon10.seq - Constructed sequence entries, part 10.
132. gbcon100.seq - Constructed sequence entries, part 100.
133. gbcon101.seq - Constructed sequence entries, part 101.
134. gbcon102.seq - Constructed sequence entries, part 102.
135. gbcon103.seq - Constructed sequence entries, part 103.
136. gbcon104.seq - Constructed sequence entries, part 104.
137. gbcon105.seq - Constructed sequence entries, part 105.
138. gbcon106.seq - Constructed sequence entries, part 106.
139. gbcon107.seq - Constructed sequence entries, part 107.
140. gbcon108.seq - Constructed sequence entries, part 108.
141. gbcon109.seq - Constructed sequence entries, part 109.
142. gbcon11.seq - Constructed sequence entries, part 11.
143. gbcon110.seq - Constructed sequence entries, part 110.
144. gbcon111.seq - Constructed sequence entries, part 111.
145. gbcon112.seq - Constructed sequence entries, part 112.
146. gbcon113.seq - Constructed sequence entries, part 113.
147. gbcon114.seq - Constructed sequence entries, part 114.
148. gbcon115.seq - Constructed sequence entries, part 115.
149. gbcon116.seq - Constructed sequence entries, part 116.
150. gbcon117.seq - Constructed sequence entries, part 117.
151. gbcon118.seq - Constructed sequence entries, part 118.
152. gbcon119.seq - Constructed sequence entries, part 119.
153. gbcon12.seq - Constructed sequence entries, part 12.
154. gbcon120.seq - Constructed sequence entries, part 120.
155. gbcon121.seq - Constructed sequence entries, part 121.
156. gbcon122.seq - Constructed sequence entries, part 122.
157. gbcon123.seq - Constructed sequence entries, part 123.
158. gbcon124.seq - Constructed sequence entries, part 124.
159. gbcon125.seq - Constructed sequence entries, part 125.
160. gbcon126.seq - Constructed sequence entries, part 126.
161. gbcon127.seq - Constructed sequence entries, part 127.
162. gbcon128.seq - Constructed sequence entries, part 128.
163. gbcon129.seq - Constructed sequence entries, part 129.
164. gbcon13.seq - Constructed sequence entries, part 13.
165. gbcon130.seq - Constructed sequence entries, part 130.
166. gbcon131.seq - Constructed sequence entries, part 131.
167. gbcon132.seq - Constructed sequence entries, part 132.
168. gbcon133.seq - Constructed sequence entries, part 133.
169. gbcon134.seq - Constructed sequence entries, part 134.
170. gbcon14.seq - Constructed sequence entries, part 14.
171. gbcon15.seq - Constructed sequence entries, part 15.
172. gbcon16.seq - Constructed sequence entries, part 16.
173. gbcon17.seq - Constructed sequence entries, part 17.
174. gbcon18.seq - Constructed sequence entries, part 18.
175. gbcon19.seq - Constructed sequence entries, part 19.
176. gbcon2.seq - Constructed sequence entries, part 2.
177. gbcon20.seq - Constructed sequence entries, part 20.
178. gbcon21.seq - Constructed sequence entries, part 21.
179. gbcon22.seq - Constructed sequence entries, part 22.
180. gbcon23.seq - Constructed sequence entries, part 23.
181. gbcon24.seq - Constructed sequence entries, part 24.
182. gbcon25.seq - Constructed sequence entries, part 25.
183. gbcon26.seq - Constructed sequence entries, part 26.
184. gbcon27.seq - Constructed sequence entries, part 27.
185. gbcon28.seq - Constructed sequence entries, part 28.
186. gbcon29.seq - Constructed sequence entries, part 29.
187. gbcon3.seq - Constructed sequence entries, part 3.
188. gbcon30.seq - Constructed sequence entries, part 30.
189. gbcon31.seq - Constructed sequence entries, part 31.
190. gbcon32.seq - Constructed sequence entries, part 32.
191. gbcon33.seq - Constructed sequence entries, part 33.
192. gbcon34.seq - Constructed sequence entries, part 34.
193. gbcon35.seq - Constructed sequence entries, part 35.
194. gbcon36.seq - Constructed sequence entries, part 36.
195. gbcon37.seq - Constructed sequence entries, part 37.
196. gbcon38.seq - Constructed sequence entries, part 38.
197. gbcon39.seq - Constructed sequence entries, part 39.
198. gbcon4.seq - Constructed sequence entries, part 4.
199. gbcon40.seq - Constructed sequence entries, part 40.
200. gbcon41.seq - Constructed sequence entries, part 41.
201. gbcon42.seq - Constructed sequence entries, part 42.
202. gbcon43.seq - Constructed sequence entries, part 43.
203. gbcon44.seq - Constructed sequence entries, part 44.
204. gbcon45.seq - Constructed sequence entries, part 45.
205. gbcon46.seq - Constructed sequence entries, part 46.
206. gbcon47.seq - Constructed sequence entries, part 47.
207. gbcon48.seq - Constructed sequence entries, part 48.
208. gbcon49.seq - Constructed sequence entries, part 49.
209. gbcon5.seq - Constructed sequence entries, part 5.
210. gbcon50.seq - Constructed sequence entries, part 50.
211. gbcon51.seq - Constructed sequence entries, part 51.
212. gbcon52.seq - Constructed sequence entries, part 52.
213. gbcon53.seq - Constructed sequence entries, part 53.
214. gbcon54.seq - Constructed sequence entries, part 54.
215. gbcon55.seq - Constructed sequence entries, part 55.
216. gbcon56.seq - Constructed sequence entries, part 56.
217. gbcon57.seq - Constructed sequence entries, part 57.
218. gbcon58.seq - Constructed sequence entries, part 58.
219. gbcon59.seq - Constructed sequence entries, part 59.
220. gbcon6.seq - Constructed sequence entries, part 6.
221. gbcon60.seq - Constructed sequence entries, part 60.
222. gbcon61.seq - Constructed sequence entries, part 61.
223. gbcon62.seq - Constructed sequence entries, part 62.
224. gbcon63.seq - Constructed sequence entries, part 63.
225. gbcon64.seq - Constructed sequence entries, part 64.
226. gbcon65.seq - Constructed sequence entries, part 65.
227. gbcon66.seq - Constructed sequence entries, part 66.
228. gbcon67.seq - Constructed sequence entries, part 67.
229. gbcon68.seq - Constructed sequence entries, part 68.
230. gbcon69.seq - Constructed sequence entries, part 69.
231. gbcon7.seq - Constructed sequence entries, part 7.
232. gbcon70.seq - Constructed sequence entries, part 70.
233. gbcon71.seq - Constructed sequence entries, part 71.
234. gbcon72.seq - Constructed sequence entries, part 72.
235. gbcon73.seq - Constructed sequence entries, part 73.
236. gbcon74.seq - Constructed sequence entries, part 74.
237. gbcon75.seq - Constructed sequence entries, part 75.
238. gbcon76.seq - Constructed sequence entries, part 76.
239. gbcon77.seq - Constructed sequence entries, part 77.
240. gbcon78.seq - Constructed sequence entries, part 78.
241. gbcon79.seq - Constructed sequence entries, part 79.
242. gbcon8.seq - Constructed sequence entries, part 8.
243. gbcon80.seq - Constructed sequence entries, part 80.
244. gbcon81.seq - Constructed sequence entries, part 81.
245. gbcon82.seq - Constructed sequence entries, part 82.
246. gbcon83.seq - Constructed sequence entries, part 83.
247. gbcon84.seq - Constructed sequence entries, part 84.
248. gbcon85.seq - Constructed sequence entries, part 85.
249. gbcon86.seq - Constructed sequence entries, part 86.
250. gbcon87.seq - Constructed sequence entries, part 87.
251. gbcon88.seq - Constructed sequence entries, part 88.
252. gbcon89.seq - Constructed sequence entries, part 89.
253. gbcon9.seq - Constructed sequence entries, part 9.
254. gbcon90.seq - Constructed sequence entries, part 90.
255. gbcon91.seq - Constructed sequence entries, part 91.
256. gbcon92.seq - Constructed sequence entries, part 92.
257. gbcon93.seq - Constructed sequence entries, part 93.
258. gbcon94.seq - Constructed sequence entries, part 94.
259. gbcon95.seq - Constructed sequence entries, part 95.
260. gbcon96.seq - Constructed sequence entries, part 96.
261. gbcon97.seq - Constructed sequence entries, part 97.
262. gbcon98.seq - Constructed sequence entries, part 98.
263. gbcon99.seq - Constructed sequence entries, part 99.
264. gbdel.txt - Accession numbers of entries deleted since the previous release.
265. gbenv1.seq - Environmental sampling sequence entries, part 1.
266. gbenv10.seq - Environmental sampling sequence entries, part 10.
267. gbenv11.seq - Environmental sampling sequence entries, part 11.
268. gbenv12.seq - Environmental sampling sequence entries, part 12.
269. gbenv13.seq - Environmental sampling sequence entries, part 13.
270. gbenv14.seq - Environmental sampling sequence entries, part 14.
271. gbenv15.seq - Environmental sampling sequence entries, part 15.
272. gbenv16.seq - Environmental sampling sequence entries, part 16.
273. gbenv17.seq - Environmental sampling sequence entries, part 17.
274. gbenv18.seq - Environmental sampling sequence entries, part 18.
275. gbenv19.seq - Environmental sampling sequence entries, part 19.
276. gbenv2.seq - Environmental sampling sequence entries, part 2.
277. gbenv20.seq - Environmental sampling sequence entries, part 20.
278. gbenv21.seq - Environmental sampling sequence entries, part 21.
279. gbenv22.seq - Environmental sampling sequence entries, part 22.
280. gbenv23.seq - Environmental sampling sequence entries, part 23.
281. gbenv3.seq - Environmental sampling sequence entries, part 3.
282. gbenv4.seq - Environmental sampling sequence entries, part 4.
283. gbenv5.seq - Environmental sampling sequence entries, part 5.
284. gbenv6.seq - Environmental sampling sequence entries, part 6.
285. gbenv7.seq - Environmental sampling sequence entries, part 7.
286. gbenv8.seq - Environmental sampling sequence entries, part 8.
287. gbenv9.seq - Environmental sampling sequence entries, part 9.
288. gbest1.seq - EST (expressed sequence tag) sequence entries, part 1.
289. gbest10.seq - EST (expressed sequence tag) sequence entries, part 10.
290. gbest100.seq - EST (expressed sequence tag) sequence entries, part 100.
291. gbest101.seq - EST (expressed sequence tag) sequence entries, part 101.
292. gbest102.seq - EST (expressed sequence tag) sequence entries, part 102.
293. gbest103.seq - EST (expressed sequence tag) sequence entries, part 103.
294. gbest104.seq - EST (expressed sequence tag) sequence entries, part 104.
295. gbest105.seq - EST (expressed sequence tag) sequence entries, part 105.
296. gbest106.seq - EST (expressed sequence tag) sequence entries, part 106.
297. gbest107.seq - EST (expressed sequence tag) sequence entries, part 107.
298. gbest108.seq - EST (expressed sequence tag) sequence entries, part 108.
299. gbest109.seq - EST (expressed sequence tag) sequence entries, part 109.
300. gbest11.seq - EST (expressed sequence tag) sequence entries, part 11.
301. gbest110.seq - EST (expressed sequence tag) sequence entries, part 110.
302. gbest111.seq - EST (expressed sequence tag) sequence entries, part 111.
303. gbest112.seq - EST (expressed sequence tag) sequence entries, part 112.
304. gbest113.seq - EST (expressed sequence tag) sequence entries, part 113.
305. gbest114.seq - EST (expressed sequence tag) sequence entries, part 114.
306. gbest115.seq - EST (expressed sequence tag) sequence entries, part 115.
307. gbest116.seq - EST (expressed sequence tag) sequence entries, part 116.
308. gbest117.seq - EST (expressed sequence tag) sequence entries, part 117.
309. gbest118.seq - EST (expressed sequence tag) sequence entries, part 118.
310. gbest119.seq - EST (expressed sequence tag) sequence entries, part 119.
311. gbest12.seq - EST (expressed sequence tag) sequence entries, part 12.
312. gbest120.seq - EST (expressed sequence tag) sequence entries, part 120.
313. gbest121.seq - EST (expressed sequence tag) sequence entries, part 121.
314. gbest122.seq - EST (expressed sequence tag) sequence entries, part 122.
315. gbest123.seq - EST (expressed sequence tag) sequence entries, part 123.
316. gbest124.seq - EST (expressed sequence tag) sequence entries, part 124.
317. gbest125.seq - EST (expressed sequence tag) sequence entries, part 125.
318. gbest126.seq - EST (expressed sequence tag) sequence entries, part 126.
319. gbest127.seq - EST (expressed sequence tag) sequence entries, part 127.
320. gbest128.seq - EST (expressed sequence tag) sequence entries, part 128.
321. gbest129.seq - EST (expressed sequence tag) sequence entries, part 129.
322. gbest13.seq - EST (expressed sequence tag) sequence entries, part 13.
323. gbest130.seq - EST (expressed sequence tag) sequence entries, part 130.
324. gbest131.seq - EST (expressed sequence tag) sequence entries, part 131.
325. gbest132.seq - EST (expressed sequence tag) sequence entries, part 132.
326. gbest133.seq - EST (expressed sequence tag) sequence entries, part 133.
327. gbest134.seq - EST (expressed sequence tag) sequence entries, part 134.
328. gbest135.seq - EST (expressed sequence tag) sequence entries, part 135.
329. gbest136.seq - EST (expressed sequence tag) sequence entries, part 136.
330. gbest137.seq - EST (expressed sequence tag) sequence entries, part 137.
331. gbest138.seq - EST (expressed sequence tag) sequence entries, part 138.
332. gbest139.seq - EST (expressed sequence tag) sequence entries, part 139.
333. gbest14.seq - EST (expressed sequence tag) sequence entries, part 14.
334. gbest140.seq - EST (expressed sequence tag) sequence entries, part 140.
335. gbest141.seq - EST (expressed sequence tag) sequence entries, part 141.
336. gbest142.seq - EST (expressed sequence tag) sequence entries, part 142.
337. gbest143.seq - EST (expressed sequence tag) sequence entries, part 143.
338. gbest144.seq - EST (expressed sequence tag) sequence entries, part 144.
339. gbest145.seq - EST (expressed sequence tag) sequence entries, part 145.
340. gbest146.seq - EST (expressed sequence tag) sequence entries, part 146.
341. gbest147.seq - EST (expressed sequence tag) sequence entries, part 147.
342. gbest148.seq - EST (expressed sequence tag) sequence entries, part 148.
343. gbest149.seq - EST (expressed sequence tag) sequence entries, part 149.
344. gbest15.seq - EST (expressed sequence tag) sequence entries, part 15.
345. gbest150.seq - EST (expressed sequence tag) sequence entries, part 150.
346. gbest151.seq - EST (expressed sequence tag) sequence entries, part 151.
347. gbest152.seq - EST (expressed sequence tag) sequence entries, part 152.
348. gbest153.seq - EST (expressed sequence tag) sequence entries, part 153.
349. gbest154.seq - EST (expressed sequence tag) sequence entries, part 154.
350. gbest155.seq - EST (expressed sequence tag) sequence entries, part 155.
351. gbest156.seq - EST (expressed sequence tag) sequence entries, part 156.
352. gbest157.seq - EST (expressed sequence tag) sequence entries, part 157.
353. gbest158.seq - EST (expressed sequence tag) sequence entries, part 158.
354. gbest159.seq - EST (expressed sequence tag) sequence entries, part 159.
355. gbest16.seq - EST (expressed sequence tag) sequence entries, part 16.
356. gbest160.seq - EST (expressed sequence tag) sequence entries, part 160.
357. gbest161.seq - EST (expressed sequence tag) sequence entries, part 161.
358. gbest162.seq - EST (expressed sequence tag) sequence entries, part 162.
359. gbest163.seq - EST (expressed sequence tag) sequence entries, part 163.
360. gbest164.seq - EST (expressed sequence tag) sequence entries, part 164.
361. gbest165.seq - EST (expressed sequence tag) sequence entries, part 165.
362. gbest166.seq - EST (expressed sequence tag) sequence entries, part 166.
363. gbest167.seq - EST (expressed sequence tag) sequence entries, part 167.
364. gbest168.seq - EST (expressed sequence tag) sequence entries, part 168.
365. gbest169.seq - EST (expressed sequence tag) sequence entries, part 169.
366. gbest17.seq - EST (expressed sequence tag) sequence entries, part 17.
367. gbest170.seq - EST (expressed sequence tag) sequence entries, part 170.
368. gbest171.seq - EST (expressed sequence tag) sequence entries, part 171.
369. gbest172.seq - EST (expressed sequence tag) sequence entries, part 172.
370. gbest173.seq - EST (expressed sequence tag) sequence entries, part 173.
371. gbest174.seq - EST (expressed sequence tag) sequence entries, part 174.
372. gbest175.seq - EST (expressed sequence tag) sequence entries, part 175.
373. gbest176.seq - EST (expressed sequence tag) sequence entries, part 176.
374. gbest177.seq - EST (expressed sequence tag) sequence entries, part 177.
375. gbest178.seq - EST (expressed sequence tag) sequence entries, part 178.
376. gbest179.seq - EST (expressed sequence tag) sequence entries, part 179.
377. gbest18.seq - EST (expressed sequence tag) sequence entries, part 18.
378. gbest180.seq - EST (expressed sequence tag) sequence entries, part 180.
379. gbest181.seq - EST (expressed sequence tag) sequence entries, part 181.
380. gbest182.seq - EST (expressed sequence tag) sequence entries, part 182.
381. gbest183.seq - EST (expressed sequence tag) sequence entries, part 183.
382. gbest184.seq - EST (expressed sequence tag) sequence entries, part 184.
383. gbest185.seq - EST (expressed sequence tag) sequence entries, part 185.
384. gbest186.seq - EST (expressed sequence tag) sequence entries, part 186.
385. gbest187.seq - EST (expressed sequence tag) sequence entries, part 187.
386. gbest188.seq - EST (expressed sequence tag) sequence entries, part 188.
387. gbest189.seq - EST (expressed sequence tag) sequence entries, part 189.
388. gbest19.seq - EST (expressed sequence tag) sequence entries, part 19.
389. gbest190.seq - EST (expressed sequence tag) sequence entries, part 190.
390. gbest191.seq - EST (expressed sequence tag) sequence entries, part 191.
391. gbest192.seq - EST (expressed sequence tag) sequence entries, part 192.
392. gbest193.seq - EST (expressed sequence tag) sequence entries, part 193.
393. gbest194.seq - EST (expressed sequence tag) sequence entries, part 194.
394. gbest195.seq - EST (expressed sequence tag) sequence entries, part 195.
395. gbest196.seq - EST (expressed sequence tag) sequence entries, part 196.
396. gbest197.seq - EST (expressed sequence tag) sequence entries, part 197.
397. gbest198.seq - EST (expressed sequence tag) sequence entries, part 198.
398. gbest199.seq - EST (expressed sequence tag) sequence entries, part 199.
399. gbest2.seq - EST (expressed sequence tag) sequence entries, part 2.
400. gbest20.seq - EST (expressed sequence tag) sequence entries, part 20.
401. gbest200.seq - EST (expressed sequence tag) sequence entries, part 200.
402. gbest201.seq - EST (expressed sequence tag) sequence entries, part 201.
403. gbest202.seq - EST (expressed sequence tag) sequence entries, part 202.
404. gbest203.seq - EST (expressed sequence tag) sequence entries, part 203.
405. gbest204.seq - EST (expressed sequence tag) sequence entries, part 204.
406. gbest205.seq - EST (expressed sequence tag) sequence entries, part 205.
407. gbest206.seq - EST (expressed sequence tag) sequence entries, part 206.
408. gbest207.seq - EST (expressed sequence tag) sequence entries, part 207.
409. gbest208.seq - EST (expressed sequence tag) sequence entries, part 208.
410. gbest209.seq - EST (expressed sequence tag) sequence entries, part 209.
411. gbest21.seq - EST (expressed sequence tag) sequence entries, part 21.
412. gbest210.seq - EST (expressed sequence tag) sequence entries, part 210.
413. gbest211.seq - EST (expressed sequence tag) sequence entries, part 211.
414. gbest212.seq - EST (expressed sequence tag) sequence entries, part 212.
415. gbest213.seq - EST (expressed sequence tag) sequence entries, part 213.
416. gbest214.seq - EST (expressed sequence tag) sequence entries, part 214.
417. gbest215.seq - EST (expressed sequence tag) sequence entries, part 215.
418. gbest216.seq - EST (expressed sequence tag) sequence entries, part 216.
419. gbest217.seq - EST (expressed sequence tag) sequence entries, part 217.
420. gbest218.seq - EST (expressed sequence tag) sequence entries, part 218.
421. gbest219.seq - EST (expressed sequence tag) sequence entries, part 219.
422. gbest22.seq - EST (expressed sequence tag) sequence entries, part 22.
423. gbest220.seq - EST (expressed sequence tag) sequence entries, part 220.
424. gbest221.seq - EST (expressed sequence tag) sequence entries, part 221.
425. gbest222.seq - EST (expressed sequence tag) sequence entries, part 222.
426. gbest223.seq - EST (expressed sequence tag) sequence entries, part 223.
427. gbest224.seq - EST (expressed sequence tag) sequence entries, part 224.
428. gbest225.seq - EST (expressed sequence tag) sequence entries, part 225.
429. gbest226.seq - EST (expressed sequence tag) sequence entries, part 226.
430. gbest227.seq - EST (expressed sequence tag) sequence entries, part 227.
431. gbest228.seq - EST (expressed sequence tag) sequence entries, part 228.
432. gbest229.seq - EST (expressed sequence tag) sequence entries, part 229.
433. gbest23.seq - EST (expressed sequence tag) sequence entries, part 23.
434. gbest230.seq - EST (expressed sequence tag) sequence entries, part 230.
435. gbest231.seq - EST (expressed sequence tag) sequence entries, part 231.
436. gbest232.seq - EST (expressed sequence tag) sequence entries, part 232.
437. gbest233.seq - EST (expressed sequence tag) sequence entries, part 233.
438. gbest234.seq - EST (expressed sequence tag) sequence entries, part 234.
439. gbest235.seq - EST (expressed sequence tag) sequence entries, part 235.
440. gbest236.seq - EST (expressed sequence tag) sequence entries, part 236.
441. gbest237.seq - EST (expressed sequence tag) sequence entries, part 237.
442. gbest238.seq - EST (expressed sequence tag) sequence entries, part 238.
443. gbest239.seq - EST (expressed sequence tag) sequence entries, part 239.
444. gbest24.seq - EST (expressed sequence tag) sequence entries, part 24.
445. gbest240.seq - EST (expressed sequence tag) sequence entries, part 240.
446. gbest241.seq - EST (expressed sequence tag) sequence entries, part 241.
447. gbest242.seq - EST (expressed sequence tag) sequence entries, part 242.
448. gbest243.seq - EST (expressed sequence tag) sequence entries, part 243.
449. gbest244.seq - EST (expressed sequence tag) sequence entries, part 244.
450. gbest245.seq - EST (expressed sequence tag) sequence entries, part 245.
451. gbest246.seq - EST (expressed sequence tag) sequence entries, part 246.
452. gbest247.seq - EST (expressed sequence tag) sequence entries, part 247.
453. gbest248.seq - EST (expressed sequence tag) sequence entries, part 248.
454. gbest249.seq - EST (expressed sequence tag) sequence entries, part 249.
455. gbest25.seq - EST (expressed sequence tag) sequence entries, part 25.
456. gbest250.seq - EST (expressed sequence tag) sequence entries, part 250.
457. gbest251.seq - EST (expressed sequence tag) sequence entries, part 251.
458. gbest252.seq - EST (expressed sequence tag) sequence entries, part 252.
459. gbest253.seq - EST (expressed sequence tag) sequence entries, part 253.
460. gbest254.seq - EST (expressed sequence tag) sequence entries, part 254.
461. gbest255.seq - EST (expressed sequence tag) sequence entries, part 255.
462. gbest256.seq - EST (expressed sequence tag) sequence entries, part 256.
463. gbest257.seq - EST (expressed sequence tag) sequence entries, part 257.
464. gbest258.seq - EST (expressed sequence tag) sequence entries, part 258.
465. gbest259.seq - EST (expressed sequence tag) sequence entries, part 259.
466. gbest26.seq - EST (expressed sequence tag) sequence entries, part 26.
467. gbest260.seq - EST (expressed sequence tag) sequence entries, part 260.
468. gbest261.seq - EST (expressed sequence tag) sequence entries, part 261.
469. gbest262.seq - EST (expressed sequence tag) sequence entries, part 262.
470. gbest263.seq - EST (expressed sequence tag) sequence entries, part 263.
471. gbest264.seq - EST (expressed sequence tag) sequence entries, part 264.
472. gbest265.seq - EST (expressed sequence tag) sequence entries, part 265.
473. gbest266.seq - EST (expressed sequence tag) sequence entries, part 266.
474. gbest267.seq - EST (expressed sequence tag) sequence entries, part 267.
475. gbest268.seq - EST (expressed sequence tag) sequence entries, part 268.
476. gbest269.seq - EST (expressed sequence tag) sequence entries, part 269.
477. gbest27.seq - EST (expressed sequence tag) sequence entries, part 27.
478. gbest270.seq - EST (expressed sequence tag) sequence entries, part 270.
479. gbest271.seq - EST (expressed sequence tag) sequence entries, part 271.
480. gbest272.seq - EST (expressed sequence tag) sequence entries, part 272.
481. gbest273.seq - EST (expressed sequence tag) sequence entries, part 273.
482. gbest274.seq - EST (expressed sequence tag) sequence entries, part 274.
483. gbest275.seq - EST (expressed sequence tag) sequence entries, part 275.
484. gbest276.seq - EST (expressed sequence tag) sequence entries, part 276.
485. gbest277.seq - EST (expressed sequence tag) sequence entries, part 277.
486. gbest278.seq - EST (expressed sequence tag) sequence entries, part 278.
487. gbest279.seq - EST (expressed sequence tag) sequence entries, part 279.
488. gbest28.seq - EST (expressed sequence tag) sequence entries, part 28.
489. gbest280.seq - EST (expressed sequence tag) sequence entries, part 280.
490. gbest281.seq - EST (expressed sequence tag) sequence entries, part 281.
491. gbest282.seq - EST (expressed sequence tag) sequence entries, part 282.
492. gbest283.seq - EST (expressed sequence tag) sequence entries, part 283.
493. gbest284.seq - EST (expressed sequence tag) sequence entries, part 284.
494. gbest285.seq - EST (expressed sequence tag) sequence entries, part 285.
495. gbest286.seq - EST (expressed sequence tag) sequence entries, part 286.
496. gbest287.seq - EST (expressed sequence tag) sequence entries, part 287.
497. gbest288.seq - EST (expressed sequence tag) sequence entries, part 288.
498. gbest289.seq - EST (expressed sequence tag) sequence entries, part 289.
499. gbest29.seq - EST (expressed sequence tag) sequence entries, part 29.
500. gbest290.seq - EST (expressed sequence tag) sequence entries, part 290.
501. gbest291.seq - EST (expressed sequence tag) sequence entries, part 291.
502. gbest292.seq - EST (expressed sequence tag) sequence entries, part 292.
503. gbest293.seq - EST (expressed sequence tag) sequence entries, part 293.
504. gbest294.seq - EST (expressed sequence tag) sequence entries, part 294.
505. gbest295.seq - EST (expressed sequence tag) sequence entries, part 295.
506. gbest296.seq - EST (expressed sequence tag) sequence entries, part 296.
507. gbest297.seq - EST (expressed sequence tag) sequence entries, part 297.
508. gbest298.seq - EST (expressed sequence tag) sequence entries, part 298.
509. gbest299.seq - EST (expressed sequence tag) sequence entries, part 299.
510. gbest3.seq - EST (expressed sequence tag) sequence entries, part 3.
511. gbest30.seq - EST (expressed sequence tag) sequence entries, part 30.
512. gbest300.seq - EST (expressed sequence tag) sequence entries, part 300.
513. gbest301.seq - EST (expressed sequence tag) sequence entries, part 301.
514. gbest302.seq - EST (expressed sequence tag) sequence entries, part 302.
515. gbest303.seq - EST (expressed sequence tag) sequence entries, part 303.
516. gbest304.seq - EST (expressed sequence tag) sequence entries, part 304.
517. gbest305.seq - EST (expressed sequence tag) sequence entries, part 305.
518. gbest306.seq - EST (expressed sequence tag) sequence entries, part 306.
519. gbest307.seq - EST (expressed sequence tag) sequence entries, part 307.
520. gbest308.seq - EST (expressed sequence tag) sequence entries, part 308.
521. gbest309.seq - EST (expressed sequence tag) sequence entries, part 309.
522. gbest31.seq - EST (expressed sequence tag) sequence entries, part 31.
523. gbest310.seq - EST (expressed sequence tag) sequence entries, part 310.
524. gbest311.seq - EST (expressed sequence tag) sequence entries, part 311.
525. gbest312.seq - EST (expressed sequence tag) sequence entries, part 312.
526. gbest313.seq - EST (expressed sequence tag) sequence entries, part 313.
527. gbest314.seq - EST (expressed sequence tag) sequence entries, part 314.
528. gbest315.seq - EST (expressed sequence tag) sequence entries, part 315.
529. gbest316.seq - EST (expressed sequence tag) sequence entries, part 316.
530. gbest317.seq - EST (expressed sequence tag) sequence entries, part 317.
531. gbest318.seq - EST (expressed sequence tag) sequence entries, part 318.
532. gbest319.seq - EST (expressed sequence tag) sequence entries, part 319.
533. gbest32.seq - EST (expressed sequence tag) sequence entries, part 32.
534. gbest320.seq - EST (expressed sequence tag) sequence entries, part 320.
535. gbest321.seq - EST (expressed sequence tag) sequence entries, part 321.
536. gbest322.seq - EST (expressed sequence tag) sequence entries, part 322.
537. gbest323.seq - EST (expressed sequence tag) sequence entries, part 323.
538. gbest324.seq - EST (expressed sequence tag) sequence entries, part 324.
539. gbest325.seq - EST (expressed sequence tag) sequence entries, part 325.
540. gbest326.seq - EST (expressed sequence tag) sequence entries, part 326.
541. gbest327.seq - EST (expressed sequence tag) sequence entries, part 327.
542. gbest328.seq - EST (expressed sequence tag) sequence entries, part 328.
543. gbest329.seq - EST (expressed sequence tag) sequence entries, part 329.
544. gbest33.seq - EST (expressed sequence tag) sequence entries, part 33.
545. gbest330.seq - EST (expressed sequence tag) sequence entries, part 330.
546. gbest331.seq - EST (expressed sequence tag) sequence entries, part 331.
547. gbest332.seq - EST (expressed sequence tag) sequence entries, part 332.
548. gbest333.seq - EST (expressed sequence tag) sequence entries, part 333.
549. gbest334.seq - EST (expressed sequence tag) sequence entries, part 334.
550. gbest335.seq - EST (expressed sequence tag) sequence entries, part 335.
551. gbest336.seq - EST (expressed sequence tag) sequence entries, part 336.
552. gbest337.seq - EST (expressed sequence tag) sequence entries, part 337.
553. gbest338.seq - EST (expressed sequence tag) sequence entries, part 338.
554. gbest339.seq - EST (expressed sequence tag) sequence entries, part 339.
555. gbest34.seq - EST (expressed sequence tag) sequence entries, part 34.
556. gbest340.seq - EST (expressed sequence tag) sequence entries, part 340.
557. gbest341.seq - EST (expressed sequence tag) sequence entries, part 341.
558. gbest342.seq - EST (expressed sequence tag) sequence entries, part 342.
559. gbest343.seq - EST (expressed sequence tag) sequence entries, part 343.
560. gbest344.seq - EST (expressed sequence tag) sequence entries, part 344.
561. gbest345.seq - EST (expressed sequence tag) sequence entries, part 345.
562. gbest346.seq - EST (expressed sequence tag) sequence entries, part 346.
563. gbest347.seq - EST (expressed sequence tag) sequence entries, part 347.
564. gbest348.seq - EST (expressed sequence tag) sequence entries, part 348.
565. gbest349.seq - EST (expressed sequence tag) sequence entries, part 349.
566. gbest35.seq - EST (expressed sequence tag) sequence entries, part 35.
567. gbest350.seq - EST (expressed sequence tag) sequence entries, part 350.
568. gbest351.seq - EST (expressed sequence tag) sequence entries, part 351.
569. gbest352.seq - EST (expressed sequence tag) sequence entries, part 352.
570. gbest353.seq - EST (expressed sequence tag) sequence entries, part 353.
571. gbest354.seq - EST (expressed sequence tag) sequence entries, part 354.
572. gbest355.seq - EST (expressed sequence tag) sequence entries, part 355.
573. gbest356.seq - EST (expressed sequence tag) sequence entries, part 356.
574. gbest357.seq - EST (expressed sequence tag) sequence entries, part 357.
575. gbest358.seq - EST (expressed sequence tag) sequence entries, part 358.
576. gbest359.seq - EST (expressed sequence tag) sequence entries, part 359.
577. gbest36.seq - EST (expressed sequence tag) sequence entries, part 36.
578. gbest360.seq - EST (expressed sequence tag) sequence entries, part 360.
579. gbest361.seq - EST (expressed sequence tag) sequence entries, part 361.
580. gbest362.seq - EST (expressed sequence tag) sequence entries, part 362.
581. gbest363.seq - EST (expressed sequence tag) sequence entries, part 363.
582. gbest364.seq - EST (expressed sequence tag) sequence entries, part 364.
583. gbest365.seq - EST (expressed sequence tag) sequence entries, part 365.
584. gbest366.seq - EST (expressed sequence tag) sequence entries, part 366.
585. gbest367.seq - EST (expressed sequence tag) sequence entries, part 367.
586. gbest368.seq - EST (expressed sequence tag) sequence entries, part 368.
587. gbest369.seq - EST (expressed sequence tag) sequence entries, part 369.
588. gbest37.seq - EST (expressed sequence tag) sequence entries, part 37.
589. gbest370.seq - EST (expressed sequence tag) sequence entries, part 370.
590. gbest371.seq - EST (expressed sequence tag) sequence entries, part 371.
591. gbest372.seq - EST (expressed sequence tag) sequence entries, part 372.
592. gbest373.seq - EST (expressed sequence tag) sequence entries, part 373.
593. gbest374.seq - EST (expressed sequence tag) sequence entries, part 374.
594. gbest375.seq - EST (expressed sequence tag) sequence entries, part 375.
595. gbest376.seq - EST (expressed sequence tag) sequence entries, part 376.
596. gbest377.seq - EST (expressed sequence tag) sequence entries, part 377.
597. gbest378.seq - EST (expressed sequence tag) sequence entries, part 378.
598. gbest379.seq - EST (expressed sequence tag) sequence entries, part 379.
599. gbest38.seq - EST (expressed sequence tag) sequence entries, part 38.
600. gbest380.seq - EST (expressed sequence tag) sequence entries, part 380.
601. gbest381.seq - EST (expressed sequence tag) sequence entries, part 381.
602. gbest382.seq - EST (expressed sequence tag) sequence entries, part 382.
603. gbest383.seq - EST (expressed sequence tag) sequence entries, part 383.
604. gbest384.seq - EST (expressed sequence tag) sequence entries, part 384.
605. gbest385.seq - EST (expressed sequence tag) sequence entries, part 385.
606. gbest386.seq - EST (expressed sequence tag) sequence entries, part 386.
607. gbest387.seq - EST (expressed sequence tag) sequence entries, part 387.
608. gbest388.seq - EST (expressed sequence tag) sequence entries, part 388.
609. gbest389.seq - EST (expressed sequence tag) sequence entries, part 389.
610. gbest39.seq - EST (expressed sequence tag) sequence entries, part 39.
611. gbest390.seq - EST (expressed sequence tag) sequence entries, part 390.
612. gbest391.seq - EST (expressed sequence tag) sequence entries, part 391.
613. gbest392.seq - EST (expressed sequence tag) sequence entries, part 392.
614. gbest393.seq - EST (expressed sequence tag) sequence entries, part 393.
615. gbest394.seq - EST (expressed sequence tag) sequence entries, part 394.
616. gbest395.seq - EST (expressed sequence tag) sequence entries, part 395.
617. gbest396.seq - EST (expressed sequence tag) sequence entries, part 396.
618. gbest397.seq - EST (expressed sequence tag) sequence entries, part 397.
619. gbest398.seq - EST (expressed sequence tag) sequence entries, part 398.
620. gbest399.seq - EST (expressed sequence tag) sequence entries, part 399.
621. gbest4.seq - EST (expressed sequence tag) sequence entries, part 4.
622. gbest40.seq - EST (expressed sequence tag) sequence entries, part 40.
623. gbest400.seq - EST (expressed sequence tag) sequence entries, part 400.
624. gbest401.seq - EST (expressed sequence tag) sequence entries, part 401.
625. gbest402.seq - EST (expressed sequence tag) sequence entries, part 402.
626. gbest403.seq - EST (expressed sequence tag) sequence entries, part 403.
627. gbest404.seq - EST (expressed sequence tag) sequence entries, part 404.
628. gbest405.seq - EST (expressed sequence tag) sequence entries, part 405.
629. gbest406.seq - EST (expressed sequence tag) sequence entries, part 406.
630. gbest407.seq - EST (expressed sequence tag) sequence entries, part 407.
631. gbest408.seq - EST (expressed sequence tag) sequence entries, part 408.
632. gbest409.seq - EST (expressed sequence tag) sequence entries, part 409.
633. gbest41.seq - EST (expressed sequence tag) sequence entries, part 41.
634. gbest410.seq - EST (expressed sequence tag) sequence entries, part 410.
635. gbest42.seq - EST (expressed sequence tag) sequence entries, part 42.
636. gbest43.seq - EST (expressed sequence tag) sequence entries, part 43.
637. gbest44.seq - EST (expressed sequence tag) sequence entries, part 44.
638. gbest45.seq - EST (expressed sequence tag) sequence entries, part 45.
639. gbest46.seq - EST (expressed sequence tag) sequence entries, part 46.
640. gbest47.seq - EST (expressed sequence tag) sequence entries, part 47.
641. gbest48.seq - EST (expressed sequence tag) sequence entries, part 48.
642. gbest49.seq - EST (expressed sequence tag) sequence entries, part 49.
643. gbest5.seq - EST (expressed sequence tag) sequence entries, part 5.
644. gbest50.seq - EST (expressed sequence tag) sequence entries, part 50.
645. gbest51.seq - EST (expressed sequence tag) sequence entries, part 51.
646. gbest52.seq - EST (expressed sequence tag) sequence entries, part 52.
647. gbest53.seq - EST (expressed sequence tag) sequence entries, part 53.
648. gbest54.seq - EST (expressed sequence tag) sequence entries, part 54.
649. gbest55.seq - EST (expressed sequence tag) sequence entries, part 55.
650. gbest56.seq - EST (expressed sequence tag) sequence entries, part 56.
651. gbest57.seq - EST (expressed sequence tag) sequence entries, part 57.
652. gbest58.seq - EST (expressed sequence tag) sequence entries, part 58.
653. gbest59.seq - EST (expressed sequence tag) sequence entries, part 59.
654. gbest6.seq - EST (expressed sequence tag) sequence entries, part 6.
655. gbest60.seq - EST (expressed sequence tag) sequence entries, part 60.
656. gbest61.seq - EST (expressed sequence tag) sequence entries, part 61.
657. gbest62.seq - EST (expressed sequence tag) sequence entries, part 62.
658. gbest63.seq - EST (expressed sequence tag) sequence entries, part 63.
659. gbest64.seq - EST (expressed sequence tag) sequence entries, part 64.
660. gbest65.seq - EST (expressed sequence tag) sequence entries, part 65.
661. gbest66.seq - EST (expressed sequence tag) sequence entries, part 66.
662. gbest67.seq - EST (expressed sequence tag) sequence entries, part 67.
663. gbest68.seq - EST (expressed sequence tag) sequence entries, part 68.
664. gbest69.seq - EST (expressed sequence tag) sequence entries, part 69.
665. gbest7.seq - EST (expressed sequence tag) sequence entries, part 7.
666. gbest70.seq - EST (expressed sequence tag) sequence entries, part 70.
667. gbest71.seq - EST (expressed sequence tag) sequence entries, part 71.
668. gbest72.seq - EST (expressed sequence tag) sequence entries, part 72.
669. gbest73.seq - EST (expressed sequence tag) sequence entries, part 73.
670. gbest74.seq - EST (expressed sequence tag) sequence entries, part 74.
671. gbest75.seq - EST (expressed sequence tag) sequence entries, part 75.
672. gbest76.seq - EST (expressed sequence tag) sequence entries, part 76.
673. gbest77.seq - EST (expressed sequence tag) sequence entries, part 77.
674. gbest78.seq - EST (expressed sequence tag) sequence entries, part 78.
675. gbest79.seq - EST (expressed sequence tag) sequence entries, part 79.
676. gbest8.seq - EST (expressed sequence tag) sequence entries, part 8.
677. gbest80.seq - EST (expressed sequence tag) sequence entries, part 80.
678. gbest81.seq - EST (expressed sequence tag) sequence entries, part 81.
679. gbest82.seq - EST (expressed sequence tag) sequence entries, part 82.
680. gbest83.seq - EST (expressed sequence tag) sequence entries, part 83.
681. gbest84.seq - EST (expressed sequence tag) sequence entries, part 84.
682. gbest85.seq - EST (expressed sequence tag) sequence entries, part 85.
683. gbest86.seq - EST (expressed sequence tag) sequence entries, part 86.
684. gbest87.seq - EST (expressed sequence tag) sequence entries, part 87.
685. gbest88.seq - EST (expressed sequence tag) sequence entries, part 88.
686. gbest89.seq - EST (expressed sequence tag) sequence entries, part 89.
687. gbest9.seq - EST (expressed sequence tag) sequence entries, part 9.
688. gbest90.seq - EST (expressed sequence tag) sequence entries, part 90.
689. gbest91.seq - EST (expressed sequence tag) sequence entries, part 91.
690. gbest92.seq - EST (expressed sequence tag) sequence entries, part 92.
691. gbest93.seq - EST (expressed sequence tag) sequence entries, part 93.
692. gbest94.seq - EST (expressed sequence tag) sequence entries, part 94.
693. gbest95.seq - EST (expressed sequence tag) sequence entries, part 95.
694. gbest96.seq - EST (expressed sequence tag) sequence entries, part 96.
695. gbest97.seq - EST (expressed sequence tag) sequence entries, part 97.
696. gbest98.seq - EST (expressed sequence tag) sequence entries, part 98.
697. gbest99.seq - EST (expressed sequence tag) sequence entries, part 99.
698. gbgen.idx - Index of the entries according to gene symbols.
699. gbgss1.seq - GSS (genome survey sequence) sequence entries, part 1.
700. gbgss10.seq - GSS (genome survey sequence) sequence entries, part 10.
701. gbgss100.seq - GSS (genome survey sequence) sequence entries, part 100.
702. gbgss101.seq - GSS (genome survey sequence) sequence entries, part 101.
703. gbgss102.seq - GSS (genome survey sequence) sequence entries, part 102.
704. gbgss103.seq - GSS (genome survey sequence) sequence entries, part 103.
705. gbgss104.seq - GSS (genome survey sequence) sequence entries, part 104.
706. gbgss105.seq - GSS (genome survey sequence) sequence entries, part 105.
707. gbgss106.seq - GSS (genome survey sequence) sequence entries, part 106.
708. gbgss107.seq - GSS (genome survey sequence) sequence entries, part 107.
709. gbgss108.seq - GSS (genome survey sequence) sequence entries, part 108.
710. gbgss109.seq - GSS (genome survey sequence) sequence entries, part 109.
711. gbgss11.seq - GSS (genome survey sequence) sequence entries, part 11.
712. gbgss110.seq - GSS (genome survey sequence) sequence entries, part 110.
713. gbgss111.seq - GSS (genome survey sequence) sequence entries, part 111.
714. gbgss112.seq - GSS (genome survey sequence) sequence entries, part 112.
715. gbgss113.seq - GSS (genome survey sequence) sequence entries, part 113.
716. gbgss114.seq - GSS (genome survey sequence) sequence entries, part 114.
717. gbgss115.seq - GSS (genome survey sequence) sequence entries, part 115.
718. gbgss116.seq - GSS (genome survey sequence) sequence entries, part 116.
719. gbgss117.seq - GSS (genome survey sequence) sequence entries, part 117.
720. gbgss118.seq - GSS (genome survey sequence) sequence entries, part 118.
721. gbgss119.seq - GSS (genome survey sequence) sequence entries, part 119.
722. gbgss12.seq - GSS (genome survey sequence) sequence entries, part 12.
723. gbgss120.seq - GSS (genome survey sequence) sequence entries, part 120.
724. gbgss121.seq - GSS (genome survey sequence) sequence entries, part 121.
725. gbgss122.seq - GSS (genome survey sequence) sequence entries, part 122.
726. gbgss123.seq - GSS (genome survey sequence) sequence entries, part 123.
727. gbgss124.seq - GSS (genome survey sequence) sequence entries, part 124.
728. gbgss125.seq - GSS (genome survey sequence) sequence entries, part 125.
729. gbgss126.seq - GSS (genome survey sequence) sequence entries, part 126.
730. gbgss127.seq - GSS (genome survey sequence) sequence entries, part 127.
731. gbgss128.seq - GSS (genome survey sequence) sequence entries, part 128.
732. gbgss129.seq - GSS (genome survey sequence) sequence entries, part 129.
733. gbgss13.seq - GSS (genome survey sequence) sequence entries, part 13.
734. gbgss130.seq - GSS (genome survey sequence) sequence entries, part 130.
735. gbgss131.seq - GSS (genome survey sequence) sequence entries, part 131.
736. gbgss132.seq - GSS (genome survey sequence) sequence entries, part 132.
737. gbgss133.seq - GSS (genome survey sequence) sequence entries, part 133.
738. gbgss134.seq - GSS (genome survey sequence) sequence entries, part 134.
739. gbgss135.seq - GSS (genome survey sequence) sequence entries, part 135.
740. gbgss136.seq - GSS (genome survey sequence) sequence entries, part 136.
741. gbgss137.seq - GSS (genome survey sequence) sequence entries, part 137.
742. gbgss138.seq - GSS (genome survey sequence) sequence entries, part 138.
743. gbgss139.seq - GSS (genome survey sequence) sequence entries, part 139.
744. gbgss14.seq - GSS (genome survey sequence) sequence entries, part 14.
745. gbgss140.seq - GSS (genome survey sequence) sequence entries, part 140.
746. gbgss141.seq - GSS (genome survey sequence) sequence entries, part 141.
747. gbgss142.seq - GSS (genome survey sequence) sequence entries, part 142.
748. gbgss143.seq - GSS (genome survey sequence) sequence entries, part 143.
749. gbgss144.seq - GSS (genome survey sequence) sequence entries, part 144.
750. gbgss145.seq - GSS (genome survey sequence) sequence entries, part 145.
751. gbgss146.seq - GSS (genome survey sequence) sequence entries, part 146.
752. gbgss147.seq - GSS (genome survey sequence) sequence entries, part 147.
753. gbgss148.seq - GSS (genome survey sequence) sequence entries, part 148.
754. gbgss149.seq - GSS (genome survey sequence) sequence entries, part 149.
755. gbgss15.seq - GSS (genome survey sequence) sequence entries, part 15.
756. gbgss150.seq - GSS (genome survey sequence) sequence entries, part 150.
757. gbgss151.seq - GSS (genome survey sequence) sequence entries, part 151.
758. gbgss152.seq - GSS (genome survey sequence) sequence entries, part 152.
759. gbgss153.seq - GSS (genome survey sequence) sequence entries, part 153.
760. gbgss154.seq - GSS (genome survey sequence) sequence entries, part 154.
761. gbgss155.seq - GSS (genome survey sequence) sequence entries, part 155.
762. gbgss156.seq - GSS (genome survey sequence) sequence entries, part 156.
763. gbgss157.seq - GSS (genome survey sequence) sequence entries, part 157.
764. gbgss158.seq - GSS (genome survey sequence) sequence entries, part 158.
765. gbgss159.seq - GSS (genome survey sequence) sequence entries, part 159.
766. gbgss16.seq - GSS (genome survey sequence) sequence entries, part 16.
767. gbgss160.seq - GSS (genome survey sequence) sequence entries, part 160.
768. gbgss161.seq - GSS (genome survey sequence) sequence entries, part 161.
769. gbgss162.seq - GSS (genome survey sequence) sequence entries, part 162.
770. gbgss163.seq - GSS (genome survey sequence) sequence entries, part 163.
771. gbgss164.seq - GSS (genome survey sequence) sequence entries, part 164.
772. gbgss165.seq - GSS (genome survey sequence) sequence entries, part 165.
773. gbgss166.seq - GSS (genome survey sequence) sequence entries, part 166.
774. gbgss167.seq - GSS (genome survey sequence) sequence entries, part 167.
775. gbgss168.seq - GSS (genome survey sequence) sequence entries, part 168.
776. gbgss169.seq - GSS (genome survey sequence) sequence entries, part 169.
777. gbgss17.seq - GSS (genome survey sequence) sequence entries, part 17.
778. gbgss170.seq - GSS (genome survey sequence) sequence entries, part 170.
779. gbgss171.seq - GSS (genome survey sequence) sequence entries, part 171.
780. gbgss172.seq - GSS (genome survey sequence) sequence entries, part 172.
781. gbgss173.seq - GSS (genome survey sequence) sequence entries, part 173.
782. gbgss174.seq - GSS (genome survey sequence) sequence entries, part 174.
783. gbgss175.seq - GSS (genome survey sequence) sequence entries, part 175.
784. gbgss176.seq - GSS (genome survey sequence) sequence entries, part 176.
785. gbgss177.seq - GSS (genome survey sequence) sequence entries, part 177.
786. gbgss178.seq - GSS (genome survey sequence) sequence entries, part 178.
787. gbgss179.seq - GSS (genome survey sequence) sequence entries, part 179.
788. gbgss18.seq - GSS (genome survey sequence) sequence entries, part 18.
789. gbgss180.seq - GSS (genome survey sequence) sequence entries, part 180.
790. gbgss181.seq - GSS (genome survey sequence) sequence entries, part 181.
791. gbgss182.seq - GSS (genome survey sequence) sequence entries, part 182.
792. gbgss183.seq - GSS (genome survey sequence) sequence entries, part 183.
793. gbgss184.seq - GSS (genome survey sequence) sequence entries, part 184.
794. gbgss185.seq - GSS (genome survey sequence) sequence entries, part 185.
795. gbgss186.seq - GSS (genome survey sequence) sequence entries, part 186.
796. gbgss187.seq - GSS (genome survey sequence) sequence entries, part 187.
797. gbgss188.seq - GSS (genome survey sequence) sequence entries, part 188.
798. gbgss189.seq - GSS (genome survey sequence) sequence entries, part 189.
799. gbgss19.seq - GSS (genome survey sequence) sequence entries, part 19.
800. gbgss190.seq - GSS (genome survey sequence) sequence entries, part 190.
801. gbgss191.seq - GSS (genome survey sequence) sequence entries, part 191.
802. gbgss192.seq - GSS (genome survey sequence) sequence entries, part 192.
803. gbgss193.seq - GSS (genome survey sequence) sequence entries, part 193.
804. gbgss194.seq - GSS (genome survey sequence) sequence entries, part 194.
805. gbgss195.seq - GSS (genome survey sequence) sequence entries, part 195.
806. gbgss196.seq - GSS (genome survey sequence) sequence entries, part 196.
807. gbgss197.seq - GSS (genome survey sequence) sequence entries, part 197.
808. gbgss198.seq - GSS (genome survey sequence) sequence entries, part 198.
809. gbgss199.seq - GSS (genome survey sequence) sequence entries, part 199.
810. gbgss2.seq - GSS (genome survey sequence) sequence entries, part 2.
811. gbgss20.seq - GSS (genome survey sequence) sequence entries, part 20.
812. gbgss200.seq - GSS (genome survey sequence) sequence entries, part 200.
813. gbgss201.seq - GSS (genome survey sequence) sequence entries, part 201.
814. gbgss202.seq - GSS (genome survey sequence) sequence entries, part 202.
815. gbgss203.seq - GSS (genome survey sequence) sequence entries, part 203.
816. gbgss204.seq - GSS (genome survey sequence) sequence entries, part 204.
817. gbgss205.seq - GSS (genome survey sequence) sequence entries, part 205.
818. gbgss206.seq - GSS (genome survey sequence) sequence entries, part 206.
819. gbgss207.seq - GSS (genome survey sequence) sequence entries, part 207.
820. gbgss208.seq - GSS (genome survey sequence) sequence entries, part 208.
821. gbgss21.seq - GSS (genome survey sequence) sequence entries, part 21.
822. gbgss22.seq - GSS (genome survey sequence) sequence entries, part 22.
823. gbgss23.seq - GSS (genome survey sequence) sequence entries, part 23.
824. gbgss24.seq - GSS (genome survey sequence) sequence entries, part 24.
825. gbgss25.seq - GSS (genome survey sequence) sequence entries, part 25.
826. gbgss26.seq - GSS (genome survey sequence) sequence entries, part 26.
827. gbgss27.seq - GSS (genome survey sequence) sequence entries, part 27.
828. gbgss28.seq - GSS (genome survey sequence) sequence entries, part 28.
829. gbgss29.seq - GSS (genome survey sequence) sequence entries, part 29.
830. gbgss3.seq - GSS (genome survey sequence) sequence entries, part 3.
831. gbgss30.seq - GSS (genome survey sequence) sequence entries, part 30.
832. gbgss31.seq - GSS (genome survey sequence) sequence entries, part 31.
833. gbgss32.seq - GSS (genome survey sequence) sequence entries, part 32.
834. gbgss33.seq - GSS (genome survey sequence) sequence entries, part 33.
835. gbgss34.seq - GSS (genome survey sequence) sequence entries, part 34.
836. gbgss35.seq - GSS (genome survey sequence) sequence entries, part 35.
837. gbgss36.seq - GSS (genome survey sequence) sequence entries, part 36.
838. gbgss37.seq - GSS (genome survey sequence) sequence entries, part 37.
839. gbgss38.seq - GSS (genome survey sequence) sequence entries, part 38.
840. gbgss39.seq - GSS (genome survey sequence) sequence entries, part 39.
841. gbgss4.seq - GSS (genome survey sequence) sequence entries, part 4.
842. gbgss40.seq - GSS (genome survey sequence) sequence entries, part 40.
843. gbgss41.seq - GSS (genome survey sequence) sequence entries, part 41.
844. gbgss42.seq - GSS (genome survey sequence) sequence entries, part 42.
845. gbgss43.seq - GSS (genome survey sequence) sequence entries, part 43.
846. gbgss44.seq - GSS (genome survey sequence) sequence entries, part 44.
847. gbgss45.seq - GSS (genome survey sequence) sequence entries, part 45.
848. gbgss46.seq - GSS (genome survey sequence) sequence entries, part 46.
849. gbgss47.seq - GSS (genome survey sequence) sequence entries, part 47.
850. gbgss48.seq - GSS (genome survey sequence) sequence entries, part 48.
851. gbgss49.seq - GSS (genome survey sequence) sequence entries, part 49.
852. gbgss5.seq - GSS (genome survey sequence) sequence entries, part 5.
853. gbgss50.seq - GSS (genome survey sequence) sequence entries, part 50.
854. gbgss51.seq - GSS (genome survey sequence) sequence entries, part 51.
855. gbgss52.seq - GSS (genome survey sequence) sequence entries, part 52.
856. gbgss53.seq - GSS (genome survey sequence) sequence entries, part 53.
857. gbgss54.seq - GSS (genome survey sequence) sequence entries, part 54.
858. gbgss55.seq - GSS (genome survey sequence) sequence entries, part 55.
859. gbgss56.seq - GSS (genome survey sequence) sequence entries, part 56.
860. gbgss57.seq - GSS (genome survey sequence) sequence entries, part 57.
861. gbgss58.seq - GSS (genome survey sequence) sequence entries, part 58.
862. gbgss59.seq - GSS (genome survey sequence) sequence entries, part 59.
863. gbgss6.seq - GSS (genome survey sequence) sequence entries, part 6.
864. gbgss60.seq - GSS (genome survey sequence) sequence entries, part 60.
865. gbgss61.seq - GSS (genome survey sequence) sequence entries, part 61.
866. gbgss62.seq - GSS (genome survey sequence) sequence entries, part 62.
867. gbgss63.seq - GSS (genome survey sequence) sequence entries, part 63.
868. gbgss64.seq - GSS (genome survey sequence) sequence entries, part 64.
869. gbgss65.seq - GSS (genome survey sequence) sequence entries, part 65.
870. gbgss66.seq - GSS (genome survey sequence) sequence entries, part 66.
871. gbgss67.seq - GSS (genome survey sequence) sequence entries, part 67.
872. gbgss68.seq - GSS (genome survey sequence) sequence entries, part 68.
873. gbgss69.seq - GSS (genome survey sequence) sequence entries, part 69.
874. gbgss7.seq - GSS (genome survey sequence) sequence entries, part 7.
875. gbgss70.seq - GSS (genome survey sequence) sequence entries, part 70.
876. gbgss71.seq - GSS (genome survey sequence) sequence entries, part 71.
877. gbgss72.seq - GSS (genome survey sequence) sequence entries, part 72.
878. gbgss73.seq - GSS (genome survey sequence) sequence entries, part 73.
879. gbgss74.seq - GSS (genome survey sequence) sequence entries, part 74.
880. gbgss75.seq - GSS (genome survey sequence) sequence entries, part 75.
881. gbgss76.seq - GSS (genome survey sequence) sequence entries, part 76.
882. gbgss77.seq - GSS (genome survey sequence) sequence entries, part 77.
883. gbgss78.seq - GSS (genome survey sequence) sequence entries, part 78.
884. gbgss79.seq - GSS (genome survey sequence) sequence entries, part 79.
885. gbgss8.seq - GSS (genome survey sequence) sequence entries, part 8.
886. gbgss80.seq - GSS (genome survey sequence) sequence entries, part 80.
887. gbgss81.seq - GSS (genome survey sequence) sequence entries, part 81.
888. gbgss82.seq - GSS (genome survey sequence) sequence entries, part 82.
889. gbgss83.seq - GSS (genome survey sequence) sequence entries, part 83.
890. gbgss84.seq - GSS (genome survey sequence) sequence entries, part 84.
891. gbgss85.seq - GSS (genome survey sequence) sequence entries, part 85.
892. gbgss86.seq - GSS (genome survey sequence) sequence entries, part 86.
893. gbgss87.seq - GSS (genome survey sequence) sequence entries, part 87.
894. gbgss88.seq - GSS (genome survey sequence) sequence entries, part 88.
895. gbgss89.seq - GSS (genome survey sequence) sequence entries, part 89.
896. gbgss9.seq - GSS (genome survey sequence) sequence entries, part 9.
897. gbgss90.seq - GSS (genome survey sequence) sequence entries, part 90.
898. gbgss91.seq - GSS (genome survey sequence) sequence entries, part 91.
899. gbgss92.seq - GSS (genome survey sequence) sequence entries, part 92.
900. gbgss93.seq - GSS (genome survey sequence) sequence entries, part 93.
901. gbgss94.seq - GSS (genome survey sequence) sequence entries, part 94.
902. gbgss95.seq - GSS (genome survey sequence) sequence entries, part 95.
903. gbgss96.seq - GSS (genome survey sequence) sequence entries, part 96.
904. gbgss97.seq - GSS (genome survey sequence) sequence entries, part 97.
905. gbgss98.seq - GSS (genome survey sequence) sequence entries, part 98.
906. gbgss99.seq - GSS (genome survey sequence) sequence entries, part 99.
907. gbhtc1.seq - HTC (high throughput cDNA sequencing) sequence entries, part 1.
908. gbhtc10.seq - HTC (high throughput cDNA sequencing) sequence entries, part 10.
909. gbhtc11.seq - HTC (high throughput cDNA sequencing) sequence entries, part 11.
910. gbhtc12.seq - HTC (high throughput cDNA sequencing) sequence entries, part 12.
911. gbhtc13.seq - HTC (high throughput cDNA sequencing) sequence entries, part 13.
912. gbhtc14.seq - HTC (high throughput cDNA sequencing) sequence entries, part 14.
913. gbhtc15.seq - HTC (high throughput cDNA sequencing) sequence entries, part 15.
914. gbhtc2.seq - HTC (high throughput cDNA sequencing) sequence entries, part 2.
915. gbhtc3.seq - HTC (high throughput cDNA sequencing) sequence entries, part 3.
916. gbhtc4.seq - HTC (high throughput cDNA sequencing) sequence entries, part 4.
917. gbhtc5.seq - HTC (high throughput cDNA sequencing) sequence entries, part 5.
918. gbhtc6.seq - HTC (high throughput cDNA sequencing) sequence entries, part 6.
919. gbhtc7.seq - HTC (high throughput cDNA sequencing) sequence entries, part 7.
920. gbhtc8.seq - HTC (high throughput cDNA sequencing) sequence entries, part 8.
921. gbhtc9.seq - HTC (high throughput cDNA sequencing) sequence entries, part 9.
922. gbhtg1.seq - HTGS (high throughput genomic sequencing) sequence entries, part 1.
923. gbhtg10.seq - HTGS (high throughput genomic sequencing) sequence entries, part 10.
924. gbhtg100.seq - HTGS (high throughput genomic sequencing) sequence entries, part 100.
925. gbhtg101.seq - HTGS (high throughput genomic sequencing) sequence entries, part 101.
926. gbhtg102.seq - HTGS (high throughput genomic sequencing) sequence entries, part 102.
927. gbhtg103.seq - HTGS (high throughput genomic sequencing) sequence entries, part 103.
928. gbhtg104.seq - HTGS (high throughput genomic sequencing) sequence entries, part 104.
929. gbhtg105.seq - HTGS (high throughput genomic sequencing) sequence entries, part 105.
930. gbhtg106.seq - HTGS (high throughput genomic sequencing) sequence entries, part 106.
931. gbhtg107.seq - HTGS (high throughput genomic sequencing) sequence entries, part 107.
932. gbhtg108.seq - HTGS (high throughput genomic sequencing) sequence entries, part 108.
933. gbhtg109.seq - HTGS (high throughput genomic sequencing) sequence entries, part 109.
934. gbhtg11.seq - HTGS (high throughput genomic sequencing) sequence entries, part 11.
935. gbhtg110.seq - HTGS (high throughput genomic sequencing) sequence entries, part 110.
936. gbhtg111.seq - HTGS (high throughput genomic sequencing) sequence entries, part 111.
937. gbhtg112.seq - HTGS (high throughput genomic sequencing) sequence entries, part 112.
938. gbhtg113.seq - HTGS (high throughput genomic sequencing) sequence entries, part 113.
939. gbhtg114.seq - HTGS (high throughput genomic sequencing) sequence entries, part 114.
940. gbhtg115.seq - HTGS (high throughput genomic sequencing) sequence entries, part 115.
941. gbhtg116.seq - HTGS (high throughput genomic sequencing) sequence entries, part 116.
942. gbhtg117.seq - HTGS (high throughput genomic sequencing) sequence entries, part 117.
943. gbhtg118.seq - HTGS (high throughput genomic sequencing) sequence entries, part 118.
944. gbhtg119.seq - HTGS (high throughput genomic sequencing) sequence entries, part 119.
945. gbhtg12.seq - HTGS (high throughput genomic sequencing) sequence entries, part 12.
946. gbhtg120.seq - HTGS (high throughput genomic sequencing) sequence entries, part 120.
947. gbhtg121.seq - HTGS (high throughput genomic sequencing) sequence entries, part 121.
948. gbhtg122.seq - HTGS (high throughput genomic sequencing) sequence entries, part 122.
949. gbhtg123.seq - HTGS (high throughput genomic sequencing) sequence entries, part 123.
950. gbhtg124.seq - HTGS (high throughput genomic sequencing) sequence entries, part 124.
951. gbhtg125.seq - HTGS (high throughput genomic sequencing) sequence entries, part 125.
952. gbhtg126.seq - HTGS (high throughput genomic sequencing) sequence entries, part 126.
953. gbhtg127.seq - HTGS (high throughput genomic sequencing) sequence entries, part 127.
954. gbhtg128.seq - HTGS (high throughput genomic sequencing) sequence entries, part 128.
955. gbhtg129.seq - HTGS (high throughput genomic sequencing) sequence entries, part 129.
956. gbhtg13.seq - HTGS (high throughput genomic sequencing) sequence entries, part 13.
957. gbhtg130.seq - HTGS (high throughput genomic sequencing) sequence entries, part 130.
958. gbhtg131.seq - HTGS (high throughput genomic sequencing) sequence entries, part 131.
959. gbhtg132.seq - HTGS (high throughput genomic sequencing) sequence entries, part 132.
960. gbhtg133.seq - HTGS (high throughput genomic sequencing) sequence entries, part 133.
961. gbhtg134.seq - HTGS (high throughput genomic sequencing) sequence entries, part 134.
962. gbhtg14.seq - HTGS (high throughput genomic sequencing) sequence entries, part 14.
963. gbhtg15.seq - HTGS (high throughput genomic sequencing) sequence entries, part 15.
964. gbhtg16.seq - HTGS (high throughput genomic sequencing) sequence entries, part 16.
965. gbhtg17.seq - HTGS (high throughput genomic sequencing) sequence entries, part 17.
966. gbhtg18.seq - HTGS (high throughput genomic sequencing) sequence entries, part 18.
967. gbhtg19.seq - HTGS (high throughput genomic sequencing) sequence entries, part 19.
968. gbhtg2.seq - HTGS (high throughput genomic sequencing) sequence entries, part 2.
969. gbhtg20.seq - HTGS (high throughput genomic sequencing) sequence entries, part 20.
970. gbhtg21.seq - HTGS (high throughput genomic sequencing) sequence entries, part 21.
971. gbhtg22.seq - HTGS (high throughput genomic sequencing) sequence entries, part 22.
972. gbhtg23.seq - HTGS (high throughput genomic sequencing) sequence entries, part 23.
973. gbhtg24.seq - HTGS (high throughput genomic sequencing) sequence entries, part 24.
974. gbhtg25.seq - HTGS (high throughput genomic sequencing) sequence entries, part 25.
975. gbhtg26.seq - HTGS (high throughput genomic sequencing) sequence entries, part 26.
976. gbhtg27.seq - HTGS (high throughput genomic sequencing) sequence entries, part 27.
977. gbhtg28.seq - HTGS (high throughput genomic sequencing) sequence entries, part 28.
978. gbhtg29.seq - HTGS (high throughput genomic sequencing) sequence entries, part 29.
979. gbhtg3.seq - HTGS (high throughput genomic sequencing) sequence entries, part 3.
980. gbhtg30.seq - HTGS (high throughput genomic sequencing) sequence entries, part 30.
981. gbhtg31.seq - HTGS (high throughput genomic sequencing) sequence entries, part 31.
982. gbhtg32.seq - HTGS (high throughput genomic sequencing) sequence entries, part 32.
983. gbhtg33.seq - HTGS (high throughput genomic sequencing) sequence entries, part 33.
984. gbhtg34.seq - HTGS (high throughput genomic sequencing) sequence entries, part 34.
985. gbhtg35.seq - HTGS (high throughput genomic sequencing) sequence entries, part 35.
986. gbhtg36.seq - HTGS (high throughput genomic sequencing) sequence entries, part 36.
987. gbhtg37.seq - HTGS (high throughput genomic sequencing) sequence entries, part 37.
988. gbhtg38.seq - HTGS (high throughput genomic sequencing) sequence entries, part 38.
989. gbhtg39.seq - HTGS (high throughput genomic sequencing) sequence entries, part 39.
990. gbhtg4.seq - HTGS (high throughput genomic sequencing) sequence entries, part 4.
991. gbhtg40.seq - HTGS (high throughput genomic sequencing) sequence entries, part 40.
992. gbhtg41.seq - HTGS (high throughput genomic sequencing) sequence entries, part 41.
993. gbhtg42.seq - HTGS (high throughput genomic sequencing) sequence entries, part 42.
994. gbhtg43.seq - HTGS (high throughput genomic sequencing) sequence entries, part 43.
995. gbhtg44.seq - HTGS (high throughput genomic sequencing) sequence entries, part 44.
996. gbhtg45.seq - HTGS (high throughput genomic sequencing) sequence entries, part 45.
997. gbhtg46.seq - HTGS (high throughput genomic sequencing) sequence entries, part 46.
998. gbhtg47.seq - HTGS (high throughput genomic sequencing) sequence entries, part 47.
999. gbhtg48.seq - HTGS (high throughput genomic sequencing) sequence entries, part 48.
1000. gbhtg49.seq - HTGS (high throughput genomic sequencing) sequence entries, part 49.
1001. gbhtg5.seq - HTGS (high throughput genomic sequencing) sequence entries, part 5.
1002. gbhtg50.seq - HTGS (high throughput genomic sequencing) sequence entries, part 50.
1003. gbhtg51.seq - HTGS (high throughput genomic sequencing) sequence entries, part 51.
1004. gbhtg52.seq - HTGS (high throughput genomic sequencing) sequence entries, part 52.
1005. gbhtg53.seq - HTGS (high throughput genomic sequencing) sequence entries, part 53.
1006. gbhtg54.seq - HTGS (high throughput genomic sequencing) sequence entries, part 54.
1007. gbhtg55.seq - HTGS (high throughput genomic sequencing) sequence entries, part 55.
1008. gbhtg56.seq - HTGS (high throughput genomic sequencing) sequence entries, part 56.
1009. gbhtg57.seq - HTGS (high throughput genomic sequencing) sequence entries, part 57.
1010. gbhtg58.seq - HTGS (high throughput genomic sequencing) sequence entries, part 58.
1011. gbhtg59.seq - HTGS (high throughput genomic sequencing) sequence entries, part 59.
1012. gbhtg6.seq - HTGS (high throughput genomic sequencing) sequence entries, part 6.
1013. gbhtg60.seq - HTGS (high throughput genomic sequencing) sequence entries, part 60.
1014. gbhtg61.seq - HTGS (high throughput genomic sequencing) sequence entries, part 61.
1015. gbhtg62.seq - HTGS (high throughput genomic sequencing) sequence entries, part 62.
1016. gbhtg63.seq - HTGS (high throughput genomic sequencing) sequence entries, part 63.
1017. gbhtg64.seq - HTGS (high throughput genomic sequencing) sequence entries, part 64.
1018. gbhtg65.seq - HTGS (high throughput genomic sequencing) sequence entries, part 65.
1019. gbhtg66.seq - HTGS (high throughput genomic sequencing) sequence entries, part 66.
1020. gbhtg67.seq - HTGS (high throughput genomic sequencing) sequence entries, part 67.
1021. gbhtg68.seq - HTGS (high throughput genomic sequencing) sequence entries, part 68.
1022. gbhtg69.seq - HTGS (high throughput genomic sequencing) sequence entries, part 69.
1023. gbhtg7.seq - HTGS (high throughput genomic sequencing) sequence entries, part 7.
1024. gbhtg70.seq - HTGS (high throughput genomic sequencing) sequence entries, part 70.
1025. gbhtg71.seq - HTGS (high throughput genomic sequencing) sequence entries, part 71.
1026. gbhtg72.seq - HTGS (high throughput genomic sequencing) sequence entries, part 72.
1027. gbhtg73.seq - HTGS (high throughput genomic sequencing) sequence entries, part 73.
1028. gbhtg74.seq - HTGS (high throughput genomic sequencing) sequence entries, part 74.
1029. gbhtg75.seq - HTGS (high throughput genomic sequencing) sequence entries, part 75.
1030. gbhtg76.seq - HTGS (high throughput genomic sequencing) sequence entries, part 76.
1031. gbhtg77.seq - HTGS (high throughput genomic sequencing) sequence entries, part 77.
1032. gbhtg78.seq - HTGS (high throughput genomic sequencing) sequence entries, part 78.
1033. gbhtg79.seq - HTGS (high throughput genomic sequencing) sequence entries, part 79.
1034. gbhtg8.seq - HTGS (high throughput genomic sequencing) sequence entries, part 8.
1035. gbhtg80.seq - HTGS (high throughput genomic sequencing) sequence entries, part 80.
1036. gbhtg81.seq - HTGS (high throughput genomic sequencing) sequence entries, part 81.
1037. gbhtg82.seq - HTGS (high throughput genomic sequencing) sequence entries, part 82.
1038. gbhtg83.seq - HTGS (high throughput genomic sequencing) sequence entries, part 83.
1039. gbhtg84.seq - HTGS (high throughput genomic sequencing) sequence entries, part 84.
1040. gbhtg85.seq - HTGS (high throughput genomic sequencing) sequence entries, part 85.
1041. gbhtg86.seq - HTGS (high throughput genomic sequencing) sequence entries, part 86.
1042. gbhtg87.seq - HTGS (high throughput genomic sequencing) sequence entries, part 87.
1043. gbhtg88.seq - HTGS (high throughput genomic sequencing) sequence entries, part 88.
1044. gbhtg89.seq - HTGS (high throughput genomic sequencing) sequence entries, part 89.
1045. gbhtg9.seq - HTGS (high throughput genomic sequencing) sequence entries, part 9.
1046. gbhtg90.seq - HTGS (high throughput genomic sequencing) sequence entries, part 90.
1047. gbhtg91.seq - HTGS (high throughput genomic sequencing) sequence entries, part 91.
1048. gbhtg92.seq - HTGS (high throughput genomic sequencing) sequence entries, part 92.
1049. gbhtg93.seq - HTGS (high throughput genomic sequencing) sequence entries, part 93.
1050. gbhtg94.seq - HTGS (high throughput genomic sequencing) sequence entries, part 94.
1051. gbhtg95.seq - HTGS (high throughput genomic sequencing) sequence entries, part 95.
1052. gbhtg96.seq - HTGS (high throughput genomic sequencing) sequence entries, part 96.
1053. gbhtg97.seq - HTGS (high throughput genomic sequencing) sequence entries, part 97.
1054. gbhtg98.seq - HTGS (high throughput genomic sequencing) sequence entries, part 98.
1055. gbhtg99.seq - HTGS (high throughput genomic sequencing) sequence entries, part 99.
1056. gbinv1.seq - Invertebrate sequence entries, part 1.
1057. gbinv10.seq - Invertebrate sequence entries, part 10.
1058. gbinv11.seq - Invertebrate sequence entries, part 11.
1059. gbinv12.seq - Invertebrate sequence entries, part 12.
1060. gbinv13.seq - Invertebrate sequence entries, part 13.
1061. gbinv14.seq - Invertebrate sequence entries, part 14.
1062. gbinv15.seq - Invertebrate sequence entries, part 15.
1063. gbinv16.seq - Invertebrate sequence entries, part 16.
1064. gbinv17.seq - Invertebrate sequence entries, part 17.
1065. gbinv18.seq - Invertebrate sequence entries, part 18.
1066. gbinv19.seq - Invertebrate sequence entries, part 19.
1067. gbinv2.seq - Invertebrate sequence entries, part 2.
1068. gbinv3.seq - Invertebrate sequence entries, part 3.
1069. gbinv4.seq - Invertebrate sequence entries, part 4.
1070. gbinv5.seq - Invertebrate sequence entries, part 5.
1071. gbinv6.seq - Invertebrate sequence entries, part 6.
1072. gbinv7.seq - Invertebrate sequence entries, part 7.
1073. gbinv8.seq - Invertebrate sequence entries, part 8.
1074. gbinv9.seq - Invertebrate sequence entries, part 9.
1075. gbjou1.idx - Index of the entries according to journal citation, part 1.
1076. gbjou2.idx - Index of the entries according to journal citation, part 2.
1077. gbjou3.idx - Index of the entries according to journal citation, part 3.
1078. gbjou4.idx - Index of the entries according to journal citation, part 4.
1079. gbjou5.idx - Index of the entries according to journal citation, part 5.
1080. gbjou6.idx - Index of the entries according to journal citation, part 6.
1081. gbjou7.idx - Index of the entries according to journal citation, part 7.
1082. gbjou8.idx - Index of the entries according to journal citation, part 8.
1083. gbkey1.idx - Index of the entries according to keyword phrase, part 1.
1084. gbkey2.idx - Index of the entries according to keyword phrase, part 2.
1085. gbkey3.idx - Index of the entries according to keyword phrase, part 3.
1086. gbkey4.idx - Index of the entries according to keyword phrase, part 4.
1087. gbmam1.seq - Other mammalian sequence entries, part 1.
1088. gbmam2.seq - Other mammalian sequence entries, part 2.
1089. gbmam3.seq - Other mammalian sequence entries, part 3.
1090. gbmam4.seq - Other mammalian sequence entries, part 4.
1091. gbmam5.seq - Other mammalian sequence entries, part 5.
1092. gbmam6.seq - Other mammalian sequence entries, part 6.
1093. gbnew.txt - Accession numbers of entries new since the previous release.
1094. gbpat1.seq - Patent sequence entries, part 1.
1095. gbpat10.seq - Patent sequence entries, part 10.
1096. gbpat100.seq - Patent sequence entries, part 100.
1097. gbpat11.seq - Patent sequence entries, part 11.
1098. gbpat12.seq - Patent sequence entries, part 12.
1099. gbpat13.seq - Patent sequence entries, part 13.
1100. gbpat14.seq - Patent sequence entries, part 14.
1101. gbpat15.seq - Patent sequence entries, part 15.
1102. gbpat16.seq - Patent sequence entries, part 16.
1103. gbpat17.seq - Patent sequence entries, part 17.
1104. gbpat18.seq - Patent sequence entries, part 18.
1105. gbpat19.seq - Patent sequence entries, part 19.
1106. gbpat2.seq - Patent sequence entries, part 2.
1107. gbpat20.seq - Patent sequence entries, part 20.
1108. gbpat21.seq - Patent sequence entries, part 21.
1109. gbpat22.seq - Patent sequence entries, part 22.
1110. gbpat23.seq - Patent sequence entries, part 23.
1111. gbpat24.seq - Patent sequence entries, part 24.
1112. gbpat25.seq - Patent sequence entries, part 25.
1113. gbpat26.seq - Patent sequence entries, part 26.
1114. gbpat27.seq - Patent sequence entries, part 27.
1115. gbpat28.seq - Patent sequence entries, part 28.
1116. gbpat29.seq - Patent sequence entries, part 29.
1117. gbpat3.seq - Patent sequence entries, part 3.
1118. gbpat30.seq - Patent sequence entries, part 30.
1119. gbpat31.seq - Patent sequence entries, part 31.
1120. gbpat32.seq - Patent sequence entries, part 32.
1121. gbpat33.seq - Patent sequence entries, part 33.
1122. gbpat34.seq - Patent sequence entries, part 34.
1123. gbpat35.seq - Patent sequence entries, part 35.
1124. gbpat36.seq - Patent sequence entries, part 36.
1125. gbpat37.seq - Patent sequence entries, part 37.
1126. gbpat38.seq - Patent sequence entries, part 38.
1127. gbpat39.seq - Patent sequence entries, part 39.
1128. gbpat4.seq - Patent sequence entries, part 4.
1129. gbpat40.seq - Patent sequence entries, part 40.
1130. gbpat41.seq - Patent sequence entries, part 41.
1131. gbpat42.seq - Patent sequence entries, part 42.
1132. gbpat43.seq - Patent sequence entries, part 43.
1133. gbpat44.seq - Patent sequence entries, part 44.
1134. gbpat45.seq - Patent sequence entries, part 45.
1135. gbpat46.seq - Patent sequence entries, part 46.
1136. gbpat47.seq - Patent sequence entries, part 47.
1137. gbpat48.seq - Patent sequence entries, part 48.
1138. gbpat49.seq - Patent sequence entries, part 49.
1139. gbpat5.seq - Patent sequence entries, part 5.
1140. gbpat50.seq - Patent sequence entries, part 50.
1141. gbpat51.seq - Patent sequence entries, part 51.
1142. gbpat52.seq - Patent sequence entries, part 52.
1143. gbpat53.seq - Patent sequence entries, part 53.
1144. gbpat54.seq - Patent sequence entries, part 54.
1145. gbpat55.seq - Patent sequence entries, part 55.
1146. gbpat56.seq - Patent sequence entries, part 56.
1147. gbpat57.seq - Patent sequence entries, part 57.
1148. gbpat58.seq - Patent sequence entries, part 58.
1149. gbpat59.seq - Patent sequence entries, part 59.
1150. gbpat6.seq - Patent sequence entries, part 6.
1151. gbpat60.seq - Patent sequence entries, part 60.
1152. gbpat61.seq - Patent sequence entries, part 61.
1153. gbpat62.seq - Patent sequence entries, part 62.
1154. gbpat63.seq - Patent sequence entries, part 63.
1155. gbpat64.seq - Patent sequence entries, part 64.
1156. gbpat65.seq - Patent sequence entries, part 65.
1157. gbpat66.seq - Patent sequence entries, part 66.
1158. gbpat67.seq - Patent sequence entries, part 67.
1159. gbpat68.seq - Patent sequence entries, part 68.
1160. gbpat69.seq - Patent sequence entries, part 69.
1161. gbpat7.seq - Patent sequence entries, part 7.
1162. gbpat70.seq - Patent sequence entries, part 70.
1163. gbpat71.seq - Patent sequence entries, part 71.
1164. gbpat72.seq - Patent sequence entries, part 72.
1165. gbpat73.seq - Patent sequence entries, part 73.
1166. gbpat74.seq - Patent sequence entries, part 74.
1167. gbpat75.seq - Patent sequence entries, part 75.
1168. gbpat76.seq - Patent sequence entries, part 76.
1169. gbpat77.seq - Patent sequence entries, part 77.
1170. gbpat78.seq - Patent sequence entries, part 78.
1171. gbpat79.seq - Patent sequence entries, part 79.
1172. gbpat8.seq - Patent sequence entries, part 8.
1173. gbpat80.seq - Patent sequence entries, part 80.
1174. gbpat81.seq - Patent sequence entries, part 81.
1175. gbpat82.seq - Patent sequence entries, part 82.
1176. gbpat83.seq - Patent sequence entries, part 83.
1177. gbpat84.seq - Patent sequence entries, part 84.
1178. gbpat85.seq - Patent sequence entries, part 85.
1179. gbpat86.seq - Patent sequence entries, part 86.
1180. gbpat87.seq - Patent sequence entries, part 87.
1181. gbpat88.seq - Patent sequence entries, part 88.
1182. gbpat89.seq - Patent sequence entries, part 89.
1183. gbpat9.seq - Patent sequence entries, part 9.
1184. gbpat90.seq - Patent sequence entries, part 90.
1185. gbpat91.seq - Patent sequence entries, part 91.
1186. gbpat92.seq - Patent sequence entries, part 92.
1187. gbpat93.seq - Patent sequence entries, part 93.
1188. gbpat94.seq - Patent sequence entries, part 94.
1189. gbpat95.seq - Patent sequence entries, part 95.
1190. gbpat96.seq - Patent sequence entries, part 96.
1191. gbpat97.seq - Patent sequence entries, part 97.
1192. gbpat98.seq - Patent sequence entries, part 98.
1193. gbpat99.seq - Patent sequence entries, part 99.
1194. gbphg1.seq - Phage sequence entries, part 1.
1195. gbpln1.seq - Plant sequence entries (including fungi and algae), part 1.
1196. gbpln10.seq - Plant sequence entries (including fungi and algae), part 10.
1197. gbpln11.seq - Plant sequence entries (including fungi and algae), part 11.
1198. gbpln12.seq - Plant sequence entries (including fungi and algae), part 12.
1199. gbpln13.seq - Plant sequence entries (including fungi and algae), part 13.
1200. gbpln14.seq - Plant sequence entries (including fungi and algae), part 14.
1201. gbpln15.seq - Plant sequence entries (including fungi and algae), part 15.
1202. gbpln16.seq - Plant sequence entries (including fungi and algae), part 16.
1203. gbpln17.seq - Plant sequence entries (including fungi and algae), part 17.
1204. gbpln18.seq - Plant sequence entries (including fungi and algae), part 18.
1205. gbpln19.seq - Plant sequence entries (including fungi and algae), part 19.
1206. gbpln2.seq - Plant sequence entries (including fungi and algae), part 2.
1207. gbpln20.seq - Plant sequence entries (including fungi and algae), part 20.
1208. gbpln21.seq - Plant sequence entries (including fungi and algae), part 21.
1209. gbpln22.seq - Plant sequence entries (including fungi and algae), part 22.
1210. gbpln23.seq - Plant sequence entries (including fungi and algae), part 23.
1211. gbpln24.seq - Plant sequence entries (including fungi and algae), part 24.
1212. gbpln25.seq - Plant sequence entries (including fungi and algae), part 25.
1213. gbpln26.seq - Plant sequence entries (including fungi and algae), part 26.
1214. gbpln27.seq - Plant sequence entries (including fungi and algae), part 27.
1215. gbpln28.seq - Plant sequence entries (including fungi and algae), part 28.
1216. gbpln29.seq - Plant sequence entries (including fungi and algae), part 29.
1217. gbpln3.seq - Plant sequence entries (including fungi and algae), part 3.
1218. gbpln30.seq - Plant sequence entries (including fungi and algae), part 30.
1219. gbpln31.seq - Plant sequence entries (including fungi and algae), part 31.
1220. gbpln32.seq - Plant sequence entries (including fungi and algae), part 32.
1221. gbpln33.seq - Plant sequence entries (including fungi and algae), part 33.
1222. gbpln34.seq - Plant sequence entries (including fungi and algae), part 34.
1223. gbpln35.seq - Plant sequence entries (including fungi and algae), part 35.
1224. gbpln36.seq - Plant sequence entries (including fungi and algae), part 36.
1225. gbpln37.seq - Plant sequence entries (including fungi and algae), part 37.
1226. gbpln38.seq - Plant sequence entries (including fungi and algae), part 38.
1227. gbpln39.seq - Plant sequence entries (including fungi and algae), part 39.
1228. gbpln4.seq - Plant sequence entries (including fungi and algae), part 4.
1229. gbpln5.seq - Plant sequence entries (including fungi and algae), part 5.
1230. gbpln6.seq - Plant sequence entries (including fungi and algae), part 6.
1231. gbpln7.seq - Plant sequence entries (including fungi and algae), part 7.
1232. gbpln8.seq - Plant sequence entries (including fungi and algae), part 8.
1233. gbpln9.seq - Plant sequence entries (including fungi and algae), part 9.
1234. gbpri1.seq - Primate sequence entries, part 1.
1235. gbpri10.seq - Primate sequence entries, part 10.
1236. gbpri11.seq - Primate sequence entries, part 11.
1237. gbpri12.seq - Primate sequence entries, part 12.
1238. gbpri13.seq - Primate sequence entries, part 13.
1239. gbpri14.seq - Primate sequence entries, part 14.
1240. gbpri15.seq - Primate sequence entries, part 15.
1241. gbpri16.seq - Primate sequence entries, part 16.
1242. gbpri17.seq - Primate sequence entries, part 17.
1243. gbpri18.seq - Primate sequence entries, part 18.
1244. gbpri19.seq - Primate sequence entries, part 19.
1245. gbpri2.seq - Primate sequence entries, part 2.
1246. gbpri20.seq - Primate sequence entries, part 20.
1247. gbpri21.seq - Primate sequence entries, part 21.
1248. gbpri22.seq - Primate sequence entries, part 22.
1249. gbpri23.seq - Primate sequence entries, part 23.
1250. gbpri24.seq - Primate sequence entries, part 24.
1251. gbpri25.seq - Primate sequence entries, part 25.
1252. gbpri26.seq - Primate sequence entries, part 26.
1253. gbpri27.seq - Primate sequence entries, part 27.
1254. gbpri28.seq - Primate sequence entries, part 28.
1255. gbpri29.seq - Primate sequence entries, part 29.
1256. gbpri3.seq - Primate sequence entries, part 3.
1257. gbpri30.seq - Primate sequence entries, part 30.
1258. gbpri31.seq - Primate sequence entries, part 31.
1259. gbpri32.seq - Primate sequence entries, part 32.
1260. gbpri33.seq - Primate sequence entries, part 33.
1261. gbpri34.seq - Primate sequence entries, part 34.
1262. gbpri35.seq - Primate sequence entries, part 35.
1263. gbpri36.seq - Primate sequence entries, part 36.
1264. gbpri37.seq - Primate sequence entries, part 37.
1265. gbpri38.seq - Primate sequence entries, part 38.
1266. gbpri39.seq - Primate sequence entries, part 39.
1267. gbpri4.seq - Primate sequence entries, part 4.
1268. gbpri40.seq - Primate sequence entries, part 40.
1269. gbpri41.seq - Primate sequence entries, part 41.
1270. gbpri5.seq - Primate sequence entries, part 5.
1271. gbpri6.seq - Primate sequence entries, part 6.
1272. gbpri7.seq - Primate sequence entries, part 7.
1273. gbpri8.seq - Primate sequence entries, part 8.
1274. gbpri9.seq - Primate sequence entries, part 9.
1275. gbrel.txt - Release notes (this document).
1276. gbrod1.seq - Rodent sequence entries, part 1.
1277. gbrod10.seq - Rodent sequence entries, part 10.
1278. gbrod11.seq - Rodent sequence entries, part 11.
1279. gbrod12.seq - Rodent sequence entries, part 12.
1280. gbrod13.seq - Rodent sequence entries, part 13.
1281. gbrod14.seq - Rodent sequence entries, part 14.
1282. gbrod15.seq - Rodent sequence entries, part 15.
1283. gbrod16.seq - Rodent sequence entries, part 16.
1284. gbrod17.seq - Rodent sequence entries, part 17.
1285. gbrod18.seq - Rodent sequence entries, part 18.
1286. gbrod19.seq - Rodent sequence entries, part 19.
1287. gbrod2.seq - Rodent sequence entries, part 2.
1288. gbrod20.seq - Rodent sequence entries, part 20.
1289. gbrod21.seq - Rodent sequence entries, part 21.
1290. gbrod22.seq - Rodent sequence entries, part 22.
1291. gbrod23.seq - Rodent sequence entries, part 23.
1292. gbrod24.seq - Rodent sequence entries, part 24.
1293. gbrod25.seq - Rodent sequence entries, part 25.
1294. gbrod26.seq - Rodent sequence entries, part 26.
1295. gbrod27.seq - Rodent sequence entries, part 27.
1296. gbrod28.seq - Rodent sequence entries, part 28.
1297. gbrod3.seq - Rodent sequence entries, part 3.
1298. gbrod4.seq - Rodent sequence entries, part 4.
1299. gbrod5.seq - Rodent sequence entries, part 5.
1300. gbrod6.seq - Rodent sequence entries, part 6.
1301. gbrod7.seq - Rodent sequence entries, part 7.
1302. gbrod8.seq - Rodent sequence entries, part 8.
1303. gbrod9.seq - Rodent sequence entries, part 9.
1304. gbsdr1.txt - Short directory of the data bank, part 1.
1305. gbsdr2.txt - Short directory of the data bank, part 2.
1306. gbsdr3.txt - Short directory of the data bank, part 3.
1307. gbsec.idx - Index of the entries according to secondary accession number.
1308. gbsts1.seq - STS (sequence tagged site) sequence entries, part 1.
1309. gbsts10.seq - STS (sequence tagged site) sequence entries, part 10.
1310. gbsts11.seq - STS (sequence tagged site) sequence entries, part 11.
1311. gbsts12.seq - STS (sequence tagged site) sequence entries, part 12.
1312. gbsts13.seq - STS (sequence tagged site) sequence entries, part 13.
1313. gbsts14.seq - STS (sequence tagged site) sequence entries, part 14.
1314. gbsts15.seq - STS (sequence tagged site) sequence entries, part 15.
1315. gbsts16.seq - STS (sequence tagged site) sequence entries, part 16.
1316. gbsts17.seq - STS (sequence tagged site) sequence entries, part 17.
1317. gbsts18.seq - STS (sequence tagged site) sequence entries, part 18.
1318. gbsts19.seq - STS (sequence tagged site) sequence entries, part 19.
1319. gbsts2.seq - STS (sequence tagged site) sequence entries, part 2.
1320. gbsts20.seq - STS (sequence tagged site) sequence entries, part 20.
1321. gbsts3.seq - STS (sequence tagged site) sequence entries, part 3.
1322. gbsts4.seq - STS (sequence tagged site) sequence entries, part 4.
1323. gbsts5.seq - STS (sequence tagged site) sequence entries, part 5.
1324. gbsts6.seq - STS (sequence tagged site) sequence entries, part 6.
1325. gbsts7.seq - STS (sequence tagged site) sequence entries, part 7.
1326. gbsts8.seq - STS (sequence tagged site) sequence entries, part 8.
1327. gbsts9.seq - STS (sequence tagged site) sequence entries, part 9.
1328. gbsyn1.seq - Synthetic and chimeric sequence entries, part 1.
1329. gbsyn2.seq - Synthetic and chimeric sequence entries, part 2.
1330. gbtsa1.seq - TSA (transcriptome shotgun assembly) sequence entries, part 1.
1331. gbtsa2.seq - TSA (transcriptome shotgun assembly) sequence entries, part 2.
1332. gbtsa3.seq - TSA (transcriptome shotgun assembly) sequence entries, part 3.
1333. gbtsa4.seq - TSA (transcriptome shotgun assembly) sequence entries, part 4.
1334. gbuna1.seq - Unannotated sequence entries, part 1.
1335. gbvrl1.seq - Viral sequence entries, part 1.
1336. gbvrl10.seq - Viral sequence entries, part 10.
1337. gbvrl11.seq - Viral sequence entries, part 11.
1338. gbvrl12.seq - Viral sequence entries, part 12.
1339. gbvrl13.seq - Viral sequence entries, part 13.
1340. gbvrl14.seq - Viral sequence entries, part 14.
1341. gbvrl2.seq - Viral sequence entries, part 2.
1342. gbvrl3.seq - Viral sequence entries, part 3.
1343. gbvrl4.seq - Viral sequence entries, part 4.
1344. gbvrl5.seq - Viral sequence entries, part 5.
1345. gbvrl6.seq - Viral sequence entries, part 6.
1346. gbvrl7.seq - Viral sequence entries, part 7.
1347. gbvrl8.seq - Viral sequence entries, part 8.
1348. gbvrl9.seq - Viral sequence entries, part 9.
1349. gbvrt1.seq - Other vertebrate sequence entries, part 1.
1350. gbvrt10.seq - Other vertebrate sequence entries, part 10.
1351. gbvrt11.seq - Other vertebrate sequence entries, part 11.
1352. gbvrt12.seq - Other vertebrate sequence entries, part 12.
1353. gbvrt13.seq - Other vertebrate sequence entries, part 13.
1354. gbvrt14.seq - Other vertebrate sequence entries, part 14.
1355. gbvrt15.seq - Other vertebrate sequence entries, part 15.
1356. gbvrt16.seq - Other vertebrate sequence entries, part 16.
1357. gbvrt17.seq - Other vertebrate sequence entries, part 17.
1358. gbvrt18.seq - Other vertebrate sequence entries, part 18.
1359. gbvrt19.seq - Other vertebrate sequence entries, part 19.
1360. gbvrt2.seq - Other vertebrate sequence entries, part 2.
1361. gbvrt20.seq - Other vertebrate sequence entries, part 20.
1362. gbvrt21.seq - Other vertebrate sequence entries, part 21.
1363. gbvrt3.seq - Other vertebrate sequence entries, part 3.
1364. gbvrt4.seq - Other vertebrate sequence entries, part 4.
1365. gbvrt5.seq - Other vertebrate sequence entries, part 5.
1366. gbvrt6.seq - Other vertebrate sequence entries, part 6.
1367. gbvrt7.seq - Other vertebrate sequence entries, part 7.
1368. gbvrt8.seq - Other vertebrate sequence entries, part 8.
1369. gbvrt9.seq - Other vertebrate sequence entries, part 9.
Sequences in the CON division data files (gbcon*.seq) are constructed from
other "traditional" sequence records, and are represented in a unique way.
CON records do not contain any sequence data; instead, they utilize a CONTIG
linetype with a join() statement which describes how component sequences
can be assembled to form the larger constructed sequence. Records in the CON
division do not contribute to GenBank Release statistics (Sections 2.2.6,
2.2.7, and 2.2.8), or to the overall release statistics presented in the header
of these release notes. The GenBank README describes the CON division of GenBank
in more detail:
ftp://ftp.ncbi.nih.gov/genbank/README.genbank
2.2.5 File Sizes
Uncompressed, the Release 176.0 flatfiles require roughly 432 GB (sequence
files only) or 463 GB (including the 'short directory', 'index' and the
*.txt files). The following table contains the approximate sizes of the
individual files in this release. Since minor changes to some of the files
might have occurred after these release notes were written, these sizes should
not be used to determine file integrity; they are provided as an aid to
planning only.
File Size File Name
1212039501 gbacc1.idx
2138374557 gbacc2.idx
685074693 gbacc3.idx
184716878 gbaut1.idx
232152382 gbaut10.idx
184783669 gbaut11.idx
195926181 gbaut12.idx
185037674 gbaut13.idx
220384757 gbaut14.idx
185445895 gbaut15.idx
183999140 gbaut16.idx
183790356 gbaut17.idx
244753878 gbaut18.idx
221356255 gbaut19.idx
184246562 gbaut2.idx
203683190 gbaut20.idx
185313391 gbaut21.idx
243438512 gbaut22.idx
187481139 gbaut23.idx
188269976 gbaut24.idx
184180698 gbaut25.idx
203347691 gbaut26.idx
216262083 gbaut27.idx
235808990 gbaut28.idx
252912056 gbaut29.idx
223092467 gbaut3.idx
184609059 gbaut30.idx
200686487 gbaut31.idx
185070837 gbaut32.idx
184288179 gbaut33.idx
183946843 gbaut34.idx
185299658 gbaut35.idx
185591688 gbaut36.idx
183827495 gbaut37.idx
184056992 gbaut38.idx
184117801 gbaut39.idx
183801081 gbaut4.idx
183724050 gbaut40.idx
184449566 gbaut41.idx
183704852 gbaut42.idx
185644201 gbaut43.idx
192969220 gbaut44.idx
184993796 gbaut45.idx
183592550 gbaut46.idx
184430699 gbaut47.idx
183911635 gbaut48.idx
251460347 gbaut49.idx
185496315 gbaut5.idx
185431640 gbaut50.idx
208513988 gbaut51.idx
184227231 gbaut52.idx
234051911 gbaut53.idx
186831918 gbaut54.idx
184847600 gbaut55.idx
205511960 gbaut56.idx
212313757 gbaut57.idx
184327372 gbaut58.idx
239228335 gbaut59.idx
229903512 gbaut6.idx
183850235 gbaut60.idx
184406146 gbaut61.idx
187922522 gbaut62.idx
184155968 gbaut63.idx
233224111 gbaut64.idx
186243377 gbaut65.idx
189067291 gbaut66.idx
209886656 gbaut67.idx
184232677 gbaut68.idx
183967857 gbaut69.idx
184187802 gbaut7.idx
200843954 gbaut70.idx
184050583 gbaut71.idx
241365600 gbaut72.idx
17967981 gbaut73.idx
183933418 gbaut8.idx
184700790 gbaut9.idx
245670591 gbbct1.seq
248123670 gbbct10.seq
244750080 gbbct11.seq
241977514 gbbct12.seq
249771899 gbbct13.seq
247999286 gbbct14.seq
242198659 gbbct15.seq
248740904 gbbct16.seq
243881228 gbbct17.seq
245715534 gbbct18.seq
248488835 gbbct19.seq
247484815 gbbct2.seq
191119418 gbbct20.seq
242657530 gbbct21.seq
241399542 gbbct22.seq
248563779 gbbct23.seq
247789210 gbbct24.seq
249346306 gbbct25.seq
248639430 gbbct26.seq
249065207 gbbct27.seq
247524096 gbbct28.seq
241329307 gbbct29.seq
245603222 gbbct3.seq
243888345 gbbct30.seq
242633740 gbbct31.seq
248968843 gbbct32.seq
134089691 gbbct33.seq
243063254 gbbct34.seq
243966990 gbbct35.seq
249684613 gbbct36.seq
234899120 gbbct37.seq
248443721 gbbct38.seq
240816898 gbbct39.seq
249997356 gbbct4.seq
238041797 gbbct40.seq
6879337 gbbct41.seq
14080134 gbbct42.seq
23146310 gbbct43.seq
45015987 gbbct44.seq
87581120 gbbct45.seq
169879280 gbbct46.seq
250000161 gbbct47.seq
249710895 gbbct48.seq
248546111 gbbct49.seq
139185712 gbbct5.seq
249999879 gbbct50.seq
249996273 gbbct51.seq
29021606 gbbct52.seq
249970296 gbbct6.seq
249683044 gbbct7.seq
249947167 gbbct8.seq
244413809 gbbct9.seq
13797186 gbchg.txt
249999108 gbcon1.seq
249996809 gbcon10.seq
249999100 gbcon100.seq
249999520 gbcon101.seq
249998411 gbcon102.seq
249997075 gbcon103.seq
155891156 gbcon104.seq
249996638 gbcon105.seq
249996958 gbcon106.seq
249972536 gbcon107.seq
249997665 gbcon108.seq
171931188 gbcon109.seq
249999912 gbcon11.seq
249998061 gbcon110.seq
249998923 gbcon111.seq
232920966 gbcon112.seq
249999612 gbcon113.seq
249998231 gbcon114.seq
249995786 gbcon115.seq
249803909 gbcon116.seq
159570146 gbcon117.seq
249999431 gbcon118.seq
161452708 gbcon119.seq
249463543 gbcon12.seq
249997137 gbcon120.seq
249999295 gbcon121.seq
249998694 gbcon122.seq
213552174 gbcon123.seq
249827639 gbcon124.seq
249754165 gbcon125.seq
249838418 gbcon126.seq
249997980 gbcon127.seq
249998793 gbcon128.seq
249997952 gbcon129.seq
244428576 gbcon13.seq
249996395 gbcon130.seq
249889369 gbcon131.seq
249991024 gbcon132.seq
210609774 gbcon133.seq
21448773 gbcon134.seq
249966154 gbcon14.seq
10102741 gbcon15.seq
248542697 gbcon16.seq
249999916 gbcon17.seq
133204833 gbcon18.seq
249994929 gbcon19.seq
248163547 gbcon2.seq
249990984 gbcon20.seq
107608268 gbcon21.seq
249999025 gbcon22.seq
86918240 gbcon23.seq
249999233 gbcon24.seq
249934985 gbcon25.seq
249999433 gbcon26.seq
250000254 gbcon27.seq
229991779 gbcon28.seq
249999538 gbcon29.seq
249795687 gbcon3.seq
249997411 gbcon30.seq
249997078 gbcon31.seq
249998450 gbcon32.seq
249122679 gbcon33.seq
155402969 gbcon34.seq
249999242 gbcon35.seq
249999877 gbcon36.seq
249996021 gbcon37.seq
249996176 gbcon38.seq
249994385 gbcon39.seq
249995768 gbcon4.seq
249994875 gbcon40.seq
60694182 gbcon41.seq
249994823 gbcon42.seq
249996670 gbcon43.seq
249994771 gbcon44.seq
249997476 gbcon45.seq
249999646 gbcon46.seq
42680484 gbcon47.seq
249996840 gbcon48.seq
249999181 gbcon49.seq
249999265 gbcon5.seq
249998673 gbcon50.seq
249995483 gbcon51.seq
249998214 gbcon52.seq
35989951 gbcon53.seq
249996602 gbcon54.seq
249994877 gbcon55.seq
249998850 gbcon56.seq
249996087 gbcon57.seq
249996119 gbcon58.seq
152231870 gbcon59.seq
249999243 gbcon6.seq
249999222 gbcon60.seq
249997773 gbcon61.seq
249999490 gbcon62.seq
249996614 gbcon63.seq
183250056 gbcon64.seq
249997380 gbcon65.seq
249998571 gbcon66.seq
250000027 gbcon67.seq
249997824 gbcon68.seq
245105048 gbcon69.seq
249997968 gbcon7.seq
249999617 gbcon70.seq
250000161 gbcon71.seq
249996007 gbcon72.seq
249996901 gbcon73.seq
249999372 gbcon74.seq
96213867 gbcon75.seq
249997855 gbcon76.seq
249995058 gbcon77.seq
249998739 gbcon78.seq
249998960 gbcon79.seq
50228775 gbcon8.seq
249999030 gbcon80.seq
62263749 gbcon81.seq
249999554 gbcon82.seq
249999004 gbcon83.seq
249998188 gbcon84.seq
249997952 gbcon85.seq
249996309 gbcon86.seq
21279195 gbcon87.seq
249996512 gbcon88.seq
249998457 gbcon89.seq
249997328 gbcon9.seq
249997420 gbcon90.seq
249998172 gbcon91.seq
197510298 gbcon92.seq
249995742 gbcon93.seq
249996706 gbcon94.seq
249996588 gbcon95.seq
249998180 gbcon96.seq
249997251 gbcon97.seq
132829813 gbcon98.seq
249997306 gbcon99.seq
9621 gbdel.txt
249997574 gbenv1.seq
249998090 gbenv10.seq
51297682 gbenv11.seq
249999962 gbenv12.seq
249999319 gbenv13.seq
249998136 gbenv14.seq
250000224 gbenv15.seq
249998935 gbenv16.seq
119804009 gbenv17.seq
249999199 gbenv18.seq
249999809 gbenv19.seq
249998328 gbenv2.seq
249999468 gbenv20.seq
249999032 gbenv21.seq
249999964 gbenv22.seq
133786599 gbenv23.seq
249998574 gbenv3.seq
249998728 gbenv4.seq
249997829 gbenv5.seq
105111490 gbenv6.seq
249997444 gbenv7.seq
249997509 gbenv8.seq
249999528 gbenv9.seq
524289938 gbest1.seq
524289129 gbest10.seq
524288827 gbest100.seq
524289421 gbest101.seq
524289999 gbest102.seq
524289849 gbest103.seq
204150169 gbest104.seq
524288770 gbest105.seq
524290759 gbest106.seq
473042568 gbest107.seq
386135878 gbest108.seq
385846596 gbest109.seq
524290137 gbest11.seq
380621933 gbest110.seq
381151606 gbest111.seq
382129415 gbest112.seq
378549441 gbest113.seq
389933108 gbest114.seq
385773183 gbest115.seq
382306233 gbest116.seq
381827520 gbest117.seq
411811305 gbest118.seq
524289133 gbest119.seq
524288800 gbest12.seq
524289047 gbest120.seq
524290262 gbest121.seq
524289249 gbest122.seq
524289954 gbest123.seq
372463900 gbest124.seq
524290783 gbest125.seq
524291690 gbest126.seq
524289522 gbest127.seq
524289883 gbest128.seq
524290204 gbest129.seq
22342 gbest13.seq
524290218 gbest130.seq
524291440 gbest131.seq
524290513 gbest132.seq
524289135 gbest133.seq
524289790 gbest134.seq
524289596 gbest135.seq
524289264 gbest136.seq
103081713 gbest137.seq
524289559 gbest138.seq
524289250 gbest139.seq
524291113 gbest14.seq
524289861 gbest140.seq
524289260 gbest141.seq
524291314 gbest142.seq
524291603 gbest143.seq
524290800 gbest144.seq
524291067 gbest145.seq
524292026 gbest146.seq
524289260 gbest147.seq
524290577 gbest148.seq
524289860 gbest149.seq
524291510 gbest15.seq
524292652 gbest150.seq
333490057 gbest151.seq
524290269 gbest152.seq
524291417 gbest153.seq
524289916 gbest154.seq
524289777 gbest155.seq
524292203 gbest156.seq
524288938 gbest157.seq
524291396 gbest158.seq
524291295 gbest159.seq
524288867 gbest16.seq
460674420 gbest160.seq
524290580 gbest161.seq
524288779 gbest162.seq
524289055 gbest163.seq
131230499 gbest164.seq
524290067 gbest165.seq
524289973 gbest166.seq
524289568 gbest167.seq
524289580 gbest168.seq
524289245 gbest169.seq
524291439 gbest17.seq
524290272 gbest170.seq
524290651 gbest171.seq
524294976 gbest172.seq
524290265 gbest173.seq
524291296 gbest174.seq
524292555 gbest175.seq
524289113 gbest176.seq
479914387 gbest177.seq
524290804 gbest178.seq
524289248 gbest179.seq
524289818 gbest18.seq
524289692 gbest180.seq
524290645 gbest181.seq
524291099 gbest182.seq
524290040 gbest183.seq
524288857 gbest184.seq
524292376 gbest185.seq
524291760 gbest186.seq
524289759 gbest187.seq
524290639 gbest188.seq
524290593 gbest189.seq
524290536 gbest19.seq
309363375 gbest190.seq
524288989 gbest191.seq
524290812 gbest192.seq
524291962 gbest193.seq
524288906 gbest194.seq
524290670 gbest195.seq
524291049 gbest196.seq
524288992 gbest197.seq
524289365 gbest198.seq
524290990 gbest199.seq
524289670 gbest2.seq
524289987 gbest20.seq
524289574 gbest200.seq
524291034 gbest201.seq
524289180 gbest202.seq
486699462 gbest203.seq
524289793 gbest204.seq
524291274 gbest205.seq
524289858 gbest206.seq
524288796 gbest207.seq
524288816 gbest208.seq
524290107 gbest209.seq
524290109 gbest21.seq
524290055 gbest210.seq
524290290 gbest211.seq
524289329 gbest212.seq
524289593 gbest213.seq
309654161 gbest214.seq
524292552 gbest215.seq
524289115 gbest216.seq
524289485 gbest217.seq
524289718 gbest218.seq
524289046 gbest219.seq
458586977 gbest22.seq
524290722 gbest220.seq
524292250 gbest221.seq
524288956 gbest222.seq
524290406 gbest223.seq
524292074 gbest224.seq
524291031 gbest225.seq
524292710 gbest226.seq
524289683 gbest227.seq
273943063 gbest228.seq
524289420 gbest229.seq
508910479 gbest23.seq
524292395 gbest230.seq
524290417 gbest231.seq
524289606 gbest232.seq
524291439 gbest233.seq
524292194 gbest234.seq
524290755 gbest235.seq
524289408 gbest236.seq
524290262 gbest237.seq
524289238 gbest238.seq
524288906 gbest239.seq
524289751 gbest24.seq
524291006 gbest240.seq
524292320 gbest241.seq
352454711 gbest242.seq
524288930 gbest243.seq
524290806 gbest244.seq
524289697 gbest245.seq
524290137 gbest246.seq
523701754 gbest247.seq
524291099 gbest248.seq
524291054 gbest249.seq
172475236 gbest25.seq
524290785 gbest250.seq
524290321 gbest251.seq
524289728 gbest252.seq
524289410 gbest253.seq
38021859 gbest254.seq
524291118 gbest255.seq
524290788 gbest256.seq
524288828 gbest257.seq
524289501 gbest258.seq
524288946 gbest259.seq
524289955 gbest26.seq
524288770 gbest260.seq
524289901 gbest261.seq
524288896 gbest262.seq
524291806 gbest263.seq
524291367 gbest264.seq
210148673 gbest265.seq
524290786 gbest266.seq
524291002 gbest267.seq
524290466 gbest268.seq
524290445 gbest269.seq
524289615 gbest27.seq
524290839 gbest270.seq
524290399 gbest271.seq
524289501 gbest272.seq
524289079 gbest273.seq
524290193 gbest274.seq
524291161 gbest275.seq
524293062 gbest276.seq
209933305 gbest277.seq
524288908 gbest278.seq
524289456 gbest279.seq
524289176 gbest28.seq
524290231 gbest280.seq
524290596 gbest281.seq
524291100 gbest282.seq
524291137 gbest283.seq
524290469 gbest284.seq
524290117 gbest285.seq
524290638 gbest286.seq
524293038 gbest287.seq
389953891 gbest288.seq
524289650 gbest289.seq
524293000 gbest29.seq
524288753 gbest290.seq
524289360 gbest291.seq
524290701 gbest292.seq
524290517 gbest293.seq
524291251 gbest294.seq
524289277 gbest295.seq
475288063 gbest296.seq
481138862 gbest297.seq
482707852 gbest298.seq
480206237 gbest299.seq
513078155 gbest3.seq
524288998 gbest30.seq
232417715 gbest300.seq
524291195 gbest301.seq
524289016 gbest302.seq
524292882 gbest303.seq
524288918 gbest304.seq
524289094 gbest305.seq
524289055 gbest306.seq
524288937 gbest307.seq
524289779 gbest308.seq
524290894 gbest309.seq
524290528 gbest31.seq
524290366 gbest310.seq
524289884 gbest311.seq
524288790 gbest312.seq
369697960 gbest313.seq
524290108 gbest314.seq
524291502 gbest315.seq
524290823 gbest316.seq
524289059 gbest317.seq
524290647 gbest318.seq
524289370 gbest319.seq
524291595 gbest32.seq
524289147 gbest320.seq
524289169 gbest321.seq
524289911 gbest322.seq
524291582 gbest323.seq
524289644 gbest324.seq
524291690 gbest325.seq
524289427 gbest326.seq
177002900 gbest327.seq
524289632 gbest328.seq
524289775 gbest329.seq
524291914 gbest33.seq
524289023 gbest330.seq
524289950 gbest331.seq
524289080 gbest332.seq
524292899 gbest333.seq
524291008 gbest334.seq
524288886 gbest335.seq
524289740 gbest336.seq
524290820 gbest337.seq
524289202 gbest338.seq
524292663 gbest339.seq
489396485 gbest34.seq
524289918 gbest340.seq
75623983 gbest341.seq
524289784 gbest342.seq
524289839 gbest343.seq
524291167 gbest344.seq
524289860 gbest345.seq
524293081 gbest346.seq
524295399 gbest347.seq
524289712 gbest348.seq
524291557 gbest349.seq
490148636 gbest35.seq
524289020 gbest350.seq
524291302 gbest351.seq
524290454 gbest352.seq
524290233 gbest353.seq
524290387 gbest354.seq
64960382 gbest355.seq
524289865 gbest356.seq
524289848 gbest357.seq
524290574 gbest358.seq
524289636 gbest359.seq
492360986 gbest36.seq
524291327 gbest360.seq
524289040 gbest361.seq
524289676 gbest362.seq
524289088 gbest363.seq
524288943 gbest364.seq
524291268 gbest365.seq
496423171 gbest366.seq
524291476 gbest367.seq
524289911 gbest368.seq
524288920 gbest369.seq
491776102 gbest37.seq
524289748 gbest370.seq
524290941 gbest371.seq
524290253 gbest372.seq
524289634 gbest373.seq
524289188 gbest374.seq
524290040 gbest375.seq
524290024 gbest376.seq
524288847 gbest377.seq
524291221 gbest378.seq
217956350 gbest379.seq
484490978 gbest38.seq
524290514 gbest380.seq
524291566 gbest381.seq
524292165 gbest382.seq
524289143 gbest383.seq
524289757 gbest384.seq
524289162 gbest385.seq
524290630 gbest386.seq
524289556 gbest387.seq
524292032 gbest388.seq
524291323 gbest389.seq
498677754 gbest39.seq
524289780 gbest390.seq
524290127 gbest391.seq
404906929 gbest392.seq
524290937 gbest393.seq
524289635 gbest394.seq
524290869 gbest395.seq
524289536 gbest396.seq
524289340 gbest397.seq
524290578 gbest398.seq
524290559 gbest399.seq
524288825 gbest4.seq
15069105 gbest40.seq
524295881 gbest400.seq
524294919 gbest401.seq
524290208 gbest402.seq
524288866 gbest403.seq
524290124 gbest404.seq
524290734 gbest405.seq
524288756 gbest406.seq
7754830 gbest407.seq
524289114 gbest408.seq
524290410 gbest409.seq
524291909 gbest41.seq
436698525 gbest410.seq
524291574 gbest42.seq
524290258 gbest43.seq
524290672 gbest44.seq
524289208 gbest45.seq
524289552 gbest46.seq
524290291 gbest47.seq
524290961 gbest48.seq
524291608 gbest49.seq
524289349 gbest5.seq
524290143 gbest50.seq
524292621 gbest51.seq
524288915 gbest52.seq
524291314 gbest53.seq
106783385 gbest54.seq
524290432 gbest55.seq
524291809 gbest56.seq
524291769 gbest57.seq
524291164 gbest58.seq
524290726 gbest59.seq
524290073 gbest6.seq
524290390 gbest60.seq
524290052 gbest61.seq
524290540 gbest62.seq
524291580 gbest63.seq
524290677 gbest64.seq
524291002 gbest65.seq
524292449 gbest66.seq
308965079 gbest67.seq
524289299 gbest68.seq
524290553 gbest69.seq
524289659 gbest7.seq
524289830 gbest70.seq
524288795 gbest71.seq
524290131 gbest72.seq
524290042 gbest73.seq
524292050 gbest74.seq
524290960 gbest75.seq
524289320 gbest76.seq
524290963 gbest77.seq
524291977 gbest78.seq
524290847 gbest79.seq
524289851 gbest8.seq
57003414 gbest80.seq
524289274 gbest81.seq
524290057 gbest82.seq
524289984 gbest83.seq
524289713 gbest84.seq
524288950 gbest85.seq
524291145 gbest86.seq
524289268 gbest87.seq
524290307 gbest88.seq
524291199 gbest89.seq
524288743 gbest9.seq
524291357 gbest90.seq
524289824 gbest91.seq
295200103 gbest92.seq
524289874 gbest93.seq
524291735 gbest94.seq
524289514 gbest95.seq
524292802 gbest96.seq
524288740 gbest97.seq
524289764 gbest98.seq
524288854 gbest99.seq
117705283 gbgen.idx
524289318 gbgss1.seq
524289829 gbgss10.seq
232426352 gbgss100.seq
524289633 gbgss101.seq
524290841 gbgss102.seq
524290586 gbgss103.seq
524289874 gbgss104.seq
524289868 gbgss105.seq
524291007 gbgss106.seq
524289679 gbgss107.seq
524289938 gbgss108.seq
524290450 gbgss109.seq
524291447 gbgss11.seq
524290926 gbgss110.seq
454166716 gbgss111.seq
524291536 gbgss112.seq
524289236 gbgss113.seq
524289672 gbgss114.seq
524289546 gbgss115.seq
524288806 gbgss116.seq
524290577 gbgss117.seq
524291239 gbgss118.seq
524291059 gbgss119.seq
524291952 gbgss12.seq
524290020 gbgss120.seq
117129703 gbgss121.seq
249997645 gbgss122.seq
249999908 gbgss123.seq
250000029 gbgss124.seq
250000145 gbgss125.seq
27249884 gbgss126.seq
249999305 gbgss127.seq
249998059 gbgss128.seq
249997528 gbgss129.seq
524291338 gbgss13.seq
249999266 gbgss130.seq
223609630 gbgss131.seq
249998415 gbgss132.seq
249998761 gbgss133.seq
249998880 gbgss134.seq
250000102 gbgss135.seq
171437117 gbgss136.seq
249997088 gbgss137.seq
249998903 gbgss138.seq
249999973 gbgss139.seq
524289955 gbgss14.seq
249999609 gbgss140.seq
249996931 gbgss141.seq
76888892 gbgss142.seq
250000188 gbgss143.seq
2879209 gbgss144.seq
79504033 gbgss145.seq
249999490 gbgss146.seq
249999462 gbgss147.seq
78034667 gbgss148.seq
249998484 gbgss149.seq
524289002 gbgss15.seq
249999450 gbgss150.seq
249997767 gbgss151.seq
114935056 gbgss152.seq
249999232 gbgss153.seq
249999334 gbgss154.seq
249999609 gbgss155.seq
246354232 gbgss156.seq
249999268 gbgss157.seq
249997406 gbgss158.seq
249999513 gbgss159.seq
524290847 gbgss16.seq
243477113 gbgss160.seq
249999336 gbgss161.seq
249997687 gbgss162.seq
249997948 gbgss163.seq
249998567 gbgss164.seq
116382755 gbgss165.seq
249998013 gbgss166.seq
250000031 gbgss167.seq
249999974 gbgss168.seq
249998152 gbgss169.seq
524290352 gbgss17.seq
94058788 gbgss170.seq
249998892 gbgss171.seq
249999850 gbgss172.seq
249998239 gbgss173.seq
249999582 gbgss174.seq
162386018 gbgss175.seq
249999236 gbgss176.seq
249997993 gbgss177.seq
249998006 gbgss178.seq
249999353 gbgss179.seq
524289873 gbgss18.seq
95926739 gbgss180.seq
249999461 gbgss181.seq
249999350 gbgss182.seq
249998406 gbgss183.seq
249998619 gbgss184.seq
102413401 gbgss185.seq
249999853 gbgss186.seq
249998411 gbgss187.seq
249998272 gbgss188.seq
42310440 gbgss189.seq
150844342 gbgss19.seq
52091543 gbgss190.seq
249998242 gbgss191.seq
249999494 gbgss192.seq
249997995 gbgss193.seq
249997330 gbgss194.seq
249997758 gbgss195.seq
102524929 gbgss196.seq
249998520 gbgss197.seq
249998861 gbgss198.seq
250000017 gbgss199.seq
524289346 gbgss2.seq
524291095 gbgss20.seq
249998774 gbgss200.seq
249998008 gbgss201.seq
118508169 gbgss202.seq
249999561 gbgss203.seq
249998751 gbgss204.seq
249999599 gbgss205.seq
249999028 gbgss206.seq
249999713 gbgss207.seq
16550600 gbgss208.seq
524290032 gbgss21.seq
524290163 gbgss22.seq
524288857 gbgss23.seq
524291258 gbgss24.seq
524289556 gbgss25.seq
524289145 gbgss26.seq
524289442 gbgss27.seq
524291007 gbgss28.seq
524288897 gbgss29.seq
524288877 gbgss3.seq
427929738 gbgss30.seq
524290679 gbgss31.seq
524289719 gbgss32.seq
524290073 gbgss33.seq
524290035 gbgss34.seq
524290617 gbgss35.seq
524290519 gbgss36.seq
524291111 gbgss37.seq
468913984 gbgss38.seq
524291365 gbgss39.seq
524291607 gbgss4.seq
524290157 gbgss40.seq
451323994 gbgss41.seq
524289132 gbgss42.seq
524289503 gbgss43.seq
524289913 gbgss44.seq
524289025 gbgss45.seq
156684744 gbgss46.seq
524291699 gbgss47.seq
524289024 gbgss48.seq
524290162 gbgss49.seq
115085286 gbgss5.seq
524289623 gbgss50.seq
524288948 gbgss51.seq
524289795 gbgss52.seq
524290040 gbgss53.seq
524289429 gbgss54.seq
524289348 gbgss55.seq
524289100 gbgss56.seq
403163907 gbgss57.seq
524288884 gbgss58.seq
524288774 gbgss59.seq
524289571 gbgss6.seq
524290996 gbgss60.seq
415173955 gbgss61.seq
524290837 gbgss62.seq
524291167 gbgss63.seq
524290183 gbgss64.seq
524289232 gbgss65.seq
524291124 gbgss66.seq
524289384 gbgss67.seq
524289855 gbgss68.seq
524291301 gbgss69.seq
524290775 gbgss7.seq
524291174 gbgss70.seq
524290481 gbgss71.seq
524291345 gbgss72.seq
411220153 gbgss73.seq
524290382 gbgss74.seq
524289713 gbgss75.seq
524289948 gbgss76.seq
524289654 gbgss77.seq
524289210 gbgss78.seq
524288753 gbgss79.seq
524290111 gbgss8.seq
524289236 gbgss80.seq
524291317 gbgss81.seq
524291578 gbgss82.seq
524290104 gbgss83.seq
524289027 gbgss84.seq
228405899 gbgss85.seq
524291900 gbgss86.seq
524289498 gbgss87.seq
524290597 gbgss88.seq
524290444 gbgss89.seq
524289011 gbgss9.seq
524291387 gbgss90.seq
524291827 gbgss91.seq
524291391 gbgss92.seq
524292075 gbgss93.seq
524290593 gbgss94.seq
524290404 gbgss95.seq
524290950 gbgss96.seq
524291640 gbgss97.seq
524290861 gbgss98.seq
524290859 gbgss99.seq
249987886 gbhtc1.seq
250000236 gbhtc10.seq
249999429 gbhtc11.seq
62041312 gbhtc12.seq
249998738 gbhtc13.seq
250000070 gbhtc14.seq
45452470 gbhtc15.seq
250000065 gbhtc2.seq
249994635 gbhtc3.seq
249991467 gbhtc4.seq
249991812 gbhtc5.seq
249986737 gbhtc6.seq
249998746 gbhtc7.seq
71970158 gbhtc8.seq
249999112 gbhtc9.seq
249955003 gbhtg1.seq
249805779 gbhtg10.seq
249822300 gbhtg100.seq
249997530 gbhtg101.seq
249708148 gbhtg102.seq
249907507 gbhtg103.seq
8546022 gbhtg104.seq
249845479 gbhtg105.seq
249901160 gbhtg106.seq
249824476 gbhtg107.seq
249802264 gbhtg108.seq
249963805 gbhtg109.seq
1643239 gbhtg11.seq
249960019 gbhtg110.seq
249885605 gbhtg111.seq
249878947 gbhtg112.seq
37217905 gbhtg113.seq
249650832 gbhtg114.seq
249841874 gbhtg115.seq
249742094 gbhtg116.seq
249789613 gbhtg117.seq
249920125 gbhtg118.seq
94350576 gbhtg119.seq
249672647 gbhtg12.seq
249781124 gbhtg120.seq
249757442 gbhtg121.seq
249916794 gbhtg122.seq
249966194 gbhtg123.seq
249934769 gbhtg124.seq
179078157 gbhtg125.seq
249841354 gbhtg126.seq
249806237 gbhtg127.seq
249761566 gbhtg128.seq
249988758 gbhtg129.seq
249784449 gbhtg13.seq
249832439 gbhtg130.seq
249931468 gbhtg131.seq
249762414 gbhtg132.seq
249986947 gbhtg133.seq
224224334 gbhtg134.seq
249745013 gbhtg14.seq
249734004 gbhtg15.seq
249827719 gbhtg16.seq
249994801 gbhtg17.seq
249729949 gbhtg18.seq
249937791 gbhtg19.seq
249980815 gbhtg2.seq
249948717 gbhtg20.seq
242641861 gbhtg21.seq
249850438 gbhtg22.seq
249856880 gbhtg23.seq
249768370 gbhtg24.seq
249893769 gbhtg25.seq
249812036 gbhtg26.seq
249795252 gbhtg27.seq
249875918 gbhtg28.seq
249712630 gbhtg29.seq
249996540 gbhtg3.seq
249951807 gbhtg30.seq
228801790 gbhtg31.seq
249774678 gbhtg32.seq
249884086 gbhtg33.seq
249784340 gbhtg34.seq
249937825 gbhtg35.seq
249968828 gbhtg36.seq
249861011 gbhtg37.seq
249830883 gbhtg38.seq
249954288 gbhtg39.seq
249897858 gbhtg4.seq
249844429 gbhtg40.seq
229615032 gbhtg41.seq
249817421 gbhtg42.seq
249802595 gbhtg43.seq
249916541 gbhtg44.seq
249943584 gbhtg45.seq
249929616 gbhtg46.seq
249093490 gbhtg47.seq
249730060 gbhtg48.seq
249736080 gbhtg49.seq
249988734 gbhtg5.seq
249953039 gbhtg50.seq
237517017 gbhtg51.seq
249907517 gbhtg52.seq
249801609 gbhtg53.seq
249806448 gbhtg54.seq
249922873 gbhtg55.seq
249982822 gbhtg56.seq
20953212 gbhtg57.seq
249812041 gbhtg58.seq
249750409 gbhtg59.seq
249769696 gbhtg6.seq
249991093 gbhtg60.seq
249987028 gbhtg61.seq
228965169 gbhtg62.seq
249982315 gbhtg63.seq
249759657 gbhtg64.seq
249955237 gbhtg65.seq
249957388 gbhtg66.seq
249870689 gbhtg67.seq
17370772 gbhtg68.seq
249960711 gbhtg69.seq
249851846 gbhtg7.seq
249973586 gbhtg70.seq
249953795 gbhtg71.seq
249734627 gbhtg72.seq
228481278 gbhtg73.seq
249706143 gbhtg74.seq
249875367 gbhtg75.seq
249952151 gbhtg76.seq
249833953 gbhtg77.seq
248594530 gbhtg78.seq
249883648 gbhtg79.seq
249791419 gbhtg8.seq
249970880 gbhtg80.seq
249969001 gbhtg81.seq
249935794 gbhtg82.seq
218843248 gbhtg83.seq
249923525 gbhtg84.seq
249839027 gbhtg85.seq
249998852 gbhtg86.seq
249818654 gbhtg87.seq
228814963 gbhtg88.seq
249861755 gbhtg89.seq
249971598 gbhtg9.seq
249912443 gbhtg90.seq
249909501 gbhtg91.seq
249763936 gbhtg92.seq
226827115 gbhtg93.seq
249743597 gbhtg94.seq
249991173 gbhtg95.seq
249787524 gbhtg96.seq
249807405 gbhtg97.seq
174594762 gbhtg98.seq
249786185 gbhtg99.seq
249997962 gbinv1.seq
249996360 gbinv10.seq
149432942 gbinv11.seq
249997873 gbinv12.seq
249995723 gbinv13.seq
249998688 gbinv14.seq
248931968 gbinv15.seq
249859557 gbinv16.seq
249997037 gbinv17.seq
249998818 gbinv18.seq
230033385 gbinv19.seq
249948121 gbinv2.seq
234976751 gbinv3.seq
249977700 gbinv4.seq
249998772 gbinv5.seq
248438094 gbinv6.seq
249999898 gbinv7.seq
249998328 gbinv8.seq
249999834 gbinv9.seq
140511691 gbjou1.idx
189792971 gbjou2.idx
200114955 gbjou3.idx
278360364 gbjou4.idx
302400075 gbjou5.idx
290498972 gbjou6.idx
278323732 gbjou7.idx
63022283 gbjou8.idx
234186900 gbkey1.idx
177515755 gbkey2.idx
205024355 gbkey3.idx
93654330 gbkey4.idx
249762679 gbmam1.seq
249999047 gbmam2.seq
249992910 gbmam3.seq
249982040 gbmam4.seq
249998475 gbmam5.seq
27338031 gbmam6.seq
57086939 gbnew.txt
249999607 gbpat1.seq
249999823 gbpat10.seq
121007762 gbpat100.seq
178894637 gbpat11.seq
249999672 gbpat12.seq
249999538 gbpat13.seq
249999424 gbpat14.seq
249999886 gbpat15.seq
249998683 gbpat16.seq
65949086 gbpat17.seq
249999229 gbpat18.seq
249999331 gbpat19.seq
249999577 gbpat2.seq
249999934 gbpat20.seq
249999820 gbpat21.seq
185120412 gbpat22.seq
249999216 gbpat23.seq
249801003 gbpat24.seq
249999212 gbpat25.seq
249999415 gbpat26.seq
71196419 gbpat27.seq
249998718 gbpat28.seq
249999268 gbpat29.seq
249999257 gbpat3.seq
250000223 gbpat30.seq
249999115 gbpat31.seq
249999447 gbpat32.seq
180146081 gbpat33.seq
249999629 gbpat34.seq
249999186 gbpat35.seq
249998629 gbpat36.seq
249998668 gbpat37.seq
133704201 gbpat38.seq
249997426 gbpat39.seq
249998049 gbpat4.seq
249978230 gbpat40.seq
249999855 gbpat41.seq
249991829 gbpat42.seq
249996158 gbpat43.seq
249998922 gbpat44.seq
249998898 gbpat45.seq
249987712 gbpat46.seq
173724157 gbpat47.seq
249998836 gbpat48.seq
249998527 gbpat49.seq
71794649 gbpat5.seq
249999531 gbpat50.seq
250000197 gbpat51.seq
222934467 gbpat52.seq
249999057 gbpat53.seq
249999815 gbpat54.seq
249999501 gbpat55.seq
164408612 gbpat56.seq
249999861 gbpat57.seq
249998453 gbpat58.seq
250000087 gbpat59.seq
249999527 gbpat6.seq
249999934 gbpat60.seq
134319183 gbpat61.seq
249996200 gbpat62.seq
249999978 gbpat63.seq
249999690 gbpat64.seq
249998210 gbpat65.seq
249999753 gbpat66.seq
194993779 gbpat67.seq
249999879 gbpat68.seq
249999564 gbpat69.seq
249999610 gbpat7.seq
249929730 gbpat70.seq
249824878 gbpat71.seq
249998855 gbpat72.seq
192932886 gbpat73.seq
249999533 gbpat74.seq
249999632 gbpat75.seq
249996399 gbpat76.seq
80124350 gbpat77.seq
249999760 gbpat78.seq
249999532 gbpat79.seq
249999950 gbpat8.seq
249998866 gbpat80.seq
250000101 gbpat81.seq
249868821 gbpat82.seq
66835451 gbpat83.seq
249999440 gbpat84.seq
249999844 gbpat85.seq
249999887 gbpat86.seq
249999094 gbpat87.seq
53428721 gbpat88.seq
250000217 gbpat89.seq
249999812 gbpat9.seq
249999792 gbpat90.seq
250000111 gbpat91.seq
89154176 gbpat92.seq
250000237 gbpat93.seq
249968113 gbpat94.seq
249999171 gbpat95.seq
250000135 gbpat96.seq
249998788 gbpat97.seq
249999223 gbpat98.seq
249999916 gbpat99.seq
100081275 gbphg1.seq
249990458 gbpln1.seq
249874841 gbpln10.seq
249843986 gbpln11.seq
86756412 gbpln12.seq
249997942 gbpln13.seq
249991131 gbpln14.seq
249997558 gbpln15.seq
247890135 gbpln16.seq
249869217 gbpln17.seq
248567866 gbpln18.seq
249999374 gbpln19.seq
249762109 gbpln2.seq
23471291 gbpln20.seq
249999395 gbpln21.seq
94408891 gbpln22.seq
249997930 gbpln23.seq
249998469 gbpln24.seq
249475243 gbpln25.seq
249999227 gbpln26.seq
249999241 gbpln27.seq
249998489 gbpln28.seq
250000053 gbpln29.seq
249910471 gbpln3.seq
113959754 gbpln30.seq
249997660 gbpln31.seq
249999167 gbpln32.seq
249999983 gbpln33.seq
249998098 gbpln34.seq
250000160 gbpln35.seq
249999295 gbpln36.seq
249998093 gbpln37.seq
249998091 gbpln38.seq
189183663 gbpln39.seq
249984146 gbpln4.seq
249934646 gbpln5.seq
249999910 gbpln6.seq
207431353 gbpln7.seq
249999059 gbpln8.seq
250000149 gbpln9.seq
148930929 gbpri1.seq
249971272 gbpri10.seq
122739633 gbpri11.seq
249966221 gbpri12.seq
249900325 gbpri13.seq
249891160 gbpri14.seq
249940560 gbpri15.seq
249938249 gbpri16.seq
249810264 gbpri17.seq
249976417 gbpri18.seq
249887769 gbpri19.seq
249903752 gbpri2.seq
249847140 gbpri20.seq
250000019 gbpri21.seq
40085445 gbpri22.seq
180070843 gbpri23.seq
249999615 gbpri24.seq
204683615 gbpri25.seq
249997366 gbpri26.seq
249857977 gbpri27.seq
249826325 gbpri28.seq
249868667 gbpri29.seq
249835727 gbpri3.seq
249834770 gbpri30.seq
249973214 gbpri31.seq
249810577 gbpri32.seq
249996477 gbpri33.seq
84902435 gbpri34.seq
249995363 gbpri35.seq
249998838 gbpri36.seq
249999938 gbpri37.seq
249997977 gbpri38.seq
249864036 gbpri39.seq
249852603 gbpri4.seq
249998424 gbpri40.seq
83023515 gbpri41.seq
249857147 gbpri5.seq
249927876 gbpri6.seq
249970365 gbpri7.seq
249940127 gbpri8.seq
249762812 gbpri9.seq
266918 gbrel.txt
249822986 gbrod1.seq
249989786 gbrod10.seq
46839377 gbrod11.seq
249991565 gbrod12.seq
249841479 gbrod13.seq
249954260 gbrod14.seq
249880505 gbrod15.seq
249871577 gbrod16.seq
249889843 gbrod17.seq
249996632 gbrod18.seq
73998904 gbrod19.seq
249861907 gbrod2.seq
249989891 gbrod20.seq
249896347 gbrod21.seq
226408925 gbrod22.seq
249999205 gbrod23.seq
249995212 gbrod24.seq
249845828 gbrod25.seq
249920451 gbrod26.seq
249998976 gbrod27.seq
232720809 gbrod28.seq
249806731 gbrod3.seq
249692718 gbrod4.seq
249720980 gbrod5.seq
249771141 gbrod6.seq
249749088 gbrod7.seq
249823470 gbrod8.seq
249821141 gbrod9.seq
2392598170 gbsdr1.txt
5194003226 gbsdr2.txt
2215546092 gbsdr3.txt
139396322 gbsec.idx
249997067 gbsts1.seq
249997362 gbsts10.seq
212226966 gbsts11.seq
249996524 gbsts12.seq
249997606 gbsts13.seq
249999051 gbsts14.seq
249999111 gbsts15.seq
26927942 gbsts16.seq
249999803 gbsts17.seq
249996731 gbsts18.seq
249998395 gbsts19.seq
249999395 gbsts2.seq
118376647 gbsts20.seq
249999978 gbsts3.seq
249996919 gbsts4.seq
35348302 gbsts5.seq
249997222 gbsts6.seq
249997899 gbsts7.seq
249998458 gbsts8.seq
249997188 gbsts9.seq
249996107 gbsyn1.seq
237996797 gbsyn2.seq
249999667 gbtsa1.seq
249999339 gbtsa2.seq
249999490 gbtsa3.seq
64796976 gbtsa4.seq
447342 gbuna1.seq
249995228 gbvrl1.seq
249999567 gbvrl10.seq
249998158 gbvrl11.seq
249997328 gbvrl12.seq
249994055 gbvrl13.seq
8401225 gbvrl14.seq
249998631 gbvrl2.seq
249980621 gbvrl3.seq
249995774 gbvrl4.seq
129727401 gbvrl5.seq
249999019 gbvrl6.seq
249998617 gbvrl7.seq
249998882 gbvrl8.seq
249999352 gbvrl9.seq
249946935 gbvrt1.seq
249937966 gbvrt10.seq
249901467 gbvrt11.seq
176017506 gbvrt12.seq
249996787 gbvrt13.seq
249970943 gbvrt14.seq
249968268 gbvrt15.seq
249999207 gbvrt16.seq
249936574 gbvrt17.seq
249994705 gbvrt18.seq
249996836 gbvrt19.seq
249998299 gbvrt2.seq
249998489 gbvrt20.seq
5968191 gbvrt21.seq
249732185 gbvrt3.seq
250000033 gbvrt4.seq
32198250 gbvrt5.seq
249994506 gbvrt6.seq
249996361 gbvrt7.seq
249866392 gbvrt8.seq
249935581 gbvrt9.seq
2.2.6 Per-Division Statistics
The following table provides a per-division breakdown of the number of
sequence entries and the total number of bases of DNA/RNA in each non-CON
and non-WGS sequence data file.
CON division records, which are constructed from other sequence records,
are not represented here because their inclusion would essentially be a
form of double-counting.
Sequences from Whole Genome Shotgun (WGS) sequencing projects are not
represented here because all WGS project data are made available on
a per-project basis:
ftp://ftp.ncbi.nih.gov/ncbi-asn1/wgs
ftp://ftp.ncbi.nih.gov/genbank/wgs
rather than being incorporated within the GenBank release distribution.
Division Entries Bases
BCT1 49148 92018229
BCT10 63 117808727
BCT11 16194 106586525
BCT12 53626 85214234
BCT13 94 110535707
BCT14 151 96466604
BCT15 60 109557523
BCT16 52 112042350
BCT17 46 111873353
BCT18 57 113989395
BCT19 67 112830215
BCT2 66 110398637
BCT20 33 84321683
BCT21 40 106881287
BCT22 58 105533856
BCT23 74 111675186
BCT24 55 110045255
BCT25 63 106845562
BCT26 55 111297421
BCT27 53 109574112
BCT28 46 109006305
BCT29 93 106054418
BCT3 52 110808721
BCT30 58 107600868
BCT31 65 108501758
BCT32 123 108842042
BCT33 33 58347023
BCT34 202 108966790
BCT35 44 107367804
BCT36 40 107342165
BCT37 63 101786877
BCT38 39 108180076
BCT39 240 99386121
BCT4 6962 107528604
BCT40 425 97248713
BCT41 1589 2511823
BCT42 3176 5212972
BCT43 6347 7901820
BCT44 12648 15076097
BCT45 25632 27763281
BCT46 50581 54060759
BCT47 76764 77537012
BCT48 69366 77953706
BCT49 6364 103159927
BCT5 31496 51172220
BCT50 41023 90327816
BCT51 66284 83511039
BCT52 5435 10048831
BCT6 39038 86911160
BCT7 6441 98374785
BCT8 14427 93307049
BCT9 5215 103446803
ENV1 94016 71993411
ENV10 83450 87086413
ENV11 18515 16329797
ENV12 85320 81225477
ENV13 121332 42815225
ENV14 90405 79077763
ENV15 99426 69095291
ENV16 74905 83850664
ENV17 30928 41718771
ENV18 71622 83926090
ENV19 104395 66885705
ENV2 89777 69742528
ENV20 134865 32298248
ENV21 134298 31619995
ENV22 129267 46145450
ENV23 57391 35265653
ENV3 82962 82501364
ENV4 89270 79380087
ENV5 89769 83784978
ENV6 49938 19527065
ENV7 131840 31330706
ENV8 87067 71137416
ENV9 94207 73643373
EST1 162724 63096884
EST10 171880 70582810
EST100 232935 143233718
EST101 220765 113977286
EST102 209234 102676423
EST103 159199 103542096
EST104 61130 52090083
EST105 169151 111428783
EST106 170796 113795521
EST107 109418 61171439
EST108 64275 23526839
EST109 64370 22877425
EST11 168409 73440432
EST110 64753 23939116
EST111 64658 27037633
EST112 64808 22296359
EST113 66147 26264658
EST114 64107 27879331
EST115 64719 26699610
EST116 64901 26779967
EST117 64712 25492255
EST118 61286 40301964
EST119 181963 87493399
EST12 196386 87636257
EST120 174469 83395774
EST121 155602 95802347
EST122 149844 94744934
EST123 158934 99786104
EST124 101087 55815533
EST125 208520 103371289
EST126 155574 79218396
EST127 162866 86222063
EST128 162039 92541442
EST129 159422 85142477
EST13 8 2589
EST130 179324 108505252
EST131 213508 115093820
EST132 168383 101583402
EST133 172644 77897866
EST134 149499 84303183
EST135 153316 92636649
EST136 161429 94379674
EST137 33350 16979897
EST138 194364 110939769
EST139 234802 103443444
EST14 223626 113057838
EST140 155572 91938632
EST141 172899 95671933
EST142 149697 86741438
EST143 137559 88808091
EST144 155888 92039940
EST145 150161 77228517
EST146 125342 65473926
EST147 124892 63621432
EST148 130482 55771823
EST149 130013 51510044
EST15 169699 108255181
EST150 131525 51650645
EST151 86798 34233673
EST152 176494 87013917
EST153 185617 86776780
EST154 161027 112271249
EST155 221075 132247582
EST156 217630 123305904
EST157 170099 92019292
EST158 144039 113985113
EST159 147554 91897663
EST16 187978 112299805
EST160 145741 82559434
EST161 146161 77506106
EST162 158841 94735352
EST163 164727 92366999
EST164 40964 24376962
EST165 160331 98864210
EST166 188323 105785005
EST167 154078 86591149
EST168 130942 68417922
EST169 181365 99804276
EST17 196187 121482986
EST170 173500 103578145
EST171 151560 91621255
EST172 172696 88022392
EST173 133867 86066443
EST174 193612 111475265
EST175 156174 90068939
EST176 116574 70160717
EST177 166921 95672830
EST178 172764 104574732
EST179 183343 131897037
EST18 194464 119197377
EST180 160611 114770076
EST181 190853 90873976
EST182 153009 117052612
EST183 150956 90657534
EST184 149553 78135957
EST185 173528 81158735
EST186 152455 84134040
EST187 160659 102517930
EST188 152409 90982675
EST189 163380 102620017
EST19 171013 115179520
EST190 116462 66325771
EST191 134084 90867492
EST192 135053 90288036
EST193 146439 90479029
EST194 134811 90482425
EST195 122871 86699536
EST196 186494 97659621
EST197 177646 98057967
EST198 177175 98301525
EST199 174402 97828338
EST2 164869 62097811
EST20 208076 108131322
EST200 172101 96114709
EST201 176279 98440418
EST202 177586 96643959
EST203 165052 90672426
EST204 196157 105673690
EST205 209868 113545409
EST206 181303 107563712
EST207 194026 111481621
EST208 199617 122166719
EST209 194817 119207671
EST21 226471 109685146
EST210 191297 119456375
EST211 186595 117646627
EST212 226356 143406159
EST213 235527 115249139
EST214 77274 54733308
EST215 149226 95358407
EST216 192773 105387941
EST217 185842 121352772
EST218 106932 90523954
EST219 139633 115137762
EST22 171445 47206817
EST220 166080 96122970
EST221 149224 107861059
EST222 223654 105167585
EST223 144609 103378888
EST224 140362 99265022
EST225 140901 100231612
EST226 123980 88773970
EST227 147304 92391975
EST228 69470 46155851
EST229 135166 98769079
EST23 126919 36585377
EST230 141148 100798514
EST231 125733 85875773
EST232 135055 88282775
EST233 161346 119072315
EST234 143522 120007707
EST235 137085 112303577
EST236 163548 99984765
EST237 179666 113725571
EST238 151967 100596033
EST239 182320 121375040
EST24 132478 44189561
EST240 162637 111385930
EST241 143822 108361226
EST242 116337 69867099
EST243 227901 96738507
EST244 171159 117840244
EST245 169062 77103339
EST246 225313 44746650
EST247 218223 83938514
EST248 174729 132827999
EST249 170089 103574233
EST25 71624 29878081
EST250 173916 111218178
EST251 167520 114037259
EST252 174608 118275329
EST253 193257 102871530
EST254 14218 6698772
EST255 183333 105293315
EST256 168840 122585166
EST257 217014 105411829
EST258 281368 123208671
EST259 212919 123684962
EST26 220618 102545937
EST260 186869 37229594
EST261 261962 126153156
EST262 163856 95515358
EST263 164632 107128921
EST264 165184 108572443
EST265 74022 50030641
EST266 202067 106078644
EST267 164646 110921978
EST268 195335 114085610
EST269 207006 44185618
EST27 225224 112771786
EST270 201972 53202487
EST271 189412 108052110
EST272 182001 120120639
EST273 178418 116826352
EST274 182523 104785194
EST275 144836 97967666
EST276 184000 82017238
EST277 47261 29278744
EST278 137868 89042970
EST279 167368 97070328
EST28 187117 84903358
EST280 275099 28790423
EST281 258415 39091372
EST282 154032 107891686
EST283 166889 116016158
EST284 158748 99140596
EST285 210088 70304220
EST286 236936 65207152
EST287 163961 106617916
EST288 149039 87974645
EST289 150760 83121902
EST29 161848 70576020
EST290 186069 123649376
EST291 189192 114628686
EST292 156855 112665748
EST293 196193 112488531
EST294 181447 102138609
EST295 184182 107578074
EST296 191803 65554788
EST297 190629 70751001
EST298 190467 72064844
EST299 192300 73103029
EST3 163315 58402707
EST30 176117 70912671
EST300 81416 46048239
EST301 185346 129585509
EST302 162529 95337421
EST303 160281 92398467
EST304 128773 102732254
EST305 154745 100683889
EST306 176938 99870162
EST307 160692 99580817
EST308 173565 100973793
EST309 162367 108383271
EST31 157659 67453539
EST310 146882 103700326
EST311 169480 107853533
EST312 166546 126883554
EST313 131527 111436281
EST314 177475 109761222
EST315 144105 83218921
EST316 145581 99566019
EST317 144373 97459873
EST318 140469 92412907
EST319 149380 103082636
EST32 170967 74872365
EST320 153757 103340172
EST321 185522 110215892
EST322 162711 93403037
EST323 164412 96216717
EST324 154528 110582598
EST325 164806 100077019
EST326 145894 92531277
EST327 46987 29931965
EST328 166053 102381561
EST329 201557 107414192
EST33 182061 75136237
EST330 125533 68471125
EST331 127852 84657971
EST332 138404 86996741
EST333 134287 85691819
EST334 154313 85250458
EST335 224172 87766513
EST336 176654 88784798
EST337 175037 106276807
EST338 167505 99952068
EST339 130844 86830439
EST34 97740 27684975
EST340 137429 91442518
EST341 19624 12781061
EST342 163271 103594800
EST343 182497 119943066
EST344 162705 84224999
EST345 188346 116844957
EST346 149617 109203416
EST347 73955 44616880
EST348 121901 66915696
EST349 135540 80875257
EST35 98478 30010703
EST350 145144 68762283
EST351 161710 33079570
EST352 161640 33193325
EST353 178931 90684657
EST354 221073 124889883
EST355 32072 14679450
EST356 258970 118376078
EST357 209544 107527949
EST358 150273 97014941
EST359 138604 88683334
EST36 98727 31262874
EST360 150125 88951039
EST361 190167 117787313
EST362 239565 75053012
EST363 193367 118416685
EST364 143300 67511130
EST365 209526 109264504
EST366 196361 108789886
EST367 165093 111080337
EST368 198166 114633492
EST369 174191 26686641
EST37 97557 29687431
EST370 164424 13709580
EST371 159838 24529022
EST372 161760 32916920
EST373 161922 32531807
EST374 153738 74522960
EST375 175266 102183224
EST376 175741 119895032
EST377 171242 100084053
EST378 162671 113281442
EST379 55948 39123566
EST38 99962 30242763
EST380 142839 100978595
EST381 151687 99485353
EST382 180406 130100131
EST383 152870 95302731
EST384 175114 68863910
EST385 171678 27876659
EST386 165969 110511003
EST387 155024 103372086
EST388 185367 119737585
EST389 140807 80697083
EST39 102684 30680740
EST390 201134 122042482
EST391 149890 104675948
EST392 105504 72084989
EST393 142198 92806154
EST394 158182 94630740
EST395 177307 97358452
EST396 161445 96838724
EST397 169767 100178208
EST398 177985 108934127
EST399 102099 73415199
EST4 174472 68804432
EST40 3241 853046
EST400 79931 61157469
EST401 111687 71848195
EST402 135131 81031413
EST403 130708 84091540
EST404 142200 86736468
EST405 159411 96558167
EST406 182439 96693741
EST407 2287 845415
EST408 164704 59654088
EST409 172142 60490789
EST41 101463 54476588
EST410 138588 54124424
EST42 124280 52815471
EST43 168983 87927303
EST44 171187 69831769
EST45 173311 94231578
EST46 174239 89773434
EST47 165746 82102543
EST48 167058 95687745
EST49 160140 88015306
EST5 172021 67418015
EST50 164161 95865841
EST51 163831 90879094
EST52 154657 75078753
EST53 156296 94312760
EST54 34648 13919149
EST55 171474 68769616
EST56 160402 78090935
EST57 176327 102561259
EST58 158202 98330629
EST59 162879 105529639
EST6 180188 70820522
EST60 168571 102650923
EST61 167710 108675254
EST62 177963 78077927
EST63 173737 101364576
EST64 151705 79450749
EST65 155781 89031190
EST66 159373 95541283
EST67 95875 56370410
EST68 148863 88192172
EST69 195742 108380735
EST7 170982 73718356
EST70 214859 113175075
EST71 201936 115385102
EST72 211262 125003777
EST73 195006 113276591
EST74 153897 78406177
EST75 138107 64991778
EST76 147499 78415053
EST77 159350 108708825
EST78 150262 79645118
EST79 147922 84723860
EST8 183704 73247753
EST80 15280 16563044
EST81 191116 95525522
EST82 219724 128954924
EST83 211874 112015068
EST84 221821 104262898
EST85 202543 89636177
EST86 151971 92674717
EST87 153409 80867317
EST88 169661 90334797
EST89 155689 87100202
EST9 171026 74513822
EST90 156482 97779503
EST91 155641 107030956
EST92 90016 46915370
EST93 156003 119442687
EST94 144315 106421285
EST95 149560 108869731
EST96 142347 85536394
EST97 165443 93218240
EST98 179711 113490281
EST99 259358 157485165
GSS1 205737 90151884
GSS10 136885 66983898
GSS100 85167 43034540
GSS101 195380 123718914
GSS102 171367 114186705
GSS103 201451 118373225
GSS104 207406 136080694
GSS105 215150 141912749
GSS106 213762 143292141
GSS107 209587 148457681
GSS108 210968 146704110
GSS109 209200 149200727
GSS11 139072 74912075
GSS110 206541 150947875
GSS111 157133 116817970
GSS112 135293 85662155
GSS113 173616 81341134
GSS114 192704 87081854
GSS115 176117 150096605
GSS116 183148 128728726
GSS117 188364 152081657
GSS118 193546 131442501
GSS119 192486 134670951
GSS12 150162 71374253
GSS120 199464 111167010
GSS121 34412 18165570
GSS122 87169 64174921
GSS123 83557 62796712
GSS124 103734 48638247
GSS125 68568 58500765
GSS126 7284 6736323
GSS127 68689 57997997
GSS128 69318 56731875
GSS129 69558 56262131
GSS13 147467 72169409
GSS130 71310 56073507
GSS131 68163 51560938
GSS132 75915 58209304
GSS133 87237 74898207
GSS134 81830 44609896
GSS135 93317 45114492
GSS136 56347 44171450
GSS137 77438 61353715
GSS138 69655 58699702
GSS139 67711 62707076
GSS14 180923 90785082
GSS140 61743 53255060
GSS141 95658 42987107
GSS142 21258 4959164
GSS143 112672 70634090
GSS144 1088 802743
GSS145 23226 28867035
GSS146 106621 69122590
GSS147 85469 35209982
GSS148 37301 23210969
GSS149 120690 73137800
GSS15 165652 96400095
GSS150 118608 75280747
GSS151 105255 63705478
GSS152 47447 23060733
GSS153 83102 54651503
GSS154 95630 61300959
GSS155 107319 78552403
GSS156 106422 76714273
GSS157 106058 79947480
GSS158 103996 80016519
GSS159 76376 51040365
GSS16 174865 93148028
GSS160 104570 63290945
GSS161 109868 66415305
GSS162 106323 59388992
GSS163 68303 37395892
GSS164 70149 39050322
GSS165 36527 17435683
GSS166 85481 46023427
GSS167 97119 55907586
GSS168 94982 49597190
GSS169 96286 55922591
GSS17 196831 125921881
GSS170 42132 23615490
GSS171 114638 43642267
GSS172 117085 39368203
GSS173 108676 55514545
GSS174 101471 78372335
GSS175 64751 40741499
GSS176 95891 36542252
GSS177 95417 37268709
GSS178 96671 35161518
GSS179 94285 39167432
GSS18 194793 105974188
GSS180 37736 17626556
GSS181 103939 66277823
GSS182 94551 61190929
GSS183 95128 60357048
GSS184 94773 60868501
GSS185 33134 33073975
GSS186 83875 28283031
GSS187 84362 27389237
GSS188 85085 25924821
GSS189 14410 4314220
GSS19 58045 24853934
GSS190 16547 7508221
GSS191 92719 59684106
GSS192 89364 47776813
GSS193 91531 63450540
GSS194 91165 63958093
GSS195 90953 64327876
GSS196 37422 26092587
GSS197 89852 65233772
GSS198 88611 68592611
GSS199 83719 79618297
GSS2 186321 94757946
GSS20 186922 105326832
GSS200 84139 79031087
GSS201 86536 72315437
GSS202 45040 27003629
GSS203 94787 57200301
GSS204 93517 59081345
GSS205 93900 58510429
GSS206 94714 57307691
GSS207 94279 58016121
GSS208 6701 3333254
GSS21 169127 115836885
GSS22 174106 99568504
GSS23 192369 138527305
GSS24 201031 123494695
GSS25 175471 104277105
GSS26 197043 119487367
GSS27 167868 161574100
GSS28 190632 134457391
GSS29 167162 120551392
GSS3 178459 89335065
GSS30 158800 105369328
GSS31 201674 130043762
GSS32 222403 119401032
GSS33 224622 116464431
GSS34 225847 114839942
GSS35 216729 131101779
GSS36 202761 159857212
GSS37 199140 148663094
GSS38 188814 55216002
GSS39 186759 106542424
GSS4 169841 86595170
GSS40 191190 139152629
GSS41 150052 134697697
GSS42 187635 102192144
GSS43 178165 125540299
GSS44 189664 127763132
GSS45 194848 125935406
GSS46 53376 50174995
GSS47 175399 104013141
GSS48 170991 105338684
GSS49 171834 105224139
GSS5 37544 23513994
GSS50 186393 124599840
GSS51 190586 117318760
GSS52 190213 125116072
GSS53 189040 119447859
GSS54 189612 132326093
GSS55 193162 112115566
GSS56 194518 124517461
GSS57 139558 107273760
GSS58 176476 141454714
GSS59 164986 113500649
GSS6 165210 86460572
GSS60 164779 113943857
GSS61 148738 100740798
GSS62 160469 131406651
GSS63 173051 144906954
GSS64 187621 121769642
GSS65 207367 129089824
GSS66 196667 113307985
GSS67 251745 131206229
GSS68 157528 105633382
GSS69 165274 126099813
GSS7 168976 80580102
GSS70 165920 126999039
GSS71 168036 126901294
GSS72 184858 114659919
GSS73 154005 113583665
GSS74 204050 129689291
GSS75 174727 115109534
GSS76 204933 135093898
GSS77 228102 77375903
GSS78 187368 115295212
GSS79 125986 93231840
GSS8 169076 90832874
GSS80 148425 119507303
GSS81 143035 118701477
GSS82 146611 118593717
GSS83 146502 121757582
GSS84 147311 120547377
GSS85 66440 55012182
GSS86 153116 125189617
GSS87 151211 119797798
GSS88 147162 116207624
GSS89 146029 118503255
GSS9 138817 67702797
GSS90 148214 123451695
GSS91 151860 123280019
GSS92 150921 121305940
GSS93 149487 124131451
GSS94 149686 123749265
GSS95 150354 122457806
GSS96 147975 115583377
GSS97 145074 117966648
GSS98 148906 124769077
GSS99 184838 128727291
HTC1 25164 27215418
HTC10 66366 67961839
HTC11 72870 78424760
HTC12 16675 24334708
HTC13 66997 61008541
HTC14 68364 69532114
HTC15 16464 11348270
HTC2 16164 36511526
HTC3 16149 36817210
HTC4 16389 35869962
HTC5 16090 40460616
HTC6 16183 37869996
HTC7 55417 33894056
HTC8 28886 15587525
HTC9 63034 76335598
HTG1 1318 188816333
HTG10 1297 186286287
HTG100 989 189369425
HTG101 999 189436314
HTG102 985 189268172
HTG103 1165 190534513
HTG104 45 6486254
HTG105 1104 189778741
HTG106 1050 189717830
HTG107 1398 191353958
HTG108 1322 190855228
HTG109 1617 191199847
HTG11 9 1245953
HTG110 1427 190989779
HTG111 1156 190064332
HTG112 1096 185358626
HTG113 181 28050854
HTG114 1626 188270914
HTG115 965 193198557
HTG116 744 168699171
HTG117 1086 193685337
HTG118 1104 193096002
HTG119 389 73095019
HTG12 1451 183812880
HTG120 1031 189752857
HTG121 1054 192606152
HTG122 1162 192088464
HTG123 1080 193006346
HTG124 1076 193046837
HTG125 790 138239998
HTG126 1123 192622328
HTG127 1100 192363970
HTG128 1081 192362656
HTG129 1104 192504106
HTG13 875 191643337
HTG130 1095 191833947
HTG131 1090 192385357
HTG132 1111 191988432
HTG133 1102 192429074
HTG134 1174 169889217
HTG14 749 192036027
HTG15 744 191917504
HTG16 783 191989084
HTG17 801 191954200
HTG18 770 191929668
HTG19 2068 170862462
HTG2 2464 186026155
HTG20 1090 187538943
HTG21 918 184004410
HTG22 785 191723345
HTG23 927 190139494
HTG24 905 190329607
HTG25 811 191303618
HTG26 783 191820779
HTG27 875 190997903
HTG28 894 190457192
HTG29 939 189851508
HTG3 2515 185243126
HTG30 909 190928838
HTG31 854 174101360
HTG32 874 191066131
HTG33 966 189474468
HTG34 884 190904567
HTG35 865 191312650
HTG36 827 191776650
HTG37 947 189899154
HTG38 947 190393059
HTG39 941 190296155
HTG4 2554 188374841
HTG40 1046 189090463
HTG41 1110 171512170
HTG42 1259 187973591
HTG43 1170 187963033
HTG44 1150 188040452
HTG45 1118 191360634
HTG46 1268 190655764
HTG47 1173 190417266
HTG48 1130 191136241
HTG49 1047 191184853
HTG5 1283 185543864
HTG50 1021 189706498
HTG51 1061 180599364
HTG52 970 190185455
HTG53 1106 190075891
HTG54 1045 190185372
HTG55 1014 189796669
HTG56 968 189203108
HTG57 91 15924971
HTG58 1010 189317741
HTG59 1030 189833096
HTG6 1273 185016781
HTG60 1068 187461057
HTG61 1129 188223655
HTG62 1007 173722493
HTG63 1086 189656608
HTG64 1065 189383724
HTG65 1169 188862683
HTG66 1177 187550196
HTG67 1284 184407283
HTG68 97 12761644
HTG69 1221 185314622
HTG7 1275 185326125
HTG70 1239 184674446
HTG71 1244 184625496
HTG72 1182 187515281
HTG73 1035 172736796
HTG74 1118 188209046
HTG75 1101 190799857
HTG76 1138 190891634
HTG77 1183 190789061
HTG78 1114 189327131
HTG79 1171 190202159
HTG8 1459 184483094
HTG80 1115 190051467
HTG81 1214 189986795
HTG82 1120 189732461
HTG83 968 166272508
HTG84 1228 188397022
HTG85 1255 187734415
HTG86 1142 190059532
HTG87 1149 189665619
HTG88 1013 173976713
HTG89 1180 189887411
HTG9 1200 186953640
HTG90 1111 190192844
HTG91 1145 190180515
HTG92 1100 190382070
HTG93 1048 173155450
HTG94 1056 190694692
HTG95 1156 191053520
HTG96 1042 189124502
HTG97 1069 189520674
HTG98 711 132601479
HTG99 1018 189690522
INV1 93989 48680980
INV10 84010 65519297
INV11 49954 39252276
INV12 85725 64914779
INV13 81342 66581538
INV14 78905 68311266
INV15 40078 110603167
INV16 194 155317154
INV17 14695 138643785
INV18 60867 82353597
INV19 33105 100531624
INV2 2565 183534304
INV3 943 156364409
INV4 27836 118546700
INV5 79419 70948815
INV6 33011 115130966
INV7 47101 98952619
INV8 80674 71868273
INV9 58199 88289429
MAM1 13623 164037144
MAM2 32259 138114822
MAM3 46733 104312667
MAM4 50184 124318168
MAM5 73658 85938910
MAM6 8325 8193543
PAT1 222542 70116699
PAT10 124682 102523661
PAT100 112152 27225046
PAT11 98153 63996194
PAT12 142114 62836086
PAT13 105848 59928277
PAT14 103676 50109932
PAT15 121036 53290601
PAT16 113148 61261981
PAT17 39214 16261019
PAT18 146866 52619866
PAT19 153883 78031245
PAT2 194544 84644913
PAT20 105042 118278487
PAT21 133347 95768472
PAT22 84481 78926117
PAT23 123527 103393927
PAT24 119451 105667859
PAT25 145592 86690945
PAT26 174983 64276296
PAT27 71447 1786175
PAT28 102177 77405969
PAT29 93964 87637529
PAT3 171983 95894231
PAT30 119942 61669438
PAT31 96656 78962764
PAT32 128375 55040494
PAT33 92196 51106640
PAT34 111301 78150375
PAT35 137080 29095092
PAT36 158181 23471538
PAT37 114369 49613938
PAT38 46523 54621373
PAT39 95733 83195520
PAT4 153734 106065135
PAT40 100231 70980964
PAT41 135176 39277514
PAT42 144261 34182437
PAT43 123895 64386570
PAT44 104278 81425449
PAT45 92446 75340640
PAT46 113547 66184235
PAT47 66387 55532785
PAT48 135193 108001084
PAT49 167081 97032398
PAT5 57197 23951583
PAT50 116397 127555645
PAT51 196343 76722711
PAT52 80302 127991032
PAT53 27631 180872621
PAT54 185408 93066629
PAT55 274454 6861350
PAT56 129252 31622558
PAT57 161341 77814723
PAT58 92879 89471453
PAT59 106732 74664021
PAT6 170642 91909261
PAT60 122357 64030175
PAT61 67061 30224535
PAT62 70750 109767817
PAT63 87947 82758966
PAT64 92971 78937516
PAT65 93166 72441718
PAT66 93384 75085880
PAT67 112153 48285117
PAT68 342935 8573375
PAT69 188806 88519660
PAT7 154826 88273196
PAT70 111348 132068608
PAT71 3850 194703659
PAT72 131292 110979300
PAT73 158599 54826034
PAT74 224735 34113732
PAT75 250080 15844368
PAT76 180675 63676831
PAT77 51648 26046194
PAT78 114199 110462365
PAT79 137682 83273933
PAT8 131192 97055113
PAT80 164183 99340286
PAT81 158860 103334939
PAT82 137435 114993907
PAT83 42311 27994848
PAT84 193712 81686401
PAT85 150217 108404022
PAT86 356051 11379688
PAT87 257211 57607099
PAT88 40710 13964759
PAT89 202631 47409644
PAT9 129613 101122792
PAT90 275102 9861629
PAT91 220269 46461635
PAT92 110616 2978567
PAT93 268989 21637646
PAT94 187202 59837899
PAT95 108389 107019705
PAT96 117383 73116638
PAT97 77689 96311354
PAT98 118950 85202933
PAT99 163525 74527057
PHG1 5363 40452514
PLN1 59804 93364096
PLN10 37336 72681714
PLN11 14007 137402093
PLN12 7656 36828642
PLN13 17557 144703865
PLN14 17615 146179384
PLN15 17549 146270813
PLN16 24619 128984358
PLN17 5392 151426908
PLN18 1226 169562290
PLN19 16395 151239728
PLN2 16356 144412385
PLN20 4814 5823069
PLN21 67503 69912529
PLN22 28356 30620223
PLN23 77098 76452098
PLN24 65526 81580660
PLN25 39104 117764751
PLN26 18507 135742720
PLN27 19852 132636876
PLN28 77844 74956962
PLN29 97682 59824678
PLN3 1419 182604570
PLN30 34835 32387860
PLN31 81207 71185693
PLN32 78038 76122110
PLN33 79908 71302903
PLN34 81190 77289697
PLN35 115891 46478178
PLN36 79280 72807277
PLN37 55813 90080260
PLN38 77478 78062069
PLN39 40340 70428280
PLN4 2007 188599123
PLN5 1670 194708406
PLN6 35288 123006440
PLN7 63755 63638075
PLN8 73388 76167382
PLN9 37150 49271920
PRI1 23014 59643101
PRI10 1283 179502291
PRI11 732 88970773
PRI12 1278 179183454
PRI13 1451 177770436
PRI14 1589 180081042
PRI15 1591 181937113
PRI16 1285 191746891
PRI17 1137 193610661
PRI18 1099 194297097
PRI19 1166 193629064
PRI2 17014 151274413
PRI20 1799 191861558
PRI21 15630 166427374
PRI22 6669 19206113
PRI23 32480 85310138
PRI24 61488 78329475
PRI25 30555 68504911
PRI26 6600 167095553
PRI27 2122 180829620
PRI28 1603 181698932
PRI29 2030 181862205
PRI3 1435 175593492
PRI30 1976 180732170
PRI31 13056 154652338
PRI32 1294 183076240
PRI33 43859 104099795
PRI34 21440 27298328
PRI35 32191 63614159
PRI36 20231 117529774
PRI37 18724 146977579
PRI38 67519 86984177
PRI39 50078 92096937
PRI4 1287 185811923
PRI40 68637 83792683
PRI41 13043 43967943
PRI5 1316 183668713
PRI6 1182 180380835
PRI7 1252 181244002
PRI8 1216 178011927
PRI9 1358 174572560
ROD1 30749 143039664
ROD10 982 181164585
ROD11 181 34081812
ROD12 1035 185585439
ROD13 940 182764372
ROD14 1040 189291758
ROD15 950 180424726
ROD16 967 181998130
ROD17 991 185852021
ROD18 1182 190536728
ROD19 16083 29970562
ROD2 912 175167648
ROD20 20161 149008857
ROD21 1137 182391172
ROD22 1061 165449321
ROD23 13731 162744731
ROD24 38681 69963859
ROD25 21915 104187136
ROD26 1515 187906430
ROD27 138199 36651240
ROD28 72023 68780277
ROD3 908 173314599
ROD4 901 173772469
ROD5 920 174094119
ROD6 975 178488779
ROD7 964 179278742
ROD8 981 181335837
ROD9 995 182023198
STS1 85094 36490082
STS10 57907 44419978
STS11 49213 37736560
STS12 57924 43637361
STS13 64506 42956287
STS14 93353 34059192
STS15 103301 26255390
STS16 11029 2984455
STS17 102990 27289864
STS18 87041 34345221
STS19 98875 33238671
STS2 84891 49890353
STS20 46296 15807224
STS3 67658 26336196
STS4 75516 36979982
STS5 7621 4471902
STS6 54177 31603126
STS7 54081 31788805
STS8 54235 31906296
STS9 55657 37682948
SYN1 45241 72328883
SYN2 46668 66953902
TSA1 124870 38005979
TSA2 119107 44240810
TSA3 116344 43365065
TSA4 28296 13180123
UNA1 222 120289
VRL1 71396 66599928
VRL10 58893 72707571
VRL11 58310 72491712
VRL12 60075 69864807
VRL13 65755 73213117
VRL14 3223 1630785
VRL2 72763 63972935
VRL3 71098 63786491
VRL4 69687 68947371
VRL5 31384 36343148
VRL6 48999 77926064
VRL7 63421 71626197
VRL8 68852 66783459
VRL9 61944 73088092
VRT1 30599 142049323
VRT10 1258 189459052
VRT11 8280 178040584
VRT12 3944 129551696
VRT13 13242 170744397
VRT14 5213 183187100
VRT15 3964 186226450
VRT16 41675 128352330
VRT17 79283 69726466
VRT18 78290 66474467
VRT19 78654 67013731
VRT2 20089 164703287
VRT20 68470 89206349
VRT21 1991 1502436
VRT3 53637 105512909
VRT4 28713 142313930
VRT5 10230 8775500
VRT6 73023 66849393
VRT7 31735 63526450
VRT8 30250 116886248
VRT9 1207 190097961
2.2.7 Selected Per-Organism Statistics
The following table provides the number of entries and bases of DNA/RNA for
the twenty most sequenced organisms in Release 176.0, excluding chloroplast and
mitochondrial sequences, metagenomic sequences, Whole Genome Shotgun sequences,
and 'constructed' CON-division sequences:
Entries Bases Species
14085418 14602149313 Homo sapiens
7862188 8847002990 Mus musculus
1999711 6281285883 Rattus norvegicus
2179593 5352760300 Bos taurus
3889893 5033561304 Zea mays
3217154 4776701267 Sus scrofa
1696465 3054657865 Danio rerio
228209 1352880670 Strongylocentrotus purpuratus
1241595 1195506571 Oryza sativa Japonica Group
1753640 1185627167 Nicotiana tabacum
1423616 1146930228 Xenopus (Silurana) tropicalis
1205099 1042841876 Drosophila melanogaster
213529 999379905 Pan troglodytes
2284063 991848716 Arabidopsis thaliana
1435080 932160671 Canis lupus familiaris
655949 911073261 Vitis vinifera
808152 888087954 Gallus gallus
1840572 864854845 Glycine max
79205 813082013 Macaca mulatta
1216189 748184685 Ciona intestinalis
2.2.8 Growth of GenBank
The following table lists the number of bases and the number of sequence
records in each release of GenBank, beginning with Release 3 in 1982.
CON-division records are not represented in these statistics: because they
are constructed from the non-CON records in the database, their inclusion
here would be a form of double-counting.
From 1982 to the present, the number of bases in GenBank has doubled
approximately every 18 months.
Release Date Base Pairs Entries
3 Dec 1982 680338 606
14 Nov 1983 2274029 2427
20 May 1984 3002088 3665
24 Sep 1984 3323270 4135
25 Oct 1984 3368765 4175
26 Nov 1984 3689752 4393
32 May 1985 4211931 4954
36 Sep 1985 5204420 5700
40 Feb 1986 5925429 6642
42 May 1986 6765476 7416
44 Aug 1986 8442357 8823
46 Nov 1986 9615371 9978
48 Feb 1987 10961380 10913
50 May 1987 13048473 12534
52 Aug 1987 14855145 14020
53 Sep 1987 15514776 14584
54 Dec 1987 16752872 15465
55 Mar 1988 19156002 17047
56 Jun 1988 20795279 18226
57 Sep 1988 22019698 19044
57.1 Oct 1988 23800000 20579
58 Dec 1988 24690876 21248
59 Mar 1989 26382491 22479
60 Jun 1989 31808784 26317
61 Sep 1989 34762585 28791
62 Dec 1989 37183950 31229
63 Mar 1990 40127752 33377
64 Jun 1990 42495893 35100
65 Sep 1990 49179285 39533
66 Dec 1990 51306092 41057
67 Mar 1991 55169276 43903
68 Jun 1991 65868799 51418
69 Sep 1991 71947426 55627
70 Dec 1991 77337678 58952
71 Mar 1992 83894652 65100
72 Jun 1992 92160761 71280
73 Sep 1992 101008486 78608
74 Dec 1992 120242234 97084
75 Feb 1993 126212259 106684
76 Apr 1993 129968355 111911
77 Jun 1993 138904393 120134
78 Aug 1993 147215633 131328
79 Oct 1993 157152442 143492
80 Dec 1993 163802597 150744
81 Feb 1994 173261500 162946
82 Apr 1994 180589455 169896
83 Jun 1994 191393939 182753
84 Aug 1994 201815802 196703
85 Oct 1994 217102462 215273
86 Dec 1994 230485928 237775
87 Feb 1995 248499214 269478
88 Apr 1995 286094556 352414
89 Jun 1995 318624568 425211
90 Aug 1995 353713490 492483
91 Oct 1995 384939485 555694
92 Dec 1995 425860958 620765
93 Feb 1996 463758833 685693
94 Apr 1996 499127741 744295
95 Jun 1996 551750920 835487
96 Aug 1996 602072354 920588
97 Oct 1996 651972984 1021211
98 Dec 1996 730552938 1114581
99 Feb 1997 786898138 1192505
100 Apr 1997 842864309 1274747
101 Jun 1997 966993087 1491069
102 Aug 1997 1053474516 1610848
103 Oct 1997 1160300687 1765847
104 Dec 1997 1258290513 1891953
105 Feb 1998 1372368913 2042325
106 Apr 1998 1502542306 2209232
107 Jun 1998 1622041465 2355928
108 Aug 1998 1797137713 2532359
109 Oct 1998 2008761784 2837897
110 Dec 1998 2162067871 3043729
111 Apr 1999 2569578208 3525418
112 Jun 1999 2974791993 4028171
113 Aug 1999 3400237391 4610118
114 Oct 1999 3841163011 4864570
115 Dec 1999 4653932745 5354511
116 Feb 2000 5805414935 5691170
117 Apr 2000 7376080723 6215002
118 Jun 2000 8604221980 7077491
119 Aug 2000 9545724824 8214339
120 Oct 2000 10335692655 9102634
121 Dec 2000 11101066288 10106023
122 Feb 2001 11720120326 10896781
123 Apr 2001 12418544023 11545572
124 Jun 2001 12973707065 12243766
125 Aug 2001 13543364296 12813516
126 Oct 2001 14396883064 13602262
127 Dec 2001 15849921438 14976310
128 Feb 2002 17089143893 15465325
129 Apr 2002 19072679701 16769983
130 Jun 2002 20648748345 17471130
131 Aug 2002 22616937182 18197119
132 Oct 2002 26525934656 19808101
133 Dec 2002 28507990166 22318883
134 Feb 2003 29358082791 23035823
135 Apr 2003 31099264455 24027936
136 Jun 2003 32528249295 25592865
137 Aug 2003 33865022251 27213748
138 Oct 2003 35599621471 29819397
139 Dec 2003 36553368485 30968418
140 Feb 2004 37893844733 32549400
141 Apr 2004 38989342565 33676218
142 Jun 2004 40325321348 35532003
143 Aug 2004 41808045653 37343937
144 Oct 2004 43194602655 38941263
145 Dec 2004 44575745176 40604319
146 Feb 2005 46849831226 42734478
147 Apr 2005 48235738567 44202133
148 Jun 2005 49398852122 45236251
149 Aug 2005 51674486881 46947388
150 Oct 2005 53655236500 49152445
151 Dec 2005 56037734462 52016762
152 Feb 2006 59750386305 54584635
153 Apr 2006 61582143971 56620500
154 Jun 2006 63412609711 58890345
155 Aug 2006 65369091950 61132599
156 Oct 2006 66925938907 62765195
157 Dec 2006 69019290705 64893747
158 Feb 2007 71292211453 67218344
159 Apr 2007 75742041056 71802595
160 Jun 2007 77248690945 73078143
161 Aug 2007 79525559650 76146236
162 Oct 2007 81563399765 77632813
163 Dec 2007 83874179730 80388382
164 Feb 2008 85759586764 82853685
165 Apr 2008 89172350468 85500730
166 Jun 2008 92008611867 88554578
167 Aug 2008 95033791652 92748599
168 Oct 2008 97381682336 96400790
169 Dec 2008 99116431942 98868465
170 Feb 2009 101467270308 101815678
171 Apr 2009 102980268709 103335421
172 Jun 2009 105277306080 106073709
173 Aug 2009 106533156756 108431692
174 Oct 2009 108560236506 110946879
175 Dec 2009 110118557163 112910950
176 Feb 2010 112326229652 116461672
The following table lists the number of bases and the number of sequence
records for WGS sequences processed at GenBank, beginning with Release 129.0
in April of 2002. Please note that WGS data are not distributed in conjunction
with GenBank releases. Rather, per-project data files are continuously
available in the WGS areas of the NCBI FTP site:
ftp://ftp.ncbi.nih.gov/ncbi-asn1/wgs
ftp://ftp.ncbi.nih.gov/genbank/wgs
Release Date Base Pairs Entries
129 Apr 2002 692266338 172768
130 Jun 2002 3267608441 397502
131 Aug 2002 3848375582 427771
132 Oct 2002 3892435593 434224
133 Dec 2002 6702372564 597042
134 Feb 2003 6705740844 597345
135 Apr 2003 6897080355 596818
136 Jun 2003 6992663962 607155
137 Aug 2003 7144761762 593801
138 Oct 2003 8662242833 1683437
139 Dec 2003 14523454868 2547094
140 Feb 2004 22804145885 3188754
141 Apr 2004 24758556215 4112532
142 Jun 2004 25592758366 4353890
143 Aug 2004 28128611847 4427773
144 Oct 2004 30871590379 5285276
145 Dec 2004 35009256228 5410415
146 Feb 2005 38076300084 6111782
147 Apr 2005 39523208346 6685746
148 Jun 2005 46767232565 8711924
149 Aug 2005 53346605784 10276161
150 Oct 2005 56162807647 11169448
151 Dec 2005 59638900034 12088491
152 Feb 2006 63183065091 12465546
153 Apr 2006 67488612571 13573144
154 Jun 2006 78858635822 17733973
155 Aug 2006 80369977826 17960667
156 Oct 2006 81127502509 18500772
157 Dec 2006 81611376856 18540918
158 Feb 2007 86043478524 19421576
159 Apr 2007 93022691867 23446831
160 Jun 2007 97102606459 23718400
161 Aug 2007 101964323738 25384475
162 Oct 2007 102003045298 25354041
163 Dec 2007 106505691578 26177471
164 Feb 2008 108635736141 27439206
165 Apr 2008 110500961400 26931049
166 Jun 2008 113639291344 39163548
167 Aug 2008 118593509342 40214247
168 Oct 2008 136085973423 46108952
169 Dec 2008 141374971004 48394838
170 Feb 2009 143797800446 49036947
171 Apr 2009 144522542010 48948309
172 Jun 2009 145959997864 49063546
173 Aug 2009 148165117763 48443067
174 Oct 2009 149348923035 48119301
175 Dec 2009 158317168385 54076973
176 Feb 2010 163991858015 57134273
3. FILE FORMATS
The flat file examples included in this section, while not always from the
current release, are usually fairly recent. Any differences compared to the
actual records are the result of updates to the entries involved.
3.1 File Header Information
With the exception of the index files and the lists of new, changed, and
deleted accession numbers, each of the files of a GenBank release begins
with the same header, except for the first line, which contains the file
name, and the sixth line, which contains the title of the file. The first
line of the file contains the file name in character positions 1 to 9 and
the full database name (Genetic Sequence Data Bank, aka 'GenBank') starting
in column 22. The brief names of the files in this release are listed in
Section 2.2.
The second line contains the date of the current release in the form
`day month year', beginning in position 27. The fourth line contains
the current GenBank release number. The release number appears in
positions 48 to 52 and consists of three numbers separated by a decimal
point. The number to the left of the decimal is the major release
number. The digit to the right of the decimal indicates the version of
the major release; it is zero for the first version. The sixth line
contains a title for the file. The eighth line lists the number of
entries (loci), number of bases (or base pairs), and number of reports
of sequences (equal to number of entries in this case). These numbers are
right-justified at fixed positions. The number of entries appears in
positions 1 to 8, the number of bases in positions 16 to 26, and the
number of reports in positions 40 to 47. The third, fifth, seventh, and
ninth lines are blank.
1 10 20 30 40 50 60 70 79
---------+---------+---------+---------+---------+---------+---------+---------
GBBCT1.SEQ Genetic Sequence Data Bank
February 15 2010
NCBI-GenBank Flat File Release 176.0
Bacterial Sequences (Part 1)
25389 loci, 102437874 bases, from 25389 reported sequences
---------+---------+---------+---------+---------+---------+---------+---------
1 10 20 30 40 50 60 70 79
Example 1. Sample File Header
3.2 Directory Files
3.2.1 Short Directory File
The short directory file contains brief descriptions of all of the
sequence entries contained in this release. These descriptions are in
fifteen groups, one group for each of the fifteen sequence entry
data files. The first record at the beginning of a group of entries
contains the name of the group in uppercase characters, beginning in
position 21. The organism groups are PRIMATE, RODENT, OTHER MAMMAL,
OTHER VERTEBRATE, INVERTEBRATE, PLANT, BACTERIAL, STRUCTURAL RNA, VIRAL,
PHAGE, SYNTHETIC, UNANNOTATED, EXPRESSED SEQUENCE TAG, PATENT, or
SEQUENCE TAGGED SITE. The second record is blank.
Each record in the short directory contains the sequence entry name
(LOCUS) in the first 12 positions, followed by a brief definition of
the sequence beginning in column 13. The definition is truncated (at
the end of a word) to leave room at the right margin for at least one
space, the sequence length, and the letters `bp'. The length of the
sequence is printed right-justified to column 77, followed by the
letters `bp' in columns 78 and 79. The next-to-last record for a group
has `ZZZZZZZZZZ' in its first ten positions (where the entry name
would normally appear). The last record is a blank line. An example of
the short directory file format, showing the descriptions of the last
entries in the Other Vertebrate sequence data file and the first
entries of the Invertebrate sequence data file, is reproduced below:
1 10 20 30 40 50 60 70 79
---------+---------+---------+---------+---------+---------+---------+---------
ZEFWNT1G3 B.rerio wnt-1 gene (exon 3) for wnt-1 protein. 266bp
ZEFWNT1G4 B.rerio wnt-1 gene (exon 4) for wnt-1 protein. 647bp
ZEFZF54 Zebrafish homeotic gene ZF-54. 246bp
ZEFZFEN Zebrafish engrailed-like homeobox sequence. 327bp
ZZZZZZZZZZ
INVERTEBRATE
AAHAV33A Acanthocheilonema viteae pepsin-inhibitor-like-protein 1048bp
ACAAC01 Acanthamoeba castelani gene encoding actin I. 1571bp
ACAACTPH Acanthamoeba castellanii actophorin mRNA, complete cds. 671bp
ACAMHCA A.castellanii non-muscle myosin heavy chain gene, partial 5894bp
---------+---------+---------+---------+---------+---------+---------+---------
1 10 20 30 40 50 60 70 79
Example 2. Short Directory File
3.3 Index Files
There are six files containing indices to the entries in this release:
Accession number index file (Accession and Version)
Secondary accession number index file
Keyword phrase index file
Author name index file
Journal citation index file
Gene name index file
The index keys (accession numbers, keywords, authors, journals, and
gene symbols.) of an index are sorted alphabetically. Following each
index key, the identifiers of the sequence entries containing that key
are listed (LOCUS name, division abbreviation, and primary accession
number). The division abbreviations are:
1. PRI - primate sequences
2. ROD - rodent sequences
3. MAM - other mammalian sequences
4. VRT - other vertebrate sequences
5. INV - invertebrate sequences
6. PLN - plant, fungal, and algal sequences
7. BCT - bacterial sequences
8. VRL - viral sequences
9. PHG - bacteriophage sequences
10. SYN - synthetic sequences
11. UNA - unannotated sequences
12. EST - EST sequences (expressed sequence tags)
13. PAT - patent sequences
14. STS - STS sequences (sequence tagged sites)
15. GSS - GSS sequences (genome survey sequences)
16. HTG - HTGS sequences (high throughput genomic sequences)
17. HTC - HTC sequences (high throughput cDNA sequences)
18. ENV - Environmental sampling sequences
19. CON - Constructed sequences
A line-oriented, TAB-delimited format is utilized for the gbaut.idx,
gbgen.idx, gbjou.idx, gbkey.idx, and gbsec.idx indexes. Each index
key is presented on its own line, and is followed by a
LOCUS/Division/Accession triplet for every record containing the key:
Indexed-Term
LOCUS-name1 Div-code1 Accession1
LOCUS-name2 Div-code2 Accession2
LOCUS-name3 Div-code3 Accession3
....
Here is an example of the format, in which TAB characters are displayed
as ^I, and carriage-returns/newlines as $ :
(H+,K+)-ATPASE BETA-SUBUNIT$
^IRATHKATPB^IROD^IM55655$
^IMUSATP4B1^IROD^IM64685$
^IMUSATP4B2^IROD^IM64686$
^IMUSATP4B3^IROD^IM64687$
^IMUSATP4B4^IROD^IM64688$
^IDOGATPASEB^IMAM^IM76486$
When viewed by a file browser such as 'less' or 'more' :
(H+,K+)-ATPASE BETA-SUBUNIT
RATHKATPB ROD M55655
MUSATP4B1 ROD M64685
MUSATP4B2 ROD M64686
MUSATP4B3 ROD M64687
MUSATP4B4 ROD M64688
DOGATPASEB MAM M76486
Note that the index keys can be distinguished from LOCUS/DIV/ACCESSION
by the fact that they do not start with a TAB character. So one can
extract just the terms via simple text-processing:
perl -ne 'print unless /^\s+/' < gbkey.idx > terms.gbkey
The format of the primary accession number index file is slightly
different, with each indexed key (Accession.Version) present on
the same line as the LOCUS/Division/Accession triplet:
Accession1.Version1 Locus-name1 Div-code1 Accession1
Accession2.Version2 Locus-name2 Div-code2 Accession2
....
Here is an example of the format, in which TAB characters are displayed
as ^I, and carriage-returns/newlines as $ :
AC000102.1^IAC000102^IPRI^IAC000102$
AC000103.1^IAC000103^IPLN^IAC000103$
AC000104.1^IF19P19^IPLN^IAC000104$
AC000105.40^IAC000105^IPRI^IAC000105$
AC000106.1^IF7G19^IPLN^IAC000106$
AC000107.1^IAC000107^IPLN^IAC000107$
AC000108.1^IAC000108^IBCT^IAC000108$
AC000109.1^IHSAC000109^IPRI^IAC000109$
AC000110.1^IHSAC000110^IPRI^IAC000110$
When viewed by a file browser such as 'less' or 'more' :
AC000102.1 AC000102 PRI AC000102
AC000103.1 AC000103 PLN AC000103
AC000104.1 F19P19 PLN AC000104
AC000105.40 AC000105 PRI AC000105
AC000106.1 F7G19 PLN AC000106
AC000107.1 AC000107 PLN AC000107
AC000108.1 AC000108 BCT AC000108
AC000109.1 HSAC000109 PRI AC000109
AC000110.1 HSAC000110 PRI AC000110
3.3.1 Accession Number Index File - gbacc.idx
Accession numbers are unique six character or eight-character alphanumeric
identifiers of GenBank database entries. The six-character accession
number format consists of a single uppercase letter, followed by 5 digits.
The eight-character accession number format consists of two uppercase
letters, followed by 6 digits. Accessions provide an unchanging identifier
for the data with which they are associated, and we encourage you to cite
accession numbers whenever you refer to data from GenBank.
GenBank entries can have both 'primary' and 'secondary' accessions
associated with them (see Section 3.5.6). Only primary accessions are present
in the gbacc.idx index.
3.3.2 Keyword Phrase Index File - gbkey.idx
Keyword phrases consist of names for gene products and other
characteristics of sequence entries.
3.3.3 Author Name Index File - gbaut*.idx
The author name index files list all of the author names that appear
in the references within sequence records.
3.3.4 Journal Citation Index File - gbjou.idx
The journal citation index file lists all of the citations that appear
in the references within sequence records.. All citations are truncated
to 80 characters.
3.3.5 Gene Name Index - gbgen.idx
The /gene qualifiers of many GenBank entries contain values other than
official gene symbols, such as the product or the standard name of the gene.
Hence, NCBI has chosen to build an index (gbgen.idx) more like a keyword index
for this field, using both the GenBank /gene qualifier and the 'Gene.locus'
fields from the NCBI internal database as keys.
3.4 Sequence Entry Files
GenBank releases contain one or more sequence entry data files, one
for each "division" of GenBank.
3.4.1 File Organization
Each of these files has the same format and consists of two parts:
header information (described in section 3.1) and sequence entries for
that division (described in the following sections).
3.4.2 Entry Organization
In the second portion of a sequence entry file (containing the
sequence entries for that division), each record (line) consists of
two parts. The first part is found in positions 1 to 10 and may
contain:
1. A keyword, beginning in column 1 of the record (e.g., REFERENCE is
a keyword).
2. A subkeyword beginning in column 3, with columns 1 and 2 blank
(e.g., AUTHORS is a subkeyword of REFERENCE). Or a subkeyword beginning
in column 4, with columns 1, 2, and 3 blank (e.g., PUBMED is a
subkeyword of REFERENCE).
3. Blank characters, indicating that this record is a continuation of
the information under the keyword or subkeyword above it.
4. A code, beginning in column 6, indicating the nature of an entry
(feature key) in the FEATURES table; these codes are described in
Section 3.4.12.1 below.
5. A number, ending in column 9 of the record. This number occurs in
the portion of the entry describing the actual nucleotide sequence and
designates the numbering of sequence positions.
6. Two slashes (//) in positions 1 and 2, marking the end of an entry.
The second part of each sequence entry record contains the information
appropriate to its keyword, in positions 13 to 80 for keywords and
positions 11 to 80 for the sequence.
The following is a brief description of each entry field. Detailed
information about each field may be found in Sections 3.4.4 to 3.4.15.
LOCUS - A short mnemonic name for the entry, chosen to suggest the
sequence's definition. Mandatory keyword/exactly one record.
DEFINITION - A concise description of the sequence. Mandatory
keyword/one or more records.
ACCESSION - The primary accession number is a unique, unchanging
identifier assigned to each GenBank sequence record. (Please use this
identifier when citing information from GenBank.) Mandatory keyword/one
or more records.
VERSION - A compound identifier consisting of the primary
accession number and a numeric version number associated with the
current version of the sequence data in the record. This is followed
by an integer key (a "GI") assigned to the sequence by NCBI.
Mandatory keyword/exactly one record.
NID - An alternative method of presenting the NCBI GI
identifier (described above).
NOTE: The NID linetype is obsolete and was removed from the
GenBank flatfile format in December 1999.
PROJECT - The identifier of a project (such as a Genome
Sequencing Project) to which a GenBank sequence record belongs.
Optional keyword/one or more records.
NOTE: The PROJECT linetype is obsolete and was removed from the
GenBank flatfile format after Release 171.0 in April 2009.
DBLINK - Provides cross-references to resources that
support the existence a sequence record, such as the Project Database
and the NCBI Trace Assembly Archive. Optional keyword/one or
more records.
KEYWORDS - Short phrases describing gene products and other
information about an entry. Mandatory keyword in all annotated
entries/one or more records.
SEGMENT - Information on the order in which this entry appears in a
series of discontinuous sequences from the same molecule. Optional
keyword (only in segmented entries)/exactly one record.
SOURCE - Common name of the organism or the name most frequently used
in the literature. Mandatory keyword in all annotated entries/one or
more records/includes one subkeyword.
ORGANISM - Formal scientific name of the organism (first line)
and taxonomic classification levels (second and subsequent lines).
Mandatory subkeyword in all annotated entries/two or more records.
In the event that the organism name exceeds 68 characters (80 - 13 + 1)
in length, it will be line-wrapped and continue on a second line,
prior to the taxonomic classification. Unfortunately, very long
organism names were not anticipated when the fixed-length GenBank
flatfile format was defined in the 1980s. The possibility of linewraps
makes the job of flatfile parsers more difficult : essentially, one
cannot be sure that the second line is truly a classification/lineage
unless it consists of multiple tokens, delimited by semi-colons.
The long-term solution to this problem is to introduce an additional
subkeyword, probably 'LINEAGE' . This might occur sometime in 2009
or 2010.
REFERENCE - Citations for all articles containing data reported
in this entry. Includes seven subkeywords and may repeat. Mandatory
keyword/one or more records.
AUTHORS - Lists the authors of the citation. Optional
subkeyword/one or more records.
CONSRTM - Lists the collective names of consortiums associated
with the citation (eg, International Human Genome Sequencing Consortium),
rather than individual author names. Optional subkeyword/one or more records.
TITLE - Full title of citation. Optional subkeyword (present
in all but unpublished citations)/one or more records.
JOURNAL - Lists the journal name, volume, year, and page
numbers of the citation. Mandatory subkeyword/one or more records.
MEDLINE - Provides the Medline unique identifier for a
citation. Optional subkeyword/one record.
NOTE: The MEDLINE linetype is obsolete and was removed
from the GenBank flatfile format in April 2005.
PUBMED - Provides the PubMed unique identifier for a
citation. Optional subkeyword/one record.
REMARK - Specifies the relevance of a citation to an
entry. Optional subkeyword/one or more records.
COMMENT - Cross-references to other sequence entries, comparisons to
other collections, notes of changes in LOCUS names, and other remarks.
Optional keyword/one or more records/may include blank records.
FEATURES - Table containing information on portions of the
sequence that code for proteins and RNA molecules and information on
experimentally determined sites of biological significance. Optional
keyword/one or more records.
BASE COUNT - Summary of the number of occurrences of each basepair
code (a, c, t, g, and other) in the sequence. Optional keyword/exactly
one record.
NOTE: The BASE COUNT linetype is obsolete and was removed
from the GenBank flatfile format in October 2003.
CONTIG - This linetype provides information about how individual sequence
records can be combined to form larger-scale biological objects, such as
chromosomes or complete genomes. Rather than presenting actual sequence
data, a special join() statement on the CONTIG line provides the accession
numbers and basepair ranges of the underlying records which comprise the
object.
As of August 2005, the 2L chromosome arm of Drosophila melanogaster
(accession number AE014134) provided a good example of CONTIG use.
ORIGIN - Specification of how the first base of the reported sequence
is operationally located within the genome. Where possible, this
includes its location within a larger genetic map. Mandatory
keyword/exactly one record.
- The ORIGIN line is followed by sequence data (multiple records).
// - Entry termination symbol. Mandatory at the end of an
entry/exactly one record.
3.4.3 Sample Sequence Data File
An example of a complete sequence entry file follows. (This example
has only two entries.) Note that in this example, as throughout the
data bank, numbers in square brackets indicate items in the REFERENCE
list. For example, in ACARR58S, [1] refers to the paper by Mackay, et
al.
1 10 20 30 40 50 60 70 79
---------+---------+---------+---------+---------+---------+---------+---------
GBSMP.SEQ Genetic Sequence Data Bank
December 15 1992
GenBank Flat File Release 74.0
Structural RNA Sequences
2 loci, 236 bases, from 2 reported sequences
LOCUS AAURRA 118 bp ss-rRNA RNA 16-JUN-1986
DEFINITION A.auricula-judae (mushroom) 5S ribosomal RNA.
ACCESSION K03160
VERSION K03160.1 GI:173593
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE A.auricula-judae (mushroom) ribosomal RNA.
ORGANISM Auricularia auricula-judae
Eukaryota; Fungi; Eumycota; Basidiomycotina; Phragmobasidiomycetes;
Heterobasidiomycetidae; Auriculariales; Auriculariaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Huysmans,E., Dams,E., Vandenberghe,A. and De Wachter,R.
TITLE The nucleotide sequences of the 5S rRNAs of four mushrooms and
their use in studying the phylogenetic position of basidiomycetes
among the eukaryotes
JOURNAL Nucleic Acids Res. 11, 2871-2880 (1983)
FEATURES Location/Qualifiers
rRNA 1..118
/note="5S ribosomal RNA"
BASE COUNT 27 a 34 c 34 g 23 t
ORIGIN 5' end of mature rRNA.
1 atccacggcc ataggactct gaaagcactg catcccgtcc gatctgcaaa gttaaccaga
61 gtaccgccca gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
//
LOCUS ABCRRAA 118 bp ss-rRNA RNA 15-SEP-1990
DEFINITION Acetobacter sp. (strain MB 58) 5S ribosomal RNA, complete sequence.
ACCESSION M34766
VERSION M34766.1 GI:173603
KEYWORDS 5S ribosomal RNA.
SOURCE Acetobacter sp. (strain MB 58) rRNA.
ORGANISM Acetobacter sp.
Prokaryotae; Gracilicutes; Scotobacteria; Aerobic rods and cocci;
Azotobacteraceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
FEATURES Location/Qualifiers
rRNA 1..118
/note="5S ribosomal RNA"
BASE COUNT 27 a 40 c 32 g 17 t 2 others
ORIGIN
1 gatctggtgg ccatggcggg agcaaatcag ccgatcccat cccgaactcg gccgtcaaat
61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagayy
//
---------+---------+---------+---------+---------+---------+---------+---------
1 10 20 30 40 50 60 70 79
Example 9. Sample Sequence Data File
3.4.4 LOCUS Format
The items of information contained in the LOCUS record are always
found in fixed positions. The locus name (or entry name), which is
always sixteen characters or less, begins in position 13. The locus name
is designed to help group entries with similar sequences: the first
three characters usually designate the organism; the fourth and fifth
characters can be used to show other group designations, such as gene
product; for segmented entries the last character is one of a series
of sequential integers.
The number of bases or base pairs in the sequence ends in position 40.
The letters `bp' are in positions 42 to 43. Positions 45 to 47 provide
the number of strands of the sequence. Positions 48 to 53 indicate the
type of molecule sequenced. Topology of the molecule is indicated in
positions 56 to 63.
GenBank sequence entries are divided among many different
'divisions'. Each entry's division is specified by a three-letter code
in positions 65 to 67. See Section 3.3 for an explanation of division
codes.
Positions 69 to 79 of the record contain the date the entry was
entered or underwent any substantial revisions, such as the addition
of newly published data, in the form dd-MMM-yyyy.
The detailed format for the LOCUS line format is as follows:
Positions Contents
--------- --------
01-05 'LOCUS'
06-12 spaces
13-28 Locus name
29-29 space
30-40 Length of sequence, right-justified
41-41 space
42-43 bp
44-44 space
45-47 spaces, ss- (single-stranded), ds- (double-stranded), or
ms- (mixed-stranded)
48-53 NA, DNA, RNA, tRNA (transfer RNA), rRNA (ribosomal RNA),
mRNA (messenger RNA), uRNA (small nuclear RNA), snRNA,
snoRNA. Left justified.
54-55 space
56-63 'linear' followed by two spaces, or 'circular'
64-64 space
65-67 The division code (see Section 3.3)
68-68 space
69-79 Date, in the form dd-MMM-yyyy (e.g., 15-MAR-1991)
Although each of these data values can be found at column-specific
positions, we encourage those who parse the contents of the LOCUS
line to use a token-based approach. This will prevent the need for
software changes if the spacing of the data values ever has to be
modified.
3.4.5 DEFINITION Format
The DEFINITION record gives a brief description of the sequence,
proceeding from general to specific. It starts with the common name of
the source organism, then gives the criteria by which this sequence is
distinguished from the remainder of the source genome, such as the
gene name and what it codes for, or the protein name and mRNA, or some
description of the sequence's function (if the sequence is
non-coding). If the sequence has a coding region, the description may
be followed by a completeness qualifier, such as cds (complete coding
sequence). There is no limit on the number of lines that may be part
of the DEFINITION. The last line must end with a period.
3.4.5.1 DEFINITION Format for NLM Entries
The DEFINITION line for entries derived from journal-scanning at the NLM is
an automatically generated descriptive summary that accompanies each DNA and
protein sequence. It contains information derived from fields in a database
that summarize the most important attributes of the sequence. The DEFINITION
lines are designed to supplement the accession number and the sequence itself
as a means of uniquely and completely specifying DNA and protein sequences. The
following are examples of NLM DEFINITION lines:
NADP-specific isocitrate dehydrogenase [swine, mRNA, 1 gene, 1585 nt]
94 kda fiber cell beaded-filament structural protein [rats, lens, mRNA
Partial, 1 gene, 1873 nt]
inhibin alpha {promoter and exons} [mice, Genomic, 1 gene, 1102 nt, segment
1 of 2]
cefEF, cefG=acetyl coenzyme A:deacetylcephalosporin C o-acetyltransferase
[Acremonium chrysogenum, Genomic, 2 genes, 2639 nt]
myogenic factor 3, qmf3=helix-loop-helix protein [Japanese quails,
embryo, Peptide Partial, 246 aa]
The first part of the definition line contains information describing
the genes and proteins represented by the molecular sequences. This can
be gene locus names, protein names and descriptions that replace or augment
actual names. Gene and gene product are linked by "=". Any special
identifying terms are presented within brackets, such as: {promoter},
{N-terminal}, {EC 2.13.2.4}, {alternatively spliced}, or {3' region}.
The second part of the definition line is delimited by square brackets, '[]',
and provides details about the molecule type and length. The biological
source, i.e., genus and species or common name as cited by the author.
Developmental stage, tissue type and strain are included if available.
The molecule types include: Genomic, mRNA, Peptide. and Other Genomic
Material. Genomic molecules are assumed to be partial sequence unless
"Complete" is specified, whereas mRNA and peptide molecules are assumed
to be complete unless "Partial" is noted.
3.4.6 ACCESSION Format
This field contains a series of six-character and/or eight-character
identifiers called 'accession numbers'. The six-character accession
number format consists of a single uppercase letter, followed by 5 digits.
The eight-character accession number format consists of two uppercase
letters, followed by 6 digits. The 'primary', or first, of the accession
numbers occupies positions 13 to 18 (6-character format) or positions
13 to 20 (8-character format). Subsequent 'secondary' accession numbers
(if present) are separated from the primary, and from each other, by a
single space. In some cases, multiple lines of secondary accession
numbers might be present, starting at position 13.
The primary accession number of a GenBank entry provides a stable identifier
for the biological object that the entry represents. Accessions do not change
when the underlying sequence data or associated features change.
Secondary accession numbers arise for a number of reasons. For example, a
single accession number may initially be assigned to a sequence described in
a publication. If it is later discovered that the sequence must be entered
into the database as multiple entries, each entry would receive a new primary
accession number, and the original accession number would appear as a secondary
accession number on each of the new entries. In the event that a large number
of continuous secondary accession numbers exist, a range can be employed:
SecAccession1-SecAccession2
In such cases, the alphabetic prefix letters of the initial and terminal
accession numbers within the range *MUST* be identical. For example:
AE000111-AE000510O
^^ ^^
Additionally, the value of the numeric portion of the initial secondary
within the range must be less than the value of the numeric portion of the
terminal secondary.
3.4.7.1 VERSION Format
This line contains two types of identifiers for a GenBank database entry:
a compound accession number and an NCBI GI identifier.
LOCUS AF181452 1294 bp DNA PLN 12-OCT-1999
DEFINITION Hordeum vulgare dehydrin (Dhn2) gene, complete cds.
ACCESSION AF181452
VERSION AF181452.1 GI:6017929
^^^^^^^^^^ ^^^^^^^^^^
Compound NCBI GI
Accession Identifier
Number
A compound accession number consists of two parts: a stable, unchanging
primary-accession number portion (see Section 3.4.6 for a description of
accession numbers), and a sequentially increasing numeric version number.
The accession and version numbers are separated by a period. The initial
version number assigned to a new sequence is one. Compound accessions are
often referred to as "Accession.Version" .
An accession number allows one to retrieve the same biological object in the
database, regardless of any changes that are made to the entry over time. But
those changes can include changes to the sequence data itself, which is of
fundamental importance to many database users. So a numeric version number is
associated with the sequence data in every database entry. If an entry (for
example, AF181452) undergoes two sequence changes, its compound accession
number on the VERSION line would start as AF181452.1 . After the first sequence
change this would become: AF181452.2 . And after the second change: AF181452.3 .
The NCBI GI identifier of the VERSION line also serves as a method for
identifying the sequence data that has existed for a database entry over
time. GI identifiers are numeric values of one or more digits. Since they
are integer keys, they are less human-friendly than the Accession.Version
system described above. Returning to our example for AF181452, it was
initially assigned GI 6017929. If the sequence changes, a new integer GI will
be assigned, perhaps 7345003 . And after the second sequence change, perhaps
the GI would become 10456892 .
Why are both these methods for identifying the version of the sequence
associated with a database entry in use? For two reasons:
- Some data sources processed by NCBI for incorporation into its Entrez
sequence retrieval system do not version their own sequences.
- GIs provide a uniform, integer identifier system for every sequence
NCBI has processed. Some products and systems derived from (or reliant
upon) NCBI products and services prefer to use these integer identifiers
because they can all be processed in the same manner.
GenBank Releases contain only the most recent versions of all sequences
in the database. However, older versions can be obtained via GI-based or
Accession.Version-based queries with NCBI's web-Entrez and network-Entrez
applications. A sequence revision history web page is also available:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/htbin-post/Entrez/girevhist
NOTE: All the version numbers for the compound Accession.Version identifier
system were initialized to a value of one in February 1999, when the
system was first introduced.
3.4.7.2 DBLINK Format
This line contains cross-references to other underlying resources that
support the existence of a GenBank sequence record. For example:
LOCUS AAWZ01000001 40763 bp DNA linear VRT 15-FEB-2007
DEFINITION Anolis carolinensis cont1.0, whole genome shotgun sequence.
ACCESSION AAWZ01000001 AAWZ01000000
VERSION AAWZ01000001.1 GI:125802685
DBLINK Project:18787
A DBLINK cross-reference consists of two data fields delimited by a colon.
The first field provides the cross-reference type ("Project"), while the
second contains the actual cross-reference identifier ("18787").
The second field can consist of multiple comma-separated identifiers,
if a sequence record has multiple DBLINK cross-references of a given type.
For example:
DBLINK Project:100,200,300
DBLINK cross-references of type 'Project' are identifiers within the
'Entrez:Project' database at the NCBI:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/entrez/query.fcgi?db=genomeprj
At the above URL, a search for 18787 would provide information about the
sequencing projects (underway or completed) for Anolis carolinensis, the
centers performing the sequencing and annotation, information about the
organism, etc. For a more detailed overview of Entrez's Project database:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/genomes/static/gprj_help.html#introduction
As of April 2009, the supported DBLINK cross-reference types are "Project"
and "Trace Assembly Archive" .
3.4.8 KEYWORDS Format
The KEYWORDS field does not appear in unannotated entries, but is
required in all annotated entries. Keywords are separated by
semicolons; a "keyword" may be a single word or a phrase consisting of
several words. Each line in the keywords field ends in a semicolon;
the last line ends with a period. If no keywords are included in the
entry, the KEYWORDS record contains only a period.
3.4.9 SEGMENT Format
The SEGMENT keyword is used when two (or more) entries of known
relative orientation are separated by a short (<10 kb) stretch of DNA.
It is limited to one line of the form `n of m', where `n' is the
segment number of the current entry and `m' is the total number of
segments.
3.4.10 SOURCE Format
The SOURCE field consists of two parts. The first part is found after
the SOURCE keyword and contains free-format information including an
abbreviated form of the organism name followed by a molecule type;
multiple lines are allowed, but the last line must end with a period.
The second part consists of information found after the ORGANISM
subkeyword. The formal scientific name for the source organism (genus
and species, where appropriate) is found on the same line as ORGANISM.
The records following the ORGANISM line list the taxonomic
classification levels, separated by semicolons and ending with a
period.
3.4.11 REFERENCE Format
The REFERENCE field consists of five parts: the keyword REFERENCE, and
the subkeywords AUTHORS, TITLE (optional), JOURNAL, MEDLINE (optional),
PUBMED (optional), and REMARK (optional).
The REFERENCE line contains the number of the particular reference and
(in parentheses) the range of bases in the sequence entry reported in
this citation. Additional prose notes may also be found within the
parentheses. The numbering of the references does not reflect
publication dates or priorities.
The AUTHORS line lists the authors in the order in which they appear
in the cited article. Last names are separated from initials by a
comma (no space); there is no comma before the final `and'. The list
of authors ends with a period. The TITLE line is an optional field,
although it appears in the majority of entries. It does not appear in
unpublished sequence data entries that have been deposited directly
into the GenBank data bank, the EMBL Nucleotide Sequence Data Library,
or the DNA Data Bank of Japan. The TITLE field does not end with a
period.
The JOURNAL line gives the appropriate literature citation for the
sequence in the entry. The word `Unpublished' will appear after the
JOURNAL subkeyword if the data did not appear in the scientific
literature, but was directly deposited into the data bank. For
published sequences the JOURNAL line gives the Thesis, Journal, or
Book citation, including the year of publication, the specific
citation, or In press.
For Book citations, the JOURNAL line is specially-formatted, and
includes:
editor name(s)
book title
page number(s)
publisher-name/publisher-location
year
For example:
LOCUS AY277550 1440 bp DNA linear BCT 17-JUN-2003
DEFINITION Stenotrophomonas maltophilia strain CSC13-6 16S ribosomal RNA gene,
partial sequence.
ACCESSION AY277550
....
REFERENCE 1 (bases 1 to 1440)
AUTHORS Gonzalez,J.M., Laiz,L. and Saiz-Jimenez,C.
TITLE Classifying bacterial isolates from hypogean environments:
Application of a novel fluorimetric method dor the estimation of
G+C mol% content in microorganisms by thermal denaturation
temperature
JOURNAL (in) Saiz-Jimenez,C. (Ed.);
MOLECULAR BIOLOGY AND CULTURAL HERITAGE: 47-54;
A.A. Balkema, The Netherlands (2003)
The presence of "(in)" signals the fact that the reference is for a book
rather than a journal article. A semi-colon signals the end of the editor
names. The next semi-colon signals the end of the page numbers, and the
colon that immediately *precedes* the page numbers signals the end of the
book title. The publisher name and location are a free-form text string.
Finally, the year appears at the very end of the JOURNAL line, enclosed in
parentheses.
The MEDLINE line provides the National Library of Medicine's Medline
unique identifier for a citation (if known). Medline UIs are 8 digit
numbers.
The PUBMED line provides the PubMed unique identifier for a citation
(if known). PUBMED ids are numeric, and are record identifiers for article
abstracts in the PubMed database :
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/entrez/query.fcgi?db=PubMed
Citations in PubMed that do not fall within Medline's scope will have only
a PUBMED identifier. Similarly, citations that *are* in Medline's scope but
which have not yet been assigned Medline UIs will have only a PUBMED identifier.
If a citation is present in both the PubMed and Medline databases, both a
MEDLINE and a PUBMED line will be present.
The REMARK line is a textual comment that specifies the relevance
of the citation to the entry.
3.4.12 FEATURES Format
GenBank releases use a feature table format designed jointly by
GenBank, the EMBL Nucleotide Sequence Data Library, and the DNA Data
Bank of Japan. This format is in use by all three databases. The
most complete and accurate Feature Table documentation can be found
on the Web at:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/projects/collab/FT/index.html
Any discrepancy between the abbreviated feature table description
of these release notes and the complete documentation on the Web
should be resolved in favor of the version at the above URL.
The Feature Table specification is also available as a printed
document: `The DDBJ/EMBL/GenBank Feature Table: Definition'. Contact
GenBank at the address shown on the first page of these Release Notes
if you would like a copy.
The feature table contains information about genes and gene products,
as well as regions of biological significance reported in the
sequence. The feature table contains information on regions of the
sequence that code for proteins and RNA molecules. It also enumerates
differences between different reports of the same sequence, and
provides cross-references to other data collections, as described in
more detail below.
The first line of the feature table is a header that includes the
keyword `FEATURES' and the column header `Location/Qualifier.' Each
feature consists of a descriptor line containing a feature key and a
location (see sections below for details). If the location does not
fit on this line, a continuation line may follow. If further
information about the feature is required, one or more lines
containing feature qualifiers may follow the descriptor line.
The feature key begins in column 6 and may be no more than 15
characters in length. The location begins in column 22. Feature
qualifiers begin on subsequent lines at column 22. Location,
qualifier, and continuation lines may extend from column 22 to 80.
Feature tables are required, due to the mandatory presence of the
source feature. The sections below provide a brief introduction to
the feature table format.
3.4.12.1 Feature Key Names
The first column of the feature descriptor line contains the feature
key. It starts at column 6 and can continue to column 20. The list of
valid feature keys is shown below.
Remember, the most definitive documentation for the feature table can
be found at:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/projects/collab/FT/index.html
allele Obsolete; see variation feature key
attenuator Sequence related to transcription termination
C_region Span of the C immunological feature
CAAT_signal `CAAT box' in eukaryotic promoters
CDS Sequence coding for amino acids in protein (includes
stop codon)
conflict Independent sequence determinations differ
D-loop Displacement loop
D_segment Span of the D immunological feature
enhancer Cis-acting enhancer of promoter function
exon Region that codes for part of spliced mRNA
gene Region that defines a functional gene, possibly
including upstream (promotor, enhancer, etc)
and downstream control elements, and for which
a name has been assigned.
GC_signal `GC box' in eukaryotic promoters
iDNA Intervening DNA eliminated by recombination
intron Transcribed region excised by mRNA splicing
J_region Span of the J immunological feature
LTR Long terminal repeat
mat_peptide Mature peptide coding region (does not include stop codon)
misc_binding Miscellaneous binding site
misc_difference Miscellaneous difference feature
misc_feature Region of biological significance that cannot be described
by any other feature
misc_recomb Miscellaneous recombination feature
misc_RNA Miscellaneous transcript feature not defined by other RNA keys
misc_signal Miscellaneous signal
misc_structure Miscellaneous DNA or RNA structure
modified_base The indicated base is a modified nucleotide
mRNA Messenger RNA
mutation Obsolete: see variation feature key
N_region Span of the N immunological feature
old_sequence Presented sequence revises a previous version
polyA_signal Signal for cleavage & polyadenylation
polyA_site Site at which polyadenine is added to mRNA
precursor_RNA Any RNA species that is not yet the mature RNA product
prim_transcript Primary (unprocessed) transcript
primer Primer binding region used with PCR
primer_bind Non-covalent primer binding site
promoter A region involved in transcription initiation
protein_bind Non-covalent protein binding site on DNA or RNA
RBS Ribosome binding site
rep_origin Replication origin for duplex DNA
repeat_region Sequence containing repeated subsequences
repeat_unit One repeated unit of a repeat_region
rRNA Ribosomal RNA
S_region Span of the S immunological feature
satellite Satellite repeated sequence
scRNA Small cytoplasmic RNA
sig_peptide Signal peptide coding region
snRNA Small nuclear RNA
source Biological source of the sequence data represented by
a GenBank record. Mandatory feature, one or more per record.
For organisms that have been incorporated within the
NCBI taxonomy database, an associated /db_xref="taxon:NNNN"
qualifier will be present (where NNNNN is the numeric
identifier assigned to the organism within the NCBI taxonomy
database).
stem_loop Hair-pin loop structure in DNA or RNA
STS Sequence Tagged Site; operationally unique sequence that
identifies the combination of primer spans used in a PCR assay
TATA_signal `TATA box' in eukaryotic promoters
terminator Sequence causing transcription termination
transit_peptide Transit peptide coding region
transposon Transposable element (TN)
tRNA Transfer RNA
unsure Authors are unsure about the sequence in this region
V_region Span of the V immunological feature
variation A related population contains stable mutation
- (hyphen) Placeholder
-10_signal `Pribnow box' in prokaryotic promoters
-35_signal `-35 box' in prokaryotic promoters
3'clip 3'-most region of a precursor transcript removed in processing
3'UTR 3' untranslated region (trailer)
5'clip 5'-most region of a precursor transcript removed in processing
5'UTR 5' untranslated region (leader)
3.4.12.2 Feature Location
The second column of the feature descriptor line designates the
location of the feature in the sequence. The location descriptor
begins at position 22. Several conventions are used to indicate
sequence location.
Base numbers in location descriptors refer to numbering in the entry,
which is not necessarily the same as the numbering scheme used in the
published report. The first base in the presented sequence is numbered
base 1. Sequences are presented in the 5' to 3' direction.
Location descriptors can be one of the following:
1. A single base;
2. A contiguous span of bases;
3. A site between two bases;
4. A single base chosen from a range of bases;
5. A single base chosen from among two or more specified bases;
6. A joining of sequence spans;
7. A reference to an entry other than the one to which the feature
belongs (i.e., a remote entry), followed by a location descriptor
referring to the remote sequence;
A site between two residues, such as an endonuclease cleavage site, is
indicated by listing the two bases separated by a carat (e.g., 23^24).
A single residue chosen from a range of residues is indicated by the
number of the first and last bases in the range separated by a single
period (e.g., 23.79). The symbols < and > indicate that the end point
of the range is beyond the specified base number.
A contiguous span of bases is indicated by the number of the first and
last bases in the range separated by two periods (e.g., 23..79). The
symbols < and > indicate that the end point of the range is beyond the
specified base number. Starting and ending positions can be indicated
by base number or by one of the operators described below.
Operators are prefixes that specify what must be done to the indicated
sequence to locate the feature. The following are the operators
available, along with their most common format and a description.
complement (location): The feature is complementary to the location
indicated. Complementary strands are read 5' to 3'.
join (location, location, .. location): The indicated elements should
be placed end to end to form one contiguous sequence.
order (location, location, .. location): The elements are found in the
specified order in the 5 to 3 direction, but nothing is implied about
the rationality of joining them.
3.4.12.3 Feature Qualifiers
Qualifiers provide additional information about features. They take
the form of a slash (/) followed by a qualifier name and, if
applicable, an equal sign (=) and a qualifier value. Feature
qualifiers begin at column 22.
Qualifiers convey many types of information. Their values can,
therefore, take several forms:
1. Free text;
2. Controlled vocabulary or enumerated values;
3. Citations or reference numbers;
4. Sequences;
5. Feature labels.
Text qualifier values must be enclosed in double quotation marks. The
text can consist of any printable characters (ASCII values 32-126
decimal). If the text string includes double quotation marks, each set
must be `escaped' by placing a double quotation mark in front of it
(e.g., /note="This is an example of ""escaped"" quotation marks").
Some qualifiers require values selected from a limited set of choices.
For example, the `/direction' qualifier has only three values `left,'
`right,' or `both.' These are called controlled vocabulary qualifier
values. Controlled qualifier values are not case sensitive; they can
be entered in any combination of upper- and lowercase without changing
their meaning.
Citation or published reference numbers for the entry should be
enclosed in square brackets ([]) to distinguish them from other
numbers.
A literal sequence of bases (e.g., "atgcatt") should be enclosed in
quotation marks. Literal sequences are distinguished from free text by
context. Qualifiers that take free text as their values do not take
literal sequences, and vice versa.
The `/label=' qualifier takes a feature label as its qualifier.
Although feature labels are optional, they allow unambiguous
references to the feature. The feature label identifies a feature
within an entry; when combined with the accession number and the name
of the data bank from which it came, it is a unique tag for that
feature. Feature labels must be unique within an entry, but can be the
same as a feature label in another entry. Feature labels are not case
sensitive; they can be entered in any combination of upper-and
lowercase without changing their meaning.
The following is a partial list of feature qualifiers.
/anticodon Location of the anticodon of tRNA and the amino acid
for which it codes
/bound_moiety Moiety bound
/citation Reference to a citation providing the claim of or
evidence for a feature
/codon Specifies a codon that is different from any found in the
reference genetic code
/codon_start Indicates the first base of the first complete codon
in a CDS (as 1 or 2 or 3)
/cons_splice Identifies intron splice sites that do not conform to
the 5'-GT... AG-3' splice site consensus
/db_xref A database cross-reference; pointer to related information
in another database. A description of all cross-references
can be found at:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/collab/db_xref.html
/direction Direction of DNA replication
/EC_number Enzyme Commission number for the enzyme product of the
sequence
/evidence Value indicating the nature of supporting evidence
/frequency Frequency of the occurrence of a feature
/function Function attributed to a sequence
/gene Symbol of the gene corresponding to a sequence region (usable
with all features)
/label A label used to permanently identify a feature
/map Map position of the feature in free-format text
/mod_base Abbreviation for a modified nucleotide base
/note Any comment or additional information
/number A number indicating the order of genetic elements
(e.g., exons or introns) in the 5 to 3 direction
/organism Name of the organism that is the source of the
sequence data in the record.
/partial Differentiates between complete regions and partial ones
/phenotype Phenotype conferred by the feature
/product Name of a product encoded by a coding region (CDS)
feature
/pseudo Indicates that this feature is a non-functional
version of the element named by the feature key
/rpt_family Type of repeated sequence; Alu or Kpn, for example
/rpt_type Organization of repeated sequence
/rpt_unit Identity of repeat unit that constitutes a repeat_region
/standard_name Accepted standard name for this feature
/transl_except Translational exception: single codon, the translation
of which does not conform to the reference genetic code
/translation Amino acid translation of a coding region
/type Name of a strain if different from that in the SOURCE field
/usedin Indicates that feature is used in a compound feature
in another entry
3.4.12.4 Cross-Reference Information
One type of information in the feature table lists cross-references to
the annual compilation of transfer RNA sequences in Nucleic Acids
Research, which has kindly been sent to us on CD-ROM by Dr. Sprinzl.
Each tRNA entry of the feature table contains a /note= qualifier that
includes a reference such as `(NAR: 1234)' to identify code 1234 in
the NAR compilation. When such a cross-reference appears in an entry
that contains a gene coding for a transfer RNA molecule, it refers to
the code in the tRNA gene compilation. Similar cross-references in
entries containing mature transfer RNA sequences refer to the
companion compilation of tRNA sequences published by D.H. Gauss and M.
Sprinzl in Nucleic Acids Research.
3.4.12.5 Feature Table Examples
In the first example a number of key names, feature locations, and
qualifiers are illustrated, taken from different sequences. The first
table entry is a coding region consisting of a simple span of bases
and including a /gene qualifier. In the second table entry, an NAR
cross-reference is given (see the previous section for a discussion of
these cross-references). The third and fourth table entries use the
symbols `<`and `>' to indicate that the beginning or end of the
feature is beyond the range of the presented sequence. In the fifth
table entry, the symbol `^' indicates that the feature is between
bases.
1 10 20 30 40 50 60 70 79
---------+---------+---------+---------+---------+---------+---------+---------
CDS 5..1261
/product="alpha-1-antitrypsin precursor"
/map="14q32.1"
/gene="PI"
tRNA 1..87
/note="Leu-tRNA-CAA (NAR: 1057)"
/anticodon=(pos:35..37,aa:Leu)
mRNA 1..>66
/note="alpha-1-acid glycoprotein mRNA"
transposon <1..267
/note="insertion element IS5"
misc_recomb 105^106
/note="B.subtilis DNA end/IS5 DNA start"
conflict 258
/replace="t"
/citation=[2]
---------+---------+---------+---------+---------+---------+---------+---------
1 10 20 30 40 50 60 70 79
Example 10. Feature Table Entries
The next example shows the representation for a CDS that spans more
than one entry.
1 10 20 30 40 50 60 70 79
---------+---------+---------+---------+---------+---------+---------+---------
LOCUS HUMPGAMM1 3688 bp ds-DNA PRI 15-OCT-1990
DEFINITION Human phosphoglycerate mutase (muscle specific isozyme) (PGAM-M)
gene, 5' end.
ACCESSION M55673 M25818 M27095
KEYWORDS phosphoglycerate mutase.
SEGMENT 1 of 2
.
.
.
FEATURES Location/Qualifiers
CAAT_signal 1751..1755
/gene="PGAM-M"
TATA_signal 1791..1799
/gene="PGAM-M"
exon 1820..2274
/number=1
/EC_number="5.4.2.1"
/gene="PGAM-M"
intron 2275..2377
/number=1
/gene="PGAM2"
exon 2378..2558
/number=2
/gene="PGAM-M"
.
.
.
//
LOCUS HUMPGAMM2 677 bp ds-DNA PRI 15-OCT-1990
DEFINITION Human phosphoglycerate mutase (muscle specific isozyme) (PGAM-M),
exon 3.
ACCESSION M55674 M25818 M27096
KEYWORDS phosphoglycerate mutase.
SEGMENT 2 of 2
.
.
.
FEATURES Location/Qualifiers
exon 255..457
/number=3
/gene="PGAM-M"
intron order(M55673:2559..>3688,<1..254)
/number=2
/gene="PGAM-M"
mRNA join(M55673:1820..2274,M55673:2378..2558,255..457)
/gene="PGAM-M"
CDS join(M55673:1861..2274,M55673:2378..2558,255..421)
/note="muscle-specific isozyme"
/gene="PGAM2"
/product="phosphoglycerate mutase"
/codon_start=1
/translation="MATHRLVMVRHGESTWNQENRFCGWFDAELSEKGTEEAKRGAKA
IKDAKMEFDICYTSVLKRAIRTLWAILDGTDQMWLPVVRTWRLNERHYGGLTGLNKAE
TAAKHGEEQVKIWRRSFDIPPPPMDEKHPYYNSISKERRYAGLKPGELPTCESLKDTI
ARALPFWNEEIVPQIKAGKRVLIAAHGNSLRGIVKHLEGMSDQAIMELNLPTGIPIVY
ELNKELKPTKPMQFLGDEETVRKAMEAVAAQGKAK"
.
.
.
//
---------+---------+---------+---------+---------+---------+---------+---------
1 10 20 30 40 50 60 70 79
Example 11. Joining Sequences
3.4.13 ORIGIN Format
The ORIGIN record may be left blank, may appear as `Unreported.' or
may give a local pointer to the sequence start, usually involving an
experimentally determined restriction cleavage site or the genetic
locus (if available). The ORIGIN record ends in a period if it
contains data, but does not include the period if the record is left
empty (in contrast to the KEYWORDS field which contains a period
rather than being left blank).
3.4.14 SEQUENCE Format
The nucleotide sequence for an entry is found in the records following
the ORIGIN record. The sequence is reported in the 5' to 3' direction.
There are sixty bases per record, listed in groups of ten bases
followed by a blank, starting at position 11 of each record. The
number of the first nucleotide in the record is given in columns 4 to
9 (right justified) of the record.
3.4.15 CONTIG Format
As an alternative to SEQUENCE, a CONTIG record can be present
following the ORIGIN record. A join() statement utilizing a syntax
similar to that of feature locations (see the Feature Table specification
mentioned in Section 3.4.12) provides the accession numbers and basepair
ranges of other GenBank sequences which contribute to a large-scale
biological object, such as a chromosome or complete genome. Here is
an example of the use of CONTIG :
CONTIG join(AE003590.3:1..305900,AE003589.4:61..306076,
AE003588.3:61..308447,AE003587.4:61..314549,AE003586.3:61..306696,
AE003585.5:61..343161,AE003584.5:61..346734,AE003583.3:101..303641,
[ lines removed for brevity ]
AE003782.4:61..298116,AE003783.3:16..111706,AE002603.3:61..143856)
However, the CONTIG join() statement can also utilize a special operator
which is *not* part of the syntax for feature locations:
gap() : Gap of unknown length.
gap(X) : Gap with an estimated integer length of X bases.
To be represented as a run of n's of length X
in the sequence that can be constructed from
the CONTIG line join() statement .
gap(unkX) : Gap of unknown length, which is to be represented
as an integer number (X) of n's in the sequence that
can be constructed from the CONTIG line join()
statement.
The value of this gap operator consists of the
literal characters 'unk', followed by an integer.
Here is an example of a CONTIG line join() that utilizes the gap() operator:
CONTIG join(complement(AADE01002756.1:1..10234),gap(1206),
AADE01006160.1:1..1963,gap(323),AADE01002525.1:1..11915,gap(1633),
AADE01005641.1:1..2377)
The first and last elements of the join() statement may be a gap() operator.
But if so, then those gaps should represent telomeres, centromeres, etc.
Consecutive gap() operators are illegal.
4. ALTERNATE RELEASES
NCBI is supplying sequence data in the GenBank flat file format to
maintain compatibility with existing software which require that
particular format. Although we have made every effort to ensure
that these data are presented in the traditional flat file format,
if you encounter any problems in using these data with software which
is based upon the flat file format, please contact us at:
info@ncbi.nlm.nih.gov
The flat file is just one of many possible report formats that can be
generated from the richer representation supported by the ASN.1 form of the
data. Developers of new software tools should consider using the ASN.1 form
directly to take advantage of those features. Documentation and a Software
Developer's Toolkit for ASN.1 are available through NCBI. You may call NCBI
at (301)496-2475, or subscribe to a developers' electronic newsgroup by
sending your name, address, affiliation, and e-mail address to:
bits-request@ncbi.nlm.nih.gov
The Software Developer's Toolkit and PostScript documentation for UNIX,
VMS, Ultrix, AIX, MacOS, DOS, and Microsoft Windows systems is available
in a compressed UNIX tar file by anonymous ftp from 'ftp.ncbi.nih.gov',
in the toolbox/ncbi_tools directory. The file is 'ncbi.tar.Z'.
5. KNOWN PROBLEMS OF THE GENBANK DATABASE
5.1 Incorrect Gene Symbols in Entries and Index
The /gene qualifier for many GenBank entries contains values other than the
official gene symbol, such as the product or the standard name of the gene. The
gene symbol index (gbgen.idx) is created from the data in the /gene qualifier
and therefore may contain data other than official gene symbols.
6. GENBANK ADMINISTRATION
The National Center for Biotechnology Information (NCBI), National Library
of Medicine, National Institutes of Health, is responsible for the production
and distribution of the NIH GenBank Sequence Database. NCBI distributes
GenBank sequence data by anonymous FTP, e-mail servers and other
network services. For more information, you may contact NCBI at the
e-mail address: info@ncbi.nlm.nih.gov or by phone: 301-496-2475.
6.1 Registered Trademark Notice
GenBank (R) is a registered trademark of the U.S. Department of Health
and Human Services for the Genetic Sequence Data Bank.
6.2 Citing GenBank
If you have used GenBank in your research, we would appreciate it if
you would include a reference to GenBank in all publications related
to that research.
When citing data in GenBank, it is appropriate to give the sequence
name, primary accession number, and the publication in which the
sequence first appeared. If the data are unpublished, we urge you to
contact the group which submitted the data to GenBank to see if there
is a recent publication or if they have determined any revisions or
extensions of the data.
It is also appropriate to list a reference for GenBank itself. The
following publication, which describes the GenBank database, should
be cited:
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL.
GenBank. Nucleic Acids Res. 36(Database issue), D25-30 (2008)
The following statement is an example of how you may cite GenBank
data. It cites the sequence, its primary accession number, the group
who determined the sequence, and GenBank. The numbers in parentheses
refer to the GenBank citation above and to the REFERENCE in the
GenBank sequence entry.
`We scanned the GenBank (1) database for sequence similarities and
found one sequence (2), GenBank accession number J01016, which showed
significant similarity...'
(1) Benson, D.A. et al, Nucleic Acids Res. 36(Database issue), D25-30 (2008)
(2) Nellen, W. and Gallwitz, D. J. Mol. Biol. 159, 1-18 (1982)
6.3 GenBank Distribution Formats and Media
Complete flat file releases of the GenBank database are available via
NCBI's anonymous ftp server:
ftp://ftp.ncbi.nih.gov
Each release is cumulative, incorporating all previous GenBank data.
No retrieval software is provided. GenBank distribution via CD-ROM
ceased as of GenBank Release 106.0 (April, 1998).
A mirror of the GenBank FTP site at the NCBI is available at the
University of Indiana:
ftp://bio-mirror.net/biomirror/genbank/
6.4 Other Methods of Accessing GenBank Data
Entrez is a molecular biology database system that presents an integrated
view of DNA and protein sequence data, 3D structure data, complete genomes,
and associated MEDLINE entries. The system is produced by the National
Center for Biotechnology Information (NCBI), and is available only via
the Internet (using the Web-Entrez and Network-Entrez applications).
Accessing Entrez is easy: if you have a World Wide Web browser, such as
Netscape or Internet-Explorer, simply point your browser to:
http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/
The Web version of Entrez has all the capabilities of the network version,
but with the visual style of the World Wide Web. If you prefer the "look and
feel" of Network-Entrez, you may download Network-Entrez from the NCBI's
FTP server:
ftp://ftp.ncbi.nih.gov/
Versions are available for PC/Windows, Macintosh and several Unix variants.
For information about Network-Entrez, Web-Entrez or any other NCBI
services, you may contact NCBI by e-mail at info@ncbi.nlm.nih.gov or by
phone at 301-496-2475.
6.5 Request for Corrections and Comments
We welcome your suggestions for improvements to GenBank. We are
especially interested to learn of errors or inconsistencies in the
data. BankIt or Sequin can be used to submit revisions to previous
submissions. In addition, suggestions and corrections can be sent by
electronic mail to: update@ncbi.nlm.nih.gov. Please be certain to
indicate the GenBank release number (e.g., Release 176.0) and the
primary accession number of the entry to which your comments apply; it
is helpful if you also give the entry name and the current contents of
any data field for which you are recommending a change.
6.6 Credits and Acknowledgments
Credits -
GenBank Release Coordination
Mark Cavanaugh
GenBank Submission Coordination
Ilene Mizrachi
GenBank Annotation Staff
Yiming Bao, Michael Baxter, Lori Black, Larissa Brown, Vincent Calhoun,
Larry Chlumsky, Karen Clark, Michel Eschenbrenner, Irene Fang,
Michael Fetchko, Linda Frisse, Andrea Gocke, Anjanette Johnston,
Mark Landree, Jason Lowry, Richard McVeigh, Ilene Mizrachi,
DeAnne Olsen Cravaritis, Chris O'Sullivan, Leigh Riley, Gert Roosen,
Susan Schafer, Beverly Underwood, Melissa Wright, and Linda Yankie
Data Management and Preparation
Serge Bazhin, Colleen Bollin, Mark Cavanaugh, Yoon Choi, Ilya Dondoshansky,
WonHee Jang, Jonathan Kans, Michael Kimelman, Jim Ostell, Carolyn Shenmen,
Denis Sinyakov, Karl Sirotkin, Vladimir Soussov, Elena Starchenko,
Hanzhen Sun, Tatiana Tatusova, Aleksey Vysokolov, Lukas Wagner, Jane Weisemann,
Eugene Yaschenko, Sergey Zhdanov
Database Administration
Slava Khotomliansky, Igor Lozitskiy, Ben Slade, Constantin Vasilyev
User Support
Peter Cooper, Wayne Matten, Scott McGinnis, Steve Pechous, Monica Romiti,
Eric Sayers, Tao Tao, Majda Valjavec-Gratian
Project Direction
David Lipman
Acknowledgments -
Contractor support for GenBank production and distribution has been
provided by Management Systems Designers, Inc., ComputerCraft Corporation,
and The KEVRIC Company, Inc.
6.7 Disclaimer
The United States Government makes no representations or warranties
regarding the content or accuracy of the information. The United States
Government also makes no representations or warranties of merchantability
or fitness for a particular purpose or that the use of the sequences will
not infringe any patent, copyright, trademark, or other rights. The
United States Government accepts no responsibility for any consequence
of the receipt or use of the information.
For additional information about GenBank releases, please contact
NCBI by e-mail at info@ncbi.nlm.nih.gov, by phone at (301) 496-2475,
or by mail at:
GenBank
National Library of Medicine
Bldg. 38A Rm. 8N-809
8600 Rockville Pike
Bethesda, MD 20894
FAX: (301) 480-9241