RefSeq collection comprises different data types, with different origins, so it is necessary to establish standard categories and identifiers to store each data type. The most important categories are:
RefSeq accession categories and molecule typesAccording to the RefSeq release 213 (July 2022), the number of species represented in the database by counting distinct taxonomic IDs are as follows:
Pruitt KD, Tatusova T, Maglott DR (January 2005). "NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins". Nucleic Acids Research. 33 (Database issue): D501 – D504. doi:10.1093/nar/gki025. PMC 539979. PMID 15608248. /wiki/Kim_D._Pruitt
Maglott DR, Katz KS, Sicotte H, Pruitt KD (January 2000). "NCBI's LocusLink and RefSeq". Nucleic Acids Research. 28 (1): 126–128. doi:10.1093/nar/28.1.126. PMC 102393. PMID 10592200. /wiki/Donna_R._Maglott
Pruitt KD, Katz KS, Sicotte H, Maglott DR (January 2000). "Introducing RefSeq and LocusLink: curated human genome resources at the NCBI". Trends in Genetics. 16 (1): 44–47. doi:10.1016/s0168-9525(99)01882-x. PMID 10637631. /wiki/Doi_(identifier)
RefSeq Release 213 Statistics (Report). National Library of Medicine. 11 July 2022. Retrieved 20 July 2022. http://ftp.ncbi.nlm.nih.gov/refseq/release/release-notes/
Sayers EW, Cavanaugh M, Clark K, Pruitt KD, Schoch CL, Sherry ST, Karsch-Mizrachi I (January 2022). "GenBank". Nucleic Acids Research. 50 (D1): D161 – D164. doi:10.1093/nar/gkab1135. PMC 8690257. PMID 34850943. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8690257
Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, et al. (July 2009). "The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes". Genome Research. 19 (7): 1316–1323. doi:10.1101/gr.080531.108. PMC 2704439. PMID 19498102. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2704439
Pujar S, O'Leary NA, Farrell CM, Loveland JE, Mudge JM, Wallin C, et al. (January 2018). "Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation". Nucleic Acids Research. 46 (D1): D221 – D228. doi:10.1093/nar/gkx1031. PMC 5753299. PMID 29126148. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753299
Farrell CM, Goldfarb T, Rangwala SH, Astashyn A, Ermolaeva OD, Hem V, et al. (January 2022). "RefSeq Functional Elements as experimentally assayed nongenic reference standards and functional interactions in human and mouse". Genome Research. 32 (1): 175–188. doi:10.1101/gr.275819.121. PMC 8744684. PMID 34876495. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8744684
Gulley ML, Braziel RM, Halling KC, Hsi ED, Kant JA, Nikiforova MN, et al. (June 2007). "Clinical laboratory reports in molecular pathology". Archives of Pathology & Laboratory Medicine. 131 (6): 852–863. doi:10.5858/2007-131-852-CLRIMP. PMID 17550311. /wiki/Doi_(identifier)
"NCBI RefSeq Targeted Loci Project". www.ncbi.nlm.nih.gov. Retrieved 2022-07-27. https://www.ncbi.nlm.nih.gov/refseq/targetedloci/
Hatcher EL, Zhdanov SA, Bao Y, Blinkova O, Nawrocki EP, Ostapchuck Y, et al. (January 2017). "Virus Variation Resource - improved response to emergent viral outbreaks". Nucleic Acids Research. 45 (D1): D482 – D490. doi:10.1093/nar/gkw1065. PMC 5210549. PMID 27899678. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210549
"NCBI RefSeq Select". www.ncbi.nlm.nih.gov. Retrieved 2022-07-27. https://www.ncbi.nlm.nih.gov/refseq/refseq_select/
Morales J, Pujar S, Loveland JE, Astashyn A, Bennett R, Berry A, et al. (April 2022). "A joint NCBI and EMBL-EBI transcript set for clinical genomics and research". Nature. 604 (7905): 310–315. Bibcode:2022Natur.604..310M. doi:10.1038/s41586-022-04558-8. PMC 9007741. PMID 35388217. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007741
RefSeq Release 213 Statistics (Report). National Library of Medicine. 11 July 2022. Retrieved 20 July 2022. http://ftp.ncbi.nlm.nih.gov/refseq/release/release-notes/
RefSeq Release 213 Statistics (Report). National Library of Medicine. 11 July 2022. Retrieved 20 July 2022. http://ftp.ncbi.nlm.nih.gov/refseq/release/release-notes/