NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE47933 Query DataSets for GSE47933
Status Public on Aug 28, 2013
Title Impact of library preparation on downstream analysis and interpretation of RNA-seq data: comparison between Illumina PolyA and NuGEN Ovation protocol
Organism Homo sapiens
Experiment type Expression profiling by high throughput sequencing
Summary Objectives: The sequencing by the PolyA selection is the most common approach for library preparation. With limited amount or degraded RNA, alternative protocols such as the NuGEN have been developed. However, it is not yet clear how the different library preparations affect the downstream analyses of the broad applications of RNA sequencing. Methods and Materials: Eight human mammary epithelial cell (HMEC) lines with high quality RNA were sequenced by Illumina’s mRNA-Seq PolyA selection and NuGEN ENCORE library preparation. The following analyses and comparisons were conducted: 1) the numbers of genes captured by each protocol; 2) the impact of protocols on differentially expressed gene detection between biological replicates; 3) expressed single nucleotide variant (SNV) detection; 4) non-coding RNAs, particularly lincRNA detection; and 5) intragenic gene expression. Results: Sequences from the NuGEN protocol had lower (75%) alignment rate than the PolyA (over 90%). The NuGEN protocol detected fewer genes (12-20% less) with a significant portion of reads mapped to non-coding regions. A large number of genes were differentially detected between the two protocols. About 17-20% of the differentially expressed genes between biological replicates were commonly detected between the two protocols. Significantly higher numbers of SNVs (5-6 times) were detected in the NuGEN samples, which were largely from intragenic and intergenic regions. The NuGEN captured fewer exons (25% less) and had higher base level coverage variance. While 6.3% of reads were mapped to intragenic regions in the PolyA samples, the percentages were much higher (20-25%) for the NuGEN samples. The NuGEN protocol did not detect more known non-coding RNAs such as lincRNAs, but targeted small and “novel” lincRNAs. Conclusion: Different library preparations can have significant impacts on downstream analysis and interpretation of RNA-seq data. The NuGEN provides an alternative for limited or degraded RNA but it has limitations for some RNA-seq applications.
 
Overall design Eight human mammary epithelial cell (HMEC) lines with high quality RNA were sequenced by Illumina’s mRNA-Seq PolyA selection and NuGEN ENCORE library preparation. The goal was to look the impacts of protocols on results and intepretation.
 
Contributor(s) Sun Z, Asmann YW, Nair A, Zhang Y, Wang L, Kalari KR, Bhagwate AV, Baker TR, Carr JM, Kocher JA, Perez EA, Thompson EA
Citation(s) 23977132
Submission date Jun 13, 2013
Last update date May 15, 2019
Contact name Zhifu Sun
Organization name Mayo Clinic
Department HSR-BSI
Street address 200 1st ST SW
City Rochester
State/province MN
ZIP/Postal code 55905
Country USA
 
Platforms (2)
GPL10999 Illumina Genome Analyzer IIx (Homo sapiens)
GPL11154 Illumina HiSeq 2000 (Homo sapiens)
Samples (16)
GSM1162717 DHF168_PolyA
GSM1162718 BSO19B_PolyA
GSM1162719 BSO28_PolyA
Relations
BioProject PRJNA208380
SRA SRP026013

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE47933_raw.gene.count.txt.gz 783.5 Kb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap