NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE35571 Query DataSets for GSE35571
Status Public on Nov 12, 2013
Title Gene expression data from 131 human subjects in Detroit, Michigan
Organism Homo sapiens
Experiment type Expression profiling by array
Summary Background
Complex diseases are often difficult to diagnose, treat and study due to the multi-factorial nature of the underlying etiology. Large data sets are now widely available that can be used to define novel, mechanistically distinct disease subtypes (endotypes) in a completely data-driven manner. However, significant challenges exist with regard to how to segregate individuals into suitable subtypes of the disease and understand the distinct biological mechanisms of each when the goal is to maximize the discovery potential of these data sets.

Results
A multi-step decision tree-based method is described for defining endotypes based on gene expression, clinical covariates, and disease indicators using childhood asthma as a case study. We attempted to use alternative approaches such as the Student’s t-test, single data domain clustering and the Modk-prototypes algorithm, which incorporates multiple data domains into a single analysis and none performed as well as the novel multi-step decision tree method. This new method gave the best segregation of asthmatics and non-asthmatics, and it provides easy access to all genes and clinical covariates that distinguish the groups.

Conclusions
The multi-step decision tree method described here will lead to better understanding of complex disease in general by allowing purely data-driven disease endotypes to facilitate the discovery of new mechanisms underlying these diseases. This application should be considered a complement to ongoing efforts to better define and diagnose known endotypes. When coupled with existing methods developed to determine the genetics of gene expression, these methods provide a mechanism for linking genetics and exposomics data and thereby accounting for both major determinants of disease.
 
Overall design We collected peripheral blood samples from 131 children for RNA extraction and hybridization of target cDNA onto Affymetrix Human U133 Plus 2.0 whole genome array.
 
Contributor(s) Williams-DeVane CR, Reif DM, Hubal EC, Bushe PR, Hudgens EE, Gallagher JE, Edwards SW
Citation(s) 24188919, 25643280
Submission date Feb 06, 2012
Last update date Mar 25, 2019
Contact name Rebecca Catherine Fry
E-mail(s) rfry@unc.edu
Organization name UNC-Chapel Hill
Department Environmental Sciences and Engineering
Street address 1213 MHRC
City Chapel Hill
ZIP/Postal code 27599
Country USA
 
Platforms (1)
GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array
Samples (131)
GSM870902 child-1
GSM870903 child-2
GSM870904 child-5
Relations
BioProject PRJNA152221

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE35571_RAW.tar 596.9 Mb (http)(custom) TAR (of CEL)
Raw data provided as supplementary file
Processed data included within Sample table

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap