NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE17041 Query DataSets for GSE17041
Status Public on Jul 10, 2009
Title Statistical identification of gene association by CID in application of constructing ER regulatory network
Organism Homo sapiens
Experiment type Expression profiling by array
Summary A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID), is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs) (X) and their downstream genes (Y) based on clinical data. More specifically, we use estrogen receptor alpha (ERalpha) as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A). RESULTS: The analytical utility of CID was evaluated in comparison with four commonly used statistical methods, Galton-Pearson's correlation coefficient (GPCC), Student's t-test (STT), coefficient of determination (CoD), and mutual information (MI). When being compared to GPCC, CoD, and MI, CID reveals its preferential ability to discover the regulatory association where distribution of the mRNA expression levels on X and Y does not fit linear models. On the other hand, when CID is used to measure the association of a continuous variable (Y) against a discrete variable (X), it shows similar performance as compared to STT, and appears to outperform CoD and MI. In addition, this study established a two-layer transcriptional regulatory network to exemplify the usage of CID, in combination with GPCC, in deciphering gene networks based on gene expression profiles from patient arrays. CONCLUSION: CID is shown to provide useful information for identifying associations between genes and transcription factors of interest in patient arrays. When coupled with the relationships detected by GPCC, the association predicted by CID are applicable to the construction of transcriptional regulatory networks. This study shows how information from different data sources and learning algorithms can be integrated to investigate whether relevant regulatory mechanisms identified in cell models can also be partially re-identified in clinical samples of breast cancers. AVAILABILITY: the implementation of CID in R codes can be freely downloaded from (http://homepage.ntu.edu.tw/~lyliu/BC/).
 
Overall design Total 48 clinical arrays (48A) used in this study can be found in GSE9309. We designed the experiments using a given breast cancer population with clear status of estrogen receptor alpha (ER), which were confirmed by immunochemical staining (If ³10% immunopositive stain is found at tumor section, we designate it as ER(+). Otherwise, it is ER(-). ) in this study. 48A consist of 36A with positive in ER status and of 12A with negative in ER status.
 
Citation(s) 19292896
Submission date Jul 10, 2009
Last update date Jan 02, 2014
Contact name Fon-Jou Hsieh
E-mail(s) fjhsieh@ntu.edu.tw
Organization name National Taiwan University
Department Department of obstetrics and gynecology
Street address No. 7, Chung-Shan South Rd
City Taipei
ZIP/Postal code 100
Country Taiwan
 
Platforms (1)
GPL887 Agilent-012097 Human 1A Microarray (V2) G4110B (Feature Number version)
Samples (48)
GSM237139 breast tumor part_IDC_rep1_1261
GSM237140 breast tumor part_IDC_rep1_1329
GSM237141 breast tumor part_IDC_rep1_1331
Relations
BioProject PRJNA117843

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE17041_RAW.tar 208.3 Mb (http)(custom) TAR (of TXT)
Processed data included within Sample table

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap