|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Jul 19, 2012 |
Title |
Chromatin Interactions by 5C from ENCODE/University of Washington |
Project |
ENCODE
|
Organism |
Homo sapiens |
Experiment type |
Other
|
Summary |
This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Richard Sandstrom mailto:sull@u.washington.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu).
This track contains chromatin interaction data from the University of Washington ENCODE group generated using 5C (Chromatin Conformation Capture Carbon Copy). The 5C method is used here to define short and long-range range interactions between transcription start sites (TSS) and DNaseI hypersensitive sites (DHS) or other genomic features. The 5C method is summarized below. Transcription factors bind to promoter-associated proteins, bringing the associated DNA sequences in close proximity to each other. Cross linking the DNA and proteins immobilizes these interactions and thus maintains their close proximity. Cleavage of the sample with restriction endonuclease followed by ligation results in hybrid molecules where a fragment with a regulatory element is physically associated with a fragment containing a TSS. The interactions are then detected by oligonucleotide-dependent, ligation-mediated assays, where one set of primers is complementary to the end of fragments with a TSS and the second set of primers are complementary to fragments with a feature. Primers are designed to the forward strand of the feature and the reverse strand of the TSS so that ligation only occurs between TSS and feature, not between different features. Specific interactions are detected by massively parallel sequencing. The data in this track comprises two different experiment types focusing on targeted regions: Gene-targeted project: Analysis of DNase I hypersensitive sites reveals many genes where there are multiple sites restricted to the cell type where a protein is observed to be expressed. These sites potentially identify regulatory sites for the gene. This set of experiments attempts to observe interactions between these DHS sites and transcription starts in 25 regions selected based on genes expressed in GM06990 (B-lymphocyte), BJ (foreskin fibroblast), HepG2 (liver cancer cell line), or SK-N-SH_RA (neuroblastoma cell line, SKNSH, differentiated with retinoic acid). Myc project: Genome wide association studies have identified SNPs linked to prostate, colon, and breast cancer in the gene desert region upstream of the myc gene. 5C of HindIII fragments interacting with those containing refSeq txStarts in this region were performed in 5 cell types: GM12878 (B-lymphocyte), CaCo2 (colon cancer cell line), LNCaP (prostate cancer cell line), MCF7 (breast cancer cell line), and K562 (erythroleukemia cell line).
For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf
|
|
|
Overall design |
Cells were grown according to the approved ENCODE cell culture protocols. The isolated nuclei were formaldehyde cross-linked. The DNA isolated from the nuclei was cleaved with restriction enzyme, ligated, and cross-links removed to create a 3C library (Dekker et al., 2002). Primers complementary to the TSS and feature were added, annealed and ligated to produce a 5C library (Dostie et al., 2006). The DNA fragments generated in the ligation mediated-reactions were partially digested with DNaseI, end-repaired and ligated to adapters, before sequencing. The sequencing reads generated were mapped to the predicted ligation products. The number of sequences mapping to predicted junction fragments were tabulated from sequencing runs. The number of times a sequence was detected for a given interaction between a TSS and feature indicates the relative strength of the interation. Gene-targeted project: Forward primers were designed to HindIII sites in a 230-415 kb sequence centered on the DNase I hypersensitive sites of interest. Reverse primers were designed to HindIII sites for all transcription starts extending 1 Mb on either side of the region targeted by the forward primer set. Matrix files are labeled by the coordinates of the region covered by the forward primer set. These experiments were done in a multiplex manner with the forward and reverse primers for all 25 regions mixed together in a single reaction. Two replicates were performed for 4 cell lines for 25 regions. High-throughput sequencing was performed on an ABI SOLiD instrument collecting 50 bp reads. The interaction files provided map all the reads in the output sequence without a mismatch threshold. Myc project: Forward primers were designed to HindIII fragments of 4.29 Mb section of human chromosome 8 centered on the gene desert 5~R of the myc gene. Reverse primers were designed to all HindIII fragments containing refseq txStarts in a 7.6 Mb region extending > 2 Mb on either side of the forward primer set. High-throughput sequencing was performed on an ABI SOLiD instrument collecting 50 bp reads. The interaction files provided map all the reads in the output sequence without a mismatch threshold. Data were verified by sequencing biological replicates displaying correlation coefficient > 0.9.
|
Web link |
http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=wgEncodeUw5C
|
|
|
Contributor(s) |
Sandstrom R |
Citation missing |
Has this study been published? Please login to update or notify GEO. |
BioProject |
PRJNA63443 |
Submission date |
Jul 19, 2012 |
Last update date |
May 15, 2019 |
Contact name |
ENCODE DCC |
E-mail(s) |
encode-help@lists.stanford.edu
|
Organization name |
ENCODE DCC
|
Street address |
300 Pasteur Dr
|
City |
Stanford |
State/province |
CA |
ZIP/Postal code |
94305-5120 |
Country |
USA |
|
|
Platforms (1) |
GPL9052 |
Illumina Genome Analyzer (Homo sapiens) |
|
Samples (9)
|
|
Relations |
SRA |
SRP014479 |
Supplementary file |
Size |
Download |
File type/resource |
GSE39505_RAW.tar |
1.6 Mb |
(http)(custom) |
TAR (of TAR) |
GSE39505_wgEncodeUw5cGeneTargetedPrimers.txt.gz |
130.1 Kb |
(ftp)(http) |
TXT |
GSE39505_wgEncodeUw5cMycPrimers.txt.gz |
39.7 Kb |
(ftp)(http) |
TXT |
GSE39505_wgEncodeUw5cPrimers_readme.txt |
882 b |
(ftp)(http) |
TXT |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|