Description
This track shows the coverage track for DNAse I Hypersensitivity sites for cell types from
the hg38 DNaseI Hypersensitive Site Master List (125 cell types) from ENCODE/Analysis
Display Conventions and Configuration
Coverage tracks represent the basepair coverage from lifting of hg38 coordinates to the Repeat Browser consensus. Peaks indicate that this region is hypersensitive at most instances of this repeat type across the genome.
Use in combination with the corresponding meta-summit and instance mapping tracks.
Methods
wget http://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeAwgDnaseMasterSites/wgEncodeAwgDnaseMasterSites.bed.gz
gunzip *.gz
wget http://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeAwgDnaseMasterSites/wgEncodeAwgDnaseMasterSources.tab
awk -v OFS="\t" '{print $1,$2,$3,$8,$5}' wgEncodeAwgDnaseMasterSites.bed > encode_DNAse_5.bed
liftOver -multiple encode_DNAse_5.bed ../lift/hg19_to_hg38reps.over.chain encode_DNAse_5_hg38reps.bed encode_DNAse_5_hg38reps.unmapped
awk -v OFS="\t" '{split($4, a, ","); for(i in a) { if (a[i] ~/^[0-9]+$/ ){print $1,$2,$3,a[i],$5,"+"}}}' encode_DNAse_5_hg38reps.bed | sort -k 4b,4 > encode_DNAse_6_hg38reps.bed
sort -k1b,1 wgEncodeAwgDnaseMasterSources.tab > DNAse_key.tab
join -1 1 -2 4 DNAse_key.tab encode_DNAse_6_hg38reps.bed | awk -v OFS="\t" '{print $3,$4,$5,$2,$6,"+"}' > encode_DNAse_6_hg38reps_named.bed
mv encode_DNAse_6_hg38reps_named.bed ~/hive/jferna10/RepeatBrowserHub/hg38reps/wgEncodeAwgDnaseMasterSites
awk -v OFS="\t" '{print $1,$2,$3,$4,$5,$6 >> "encode_DNAse_6_" $4 ".bed"}' encode_DNAse_6_hg38reps_named.bed
ls *.bed | cut -f 1 -d"."| while read i; do bedSort $i.bed $i.bed ; bedtools genomecov -bg -split -i $i.bed -g ../hg38reps.sizes > temp.bg; bedGraphToBigWig temp.bg ../hg38reps.sizes $i.bw; done
ls *.bed | cut -f 1 -d"."| while read i; do bedToBigBed $i.bed ../hg38reps.sizes $i.bb -type=bed6; done
References
DNaseI Hypersensitive Site Master List (125 cell types) from ENCODE/Analysis
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang
H, Vernot B et al.
The accessible chromatin landscape of the human genome.
Nature. 2012 Sep 6;489(7414):75-82.
PMID: 22955617; PMC: PMC3721348
John S, Sabo PJ, Thurman RE, Sung MH, Biddie SC, Johnson TA, Hager GL, Stamatoyannopoulos JA.
Chromatin accessibility pre-determines glucocorticoid receptor binding patterns.
Nat Genet. 2011 Mar;43(3):264-8.
PMID: 21258342
See also the references and credit sections in the related
ENCODE Uniform DnaseI HS,
ENCODE UW DnaseI HS
and
ENCODE Duke DnaseI HS
tracks.
Email max@soe.ucsc.edu or jferna10@ucsc.edu
|