BIG/UHN Data Description
BIG/UHN represents the integrated CpG Island (CGI) clone sequences generated by Beijing Institute of Genomics (BIG), Chinese Academy of Sciences and downloaded from UHN Human CpG Island Microarray Database (UHN) (Heisler et al., 2005; Schumacher et al., 2006).
In order to study the global CGI distribution in human genome and alteration of DNA methylation patterns in promoter CGIs, BIG has generated approximately 20,000 CGI clone sequences by large-scale sequencing of the CGI library from the Sanger Institute (Cross et al., 1994). The number continues to increase with development of the new sequencing technologies. In addition, 17,132 CGI clone sequences downloaded from UHN Human CpG Island Microarray Database were also integrated. Taken together, currently a total of 34,738 CGI clones have been included in MethyCancer database. By mapping the clones onto human genome (NCBI Build36) using BLAT and BLAST, all clones were clustered into 18,240 genomic loci.