Skip to main content

Genomic sweeping for hypermethylated genes.

Publication ,  Journal Article
Goh, L; Murphy, SK; Muhkerjee, S; Furey, TS
Published in: Bioinformatics
February 1, 2007

MOTIVATION: Genes silenced by the aberrent methylation of nearby CpG islands can contribute to the onset or progression of cancer and represent potential biomarkers for diagnosis and prognosis. Relatively few have thus far been validated as hypermethylated in cancer among over 14,000 candidates with promoter region CpG islands. A descriptive set of genes known to be unmethylated in cancer does not exist. This lack of a negative set and a large number of candidates necessitated the development of a new approach to identify novel genes hypermethylated in cancer. RESULTS: We developed a general method, cluster_boost, that in an imbalanced data setting predicts new minority class members given limited known samples and a large set of unlabeled samples. Synthetic datasets modeled after the hypermethylated genes data show that cluster_boost can successfully identify minority samples within unlabeled data. Using genome sequence features, cluster_boost predicted candidate hypermethylated genes among 14,000 genes of unknown status. In primary ovarian cancers, we determined the methylation status for 15 genes with different levels of support for being hypermethlyated. Results indicate cluster_boost can accurately identify novel genes hypermethylated in cancer. AVAILABILITY: Software and datasets are freely available at http://labs.genome.duke.edu/FureyLab/cluster_boost.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Duke Scholars

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

February 1, 2007

Volume

23

Issue

3

Start / End Page

281 / 288

Location

England

Related Subject Headings

  • Sequence Analysis, DNA
  • Ovarian Neoplasms
  • Molecular Sequence Data
  • Humans
  • Genetic Testing
  • Female
  • DNA, Neoplasm
  • DNA Methylation
  • CpG Islands
  • Chromosome Mapping
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Goh, L., Murphy, S. K., Muhkerjee, S., & Furey, T. S. (2007). Genomic sweeping for hypermethylated genes. Bioinformatics, 23(3), 281–288. https://doi.org/10.1093/bioinformatics/btl620
Goh, Liang, Susan K. Murphy, Sayan Muhkerjee, and Terrence S. Furey. “Genomic sweeping for hypermethylated genes.Bioinformatics 23, no. 3 (February 1, 2007): 281–88. https://doi.org/10.1093/bioinformatics/btl620.
Goh L, Murphy SK, Muhkerjee S, Furey TS. Genomic sweeping for hypermethylated genes. Bioinformatics. 2007 Feb 1;23(3):281–8.
Goh, Liang, et al. “Genomic sweeping for hypermethylated genes.Bioinformatics, vol. 23, no. 3, Feb. 2007, pp. 281–88. Pubmed, doi:10.1093/bioinformatics/btl620.
Goh L, Murphy SK, Muhkerjee S, Furey TS. Genomic sweeping for hypermethylated genes. Bioinformatics. 2007 Feb 1;23(3):281–288.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

February 1, 2007

Volume

23

Issue

3

Start / End Page

281 / 288

Location

England

Related Subject Headings

  • Sequence Analysis, DNA
  • Ovarian Neoplasms
  • Molecular Sequence Data
  • Humans
  • Genetic Testing
  • Female
  • DNA, Neoplasm
  • DNA Methylation
  • CpG Islands
  • Chromosome Mapping