Skip to main content
construction release_alert
Scholars@Duke will be undergoing maintenance April 11-15. Some features may be unavailable during this time.
cancel

Identification of key concepts in biomedical literature using a modified Markov heuristic

Publication ,  Journal Article
Majoros, WH; Subramanian, GM; Yandell, MD
Published in: Bioinformatics
February 12, 2003

Motivation: The recent explosion of interest in mining the biomedical literature for associations between defined entities such as genes, diseases and drugs has made apparent the need for robust methods of identifying occurrences of these entities in biomedical text. Such concept-based indexing is strongly dependent on the availability of a comprehensive ontology or lexicon of biomedical terms. However, such ontologies are very difficult and expensive to construct, and often require extensive manual curation to render them suitable for use by automatic indexing programs. Furthermore, the use of statistically salient noun phrases as surrogates for curated terminology is not without difficulties, due to the lack of high-quality part-of-speech taggers specific to medical nomenclature.Results: We describe a method of improving the quality of automatically extracted noun phrases by employing prior knowledge during the HMM training procedure for the tagger. This enhancement, when combined with appropriate training data, can greatly improve the quality and relevance of the extracted phrases, thereby enabling greater accuracy in downstream literature mining tasks.Contact: bmajoros@tigr.org* To whom correspondence should be addressed.

Duke Scholars

Published In

Bioinformatics

DOI

EISSN

1367-4811

ISSN

1367-4803

Publication Date

February 12, 2003

Volume

19

Issue

3

Start / End Page

402 / 407

Publisher

Oxford University Press (OUP)

Related Subject Headings

  • Bioinformatics
  • 08 Information and Computing Sciences
  • 06 Biological Sciences
  • 01 Mathematical Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Majoros, W. H., Subramanian, G. M., & Yandell, M. D. (2003). Identification of key concepts in biomedical literature using a modified Markov heuristic. Bioinformatics, 19(3), 402–407. https://doi.org/10.1093/bioinformatics/btg010
Majoros, W. H., G. M. Subramanian, and M. D. Yandell. “Identification of key concepts in biomedical literature using a modified Markov heuristic.” Bioinformatics 19, no. 3 (February 12, 2003): 402–7. https://doi.org/10.1093/bioinformatics/btg010.
Majoros WH, Subramanian GM, Yandell MD. Identification of key concepts in biomedical literature using a modified Markov heuristic. Bioinformatics. 2003 Feb 12;19(3):402–7.
Majoros, W. H., et al. “Identification of key concepts in biomedical literature using a modified Markov heuristic.” Bioinformatics, vol. 19, no. 3, Oxford University Press (OUP), Feb. 2003, pp. 402–07. Crossref, doi:10.1093/bioinformatics/btg010.
Majoros WH, Subramanian GM, Yandell MD. Identification of key concepts in biomedical literature using a modified Markov heuristic. Bioinformatics. Oxford University Press (OUP); 2003 Feb 12;19(3):402–407.

Published In

Bioinformatics

DOI

EISSN

1367-4811

ISSN

1367-4803

Publication Date

February 12, 2003

Volume

19

Issue

3

Start / End Page

402 / 407

Publisher

Oxford University Press (OUP)

Related Subject Headings

  • Bioinformatics
  • 08 Information and Computing Sciences
  • 06 Biological Sciences
  • 01 Mathematical Sciences