Close
Help


Text Categorization in the Database of Genotypes and Phenotypes

Posted Tue, Jul, 23,2013

Published today in Biomedical Informatics Insights is a new original research article by Mindy K. Ross, Ko-Wei Lin, Karen Truong, Abhishek Kumar and Mike Conway.  Read more about this paper below:

Title

Text Categorization of Heart, Lung, and Blood Studies in the Database of Genotypes and Phenotypes (dbGaP) Utilizing n-grams and Metadata Features

Abstract

The database of Genotypes and Phenotypes (dbGaP) allows researchers to understand phenotypic contribution to genetic conditions, generate new hypotheses, confirm previous study results, and identify control populations. However, effective use of the database is hindered by suboptimal study retrieval. Our objective is to evaluate text classification techniques to improve study retrieval in the context of the dbGaP database. We utilized standard machine learning algorithms (naive Bayes, support vector machines, and the C4.5 decision tree) trained on dbGaP study text and incorporated n-gram features and study metadata to identify heart, lung, and blood studies. We used the χ2 feature selection algorithm to identify features that contributed most to classification performance and experimented with dbGaP associated PubMed papers as a proxy for topicality. Classifier performance was favorable in comparison to keyword-based search results. It was determined that text categorization is a useful complement to document retrieval techniques in the dbGaP.

Click here to learn more about the article, download it and comment

share on

Posted in: Articles Published

  • Efficient Processing: 4 Weeks Average to First Editorial Decision
  • Fair & Independent Expert Peer Review
  • High Visibility & Extensive Database Coverage
Services for Authors
What Your Colleagues Say About Libertas Academica
testimonial_image
The experience with the journal was excellent.  The submission process was simple, reviews were expedited quickly, and communication with the corresponding author was excellent.
Dr Andrew Briggs (Curtin University, Australia)
More Testimonials

Quick Links


New article and journal news notification services
Email Alerts RSS Feeds
Facebook Google+ Twitter
Pinterest Tumblr YouTube