A Machine Learning Approach for the Curation of Biomedical Literature

机译：一种用于生物医学文献管理的机器学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the field of the biomedical sciences there exists a vast repository of information located within large quantities of research papers. Very often, researchers need to spend considerable amounts of time reading through entire papers before being able to determine whether or not they should be curated (archived). In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental evidence for the expression of molecular gene products for specified genes within a given paper. The system performs preprocessing and data cleaning, followed by feature extraction from the raw text. It subsequently classifies the paper using the extracted features with a Naive Bayes Classifier. Our approach has made it possible to classify (and curate) biomedical papers automatically, thus potentially saving considerable time and resources. The system proved to be highly accurate, and won honourable mention in the KDD Cup 2002 task 1.

机译：在生物医学领域，存在大量研究论文中的大量信息库。很多时候，研究人员需要花费大量时间阅读整篇论文，然后才能确定是否应该对其进行整理（存档）。在本文中，我们提出了一种用于生物医学论文分类的自动文本分类系统。该分类基于给定论文中是否有针对特定基因的分子基因产物表达的实验证据。该系统执行预处理和数据清理，然后从原始文本中提取特征。随后，它通过Naive Bayes分类器使用提取的特征对论文进行分类。我们的方法使自动分类（和管理）生物医学论文成为可能，从而潜在地节省了大量时间和资源。该系统被证明是高度准确的，并在2002年KDD Cup任务1中获得了荣誉奖。

著录项

来源
《Advances in Information Retrieval》|2003年|p.597-604|共8页
会议地点
作者
Min Shi; David S. Edwin; Rakesh Menon; Lixiang Shen; Jonathan Y.K. Lim; Han Tong Loh; S. Sathiya Keerthi; Chong Jin Ong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature [J] . Rong Xu, QuanQiu Wang BMC Bioinformatics . 2015,第SUPPLEMENTa5期

机译：从大规模的自由文本生物医学文献中比较知识驱动的方法与有监督的机器学习方法以大规模提取药物副作用的关系
2. Impact of Automatic Query Generation and Quality Recognition Using Deep Learning to Curate Evidence From Biomedical Literature: Empirical Study [J] . Muhammad Afzal, Maqbool Hussain, Khalid Mahmood Malik, JMIR Medical Informatics . 2019,第4期

机译：自动查询生成和质量识别的影响利用深度学习策划生物医学文学的证据：实证研究
3. An approach to self-triage of routine skin conditions using machine learning and curated medical knowledge [J] . Papier A. The Journal of investigative dermatology. . 2019,第5Suppla1期

机译：一种使用机器学习和策划医学知识进行常规皮肤状况的自动分类方法
4. A Machine Learning Approach for the Curation of Biomedical Literature [C] . Min Shi, David S. Edwin, Rakesh Menon, European Conference on Information Retrieval Research . 2003

机译：生物医学文献策择机器学习方法
5. Exploring machine learning and text mining in information extraction using gene expression profiles and biomedical literature. [D] . Ghaffari, Noushin. 2006

机译：使用基因表达谱和生物医学文献探索信息提取中的机器学习和文本挖掘。
6. Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature [O] . Rong Xu, QuanQiu Wang 2015

机译：从大规模文本医学生物医学文献中大规模提取药物副作用关系时将知识驱动方法与有监督的机器学习方法进行比较
7. A machine learning approach for the curation of biomedical literature [O] . S. Sathiya Keerthi, Chong Jin Ong, Keng Boon Siah, 2002

机译：用于生物医学文献管理的机器学习方法

A Machine Learning Approach for the Curation of Biomedical Literature

摘要

著录项

相似文献

相关主题

期刊订阅