Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion

Agarwal Shashank; Yu Hong

首页> 外文期刊>Bioinformatics >Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion

【24h】

Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion

机译：将全文生物医学文章中的句子自动分类为简介，方法，结果和讨论

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Biomedical texts can be typically represented by four rhetorical categories: Introduction, Methods, Results and Discussion (IMRAD). Classifying sentences into these categories can benefit many other text-mining tasks. Although many studies have applied different approaches for automatically classifying sentences in MEDLINE abstracts into the IMRAD categories, few have explored the classification of sentences that appear in full-text biomedical articles. We first evaluated whether sentences in full-text biomedical articles could be reliably annotated into the IMRAD format and then explored different approaches for automatically classifying these sentences into the IMRAD categories. Our results show an overall annotation agreement of 82.14% with a Kappa score of 0.756. The best classification system is a multinomial naive Bayes classifier trained on manually annotated data that achieved 91.95% accuracy and an average F-score of 91.55%, which is significantly higher than baseline systems. A web version of this system is available online at-http://wood.ims.uwm.edu/full_text_classifier/.

机译：生物医学文本通常可以用四个修辞学类别表示：简介，方法，结果和讨论（IMRAD）。将句子分类为这些类别可以使许多其他文本挖掘任务受益。尽管许多研究采用了不同的方法来将MEDLINE摘要中的句子自动分类为IMRAD类别，但是很少有人探索全文生物医学文章中出现的句子分类。我们首先评估全文生物医学文章中的句子是否可以可靠地注释为IMRAD格式，然后探索了将这些句子自动分类为IMRAD类别的不同方法。我们的结果显示总体注释一致性为82.14％，Kappa分数为0.756。最好的分类系统是经过人工注释的数据训练的多项式朴素贝叶斯分类器，其准确度达到91.95％，平均F评分为91.55％，这明显高于基线系统。该系统的Web版本可从以下网址在线获得：http：//wood.ims.uwm.edu/full_text_classifier/。

著录项

来源
《Bioinformatics》 |2009年第23期|共7页
作者
Agarwal Shashank; Yu Hong;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词
MEDLINE database; text-mining task; sentence classification; IMRAD category;

机译：MEDLINE数据库;文本挖掘任务;句子分类;IMRAD分类;
入库时间 2022-08-18 11:10:53

相似文献

外文文献
中文文献
专利

1. Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion [J] . Agarwal Shashank, Yu Hong Bioinformatics . 2009,第23期

机译：将全文生物医学文章中的句子自动分类为简介，方法，结果和讨论
2. Free Model of Sentence Classifier for Automatic Extraction of Topic Sentences [J] . M.L. Khodra, D.H. Widyantoro, E.A. Aziz, Journal of ICT Research and Applications . 2011,第1期

机译：用于自动提取主题句的句子分类器免费模型
3. Free Model of Sentence Classifier for Automatic Extraction of Topic Sentences [J] . M.L. Khodra, D.H. Widyantoro, E.A. Aziz, ITB Journal of Information and Communication Technology . 2011,第1期

机译：用于自动提取主题句的句子分类器免费模型
4. Naive Bayes and SVM classifiers for classifying Databank Accession Number sentences from online biomedical articles [C] . Jongwoo Kim, rnDaniel X. Le, rnGeorge R. Thoma Document recognition and retrieval XVII . 2010

机译：朴素贝叶斯和SVM分类器用于对在线生物医学文章中的数据库登录号句子进行分类
5. Exploration of Classifying Sentence Bias in News Articles with Machine Learning Models [D] . Bellows, Martha R. 2018

机译：机器学习模型新闻文章中对句子偏见的探索
6. Automatically Classifying Sentences in Full-Text Biomedical Articles into Introduction Methods Results and Discussion [O] . Shashank Agarwal, Hong Yu 2009

机译：将全文生物医学文章中的句子自动分类为简介方法结果和讨论
7. Automatically Classifying Sentences in Full-Text Biomedical Articles into Introduction, Methods, Results and Discussion [O] . Agarwal, Shashank, Yu, Hong 2009

机译：将全文生物医学文章中的句子自动分类为简介，方法，结果和讨论

Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion

摘要

著录项

相似文献

相关主题

期刊订阅