Automatically determining cause of death from verbal autopsy narratives

Serena Jeblee; Mireille Gomes; Prabhat Jha; Frank Rudzicz; Graeme Hirst

首页> 外文期刊>BMC Medical Informatics and Decision Making >Automatically determining cause of death from verbal autopsy narratives

【24h】

Automatically determining cause of death from verbal autopsy narratives

机译：通过口头尸检叙述自动确定死亡原因

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A verbal autopsy (VA) is a post-hoc written interview report of the symptoms preceding a person’s death in cases where no official cause of death (CoD) was determined by a physician. Current leading automated VA coding methods primarily use structured data from VAs to assign a CoD category. We present a method to automatically determine CoD categories from VA free-text narratives alone. After preprocessing and spelling correction, our method extracts word frequency counts from the narratives and uses them as input to four different machine learning classifiers: na?ve Bayes, random forest, support vector machines, and a neural network. For individual CoD classification, our best classifier achieves a sensitivity of.770 for adult deaths for 15 CoD categories (as compared to the current best reported sensitivity of.57), and.662 with 48 WHO categories. When predicting the CoD distribution at the population level, our best classifier achieves.962 cause-specific mortality fraction accuracy for 15 categories and.908 for 48 categories, which is on par with leading CoD distribution estimation methods. Our narrative-based machine learning classifier performs as well as classifiers based on structured data at the individual level. Moreover, our method demonstrates that VA narratives provide important information that can be used by a machine learning system for automated CoD classification. Unlike the structured questionnaire-based methods, this method can be applied to any verbal autopsy dataset, regardless of the collection process or country of origin.

机译：口头尸检（VA）是在医生未确定官方死亡原因（CoD）的情况下，对人死亡前症状的事后书面访谈报告。当前领先的自动VA编码方法主要使用VA中的结构化数据来分配CoD类别。我们提出一种仅根据VA自由文本叙述自动确定CoD类别的方法。经过预处理和拼写校正后，我们的方法从叙述中提取词频计数，并将其用作四个不同机器学习分类器的输入：朴素贝叶斯，随机森林，支持向量机和神经网络。对于单独的CoD分类，我们的最佳分类器对15种CoD类别的成年人死亡灵敏度为770（与目前最佳报道的灵敏度为57）相比，对于48种WHO类别，灵敏度为662。当在人口水平上预测CoD分布时，我们的最佳分类器可实现15种类别的962种特定原因死亡率分数准确性和48种类别的908种原因致死率准确性，与领先的CoD分布估计方法相当。我们基于叙事的机器学习分类器的性能与基于个体级别的结构化数据的分类器一样好。此外，我们的方法证明了VA叙事提供了重要信息，机器学习系统可以使用这些信息进行CoD自动分类。与基于结构化问卷的方法不同，此方法可以应用于任何口头尸检数据集，而无论收集过程或原籍国如何。

著录项

来源
《BMC Medical Informatics and Decision Making》 |2019年第1期|共13页
作者
Serena Jeblee; Mireille Gomes; Prabhat Jha; Frank Rudzicz; Graeme Hirst;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生;
关键词
Cause of deathComputer-coded verbal autopsy (CCVA)Physician-certified verbal autopsy (PCVA)Machine learningNatural language processingTariff methodVerbal autopsy;

机译：死亡原因计算机编码的语言尸检（CCVA）医师认证的语言尸检（PCVA）机器学习自然语言处理收费方式语言尸检;

相似文献

外文文献
中文文献
专利

1. Comparison of machine learning algorithms applied to symptoms to determine infectious causes of death in children: national survey of 18,000 verbal autopsies in the Million Death Study in India [J] . Idicula-Thomas Susan, Gawde Ulka, Jha Prabhat BMC Public Health . 2021,第1期

机译：机器学习算法应用于症状，以确定儿童死亡的传染性原因：全国调查印度百万死亡研究中的18,000名口头尸检
2. Applying a Public Health Ethics Framework to Consider Scaled-Up Verbal Autopsy and Verbal Autopsy with Immediate Disclosure of Cause of Death in Rural Nepal [J] . Joanna Morrison, Edward Fottrell, Bharat Budhatokhi, Public health ethics. . 2018,第3期

机译：应用公共卫生伦理框架考虑扩大的口头尸检和口头尸检，立即披露农村尼泊尔死因
3. Comparison of physician-certified verbal autopsy with computer-coded verbal autopsy for cause of death assignment in hospitalized patients in low- and middle-income countries: systematic review [J] . Jordana Leitao, Nikita Desai, Lukasz Aleksandrowicz, BMC Medicine . 2014,第1期

机译：低收入和中等收入国家住院患者的医师认证的口头尸检与计算机编码的口头尸检的死因分配比较：系统评价
4. Can Character Embeddings Improve Cause-of-Death Classification for Verbal Autopsy Narratives? [C] . Zhaodong Yan, Serena Jeblee, Graeme Hirst SIGBioMed workshop on biomedical natural language processing;Annual meeting of the Association for Computational Linguistics . 2019

机译：字符嵌入能否改善言语尸检叙事的死因分类？
5. Sex and Age at Death Estimation from the Os Pubis: Validation of Two Methods on a Modern Autopsy Sample [D] . Curtis, Ashley E. 2017

机译：OS Pubis的死亡估计性和年龄：验证现代尸检样本的两种方法
6. Automatically determining cause of death from verbal autopsy narratives [O] . Serena Jeblee, Mireille Gomes, Prabhat Jha, 2019

机译：通过口头尸检叙述自动确定死亡原因
7. A semantically annotated verbal autopsy corpus for automatic analysis of cause of death [O] . Danso S, Atwell ES, Johnson O, 2013

机译：一个语义注释的口头尸检语料库，用于自动分析死因

Automatically determining cause of death from verbal autopsy narratives

摘要

著录项

相似文献

相关主题

期刊订阅