Crowdsourcing Without a Crowd: Reliable Online Species Identification Using Bayesian Models to Minimize Crowd Size

Siddharthan Advaith; Lambin Christopher; Robinson Anne-Marie; Sharma Nirwan; Comont Richard; OMahony Elaine; Mellish Chris; Van der Wal Rene

首页> 外文期刊>ACM transactions on intelligent systems >Crowdsourcing Without a Crowd: Reliable Online Species Identification Using Bayesian Models to Minimize Crowd Size

【24h】

Crowdsourcing Without a Crowd: Reliable Online Species Identification Using Bayesian Models to Minimize Crowd Size

机译：没有人群的众包：使用贝叶斯模型进行可靠的在线物种识别以最小化人群规模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an incremental Bayesian model that resolves key issues of crowd size and data quality for consensus labeling. We evaluate our method using data collected from a real-world citizen science program, BEEWATCH, which invites members of the public in the United Kingdom to classify (label) photographs of bumblebees as one of 22 possible species. The biological recording domain poses two key and hitherto unaddressed challenges for consensus models of crowdsourcing: (1) the large number of potential species makes classification difficult, and (2) this is compounded by limited crowd availability, stemming from both the inherent difficulty of the task and the lack of relevant skills among the general public. We demonstrate that consensus labels can be reliably found in such circumstances with very small crowd sizes of around three to five users (i.e., through group sourcing). Our incremental Bayesian model, which minimizes crowd size by re-evaluating the quality of the consensus label following each species identification solicited from the crowd, is competitive with a Bayesian approach that uses a larger but fixed crowd size and outperforms majority voting. These results have important ecological applicability: biological recording programs such as BEEWATCH can sustain themselves when resources such as taxonomic experts to confirm identifications by photo submitters are scarce (as is typically the case), and feedback can be provided to submitters in a timely fashion. More generally, our model provides benefits to any crowdsourced consensus labeling task where there is a cost (financial or otherwise) associated with soliciting a label.

机译：我们提出了一种增量贝叶斯模型，该模型解决了共识标签的人群规模和数据质量的关键问题。我们使用从现实世界公民科学计划BEEWATCH收集的数据评估我们的方法，该计划邀请英国公众将大黄蜂的照片分类（标记）为22种可能的物种之一。生物记录领域对众包共识模型提出了两个关键的，迄今尚未解决的挑战：（1）大量潜在物种使分类变得困难，（2）人群可利用性有限，这是由于两者固有的困难所致。任务和普通民众缺乏相关技能。我们证明，在这种情况下，只有三到五个用户的很小规模的人群（即通过小组采购）可以可靠地找到共识标签。我们的增量贝叶斯模型通过在人群中寻求每个物种识别后通过重新评估共识标签的质量来最小化人群规模，与使用较大但固定的人群规模并且胜过多数投票的贝叶斯方法具有竞争性。这些结果具有重要的生态适用性：当诸如分类专家等资源不足以确认照片提交者确认身份的生物记录程序（如通常）时，BEEWATCH等生物记录程序就可以维持自身生存，并且可以及时向提交者提供反馈。更笼统地说，我们的模型可以为任何众包共识标签任务带来好处，因为在这种情况下，征集标签会产生成本（财务或其他方面的费用）。

著录项

来源
《ACM transactions on intelligent systems》 |2016年第4期|45.1-45.20|共20页
作者
Siddharthan Advaith; Lambin Christopher; Robinson Anne-Marie; Sharma Nirwan; Comont Richard; OMahony Elaine; Mellish Chris; Van der Wal Rene;
展开▼
作者单位

Univ Aberdeen, Comp Sci, Aberdeen AB24 3UE, Scotland;

Univ Aberdeen, Aberdeen Ctr Environm Sustainabil, Aberdeen AB24 3UU, Scotland;

Univ Aberdeen, Aberdeen Ctr Environm Sustainabil, Aberdeen AB24 3UU, Scotland;

Univ Aberdeen, Comp Sci, Aberdeen AB24 3UE, Scotland;

Univ Stirling, Bumblebee Conservat Trust, Cottrell Bldg, Stirling FK9 4LA, Scotland;

Univ Stirling, Bumblebee Conservat Trust, Cottrell Bldg, Stirling FK9 4LA, Scotland;

Univ Aberdeen, Comp Sci, Aberdeen AB24 3UE, Scotland;

Univ Aberdeen, Aberdeen Ctr Environm Sustainabil, Aberdeen AB24 3UU, Scotland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Algorithms; Performance; Human Factors; Crowdsourcing; citizen science; consensus model; Bayesian reasoning; bumblebee identification; biological recording;

机译：算法;性能;人为因素;众包;公民科学;共识模型;贝叶斯推理;大黄蜂识别;生物记录;

相似文献

外文文献
中文文献
专利

1. Wisdom of the Crowds: Crowd-Based Development of a Logo for a Conference Using a Crowdsourcing Contest [J] . Ong Jason J., Bilardi Jade E., Tucker Joseph D. Sexually transmitted diseases . 2017,第10期

机译：人群的智慧：基于人群的开发，用于会议使用众群竞赛的徽标
2. How to work a crowd: Developing crowd capital through crowdsourcing [J] . Prpic John, Shukla Prashant P., Kietzmann Jan H., Business Horizons. . 2015,第1期

机译：如何工作人群：通过众包开发众筹
3. The Paradox of Interaction: Communication Network Centralization, Shared Task Experience, and the Wisdom of Crowds in Online Crowdsourcing Communities [J] . Bei Yan, Lian Jian, Ruqin Ren, Communication research . 2021,第6期

机译：互动的悖论：通信网络集中，共享任务经验以及在线众包社区中的人群智慧
4. Crowd Work with or without Crowdsourcing Platforms [C] . Xin Yan, Xianghua Ding, Ning Gu IEEE International Conference on Computer Supported Cooperative Work in Design . 2016

机译：人群在有或没有众包平台工作
5. Using Bayesian Cognitive Models in Wisdom of the Crowd Applications [D] . Danileiko, Irina. 2018

机译：在人群应用中使用贝叶斯认知模型
6. Wisdom of the crowds: Crowd-based development of a logo for a conference using a crowdsourcing contest [O] . Jason J. Ong, Jade E. Bilardi, Joseph D. Tucker -1

机译：人群的智慧：使用众包竞赛以人群为基础开发会议徽标
7. Crowdsourcing without a crowd : Reliable online species identification using Bayesian models to minimize crowd size [O] . Siddharthan, Advaith, Lambin, Christopher, Robinson, Anne-Marie, 2016

机译：没有人群的众包：使用贝叶斯模型进行可靠的在线物种识别，以最小化人群

Crowdsourcing Without a Crowd: Reliable Online Species Identification Using Bayesian Models to Minimize Crowd Size

摘要

著录项

相似文献

相关主题

期刊订阅