首页> 外文期刊>Malaysian Journal of Computer Science >A Framework for Massive Twitter Data Extraction and Analysis
【24h】

A Framework for Massive Twitter Data Extraction and Analysis

机译:大规模Twitter数据提取和分析的框架

获取原文
           

摘要

Social networks surfaced as communication and socialization tools. The vast amount of data these networks generate has led to a growing need of automatic knowledge extraction. The popular nature of these services is ideal for trends discovery. In particular, Twitter offers an open environment where people all around the world share information and opinions, emerging as a real-time repository of knowledge that can be exploited by researchers and applications. We propose an open framework to automatically collect and analyze data from Twitter’s public streams. This is a customizable and extensible framework, so researchers can use it to test new techniques. The framework is complemented with a language-agnostic sentiment analysis module, which provides a set of tools to perform sentiment analysis of the collected tweets. The capabilities of this platform are illustrated with two study cases in Spanish, one related to a high impact event (the Boston Terror Attack), and another one related to regular political activity on Twitter.
机译:社交网络作为沟通和社会化工具浮出水面。这些网络生成的大量数据导致对自动知识提取的需求不断增长。这些服务的流行性质是发现趋势的理想选择。尤其是,Twitter提供了一个开放的环境,全世界的人们都在这里共享信息和观点,并逐渐发展为可以供研究人员和应用程序利用的实时知识仓库。我们提出了一个开放框架,可以自动从Twitter的公共流收集和分析数据。这是一个可定制且可扩展的框架,因此研究人员可以使用它来测试新技术。该框架辅以语言不可知的情感分析模块,该模块提供了一组工具来对收集的推文执行情感分析。该平台的功能通过两个西班牙语学习案例进行了说明,一个案例涉及一个重大影响事件(波士顿恐怖袭击),另一个案例涉及Twitter上的定期政治活动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号