首页> 外文会议>IEEE International Congress on Big Data >CouchFS: A High-Performance File System for Large Data Sets
【24h】

CouchFS: A High-Performance File System for Large Data Sets

机译:CouchFS:用于大型数据集的高性能文件系统

获取原文

摘要

Numerous file systems have been implemented to meet the needs in today's big data era, however many of them require specific configurations or frameworks for data processing. This paper presents CouchFS, a POSIX-compliant distributed file system for large data sets. We build CouchFS on top of CouchDB, which grants us flexibility to handle semistructured data. Since a database has similar behaviors as a file system, and CouchDB provides a high customizable MapReduce view for indexing, CouchFS is able to achieve high-performance searching for both text and supported binary objects. This work compares search of Wikipedia data using CouchDB, PostgreSQL and Spotlight on HFS+ file system. We show our design of CouchFS and discuss future approaches to improve this file system.
机译:已经实现了许多文件系统来满足当今大数据时代的需求,但是其中许多文件系统需要用于数据处理的特定配置或框架。本文介绍了CouchFS,这是一种适用于大型数据集的POSIX兼容的分布式文件系统。我们在CouchDB之上构建CouchFS,这使我们能够灵活地处理半结构化数据。由于数据库具有与文件系统类似的行为,并且CouchDB提供了高度可自定义的MapReduce视图用于建立索引,因此CouchFS能够对文本和受支持的二进制对象进行高性能搜索。这项工作比较了在HFS +文件系统上使用CouchDB,PostgreSQL和Spotlight对Wikipedia数据的搜索。我们展示了CouchFS的设计,并讨论了改进此文件系统的未来方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号