An improved HDFS for small file

机译：改进的小文件HDFS

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop is an open source distributed computing platform, and HDFS is Hadoop distributed file system. The HDFS has a powerful data storage capacity. Therefore, it is suitable for cloud storage system. However, HDFS was originally developed for the streaming access on large software, it has low storage efficiency for massive small files. To solve this problem, the HDFS file storage process is improved. The files are judged before uploading to HDFS clusters. If the file is a small file, it is merged and the index information of the small file is stored in the index file with the form of key-value pairs. The simulation shows that the improved HDFS has lower NameNode memory consumption than original HDFS and Hadoop Archives (HAR files). Thus, it can improve the access efficiency.

机译：Hadoop是一个开源的分布式计算平台，HDFS是Hadoop的分布式文件系统。 HDFS具有强大的数据存储容量。因此，它适用于云存储系统。但是，HDFS最初是为大型软件上的流访问而开发的，它对大量小文件的存储效率较低。为解决此问题，改进了HDFS文件存储过程。在将文件上传到HDFS群集之前，将对文件进行判断。如果文件是小文件，则将其合并，并将小文件的索引信息以键值对的形式存储在索引文件中。仿真显示，改进的HDFS比原始HDFS和Hadoop存档（HAR文件）具有更低的NameNode内存消耗。因此，可以提高访问效率。

著录项

来源
《International Conference on Advanced Communication Technology》|2016年|1-1|共1页
会议地点
作者
Liu Changtong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
HDFS; Hadoop; cloud storage; merge; small file;

机译：HDFS; Hadoop;云存储;合并;小文件;

相似文献

外文文献
中文文献
专利

1. Efficient Ways to Improve the Performance of HDFS for Small Files [J] . Parth Gohil, Bakul Panchal Computer Engineering and Intelligent Systems . 2014,第1期

机译：改善HDFS小文件性能的有效方法
2. Efficient Ways to Improve the Performance of HDFS for Small Files [J] . Parth Gohil, Bakul Panchal Journal of Economics and Sustainable Development . 2014,第1期

机译：改善HDFS小文件性能的有效方法
3. Efficient Ways to Improve the Performance of HDFS for Small Files [J] . Parth Gohil, Bakul Panchal Journal of Economics and Sustainable Development . 2014,第1期

机译：改善HDFS小文件性能的有效方法
4. File placement mechanisms for improving write throughputs of cloud storage services based on Ceph and HDFS [C] . Chun-Feng Wu, Tse-Chuan Hsu, Hongji Yang, Proceedings of the 2017 IEEE International Conference on Applied System Innovation . 2017

机译：文件放置机制可提高基于Ceph和HDFS的云存储服务的写入吞吐量
5. P2PHDFS: An implementation of Statistic Multiplexed Computing Architecture in Hadoop File System. [D] . Pradeep, Aakash. 2012

机译：P2PHDFS：Hadoop文件系统中统计复用计算体系结构的实现。
6. Experimental Directory Structure (Exdir): An Alternative to HDF5 Without Introducing a New File Format [O] . Svenn-Arne Dragly, Milad Hobbi Mobarhan, Mikkel E. Lepperød, 2018

机译：实验性目录结构（Exdir）：HDF5的替代方案无需引入新的文件格式
7. Figure 13: Browse directory show that encrypted file and abc.txt non-encrypted file HDFS. [O] . -1

机译：图13：浏览目录显示加密文件和ABC.txt未加密的文件HDFS。

An improved HDFS for small file

摘要

著录项

相似文献

相关主题

期刊订阅