Diamond Sketch: Accurate Per-flow Measurement for Big Streaming Data

Yang Tong; Gao Siang; Sun Zhouyi; Wang Yufei; Shen Yulong; Li Xiaoming

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Diamond Sketch: Accurate Per-flow Measurement for Big Streaming Data

【24h】

Diamond Sketch: Accurate Per-flow Measurement for Big Streaming Data

机译：Diamond Sketch：大流量数据的精确每流测量

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Per-flow measurement is a critical issue in computer networks, and one of its key tasks is to count the number of packets in each flow (for big streaming data). The literature has demonstrated that sketch is the most memory-efficient data structure for the counting task, and is widely used in distributed systems. Existing sketches often use many counters that are of the same size to record the number of packets in a flow, thus the counters are forced to be large enough to accommodate the size of the largest flow. Unfortunately, as most flows are small (i.e., mice flows) and only a very few flows are large (i.e., elephant flows), many counters represent very small values, which is a waste of memory. Sketches are often stored in fast but expensive memory (e.g., SRAM), thus it is critical to achieve high memory efficiency. To address this issue, we propose a novel sketch, namely the Diamond sketch. The Diamond sketch is composed of atom sketches, and each atom sketch uses small counters. The key idea of Diamond is to dynamically assign an appropriate number of atom sketches to each flow on demand, thus optimizing memory efficiency. Experimental results show that the Diamond sketch outperforms the best of the five typical sketches by up to 508.3 times in terms of relative error while keeping comparable speed. We made the source code of all the six sketches available on GitHub [1] .

机译：每流测量是计算机网络中的一个关键问题，它的关键任务之一是计算每个流中的数据包数量（对于大型流数据）。文献已经证明，草图是用于计数任务的内存效率最高的数据结构，并且已广泛用于分布式系统中。现有的草图通常使用许多大小相同的计数器来记录流中的数据包数量，因此迫使计数器足够大以容纳最大流的大小。不幸的是，由于大多数流量很小（即，老鼠流量），而只有很少的流量很大（即，大象流量），所以许多计数器代表的值很小，这浪费了内存。草图通常存储在快速但昂贵的内存中（例如SRAM），因此实现高存储效率至关重要。为了解决这个问题，我们提出了一种新颖的草图，即钻石草图。 Diamond草图由原子草图组成，每个原子草图都使用小计数器。 Diamond的关键思想是根据需要为每个流动态分配适当数量的原子草图，从而优化内存效率。实验结果表明，Diamond草图在保持可比速度的同时，相对误差方面比五个典型草图中的最佳表现高出508.3倍。我们在GitHub [1]上提供了所有六个草图的源代码。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2019年第12期|2650-2662|共13页
作者
Yang Tong; Gao Siang; Sun Zhouyi; Wang Yufei; Shen Yulong; Li Xiaoming;
展开▼
作者单位

Peking Univ Sch EECS 5 Yiheyuan Rd Beijing 100871 Peoples R China;

Xidian Univ Sch Comp Sci & Technol Xian 710071 Shaanxi Peoples R China;

Peking Univ Comp Sci & Technol Beijing Peoples R China|Peking Univ Inst Network Comp & Informat Syst NCIS Beijing Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Diamond; Random access memory; Hash functions; Data structures; Mice; Size measurement; Atomic measurements; Sketch; data streams; accuracy; distributed monitoring;

机译：钻石;随机存取存储器;哈希函数;数据结构;老鼠;尺寸测量;原子测量;草图;数据流;准确性;分布式监控;

相似文献

外文文献
中文文献
专利

1. FID-sketch: an accurate sketch to store frequencies in data streams [J] . Yang Tong, Zhang Haowei, Wang Hao, World Wide Web . 2019,第6期

机译：FID草图：将频率存储在数据流中的准确草图
2. DAP-Sketch: An accurate and effective network measurement sketch with Deterministic Admission Policy [J] . Wang Rui, Du Hongchao, Shen Zhaoyan, Computer networks . 2021,第Jula20期

机译：DAP画面：具有确定性入学政策的准确和有效的网络测量草图
3. Data Streaming Algorithms for Accurate and Efficient Measurement of Traffic and Flow Matrices [J] . Qi (George) Zhao, Abhishek Kumar, Jia Wang, Performance evaluation review . 2005,第1期

机译：数据流算法，用于流量和流量矩阵的准确，高效测量
4. Diamond sketch: Accurate per-flow measurement for real IP streams [C] . Tong Yang, Siang Gao, Zhouyi Sun, IEEE Conference on Computer Communications Workshops . 2018

机译：菱形草图：针对真实IP流的准确每流测量
5. Streaming and Sketch Algorithms for Large Data NLP. [D] . Goyal, Amit. 2013

机译：大数据NLP的流和草图算法。
6. Measurement system with real time data converter for conversion of I2S data stream to UDP protocol data [O] . Zoltan Vizvari, Attila Toth, Zoltan Sari, 2020

机译：具有实时数据转换器的测量系统用于将I2S数据流转换为UDP协议数据
7. Sketch?-metric: Comparing Data Streams via Sketching [O] . Emmanuelle Anceaume, Yann Busnel 2015

机译：sketch？-metric：通过草图比较数据流
8. Combining Conditioned Laser Altimeter Data and GPS Altitude Data to Obtain Accurate Aircraft Sensor Height Measurements [R] . Grimmett, T. K. 2003

机译：结合条件激光测高仪数据和Gps高度数据，获得精确的飞机传感器高度测量

Diamond Sketch: Accurate Per-flow Measurement for Big Streaming Data

摘要

著录项

相似文献

相关主题

期刊订阅