qcAffin: A Hardware Topology Aware Interrupt Affinitizing and Balancing Scheme for Multi-Core and Multi-Queue Packet Processing Systems

Nen-Fu Huang; Wen-Yen Tsai

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >qcAffin: A Hardware Topology Aware Interrupt Affinitizing and Balancing Scheme for Multi-Core and Multi-Queue Packet Processing Systems

【24h】

qcAffin: A Hardware Topology Aware Interrupt Affinitizing and Balancing Scheme for Multi-Core and Multi-Queue Packet Processing Systems

机译：qcAffin：用于多核和多队列数据包处理系统的硬件拓扑感知中断仿制和平衡方案

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Interrupt affinitization of multi-queue network interface cards is a fundamental composition that defines how packets from individual queue are processed by which CPU-cores on multi-core platforms. In this paper, we propose to attain an optimal queue-to-core affinitization for packet processing systems based on a numerical cost model derived from hardware topology and runtime system workloads. Static architectural characteristics comprising the memory hierarchy and topology of hardware components are first analyzed to calculate static interrupt affinitization costs. Then we attempt dynamic interrupt affinitization to balance workloads on CPU-cores and improve overall performance. Classical networking applications ranging from bridging, routing, access control list (ACL) matching to deep packet inspection (DPI) with different frame sizes are extensively experimented to compare the performance of the proposed scheme and other existing approaches. As demonstrated in the comparison result, achieves the similar performance of the best affinitization approach and outperforms the Linux default affinitizer by averages of 102, 278, 248 and 131 percent on 1G NICs for the four applications. On 10G NICs, dramatic boosts of 1,424 and 1,343 percent are measured for the bridging and routing applications, respectively. Moreover, the effectiveness of dynamic interrupt balancing is justified by a maximum of 150 percent higher system utilization and 1.2 Mpps more throughput compared to the fixed affinitization approach in a simulated setup of unbalanced traffic load.

机译：多队列网络接口卡的中断亲缘关系是一个基本组成，它定义了多核平台上的CPU内核如何处理来自单个队列的数据包。在本文中，我们建议基于从硬件拓扑和运行时系统工作负载得出的数值成本模型，为数据包处理系统获得最佳的队列到核心亲缘关系。首先分析包括存储器层次结构和硬件组件拓扑的静态体系结构特征，以计算静态中断关联化成本。然后，我们尝试进行动态中断关联，以平衡CPU内核上的工作负载并提高整体性能。从桥接，路由，访问控制列表（ACL）匹配到具有不同帧大小的深层数据包检查（DPI）的经典网络应用都已进行了广泛的实验，以比较所提出的方案和其他现有方法的性能。如比较结果所示，在四个应用程序的1G NIC上，其最佳亲和化方法的性能均相似，并且优于Linux默认亲和器，分别为102％，278％，248％和131％。在10G NIC上，桥接和路由应用分别实现了1,424％和1,343％的大幅提升。此外，在不平衡流量负载的模拟设置中，与固定亲权化方法相比，系统中断利用率最多提高150％，吞吐量最多提高1.2 Mpps，证明了动态中断平衡的有效性。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2016年第6期|1783-1795|共13页
作者
Nen-Fu Huang; Wen-Yen Tsai;
展开▼
作者单位

National Tsing Hua University, Hsinchu, Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Interrupts affinitization; interrupts affinitization; interrupts balancing; multi-core computing;

机译：中断关联;中断关联;中断平衡;多核计算;

相似文献

外文文献
中文文献
专利

1. Protocol-Aware Packet Scheduling Algorithm for Multi-Protocol Processing in Multi-Core MPL Architecture [J] . Runzi ZHANG, Jinlin WANG, Yiqiang SHENG, IEICE transactions on information and systems . 2017,第12期

机译：多核MPL体系结构中用于多协议处理的协议感知数据包调度算法
2. A topology-aware load balancing algorithm for clustered hierarchical multi-core machines [J] . Laercio L. Pilla, Christiane P. Ribeiro, Pierre Coucheney, Future generation computer systems . 2014,第jana期

机译：集群分层多核计算机的拓扑感知负载均衡算法
3. A Cost-Effective Load-Balancing Policy for Tile-Based, Massive Multi-Core Packet Processors [J] . ENRIC MUSOLL ACM Transactions on Embedded Computing Systems . 2010,第3期

机译：基于图块的大规模多核分组处理器的一种经济有效的负载均衡策略
4. A port-configuration assisted NIC IRQ affinitization scheme for multi-core packet forwarding applications [C] . Wen-Yen Tsai, Nen-Fu Huang, Hsien-Wei Hung GLOBECOM;IEEE Global Communications Conference;GC12 AHSN;Ad hoc and sensor networking symposium;GC12 CISS;Communication and information system security symposium;GC12 CogRN;Communication theory symposium;Cognitive radio and networks symposium;GC12 CT;Communications software, services and multimedia symposium;Communications QoS, reliability and modelling symposium;GC12 CSSM;Symposium on selected areas in communications;GC12 CQRM;Optical networks and systems symposium;GC12 SAC;Next generation networking and internet symposium;GC12 ONS;GC12 SAC;GC12 NGNI;GC12 WN;Wireless networking symposium;Wireless communications symposium . 2012

机译：用于多核数据包转发应用程序的端口配置辅助NIC IRQ关联化方案
5. Hardware-Assisted Security Mechanisms on Arm-Based Multi-Core Processors [D] . Wan, Shengye. 2020

机译：基于ARM的多核处理器的硬件辅助安全机制
6. Learning-Directed Dynamic Voltage and Frequency Scaling Scheme with Adjustable Performance for Single-Core and Multi-Core Embedded and Mobile Systems [O] . Yen-Lin Chen, Ming-Feng Chang, Chao-Wei Yu, 2018

机译：具有学习性能的学习型动态电压和频率缩放方案适用于单核和多核嵌入式和移动系统
7. Protocol-Aware Packet Scheduling Algorithm for Multi-Protocol Processing in Multi-Core MPL Architecture [O] . Runzi ZHANG, Jinlin WANG, Yiqiang SHENG, 2017

机译：多核MPL体系结构中多协议处理的协议感知分组调度算法

qcAffin: A Hardware Topology Aware Interrupt Affinitizing and Balancing Scheme for Multi-Core and Multi-Queue Packet Processing Systems

摘要

著录项

相似文献

相关主题

期刊订阅