面向通讯同步的多处理器阵列重构

吴亚兰; 武继刚; 姜文超; 刘竹松

首页> 中文期刊> 《计算机科学》 >面向通讯同步的多处理器阵列重构

面向通讯同步的多处理器阵列重构

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

从多处理器阵列中获取所需大小并且同步通讯性能优良的子阵列,是高性能拓扑重构的核心问题之一.基于不同的逻辑列剔除策略提出了3种面向通讯同步的拓扑重构算法:基于分治思想剔除逻辑列的重构算法(SCA_01),该算法能够使被优化的逻辑列相对均匀地分布在物理阵列中;优先剔除长逻辑列的贪心重构算法(SCA_02),该算法能够使被优化的逻辑列的长链接总数最少;基于分治与长链接数的混成重构算法(SCA_03),该算法将某一区域内的最长逻辑列剔除,且尽可能将剩余逻辑列均匀分布在物理阵列中.同时,对逻辑阵列的最大通讯延时给出了下界的求解算法.实验结果表明,3种算法在故障率小于1%、逻辑列的剔除率超过20%时,算法重构出的逻辑阵列的通讯延时特别接近计算出的性能下界.在多数情况下SCA_01优于SCA_02和SCA_03,而后两者的性能相近.在小阵列上且故障率与剔除率较小时,SCA_02具有性能优势,但在大阵列上SCA_03具有优势.在32×32的阵列上,SCA_01构造的阵列产生的通讯延时较SCA_02和SCA_03产生的延时平均减少25%,并且运行速度也提升了19.4%.%Reconfiguring VLSI arrays to get a logical array with given size and synchronous communication is one of the key problems in reconfigurable topology for high performance computing.This paper presented three algorithms based on three different strategies of excluding logical co-lumns.The first algorithm,named SCA_01,can make the logical co-lumns in the uniform distribution in the host array,based on the divide-and-conquer for excluding logical columns.The second algorithm,named SCA_02,can minimize the number of the long interconnects of the logical array,based on the excluding the long logical column in priority.The third algorithm,named SCA_03,keeps the logical columns distributed in uniform way based on the hybrid strategy from excluding the long logical column and divide-and-conquer.In addition,this paper contributed an algorithm to calculate the lower bound of the communication delay for the given logical array.Simulation results show that,the communication delay of the logical array reconstructed by three algorithms is close to the lower bound proposed in this paper,when fault rate is less than 1% and the exclusion rate of logical columns is over 20%.Algorithm SCA_01 is superior to SCA_02 and SCA_03 in most cases,while SCA_02 and SCA_03 have nearly the same performance.But SCA_02 is better on smaller arrays and SCA_03 is better on large arrays,when the fault rate and exclusion rate are relatively small.The communication delay generated by SCA_01 is less than that of SCA_02 and SCA_03 by 25% on 32×32 host arrays.Moreover,SCA_01 is faster than the other two algorithms,and the running time is saved by 19.4%.It is concluded that SCA_01 is one of the relatively desirable algorithms to generate the logical arrays with minimum communication delay for high performance computing.

著录项

来源
《计算机科学》 |2017年第7期|47-56|共10页
作者
吴亚兰; 武继刚; 姜文超; 刘竹松;
展开▼
作者单位

广东工业大学计算机学院广州510006;

广东工业大学计算机学院广州510006;

广东工业大学计算机学院广州510006;

广东工业大学计算机学院广州510006;

展开▼
原文格式 PDF
正文语种 chi
中图分类总体结构、系统结构;
关键词
VLSI阵列; 拓扑重构; 容错; 分治; 算法;

相似文献

中文文献
外文文献
专利

1. 阵列多通道同步采集系统与多处理器结构的数据采集方法实现 [J] . 阎振华 ,黄建国 ,何成兵 . 测控技术 . 2007,第009期
2. 可重构阵列的同步性能优化算法 [J] . 张元瑞 ,武继刚 ,段新明 . 计算机科学 . 2012,第003期
3. 一种面向密码算法的轻量级可重构阵列 [J] . 张宇帆 . 现代计算机（专业版） . 2021,第010期
4. 一种面向粗粒度可重构阵列的硬件木马检测算法的设计与实现 [J] . 严迎建 ,刘敏 ,邱钊洋 . 电子与信息学报 . 2019,第005期
5. 面向对数与指数函数的可重构阵列结构 [J] . 吕青 ,蒋林 ,邓军勇 . 微电子学与计算机 . 2016,第10期
6. 片上网络多处理器阵列的高效拓扑重构算法 [C] . WANG Chao ,王超 ,WU Ji-Gang . 2013全国高性能计算学术年会 . 2013
7. 片上网络多处理器阵列的拓扑重构 [A] . 王超 . 2014

面向通讯同步的多处理器阵列重构

摘要

著录项

相似文献

相关主题

期刊订阅