首页> 外文学位 >Systematic exploration of trade-offs between application throughput and hardware resource requirements in DSP systems.

【24h】

Systematic exploration of trade-offs between application throughput and hardware resource requirements in DSP systems.

机译：系统研究DSP系统中应用程序吞吐量和硬件资源需求之间的折衷。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dataflow has been used extensively as an efficient model-of-computation to analyze performance and resource requirements in implementing DSP algorithms on various target architectures. Although various software synthesis techniques have been widely studied in recent years, there is a distinct lack of efficient synthesis techniques in the literature for systematically mapping dataflow models into efficient hardware implementations. In this thesis, we explore three different aspects that contribute to the development of a powerful dataflow-based hardware synthesis framework: (1) Systematic generation of 1D/2D FFT implementation on field programmable gate arrays (FPGAs). The fast Fourier transform (FFT) is one of the most widely-used and important signal processing functions. However, FFT computation generally becomes a major bottleneck for overall system performance due to its high computational requirements. We propose a systematic approach for synthesizing FPGA implementations of one- and two-dimensional (1D and 2D) FFT computations, and rigorously exploring trade-offs between cost (in terms of FPGA resource requirements) and performance (in terms of throughput). Our approach provides an efficient hardware synthesis framework that can be customized to specific design constraints. In our FFT synthesis approach, we apply two orthogonal techniques in FPGA implementation to realize data-parallelism and parallel processing in FFT computation, respectively. These techniques can be applied to various 1D FFT algorithms, including Radix-2 and Radix-4 algorithms, and extended naturally and efficiently to 2D FFT implementation. (2) Buffer optimization under self-timed execution. Self-timed execution is known to provide the maximum achievable throughput when mapping DSP dataflow graphs into hardware under certain technical constraints. Throughput-constrained buffer minimization under self-timed execution is a key question in efficient hardware synthesis for practical design scenarios. Previous approaches to this problem have suffered from high worst case complexity or loose buffer bounds, which lead to inefficient resource utilization. In this thesis, we integrate a novel constraint into traditional self-timed execution to obtain a modified form of self-timed execution, which we call MSTE (Modified Self-Timed Execution). We show that MSTE greatly improves the efficiency with which we can accurately analyze and optimize hardware configurations of dataflow graphs, and furthermore, the additional execution constraints imposed in MSTE result in relatively minor performance overhead. Based on MSTE, we explore novel methods for self-timed analysis and associated techniques for buffer optimization subject to given throughput constraints. (3) Hardware synthesis technique for parameterized dataflow model. Parameterized dataflow modeling approaches allow for dynamic capabilities without excessively compromising the key properties of the existing static dataflow model---compile-time predictability and potential for rigorous optimizations. We develop a novel PSDF-based FPGA architecture framework using National Instrument's LabVIEW FPGA, a recently-introduced commercial platform for reconfigurable hardware implementation. This framework develops novel connections among model-based DSP system design, FPGA implementation, and next generation wireless communication systems.

机译：数据流已广泛用作一种有效的计算模型，以分析在各种目标体系结构上实现DSP算法时的性能和资源需求。尽管近年来已经广泛研究了各种软件综合技术，但是文献中仍然缺乏有效的综合技术，无法将数据流模型系统地映射到有效的硬件实现中。在本文中，我们探讨了有助于开发基于数据流的强大硬件综合框架的三个不同方面：（1）在现场可编程门阵列（FPGA）上系统生成1D / 2D FFT实现。快速傅立叶变换（FFT）是最广泛使用且最重要的信号处理功能之一。但是，由于FFT计算量大，因此通常成为整个系统性能的主要瓶颈。我们提出一种系统化的方法，用于合成一维和二维（一维和二维）FFT计算的FPGA实现，并严格探索成本（就FPGA资源需求而言）和性能（就吞吐量而言）之间的取舍。我们的方法提供了可以针对特定设计约束进行定制的高效硬件综合框架。在我们的FFT综合方法中，我们在FPGA实现中应用了两种正交技术，分别在FFT计算中实现数据并行和并行处理。这些技术可以应用于包括Radix-2和Radix-4算法在内的各种1D FFT算法，并且自然而有效地扩展到2D FFT实现。（2）自定时执行下的缓冲区优化。在某些技术约束下，将DSP数据流图映射到硬件时，自定时执行可提供最大的吞吐量。自定时执行下的吞吐量受限的缓冲区最小化是针对实际设计方案进行有效硬件综合的关键问题。解决该问题的先前方法遭受了最坏情况下的高复杂性或缓冲区边界的松动，这导致资源利用效率低下。在本文中，我们将一种新颖的约束条件整合到传统的自定时执行中，以获得一种修改后的自定时执行形式，我们称之为MSTE（修改后的自定时执行）。我们表明，MSTE大大提高了我们可以准确分析和优化数据流图的硬件配置的效率，此外，MSTE中施加的其他执行约束导致性能开销相对较小。在MSTE的基础上，我们探索了在给定吞吐量约束下用于自定时分析的新方法以及用于缓冲区优化的相关技术。（3）参数化数据流模型的硬件综合技术。参数化数据流建模方法可实现动态功能，而不会过度损害现有静态数据流模型的关键特性-编译时的可预测性和进行严格优化的潜力。我们使用National Instruments的LabVIEW FPGA开发了基于PSDF的新颖FPGA架构框架，这是最近引入的用于可重配置硬件实现的商业平台。该框架在基于模型的DSP系统设计，FPGA实现和下一代无线通信系统之间建立了新颖的联系。

著录项

作者
Kee, Hojin.;
展开▼
作者单位

University of Maryland, College Park.;

展开▼
授予单位 University of Maryland, College Park.;
学科 Engineering Electronics and Electrical.
学位 Ph.D.
年度 2010
页码 129 p.
总页数 129
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Modeling and Exploiting Spatial Locality Trade-Offs in Wavelet-Based Applications under Varying Resource Requirements [J] . BERT GEELEN, VISSARION FERENTINOS, FRANCKY CATTHOOR, ACM Transactions on Embedded Computing Systems . 2010,第3期

机译：资源需求变化下基于小波的应用程序中空间局部权衡的建模与开发
2. Systematic methodology for exploration of performance Energy trade-offs in network applications using Dynamic Data Type refinement [J] . Mamagkakis S, Bartzas A, Pouiklis G, Journal of systems architecture . 2007,第7期

机译：利用动态数据类型细化探索性能的网络方法在网络应用中的能量权衡
3. High-Speed Hardware Architectures for ARIA with Composite Field Arithmetic and Area-Throughput Trade-Offs [J] . Sang-Woo Lee, Sang-Jae Moon, Jeong-Nyeo Kim ETRI journal . 2008,第5期

机译：具有复合场算法和面积通量折衷的ARIA高速硬件架构
4. Systematic cycle budget versus system power trade-off: a new perspective on system exploration of real-time data-dominated applications [C] . Brockmeyer, E., Vandecappelle, . 2000

机译：系统周期预算与系统功耗之间的权衡：对实时数据为主的应用程序进行系统探索的新视角
5. Hardware software co-design of a resource efficient coarse grained reconfigurable architecture for high throughput media processing applications. [D] . Purohit, Sohan S. 2011

机译：用于高吞吐量媒体处理应用程序的资源有效的粗粒度可重新配置体系结构的硬件软件协同设计。
6. Software and Hardware Requirements and Trade-Offs in Operating Systems for Wearables: A Tool to Improve Devices’ Performance [O] . Vicente J. P. Amorim, Mateus C. Silva, Ricardo A. R. Oliveira 2019

机译：可穿戴设备操作系统中的软件和硬件要求以及取舍：提高设备性能的工具
7. Systematic exploitation of data parallelism in hardware synthesis of DSP applications [O] . Mainak Sen, Shuvra S. Bhattacharyya 2004

机译：在Dsp应用的硬件综合中系统地利用数据并行性
8. Economic/Trade-Off Analysis: Common Hardware-Software Computer Resources for the Advanced Attack Helicopter (YAH-64) Fire Control System [R] . Brachman, R. J., Clineff, G. 1980

机译：经济/权衡分析：高级攻击直升机（YaH-64）火控系统的通用硬件 - 软件计算机资源

Systematic exploration of trade-offs between application throughput and hardware resource requirements in DSP systems.

摘要

著录项

相似文献

相关主题

期刊订阅