0% found this document useful (0 votes)

85 views10 pages

A Low-Power Reconfigurable Data-Flow Driven DSP System: Motivation and Background

This document discusses a low-power reconfigurable data-flow driven digital signal processing system. It describes the computation model, communication mechanisms, and implementation of a reconfigurable data-flow driven architecture. Software tools are also described that automatically map algorithms to the architecture and evaluate performance and energy. Experimental results on signal processing and wireless algorithms show over an order of magnitude improvement in energy efficiency compared to programmable processors.

Uploaded by

Chethan Jayasimha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views10 pages

A Low-Power Reconfigurable Data-Flow Driven DSP System: Motivation and Background

Uploaded by

Chethan Jayasimha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

A Low-Power Reconfigurable Data-flow Driven DSP System

Marlene Wan, Hui Zhang, Martin Benes Jan Rabaey Berkeley Wireless Research Center EECS Department, University of California, Berkeley

ABSTRACT - Reconfigurable architectures have emerged as a promising implementation platform to provide high-flexibility, high-performance, and lowpower solutions for future wireless embedded devices. We discuss in details a reconfigurable data-flow driven architecture, including the computation model, communication mechanism, and implementation. We also describe a set of software tools developed to perform automatic mapping from algorithms to the architecture, as well as to evaluate the resulting performance and energy of the mapping. Finally, we present results on digital signal processing and wireless communication algorithms to show the energy efficiency of the system and the effectiveness of the tools. Our system shows more than one order of magnitude of improvement in terms of energy efficiency when compared to low-power programmable processors.

MOTIVATION AND BACKGROUND

Future wireless multimedia computing devices will be required to adapt their functionality to the changing parameters of the communication link available (e.g., bandwidth, error rates, protocols, etc.) to increase spectral efficiency. Therefore, these devices will have to be flexible enough to accommodate various multimedia services (e.g., different video compression schemes) and communication capabilities (e.g., cellular GSM, PCS and pico-cellular) while meeting the high-performance computational demand. At the same time, lowpower consumption will continue to be the predominant design challenge of wireless systems. Reconfigurable architectures have emerged as a promising implementation platform to provide the flexibility, high performance [1] and low power [2] [3] required for future wireless embedded devices. In [4], a heterogeneous reconfigurable architecture template is proposed to meet all these requirements. The reconfigurable architecture template possesses both software programmability and hardware reconfigurability, and consists of a wide range of hardware modules such as embedded processors, arithmetic logic units, embedded memories, address generators and FPGAs. In this paper, we introduce a realization of such an architecture template (in particular, its model of computation and basic processing elements for data-flow computations) and supporting software to assist in direct implementations on such an architecture. The shaded box in Figure 1 shows the scope of this paper: the data-flow driven architecture model is described in detail, then tools to perform mapping and estimation are presented. Finally, the energy efficiency of the proposed realization

is demonstrated by mapping some wireless communication and signal processing algorithms to the architecture. algorith m dataflow kernel* computation high-level control on microprocessor

hardwar e compone

mapping estimatio n

architectur e description

algorithm optimizati on architecture selection ASIC? programmable DSP? reconfigurable? reconfigurable architecture implementation optimization

Figure 1. Reconfigurable Digital Signal Processor Design Flow *Kernel:computational intensive operations within an algorithm, often correspond to data-fow computations in nested loops

DATA-FLOW DRIVEN ARCHITECTURE DESCRIPTION

In [4], the proposed architecture consisted of control-flow computation performed on the microprocessor and data-flow computation executed on the heterogeneous satellites. This architecture template fixes the communication scheme between each satellite as well as the interface method between the microprocessor and the satellite. To eliminate the energy overhead of satellite computations, communications between each satellite is data-flow driven and each satellite follows strict execution (i.e. operation starts only when all input data are ready). Reconfigurable interconnection network is used to establish dedicated links between satellites to preserve data correlation in signals, thus reducing energy consumption. In this paper, we will concentrate mainly on the architecture definition and software support of the reconfigurable data-flow driven satellites and satellite communications.

In out realization of the architecture, the data-flow driven satellites are medium to fine-grained according to the definition of [5]. The functionality of the satellites is divided into three categories: source, computation and memory. To support adaptive computations without reconfiguration such as changing the vector length or number of taps for the computation satellites, we have developed a minimumoverhead mechanism for passing data structures (scalar, vector and matrix). Each computation satellite needs to be configured for the data structures it consumes and produces (e.g., vectors to scalar for MAC, shown in Figure 2). The source satellites generate tokens indicating the end of the data structure in parallel with corresponding data. End-of-vector sent with this data

Vector -> n Scalar -> 1 Figure 2 Data-flow Driven Operations of the Satellites In order to support dedicated links between satellites without reconfiguration overhead and global control, data steering elements are embedded in the reconfigurable network. In general, data steering elements are divided into three categories: static (data goes in a fixed direction in between reconfiguration periods); statically scheduled (data goes in directions instructed by programs configured at reconfiguration times); dynamically determined (data is annotated with the direction). Only the first two are supported by our realization of the architecture template because dynamic data steering imposes too large of an energy overhead for the granularity of the computational satellites. Currently, the data-flow driven computation is implemented using global asynchronous and local synchronous clocking. A general handshaking scheme has been developed and a library of satellites has been designed using the scheme[6]. Address generators, input ports (with data from microprocessor) and FPGAs can serve as sources and are in charge of generating end of data structure tokens. Implementation issues for low-power reconfigurable interconnection networks are addressed in detail in [7].

THE SOFTWARE TOOLS

In order to supply fast implementation feedback to the user, we have developed tools to support application specific simulation and direct-mapped synthesis from a high-level language to the satellites. To give effective implementation feedback, the tools utilize energy and performance models of the hardware components described in the previous section. In this section, we first give a short description of the performance models used. We then discuss the simulation and synthesis tool developed for our system.

Performance Models
Power, delay, and area models have been developed for an extensive library of satellite modules designed at the University of California at Berkeley. Latency and analytical models of the effective switching capacitance (Ceff) of the modules are derived from circuit level simulation. In this section, we will introduce only models used for energy since the characterization of area and timing is well understood. Since our models are used for high level architecture selection, some degree of inaccuracy is acceptable. Therefore, a white noise signal distribution is assumed instead of statistical signal modeling when obtaining Ceff.

EnergySAT = CeffVdd

(Eq1)

Another important contributor of power consumption in our architecture is the reconfigurable interconnect. Methodology exists [7] to optimize domain specific reconfigurable interconnect architectures such that the energy and performance is close to ASIC implementations. Therefore, ASIC based interconnect power estimation is used for reconfigurable interconnect base cost--the average length (thus switching capacitance, Cave) between satellites is predicted based on the area of the modules needed for an application. A preliminary dynamic switching element has also been designed and the power model characterized [6]. The energy for a satellite-to-satellite link is therefore as follows:

Energynet = CavgVdd +CdVddM

(Eq2)

The parameter, M, specifies the number of dynamic switches required on the particular link which is known at synthesis time.

Simulation Tool
Based on the realization of the architecture template, a simulation environment is developed to provide an application-specific simulator in a style similar to [8]. Since computation is mapped to clusters of satellites, an object-oriented intermediate form based on the concept of modules (heterogeneous satellites) and queues (links between satellites) is created. A mapped kernel is constructed by

building a netlist using the module and queue library (Figure 3). In order to facilitate verification and performance feedback, wrappers are placed around all modules and queues so modules can be modeled as concurrent processes and queues as synchronized objects. Energy and time stamps are also associated with each module and queue so performance data can be collected. An application specific simulator is automatically instantiated once a netlist is specified.

Figure 3 An Intermediate Form Specification for a Computation Kernel Currently, the intermediate form is implemented in the C++ language and the Solaris thread library [9] (other common thread libraries can be switched in easily). Common satellite processors (such as MAC/multiply processor, ALU processor, memory and address generator, etc.) and data-steering modules have been incorporated in our module library.

Synthesis Tool
To ease the process of manually mapping algorithms to the architecture, we provide a synthesis tool to translate an algorithm (specified in a subset of C) to the direct-mapped implementation of the architecture. The output is the computation specified in the intermediate form. The kernel performance and energy can then be dynamically collected. In addition, for algorithms with nested loops of constant loop length, energy and performance information is also analyzed statically to avoid the overhead of simulation. The algorithm is compiled to the Stanford Unified Intermediate Form (SUIF) then converted to hierarchical Control/Data Flow Graph (CDFG [10]) representation. The current conversion from SUIF to CDFG exposes all scalar dependencies and preserves all WAW, RAW, and WAR dependencies in array accesses. The current synthesis tool allocates arrays of the same name to a particular memory and each

operation node in CDFG to a hardware unit (Figure 4 shows an example of a mapped kernel). This assignment of operations gives the rate-optimal execution of each computational node in the CDFG graph. The address generator program for each memory is generated based on the sequence of address expressions and corresponding loop iterations (an end of loop indicates an end of a vector). By merging corresponding fan-outs of each memory data read node and computational node in CDFG, a dedicated data steering element is generated for each output port. As shown in the example in Figure 4, while all other links are static, the output of memory Y1 (y1 has 4 read nodes in the algorithm) is statically scheduled by a program. Data from memory Y1 has to be broadcast at first to the MAC satellites, but after an end-of-vector (corresponds to Loop1), the direction of the data is changed to the multiplier.

Figure 4 Direct Mapping from C to Data-flow Driven Implementation Static performance estimation for loops with constant loop length is also provided so the overhead of simulation can be avoided. For a hierarchical CDFG, the total energy can be computed by performing a tree search on the graph in O(E) time:

TotalEnergy = IterationNum

comp

Energy

comp

In the equation, comp is either a satellite (Eq1), a link between satellites (Eq2) or another CDFG hierarchy.

Since the synthesis tool performs a direct mapping of CDFP to hardware, the latency of the implementation is characterized by the longest path and iteration period bound of the CDFG graph. The longest path is calculated by performing a topological sort of the CDFG graph (O(E)) and the iteration period bound is calculated in O(V E logE) [12][13].

Latency = LongestPat h + ( IterationN um 1) IterationP eriondBoun d

The output of the static performance analysis also includes energy and performance macro-models [14] with IterationNum s as parameters. The users can then evaluate the performance of the algorithm with different loop lengths (for example, different filter orders in adaptive filters).

CASE STUDIES
We show the low-energy feature of the system and effective performance feedback of the tools by using the performance information in several architecture selection processes. All energy and performance models of all satellite modules and interconnects are based on physical implementations designed in 0.25 m technology. Detailed reconfigurable interconnects are characterized in [7] also. The preliminary overhead of steering elements is included as well.

Multiuser Detection Channel Estimator

The first example is the mapping of an adaptive LMS filter for a multiuser detection (MUD) channel estimator in direct sequence code division multiple access systems. The specification, documented in [15], provides a symbol rate of 1.67MHz and a spreading factor of 15. The algorithm, specified in C, is synthesized to the architecture. Performance and energy data are determined statically and verified dynamically using the simulator. Table 1 shows the results of implementing the algorithm on the data-flow driven reconfigurable architecture along with two other implementations. The analysis [15] of the algorithm implemented on TMS320C54x is based on the same C algorithm compiled down to a low power version of the processor [16]. Based on the comparison offered by Table 1, users can select a specific architecture style based on the power/area/flexibility requirement. Architecture TMS320C54x Data-flow Driven Satellites ASIC Power (mW) 460 18.04 3 [15] Area (mm2) 1089 5.07 1.5

Table 1 Different Architecture Implementation of LMS for MUD

VSELP Speech Coding

For more complex algorithms with both control-flow and data-flow computations, the data-flow computations often become the performance and energy bottleneck. Therefore, the data-flow driven satellites serve as a good accelerator to a microprocessor in the implementations of such systems. In this case study, we mapped all kernels in the VSELP [17] speech-coding algorithm to the architecture as a accelerator to a low-power embedded microprocessor (ARM8). The following is a list of kernels in the algorithm, discovered by the system compilation tool described in [18]: Dot_product, FIR, IIR, VectorSumScalarMul, Compute_Code. All of the kernels were synthesized and simulated, and performance information was gathered. Table 2 shows the energy utilized by each kernel to process 50 frames of voice data for the two different implementations. Given the same performance constraint, the data-flow driven architecture is able to run at much a lower voltage. Therefore, the data-flow driven architecture offers orders of magnitude of energy improvement for the kernels. The performance and energy information can then be passed up to a system level evaluation tool [18] to evaluation the effect of the architecture selection. In addition, The netlist generated by the synthesis step is passed down to the implementation step for further optimizations [7]. The final implementation of the VSELP algorithm (including ARM8 and satellites) is 1.11 mW (dropped from 37.29 mW, which is the power consumption of the ARM8 implementation). Energy on Data-Driven Reconfigurable Architecture (1V) 153.7 J 96.10 J 23.95 J 2.195 J 1.200 J

Energy on ARM8 (2.5V) Dot_product FIR VectorSumScalarMul Compute_Code IIR 11550 J 5690 J 4800 J 1550 J 390 J

Table 2 Comparisons of Two Architectures for VSELP Kernels

CONCLUSION
We have presented a low-power reconfigurable data-flow driven digital signal processing system, described architecture concept in detail, and shown the energy efficiency of the architecture in the case studies. The examples in the case study also

illustrate how the tools introduced in this paper allow rapid architecture selection and serve as the basis of future optimizations. Utilizing the ideas introduced in this paper, future work will include algorithm level transformations (loop transformation and parallelism), implementation optimizations as well as more application mappings in adaptive filtering for the wireless communication domain.

ACKNOWLEDGEMENTS
The authors would like to acknowledge DARPA s support of the Pleiades project (DABT-63-96-C-0026) and all the Pleiades members for their input on the research topic discussed in this article.

REFERENCES
[1] G. R. Goslin, A Guide to Using Field Programmable Gate Arrays for Application Specific Digital Signal Processing Performance, Proceedings of SPIE, vol. 2914, p321-331. [2] Abnous et al, Evaluation of a Low-Power Reconfigurable DSP Architecture, Proceedings of the Reconfigurable Architecture Workshop, Orlando, Florida, USA, March 1998. [3] M. Goel and N. R. Shanbhag, Low-Power Reconfigurable Signal Processing via Dynamic Algorithm Transformations (DAT), Proceedings of Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, November, 1998. [4] Arthur Abnous and Jan Rabaey, "Ultra-Low-Power Domain-Specific Multimedia Processors", Proceedings of the IEEE VLSI Signal Processing Workshop, San Francisco, California, USA, October 1996. [5] P. Lieverse, E.F. Deprettere, A.C.J. Kienhuis and E.A. de Kock, ``A Clustering Approach to Explore Grain-sizes in the Definition of Weakly Programmable Processing Elements'', In 1997 IEEE Workshop on Signal Processing Systems: Design and Implementation, pp. 107-120, De Montfort University, Leicester, UK, November 3-5 1997. [6] Martin Benes, Master Thesis, University of California at Berkeley, 1999 [7] H. Zhang, M. Wan, V. George, J. Rabaey, "Interconnect Architecture Exploration for Low Energy Reconfigurable Single-Chip DSPs", Proceedings of the WVLSI , Orlando, FL, USA, April 1999. [8] B. Kienhuis, E. Deprettere, K. Vissers and P. van der Wolf, An Approach for Quantitative Analysis of Application Specific Dataflow Architectures, In

Proc. 11th Int. Conf. on Application-specific Systems, Architectures and Processors, Zurich, Switzerland, July 14-16 1997. [9] SunSoft Press, Solaris Multithreaded Programming Guide. [10] J. Rabaey, C. Chu, P. Hoang, M. Potkonjak, Fast Prototyping of DatapathIntensive Architectures. IEEE Design & Test of Computers, vol.8, (no.2), June 1991. p.40-51. [11] D. Messerschmitt, "Breaking The Recursive Bottleneck", in Performance Limits in Communication Theory and Practice, Kluwer Academic Publishers, 1988. [12] Shan-Hsi Huang and Rabaey, J.M. An Integrated framework for optimizing transformations, Proceedings of VLSI Signal Processing IX, p. 263-72. [13] C. Leiserson and F. Rose, "Optimizing Synchronous Circuitry by Retiming", Third Caltech Conf. On VLSI, March 1983. [14] D. Lidsky and J. Rabaey, Early Power Exploration a World Wide Web Application, Proceedings of Deisgn Automation Conference, Las Vegas, NV, June 1996. [14] N. Zhang, Implementation Issues in a Wideband Receiver Using Multiuser Detection, Master s Thesis, University of California at Berkeley, 1998. [16] W. Lee et. al A 1-V Programmable DSP for Wirelss Communications, IEEE Journal of Solid-State Circuits, Nov. 1997, vol.32, (no.11):1766-76. [17] Gerson and M. Jasiuk, Vector Sum Excited Linear Prediction (VSELP) Speech Coding at 8Kbps, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 461-464, April 1990. [18] M. Wan, Yuji Ichikawa, David Lidsky, Jan Rabaey, "An Energy Conscious Methodology for Early Design Exploration of Heterogeneous DSPs" Proceedings of the Custom Intergrated Circuit Conference , Santa Clara, CA, USA, May 1998

Renr8091-04 Pl1000e Manual
100% (4)
Renr8091-04 Pl1000e Manual
100 pages
Vlsi Companies in Europe and Uk
50% (2)
Vlsi Companies in Europe and Uk
12 pages
Creams: An Embedded Multiprocessor Platform
No ratings yet
Creams: An Embedded Multiprocessor Platform
12 pages
A Low Power, Programmable Networking Platform and Development Environment
No ratings yet
A Low Power, Programmable Networking Platform and Development Environment
19 pages
Unit 3 and 4 PPts of RC2
No ratings yet
Unit 3 and 4 PPts of RC2
100 pages
Low Power DSP 1 Tech
No ratings yet
Low Power DSP 1 Tech
4 pages
High Performance Cognitive Radio Platform With Integrated Physical and Network Layer Capabilities
No ratings yet
High Performance Cognitive Radio Platform With Integrated Physical and Network Layer Capabilities
13 pages
Fpga Implementation of A License Plate Recognition Soc Using Automatically Generated Streaming Accelerators
No ratings yet
Fpga Implementation of A License Plate Recognition Soc Using Automatically Generated Streaming Accelerators
8 pages
Lec 3
No ratings yet
Lec 3
25 pages
541420
No ratings yet
541420
259 pages
A Co-Design Flow For Reconfigurable Embedded Computing System With RTOS Support
No ratings yet
A Co-Design Flow For Reconfigurable Embedded Computing System With RTOS Support
8 pages
Kimmo Soc
No ratings yet
Kimmo Soc
24 pages
Adriatic Paper RAW-2003
No ratings yet
Adriatic Paper RAW-2003
8 pages
DSP Architecture Design Essentials
No ratings yet
DSP Architecture Design Essentials
353 pages
DSP Architecture PDF
No ratings yet
DSP Architecture PDF
353 pages
Standardization Concepts For CubeSat Applications
No ratings yet
Standardization Concepts For CubeSat Applications
5 pages
Course Description M.E. Embedded Systems
No ratings yet
Course Description M.E. Embedded Systems
4 pages
Implementation of Module Based Partial R
No ratings yet
Implementation of Module Based Partial R
5 pages
A Reconfigurable System Featuring Dynamically Extensible Embedded Microprocessor, and Customisable
No ratings yet
A Reconfigurable System Featuring Dynamically Extensible Embedded Microprocessor, and Customisable
4 pages
Cgra Content
No ratings yet
Cgra Content
2 pages
Uart Details
No ratings yet
Uart Details
12 pages
Hardware-Software Debugging Techniques For Reconfigurable Systems-on-Chip
No ratings yet
Hardware-Software Debugging Techniques For Reconfigurable Systems-on-Chip
6 pages
In-House Developed 32-Bit Digital Signal Processor For Strategic Applications
No ratings yet
In-House Developed 32-Bit Digital Signal Processor For Strategic Applications
5 pages
HW-SW-MID 25 MTech Scheme
No ratings yet
HW-SW-MID 25 MTech Scheme
10 pages
FPGA-based Custom Microprocessor Architectures
No ratings yet
FPGA-based Custom Microprocessor Architectures
8 pages
Embedded Sys1 - Updated
No ratings yet
Embedded Sys1 - Updated
5 pages
NetFPGA Stanford
No ratings yet
NetFPGA Stanford
22 pages
A Medium-Grain Reconfigurable Cell Array For DSP
No ratings yet
A Medium-Grain Reconfigurable Cell Array For DSP
6 pages
Frequently and Non Frequently Configurable Devices Applications of Reconfigurable Devices
No ratings yet
Frequently and Non Frequently Configurable Devices Applications of Reconfigurable Devices
14 pages
IJECE - Design and Implementation of An On CHIP Journal
No ratings yet
IJECE - Design and Implementation of An On CHIP Journal
8 pages
Blocks Challenging SIMDs and VLIWs With A Reconfigurable Architecture
No ratings yet
Blocks Challenging SIMDs and VLIWs With A Reconfigurable Architecture
14 pages
Lec 1
No ratings yet
Lec 1
25 pages
Efficient Datapath Merging For Partially Reconfigurable Architectures
No ratings yet
Efficient Datapath Merging For Partially Reconfigurable Architectures
12 pages
2010 Software-Defined Radio For OFDM
No ratings yet
2010 Software-Defined Radio For OFDM
7 pages
GPS-based Vehicle Tracking System-on-Chip: Adnan I. Yaqzan, Issam W. Damaj, and Rached N. Zantout
No ratings yet
GPS-based Vehicle Tracking System-on-Chip: Adnan I. Yaqzan, Issam W. Damaj, and Rached N. Zantout
6 pages
Reconfigurable Computing Using Content Addressable Memory For Improved Performance and Resource Usage
No ratings yet
Reconfigurable Computing Using Content Addressable Memory For Improved Performance and Resource Usage
6 pages
Eai 12-5-2020 164497
No ratings yet
Eai 12-5-2020 164497
6 pages
Spacecraft Computer Systems: Colonel John E. Keesee
No ratings yet
Spacecraft Computer Systems: Colonel John E. Keesee
34 pages
Dynamically Reconfigurable Architectures - Collections
No ratings yet
Dynamically Reconfigurable Architectures - Collections
23 pages
Network On Chip On FPGAs
No ratings yet
Network On Chip On FPGAs
232 pages
Reconfigurable Dataflow Graphs For Processing-In-memory
No ratings yet
Reconfigurable Dataflow Graphs For Processing-In-memory
11 pages
AutoMM Energy-Efficient Multi-Data-Type Matrix Multiply Design On Heterogeneous Programmable System-On-chip
No ratings yet
AutoMM Energy-Efficient Multi-Data-Type Matrix Multiply Design On Heterogeneous Programmable System-On-chip
7 pages
Design of A Reconfigurable Switch Architecture For Next Generation Communication Networks
No ratings yet
Design of A Reconfigurable Switch Architecture For Next Generation Communication Networks
6 pages
Design and Performance Analysis of Asynchronous Network On Chip For Streaming Data Transmission On FPGA
No ratings yet
Design and Performance Analysis of Asynchronous Network On Chip For Streaming Data Transmission On FPGA
11 pages
Minor Last
No ratings yet
Minor Last
67 pages
Introduction To Reconfigurable Systems1
No ratings yet
Introduction To Reconfigurable Systems1
45 pages
Communication System Assignment 2
No ratings yet
Communication System Assignment 2
10 pages
Reconfigurable Cell Array For Concurrent Support of Multiple Radio Standards by Flexible Mapping
No ratings yet
Reconfigurable Cell Array For Concurrent Support of Multiple Radio Standards by Flexible Mapping
4 pages
HyCUBE - A CGRA With Reconfigurable Single-Cycle Multihop Interconnect
No ratings yet
HyCUBE - A CGRA With Reconfigurable Single-Cycle Multihop Interconnect
6 pages
Inherently Lower-Power High-Performance Superscalar Architectures
No ratings yet
Inherently Lower-Power High-Performance Superscalar Architectures
21 pages
Module 4B
No ratings yet
Module 4B
21 pages
Chinese J of Electronics - 2020 - Wei - The Principle and Progress of Dynamically Reconfigurable Computing Technologies
No ratings yet
Chinese J of Electronics - 2020 - Wei - The Principle and Progress of Dynamically Reconfigurable Computing Technologies
13 pages
Ieee Papers
No ratings yet
Ieee Papers
8 pages
Blind Navigation Band
No ratings yet
Blind Navigation Band
50 pages
Embedded Systems-UnitI
No ratings yet
Embedded Systems-UnitI
33 pages
C Compiler Design For A Network Processor
No ratings yet
C Compiler Design For A Network Processor
8 pages
Ieee 1149
No ratings yet
Ieee 1149
8 pages
Vlsid 2
No ratings yet
Vlsid 2
3 pages
Scan Chain Reorder: Sying-Jyan Wang Department of Computer Science National Chung-Hsing University
100% (1)
Scan Chain Reorder: Sying-Jyan Wang Department of Computer Science National Chung-Hsing University
48 pages
Wherefore Ever Ramble On? For The Good Is Lying Near
No ratings yet
Wherefore Ever Ramble On? For The Good Is Lying Near
8 pages
Logic Circuit Simplification - Page1 - Combine - Combine
No ratings yet
Logic Circuit Simplification - Page1 - Combine - Combine
10 pages
Ti Jtag Seminar
100% (1)
Ti Jtag Seminar
7 pages
Real Time Operating System
No ratings yet
Real Time Operating System
5 pages
Mue Hlman
No ratings yet
Mue Hlman
72 pages
Digital To Analog Converter: Nov. 1, 2005 Fabian Goericke, Keunhan Park, Geoffrey Williams
No ratings yet
Digital To Analog Converter: Nov. 1, 2005 Fabian Goericke, Keunhan Park, Geoffrey Williams
39 pages
Expt 1
No ratings yet
Expt 1
2 pages
Design For At-Speed Delay Test: Outline
No ratings yet
Design For At-Speed Delay Test: Outline
10 pages
Introduction and Chapter Objectives: Real Analog - Circuits 1 Chapter 1: Circuit Analysis Fundamentals
No ratings yet
Introduction and Chapter Objectives: Real Analog - Circuits 1 Chapter 1: Circuit Analysis Fundamentals
35 pages
BI Technologies R2R Resistor Ladder Networks
No ratings yet
BI Technologies R2R Resistor Ladder Networks
6 pages
J.Ramprabu AP-2/EEE/KCT
No ratings yet
J.Ramprabu AP-2/EEE/KCT
45 pages
Mca 41
No ratings yet
Mca 41
146 pages
Worldwide industrialPC 10 PDF
No ratings yet
Worldwide industrialPC 10 PDF
259 pages
Vehicle Ignition Using Fingerprint Sensor
No ratings yet
Vehicle Ignition Using Fingerprint Sensor
7 pages
Report - Health Monitoring System
No ratings yet
Report - Health Monitoring System
58 pages
Project Bank: Department of ECE. ASET, Amity University. Version 1.1 (Will Be Updated Later)
No ratings yet
Project Bank: Department of ECE. ASET, Amity University. Version 1.1 (Will Be Updated Later)
5 pages
UNIT-2 Two Marks
No ratings yet
UNIT-2 Two Marks
15 pages
Final Year Project
No ratings yet
Final Year Project
61 pages
Microcontroller: Difference Between Microprocessor and Microcontroller
No ratings yet
Microcontroller: Difference Between Microprocessor and Microcontroller
23 pages
Simulink & Physical Modeling PDF
No ratings yet
Simulink & Physical Modeling PDF
46 pages
WinAVR User Manual
100% (2)
WinAVR User Manual
25 pages
JNTUA B.tech 4 1 Embedded Systems Previous Paper Held On November December 2017
No ratings yet
JNTUA B.tech 4 1 Embedded Systems Previous Paper Held On November December 2017
2 pages
Magelis SCU Programming Guide PDF
No ratings yet
Magelis SCU Programming Guide PDF
142 pages
Industrie 4.0 Enabling Technologies (2015)
No ratings yet
Industrie 4.0 Enabling Technologies (2015)
6 pages
Dhrystone White Paper
No ratings yet
Dhrystone White Paper
16 pages
AN QP and QT
No ratings yet
AN QP and QT
37 pages
5.embedded Based Solar Panel Cleaning System
No ratings yet
5.embedded Based Solar Panel Cleaning System
34 pages
Embedded Poultry Farm 2019
No ratings yet
Embedded Poultry Farm 2019
7 pages
Embedded Systems and Information Appliances
100% (4)
Embedded Systems and Information Appliances
11 pages
Prospectus
No ratings yet
Prospectus
30 pages
HP Compaq NC6400 Laptop Specifications
No ratings yet
HP Compaq NC6400 Laptop Specifications
34 pages
Notes BE&C 21ELN14-24 Module-3
No ratings yet
Notes BE&C 21ELN14-24 Module-3
23 pages
Rfid-Based Secured Access System Using 8051 Microcontroller (AT89C51)
No ratings yet
Rfid-Based Secured Access System Using 8051 Microcontroller (AT89C51)
30 pages
Lab Reprt 3 Micro
No ratings yet
Lab Reprt 3 Micro
8 pages
The 8051 Microcontroller: Hsabaghianb at Kashanu - Ac.ir
No ratings yet
The 8051 Microcontroller: Hsabaghianb at Kashanu - Ac.ir
141 pages
SS ZG656
No ratings yet
SS ZG656
13 pages
Real Time Embedded Systems
No ratings yet
Real Time Embedded Systems
2 pages
Task Manangement System
No ratings yet
Task Manangement System
23 pages

A Low-Power Reconfigurable Data-Flow Driven DSP System: Motivation and Background

Uploaded by

A Low-Power Reconfigurable Data-Flow Driven DSP System: Motivation and Background

Uploaded by

A Low-Power Reconfigurable Data-flow Driven DSP System

MOTIVATION AND BACKGROUND

DATA-FLOW DRIVEN ARCHITECTURE DESCRIPTION

THE SOFTWARE TOOLS

Energynet = CavgVdd +CdVddM

Latency = LongestPat h + ( IterationN um 1) IterationP eriondBoun d

Multiuser Detection Channel Estimator

Table 1 Different Architecture Implementation of LMS for MUD

VSELP Speech Coding

Table 2 Comparisons of Two Architectures for VSELP Kernels

You might also like