0% found this document useful (0 votes)

50 views31 pages

Imp 22

This document discusses architectural features of programmable digital signal processing devices. It describes basic features like arithmetic, logical, and multiply-accumulate instructions. It also discusses computational building blocks like multipliers and shifters. Memory architectures like Von Neumann and Harvard are covered. Addressing modes like immediate, register, direct, indirect, circular and bit-reversed are explained. The document provides details about multipliers, barrel shifters, multiply-accumulate units, and arithmetic logic units.

Uploaded by

Panku Rangaree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views31 pages

Imp 22

Uploaded by

Panku Rangaree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

Subject Name: Digital Signal Processing Algorithms & Architecture

Subject Code:10EC751

Prepared By: S. Shikky Marice, Prashanth, Shivlila

Department: Electronics and Communication Engineering

Date:24.8.2014
Unit-02

Architectures for programmable digital signal –

processing devices
Basic Architectural Features

A programmable DSP device should provide instructions similar to a conventional

microprocessor. The instruction set of a typical DSP device should include the
following,
a. Arithmetic operations such as ADD, SUBTRACT, MULTIPLY etc
b. Logical operations such as AND, OR, NOT, XOR etc
c. Multiply and Accumulate (MAC) operation
d. Signal scaling operation
In addition to the above provisions, the architecture should also include,
a. On chip registers to store immediate results
b. On chip memories to store signal samples (RAM)
c. On chip memories to store filter coefficients (ROM)
DSP Computational Building Blocks

•Multipliers
The advent of single chip multipliers paved the way for
implementing DSP functions on a VLSI chip. Parallel multipliers
replaced the traditional shift and add multipliers now days.
Parallel multipliers take a single processor cycle to fetch and
execute the instruction and to store the result. They are also
called as Array multipliers. The key features to be considered
for a multiplier are:
a. Accuracy
b. Dynamic range
c. Speed
•Parallel multipliers:
Consider the multiplication of two unsigned numbers A and B. Let A be
represented using m bits as (Am-1 Am-2 …….. A1 A0) and B be
represented using n bits as (Bn-1 Bn-2 …….. B1 B0). Then the product of
these two numbers is given by, Braun multiplier.
•Multipliers for signed numbers:
Consider two signed numbers A and B,
•Bus Widths
1.Consider the multiplication of two n bit numbers X and Y. The product Z can
be atmost 2n bits long. In order to perform the whole operation in a single
execution cycle, we require two buses of width n bits each to fetch the operands
X and Y and a bus of width 2n bits to store the result Z to the memory. Although
this performs the operation faster, it is not an efficient way of implementation as
it is expensive.

We have two alternatives to solve this problem,

a. Use the n bits operand bus and save Z at two successive memory locations.
Although it stores the exact value of Z in the memory, it takes two cycles to store
the result.
b. Discard the lower n bits of the result Z and store only the higher order n bits
into the memory. It is not applicable for the applications where accurate result is
required.
Another alternative can be used for the applications where speed is not a major
concern. In which latches are used for inputs and outputs thus requiring a single bus to
fetch the operands and to store the result
Shifters
Shifters are used to either scale down or scale up operands or the results. The
following scenarios give the necessity of a shifter

a. While performing the addition of N numbers each of n bits long, the sum can
grow up to n+log2 N bits long. If the accumulator is of n bits long, then an
overflow error will occur. This can be overcome by using a shifter to scale down
the operand by an amount of log2N.
b. Similarly while calculating the product of two n bit numbers, the product can
grow up to 2n bits long. Generally the lower n bits get neglected and the sign bit
is shifted to save the sign of the product.
c. Finally in case of addition of two floating-point numbers, one of the operands has
to be shifted appropriately to make the exponents of two numbers equal.
Barrel Shifters
For an input of length n, log2 n control lines are required. And an additional
control line is required to indicate the direction of the shift.
•A Barrel Shifter is to be designed with 16 inputs for left shifts from 0 to 15 bits.
How many control lines are required to implement the shifter?
As the number of bits used to represent the input are 16, log2 16=4 control inputs
are required.

•It is required to find the sum of 64, 16 bit numbers. How many bits should the
accumulator have so that the sum can be computed without the occurrence of
overflow error or loss of accuracy?
The sum of 64, 16 bit numbers can grow up to (16+ log2 64 )=22 bits long. Hence
the accumulator should be 22 bits long in order to avoid overflow error from occurring.
Multiply and Accumulate Unit
Overflow and Underflow

While designing a MAC unit, attention has to be paid to the word sizes
encountered at the input of the multiplier and the sizes of the add/subtract unit
and the accumulator, as there is a possibility of overflow and underflows.
Overflow/underflow can be avoided by using any of the following methods viz
a. Using shifters at the input and the output of the MAC
b. Providing guard bits in the accumulator
c. Using saturation logic
Saturation logic
Overflow/ underflow will occur if the result goes beyond the most positive number or
below the least negative number the accumulator can handle. Thus the
overflow/underflow error can be resolved by loading the accumulator with the most
positive number which it can handle at the time of overflow and the least negative
number that it can handle at the time of underflow. This method is called as saturation
logic.

Arithmetic and Logic Unit

Arithmetic logic unit (ALU) carries out additional arithmetic and logic operations required
for a DSP:
• add, subtract, increment, decrement, negate AND, OR, NOT, XOR, compare
shift, multiply (uncommon to general microprocessors) with additional features common
to general microprocessors:
1. status flags for sign, zero, carry and overflow
2.overflow management via saturation logic
3.register files for storing intermediate results
Arithmetic Logic Unit of a DSP

Bus Architecture and Memory

Bus architecture and memory play a significant role in dictating cost, speed
and size of DSPs.
Common architectures include the von Neumann and Harvard architectures.
Von Neumann Architecture

Harvard Architecture
Von Neumann Architecture
• program and data reside in same memory
•single bus is used to access both

Implications:
slows down program execution since processor has to wait for data even
after instruction is made available

Harvard Architecture
program and data reside in separate memories with two independent buses
Implications:
• faster program execution because of simultaneous memory
access capability
On-Chip Memory
• on-chip = on-processor
•help in running the DSP algorithms faster than when memory is off-chip dedicated
addresses and data buses are available
 speed: on-chip memories should match the speeds of the ALU
Operations
size: the more area chip memory takes, the less area available for
other DSP functions

Data Addressing Capabilities

• Efficient way of accessing data (signal sample and filter coefficients) can
significantly improve implementation
1. performance flexible ways to access data helps in writing efficient.
2.programs data addressing modes enhance DSP implementations
DSP Addressing Modes

• Immediate
•Register
• Direct
•Indirect

Special Addressing Modes:

• Circular
•Bit-reversed
Immediate Addressing Mode:
• operand is explicitly known in value
• capability to include data as part of the instruction
Instruction Operation
ADD #imm #imm + A A
#imm: value represented by imm (fixed number such as filter coefficient is
known ahead of time)
A: accumulator register

Register Addressing Mode

• operand is always in processor register reg
• capability to reference data through its register
Instruction Operation
ADD reg reg + A A
• reg : processor register provides operand
A: accumulator register
Direct Addressing Mode
• operand is always in memory location mem
• capability to reference data by giving its memory location directly
Instruction Operation
ADD mem mem + A A
• mem: specied memory location provides operand (e.g., memory could hold input
signal value)
A: accumulator register

Indirect Addressing Mode

• operand memory location is variable
• operand address is given by the value of register addrreg
operand accessed using pointer addrreg
Instruction Operation
ADD addrreg addrreg + A A
addrreg: needs to be loaded with the register location before use
A: accumulator register
Special Addressing Modes

Circular Addressing Mode: circular buffer allows one to handle a continuous

stream of incoming data samples; once the end of the buffer is reached,
samples are wrapped around and added to the beginning again useful for
implementing real-time digital signal processing where the input stream is
effectively continuous

Bit-Reversed Addressing Mode: address generation unit can be provided with

the capability of providing bit-reversed indices useful for implementing radix-
2 FFT (fast Fourier Transform) algorithms where either the input or output is
in bit-reversed order
Circular Addressing:
Can avoid constantly testing for the need to wrap.
Suppose we consider eight registers to store an incoming data stream.

Reference Index Address

0 = 0 mod 8 = 8 mod 8 = 16 mod 8 000 = 0

1 = 1 mod 8 = 9 mod 8 = 17 mod 8 001 = 1
2 = 2 mod 8 = 10 mod 8 = 18 mod 8 010 = 2
3 = 3 mod 8 = 11 mod 8 = 19 mod 8 011 = 3
4 = 4 mod 8 = 12 mod 8 = 20 mod 8 100 = 4
5 = 5 mod 8 = 13 mod 8 = 21 mod 8 101 = 5
6 = 6 mod 8 = 14 mod 8 = 22 mod 8 110 = 6
7 = 7 mod 8 = 15 mod 8 = 23 mod 8 111 = 7
Bit-Reversed Addressing:

Input Index Output Index

000 = 0 000 = 0
001 = 1 100 = 4
010 = 2 010 = 2
011 = 3 110 = 6
100 = 4 001 = 1
101 = 5 101 = 5
110 = 6 011 = 3
111 = 7 111 = 7
Speed Issues
fast execution of algorithms is the most important requirement
of a DSP architecture
• high speed instruction operation
• large throughputs
•facilitated by advances in VLSI technology and design
innovations
Hardware Architecture
•dedicated hardware support for multiplications, scaling, loops and repeats, and
special addressing modes are essential for fast.
DSP implementations
•Harvard architecture significantly improves program execution time compared
to von Neumann
•on-chip memories aid speed of program execution considerably

Parallelism
Parallelism means:
provision of multiple function units, which may operate in parallel to increase throughput
multiple memories
different ALUs for data and address computations
advantage: algorithms can perform more than one operation at a time increasing speed
disadvantage: complex hardware required to control units and make sure instructions and
data can be fetched simultaneously
Pipelining
architectural feature in which an instruction is broken into a number of steps
a separate unit performs each step at the same time usually working on different stage
of data
advantage: if repeated use of the instruction is required, then after an initial latency
the output throughput becomes one instruction per unit time
disadvantages: pipeline latency, having to break instructions up into equally-timed
units

Pipelining example:
Five steps:
Step 1: instruction fetch
Step 2: instruction decode
Step 3: operand fetch
Step 4: execute
Step 5: save
Pipelining for speeding up the execution of an instruction

Time slot Step 1 step2 Step 3 Step 4 Step 5 Result

T0 Inst1
T1 Inst 2 Inst 1
T2 Inst 3 Inst 2 Inst 1
T3 Inst 4 Inst 3 Inst 2 Inst 1
T4 Inst 5 Inst 4 Inst 3 Inst 2 Inst 1 Inst 1
complete
t5 Inst 6 Inst 5 Inst 4 Inst 3 Inst 2 Inst 2
complete
Consider 8-tap FIR filter:
y(n) =∑h(i)x(n-i)
The filter can be implemented in many ways depending on the multipliers and
accumulators avaliable.

1.Implementation using a single MAC unit

X(n-1) X(n-2) X(n-3) X(n-4) X(n-5) X(n-6) X(n-7)
X(n)
8T 8T 8T 8T 8T 8T 8T

Multiplier

MAC
unit

multiplexer
•Pipelined implementation of an 8-tap FIR filter using eight MACs
•Parallel implementation using two MAC units

Type of Maximum sample Maximum

implementation rate throughput
1 MAC 1/8T 1 sample in 8T units
of time
Pipelined(8 1/T 1 sample in T units
multipliers and 8 of time
adders)
2 MAC 1/4T 1 sample in 4T units
of time

One and Half Breaker Bus System
100% (4)
One and Half Breaker Bus System
3 pages
Computational Building Blocks of DSP
80% (5)
Computational Building Blocks of DSP
28 pages
6.3. Atlas Terex 1604 KZW Kullanım Manueli
100% (9)
6.3. Atlas Terex 1604 KZW Kullanım Manueli
92 pages
Introduction To Digital Signal Processors (DSPS) - Student
No ratings yet
Introduction To Digital Signal Processors (DSPS) - Student
24 pages
DSP Notes Unit1 and 2
No ratings yet
DSP Notes Unit1 and 2
45 pages
Computer Architecture and Organization: The Central Processing Unit
100% (1)
Computer Architecture and Organization: The Central Processing Unit
126 pages
Unit 1dspa
No ratings yet
Unit 1dspa
95 pages
COA Ch4 Cpu
No ratings yet
COA Ch4 Cpu
42 pages
Module 2-1
No ratings yet
Module 2-1
93 pages
1 DSP Processor
No ratings yet
1 DSP Processor
34 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
71 pages
SP Unit 3 SB
No ratings yet
SP Unit 3 SB
72 pages
DSP 5th Unit
No ratings yet
DSP 5th Unit
26 pages
DSP Processor Fundamentals
No ratings yet
DSP Processor Fundamentals
58 pages
Chap 15
No ratings yet
Chap 15
61 pages
DSP - Presentation - Sumit 2
No ratings yet
DSP - Presentation - Sumit 2
68 pages
Instruction Set Principles and Examples
No ratings yet
Instruction Set Principles and Examples
77 pages
02 Architecture of Arm
No ratings yet
02 Architecture of Arm
43 pages
DSP - Presentation - Sumit 1
No ratings yet
DSP - Presentation - Sumit 1
71 pages
DSP - Presentation - Sumit 3
No ratings yet
DSP - Presentation - Sumit 3
63 pages
Chapter 8
No ratings yet
Chapter 8
31 pages
Central Processing Unit
No ratings yet
Central Processing Unit
22 pages
CH - 0820 4 2017
No ratings yet
CH - 0820 4 2017
24 pages
DSP R20 Unit V
No ratings yet
DSP R20 Unit V
23 pages
DSP Presentation Overview For Class
100% (1)
DSP Presentation Overview For Class
71 pages
Ece-Vii-dsp Algorithms & Architecture U2
No ratings yet
Ece-Vii-dsp Algorithms & Architecture U2
21 pages
Unit 2 Architectures For Programmable Digital Signal-Processors
No ratings yet
Unit 2 Architectures For Programmable Digital Signal-Processors
57 pages
Addressing Modes
No ratings yet
Addressing Modes
4 pages
DSP Module 5 2018 Scheme
100% (1)
DSP Module 5 2018 Scheme
104 pages
Unit 5
No ratings yet
Unit 5
71 pages
02 TargetArchitectures DSP Extra
No ratings yet
02 TargetArchitectures DSP Extra
9 pages
DSP Processors: We Have Seen That The Multiply and Accumulate (MAC) Operation Is Very Prevalent in DSP Computation
No ratings yet
DSP Processors: We Have Seen That The Multiply and Accumulate (MAC) Operation Is Very Prevalent in DSP Computation
9 pages
Chap 15
No ratings yet
Chap 15
60 pages
INTRODUCTION TO DSP PROCESSORS Unit-5
No ratings yet
INTRODUCTION TO DSP PROCESSORS Unit-5
43 pages
DSP Processors
100% (1)
DSP Processors
24 pages
DSP Architectures
No ratings yet
DSP Architectures
71 pages
Module 2 Notes
No ratings yet
Module 2 Notes
28 pages
DSPAA Notes
No ratings yet
DSPAA Notes
42 pages
Ece 18ec734 M2S5 SM
No ratings yet
Ece 18ec734 M2S5 SM
7 pages
DSP Architecture - Part 1
No ratings yet
DSP Architecture - Part 1
36 pages
Cao Notes Mca First Sem
No ratings yet
Cao Notes Mca First Sem
31 pages
DSP Arch
No ratings yet
DSP Arch
10 pages
Digital Signal Processors and Architectures (DSPA) Unit-2
No ratings yet
Digital Signal Processors and Architectures (DSPA) Unit-2
92 pages
DSP Processor and Architecture
No ratings yet
DSP Processor and Architecture
45 pages
Dspa 17ec751 M2
No ratings yet
Dspa 17ec751 M2
27 pages
Architecture
No ratings yet
Architecture
112 pages
DSP Architecture
100% (1)
DSP Architecture
71 pages
Architectures For Programmable Digital Signal Processing Devices
No ratings yet
Architectures For Programmable Digital Signal Processing Devices
24 pages
Unit-5 DSP Processor
No ratings yet
Unit-5 DSP Processor
28 pages
Sanjay - High Performance DSP Architectures
No ratings yet
Sanjay - High Performance DSP Architectures
38 pages
Computer Architecture 3rd Edition by Moris Mano CH 08
No ratings yet
Computer Architecture 3rd Edition by Moris Mano CH 08
43 pages
Digital Signal Processing Unit V: DSP Processor
No ratings yet
Digital Signal Processing Unit V: DSP Processor
20 pages
DSP Processors Theory
No ratings yet
DSP Processors Theory
9 pages
DSP-8 (DSP Processors)
No ratings yet
DSP-8 (DSP Processors)
8 pages
Digital Signal Processor: Architecture
No ratings yet
Digital Signal Processor: Architecture
3 pages
YORK VRF IDU Ceiling Duct - JTD (L, M, H) (022-160) - Installation Manual - FAN-1708 201602
No ratings yet
YORK VRF IDU Ceiling Duct - JTD (L, M, H) (022-160) - Installation Manual - FAN-1708 201602
22 pages
VLSI FDP Brochure - Phase3
No ratings yet
VLSI FDP Brochure - Phase3
2 pages
Motoare Chiaravalli Melcate
100% (1)
Motoare Chiaravalli Melcate
20 pages
Labview RM
No ratings yet
Labview RM
92 pages
Automatic Bar Screens and Trash Rakes: Water & Waste Water
No ratings yet
Automatic Bar Screens and Trash Rakes: Water & Waste Water
10 pages
System Verilog Imp
No ratings yet
System Verilog Imp
59 pages
Clapswitchpbl 2
No ratings yet
Clapswitchpbl 2
22 pages
626 9802 CG PDF
No ratings yet
626 9802 CG PDF
23 pages
M.Tech-VLSISD - R18 - Syllabus
No ratings yet
M.Tech-VLSISD - R18 - Syllabus
57 pages
FPGA Presentation
No ratings yet
FPGA Presentation
57 pages
16 Segment Display
No ratings yet
16 Segment Display
18 pages
IJRTI2209058 Sharvani
No ratings yet
IJRTI2209058 Sharvani
7 pages
Mahindra Diag Manual Scorpio p10 of 53
No ratings yet
Mahindra Diag Manual Scorpio p10 of 53
1 page
Ficha Tecnica Jinko Solar JKM 235 P
No ratings yet
Ficha Tecnica Jinko Solar JKM 235 P
1 page
DSP MCQ Paper
No ratings yet
DSP MCQ Paper
4 pages
SV2-020L2 Datasheet
No ratings yet
SV2-020L2 Datasheet
4 pages
SFL2000 - Instruction Manual FF06206
No ratings yet
SFL2000 - Instruction Manual FF06206
104 pages
Battery
No ratings yet
Battery
4 pages
CMOS MSD Question Bank
No ratings yet
CMOS MSD Question Bank
1 page
1
No ratings yet
1
12 pages
Wiring Diagram Heater & Air Conditioning Vol 1
No ratings yet
Wiring Diagram Heater & Air Conditioning Vol 1
6 pages
A. The Teeth Pointing Forward: 1 (MDSP 2)
No ratings yet
A. The Teeth Pointing Forward: 1 (MDSP 2)
22 pages
Ac Auxiliary Load Flow Analysis
No ratings yet
Ac Auxiliary Load Flow Analysis
22 pages
R18 B.Tech ECE
No ratings yet
R18 B.Tech ECE
153 pages
B.Tech III-I TT
No ratings yet
B.Tech III-I TT
4 pages
Quadcopter With Arduino Uno Running MultiWii
No ratings yet
Quadcopter With Arduino Uno Running MultiWii
5 pages
RAVON Brochure
No ratings yet
RAVON Brochure
20 pages
Sample Doc Mini Project
No ratings yet
Sample Doc Mini Project
61 pages
M Tech Publications
No ratings yet
M Tech Publications
5 pages
(Autonomous) : Department of Electronics & Communication Engineering
No ratings yet
(Autonomous) : Department of Electronics & Communication Engineering
7 pages
R 2 Ky
No ratings yet
R 2 Ky
3 pages
CS276A Text Retrieval and Mining
No ratings yet
CS276A Text Retrieval and Mining
48 pages
Dspa Question Bank
No ratings yet
Dspa Question Bank
2 pages
Vaagdevi College of Engineering: Autonomous B.Tech. Electronics & Communication Engineering Course Structure
No ratings yet
Vaagdevi College of Engineering: Autonomous B.Tech. Electronics & Communication Engineering Course Structure
5 pages
01-Touchpoint Plus QSG Man0985 v2 0216
No ratings yet
01-Touchpoint Plus QSG Man0985 v2 0216
2 pages
2SD669 - Datasheet
No ratings yet
2SD669 - Datasheet
2 pages
Iot, Industrial Iot, Industry 4.0
No ratings yet
Iot, Industrial Iot, Industry 4.0
1 page
Globo
No ratings yet
Globo
9 pages
BT44RT Specication Sheet
No ratings yet
BT44RT Specication Sheet
2 pages
Ab 4004 R13.0
No ratings yet
Ab 4004 R13.0
8 pages
MCU - PIC24FV32KA304 - MICROCHIP - Programming Specifications
No ratings yet
MCU - PIC24FV32KA304 - MICROCHIP - Programming Specifications
54 pages
Proposal of 1 MWP PV Rooftop Project: Trieu Van Binh - Employee of O&M Department, Gec
No ratings yet
Proposal of 1 MWP PV Rooftop Project: Trieu Van Binh - Employee of O&M Department, Gec
19 pages
Ringflash 100/200
No ratings yet
Ringflash 100/200
15 pages
Design Principles of Micro Turbines
No ratings yet
Design Principles of Micro Turbines
17 pages
LED Array Light Module: Specifications
No ratings yet
LED Array Light Module: Specifications
2 pages
Wheatstone Bridge
No ratings yet
Wheatstone Bridge
8 pages
60V Dual N-Channel MOSFET: Features General Description
No ratings yet
60V Dual N-Channel MOSFET: Features General Description
4 pages
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Comptia Server+ Primer
From Everand
Comptia Server+ Primer
John Greene
5/5 (1)
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Correct Maintenance - Cognex DataMan 8500
From Everand
Correct Maintenance - Cognex DataMan 8500
Unique Content
No ratings yet
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet