0% found this document useful (0 votes)

60 views6 pages

Simple Vector Processor Modeled With VHDL

This document describes a project to model a simple vector processor using VHDL. It includes definitions of the instruction set architecture, which includes both scalar and vector instructions. The processor architecture is then described, consisting of main memory, a scalar processor, vector registers, and functional units. The methodology used VHDL and software tools to capture, synthesize, and simulate the design. The project demonstrated the functionality of a basic vector processor and future work could expand the architecture.

Uploaded by

duzngvt123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views6 pages

Simple Vector Processor Modeled With VHDL

Uploaded by

duzngvt123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Simple Vector Processor Modeled with VHDL

Osvaldo Espinosa Sosa 1 , Luis Villa Vargas2 and Oscar Camacho Nieto 1
1

Centro de Investigacin en Computacin-IPN, Av. Juan de Dios Batz, esquina con Miguel Otn de Mendizbal, Mxico, D.F., 07738. Mxico [email protected], [email protected] 2 Instiuto Mexicano del Petrleo, Eje Central Lzaro Crdenas No. 152 Col. San Bartolo Atepehuacan, Delegacin Gustavo A Madero, Mxico, D.F. [email protected]
Abstract. Vector architectures have proven to be the best choice if we want to achieve high performance when executing numerical applications and multimedia. In this paper we describe a project whose main goal is to obtain a detailed description of a traditional vector processor using VHDL Hardware Description Language. This work will be very useful for direct application on computer architecture and digital systems courses at universities. Research on power consumption and improved architectures can be benefited also.

Keywords: Vector Processor, Hardware Description Language, VHDL.

1 Introduction
Vector processors are architectures that exploit parallelism at data level. They can operate on numeric arrays of data called vectors. One instruction can add or multiply two vectors containing the same number of data in order to obtain one vector containing results [1]. Vector instructions have several interesting properties such as: Each vector instruction represents several scalar instructions, this permit to reduce fetch and decode processes and the bandwidth required by these steps. Vector operations imply independence among results computation, this way no data dependence need to be checked. Dependences arise between two vector instructions not among vector elements. Instructions that access memory locations have regular patterns that permit design memory schemes with high grade of interleaving in order to obtain high bandwidth with low latencies. Because vector instructions operate in a predetermined way, control hazards are avoided.

It is clear that vector code can run faster than scalar code. This characteristic allows vector processors to execute numerical applications in a more efficient way that scalar computers. Computer architecture courses include vector processing as a topic. Study

of these architectures are important at graduate and undergraduate levels. Hardware Description Languages such as VHDL allow to obtain a precise description of digital circuits facilitating comprehension process. Modern software tools include capture, synthesis and simulation facilities. Students can be benefited using the mentioned software and its advantages.

2 Instruction Set Architecture

Design process begins with definitions about the instructions that processor can execute. Instructions can be divided into scalar and vector: Instructions Arithmetic Memory Branch Control scalar add,sub ld, st bc,bz clrc, setc Vector ADDV, SUBV, MULV LV, SV VEQ

According to Load-Store architectures the syntax proposed is presented below: add rd, rf1,rf2 sub rd, rf1,rf2 ld rd,dir st rd,dir bc offset bz offset clrc setc add v1,v2,v3 subv v1,v2,v3 mulv v1,v2,v3 lv v1,r1 sv v1,r1 veq v1,v2 Adds registers rf1 and rf2. Result will be stored in rd. Substracts rf2 from rf1. Result will be stored in rd. Load rd with content of memory location dir. Stores in memory location dir the content of rd. If carry flag is set, PC=PC+offset If zero flag is set PC=PC+offset Clears carry flag Sets carry flag. Adds vectors v2 and v3, result will be stored in v1. Substracts v3 from v2, result will be stored in v1 Multiply v2 and v3, result will be stored in v1. Load vector pointed by r1 in v1. Store vector v1 at location pointed by r1. Compares v1and v2, result will be stored in a mask.

Arithmetic instructions have the following format: 15 Opcode 12 11 Register destiny 87 Source Register 1 43 0 Source Register 2

Memory operations are coded as follows:

15 Opcode

12 11 Register

87 Offset

Branch Instructions and compare correspond to :

15 Opcode

12 11 Register 1

87 Register 2

43 Offset

Other instructions such as control can only use de opcode field of format instruction.

3 Processor Architecture
Basically the vector processor has two sections [2]. It has one scalar processor and one vector processor as showed in figure 1.

Main Memory
Instructions (scalar + vector) + Data

Scalar data

Vector data

...
Instr

... . Control Unit Vector Reg.

Scalar Reg.

Fig. 1. Basic architecture of vector processor

3.1 Memory system The first component is the memory system, and is normally the most expensive component in real vector computers because desired characteristics. It is desirable that memory system delivers one data each clock cycle in order to process vectors and generate results as soon as possible, so we need high bandwidth if we want to saturate the occupancy of functional units [3]. An interleaved memory scheme is used to obtain maximum bandwidth reducing the latency observed by memory accesses. Interleaved memory is shown in figure 2.

Fig. 2. Interleaved memory system

3.2 Register file The register file in our design of vector processor is a component that includes eight vector registers of eight elements each. We are using short registers because we only want to show the mechanics of using this type of structures, real machine implementations use 64 or 128 elements per vector register. Resource restrictions on programmable devices affect also at implementation time. Vector registers interact with load/store unit to transfer data from memory to registers and from registers to memory. It is important to consider the number of read and write ports in order to diminish possible conflicts called structural hazards. At least we must include one write port and two read ports. Figure 3 shows the register file.

Fig. 3. Vector register file

3.3 Functional units Figure 4 shows how functional units and register file are connected.

Fig. 4. Register file and functional units

Functional units in vector processors must be deeply pipelined if we want to process so much data as memory can deliver. Functional units must generate results at the same rate of memory transactions, of course one result is produced every clock cycle. We are adding several functional units of integer type as shown in figure 4.

4 Methodology
This project has been captured using software tools such as ISE WebPack 7.0 from Xilinx company. The first step was to capture VHDL modules corresponding to each processor section, after that, it is necessary to perform synthesis of modules. Once we have finished, we can enter to simulation process and then validate the correctness of circuitry. If all steps run successfully we could implement the design in a programmable logic device such as FPGA (Field Programmable Gate Array) [4].

Conclusions and future work

In this paper we describe a simple architecture for vector processing. This project has been captured and simulated, showing correct functionality. Hardware Description Languages allow us to design circuits in a fast way, reducing development times. The advantages of VHDL language permit to students to understand and modify the architecture in order to achieve the goals proposed in computer architecture courses. Future work will include description of advanced architectural techniques such as out of order execution, decoupling etc. Acknowledgments. We would like to thank to Instituto Politcnico Nacional (IPN) for its economical support for the development of this research.

References
[1] J.L. Hennessy and D.A. Patterson. Computer Architecture: a quantitative approach. Apendix G. Morgan Kauffman . 2003. [2] F.Pardo Carpio. Arquitecturas avanzadas. Apuntes de la Universidad de Valencia, Espaa. 2002. [3] J.L. Hennesy and D.A. Patterson. Computer Organization and Design. Ed. Morgan Kauffman [4] F. Pardo, J. A. Boluda. VHDL Lenguaje para sntesis y modelado de circuitos Ed. Alfaomega

02-General Purpose Processors
No ratings yet
02-General Purpose Processors
37 pages
Group IV June 4,2013 EE 3303 Engr - Kenneth F. Fajilan
100% (2)
Group IV June 4,2013 EE 3303 Engr - Kenneth F. Fajilan
170 pages
Computer Architecture Simd Vector Gpu
No ratings yet
Computer Architecture Simd Vector Gpu
16 pages
SIMD
No ratings yet
SIMD
44 pages
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
No ratings yet
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
31 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
16 pages
Vector
No ratings yet
Vector
38 pages
CS6461 - Computer Architecture Fall 2016 - Vector Operations
No ratings yet
CS6461 - Computer Architecture Fall 2016 - Vector Operations
47 pages
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
No ratings yet
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
26 pages
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
28 pages
Chapter 04
No ratings yet
Chapter 04
47 pages
19 Computer Architecture Vector Processor
No ratings yet
19 Computer Architecture Vector Processor
20 pages
Vector
No ratings yet
Vector
42 pages
XX-BSC Compact Vector Processing
No ratings yet
XX-BSC Compact Vector Processing
49 pages
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
No ratings yet
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
64 pages
1 Vector Processing: Solutions
No ratings yet
1 Vector Processing: Solutions
16 pages
Unit 3-4
No ratings yet
Unit 3-4
76 pages
CS7103 - MultiCore Architecture Ppts Unit-II
No ratings yet
CS7103 - MultiCore Architecture Ppts Unit-II
43 pages
Guc 315 61 38694 2023-11-23T11 50 52
No ratings yet
Guc 315 61 38694 2023-11-23T11 50 52
33 pages
Why Vector Processing: Deep Pipeline More Parallelism
No ratings yet
Why Vector Processing: Deep Pipeline More Parallelism
7 pages
Organisasi & Arsitektur Komputer
No ratings yet
Organisasi & Arsitektur Komputer
3 pages
Vector Processor
No ratings yet
Vector Processor
13 pages
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
No ratings yet
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
31 pages
VLIW ARCHITECTURE and Pipeline
No ratings yet
VLIW ARCHITECTURE and Pipeline
5 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
Ca Part 3
No ratings yet
Ca Part 3
20 pages
Module 1.6
No ratings yet
Module 1.6
53 pages
7TH - Unit 4-21ec74h6 - Ca
No ratings yet
7TH - Unit 4-21ec74h6 - Ca
67 pages
Unit 2
No ratings yet
Unit 2
43 pages
Unit Iii - Aca
No ratings yet
Unit Iii - Aca
13 pages
Onur 447 Spring15 Lecture14 Simd Afterlecture
No ratings yet
Onur 447 Spring15 Lecture14 Simd Afterlecture
60 pages
1699039628chapter 94
No ratings yet
1699039628chapter 94
4 pages
An Instructional Processor Design Using VHDL and An Fpga
No ratings yet
An Instructional Processor Design Using VHDL and An Fpga
10 pages
Computer Architecture AllClasses-Outline-199-294
No ratings yet
Computer Architecture AllClasses-Outline-199-294
96 pages
17.40 Vector - RISCV 20190611 Vectors
No ratings yet
17.40 Vector - RISCV 20190611 Vectors
26 pages
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
No ratings yet
CSE 820 Graduate Computer Architecture Vectors and Multiprocessor Introduction
39 pages
EE6304 Lecture13 Processors
No ratings yet
EE6304 Lecture13 Processors
69 pages
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
50 pages
FPGA Design Final
No ratings yet
FPGA Design Final
4 pages
Avr A & A: Rchitecture Ssembly
No ratings yet
Avr A & A: Rchitecture Ssembly
45 pages
CRAY-1 Brochure 1975
No ratings yet
CRAY-1 Brochure 1975
15 pages
Module 4 Chapter 2
No ratings yet
Module 4 Chapter 2
42 pages
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
58 pages
Data-Level Parallelism Vector and GPU
No ratings yet
Data-Level Parallelism Vector and GPU
6 pages
Anization
No ratings yet
Anization
72 pages
COA Unit V B
No ratings yet
COA Unit V B
5 pages
CA Classes-201-205
No ratings yet
CA Classes-201-205
5 pages
Architecture Chapter4 E5 2012
No ratings yet
Architecture Chapter4 E5 2012
92 pages
Lab 7 PDF
No ratings yet
Lab 7 PDF
9 pages
COE4590 14 Vector
No ratings yet
COE4590 14 Vector
14 pages
Module 5 Coa
No ratings yet
Module 5 Coa
11 pages
Chapter 3 General-Purpose Processors: Software
No ratings yet
Chapter 3 General-Purpose Processors: Software
44 pages
Chapter04 ProcessorDesign PDF
No ratings yet
Chapter04 ProcessorDesign PDF
39 pages
Module 1: Basic Structure of Computers 1.1 Basic Operational Concepts
No ratings yet
Module 1: Basic Structure of Computers 1.1 Basic Operational Concepts
34 pages
Code Generation
No ratings yet
Code Generation
30 pages
Chapter 8
No ratings yet
Chapter 8
59 pages
Getting Started
No ratings yet
Getting Started
14 pages
Vector Processor
No ratings yet
Vector Processor
83 pages
The Von Neumann Computer Model
No ratings yet
The Von Neumann Computer Model
11 pages
l22 Vector
No ratings yet
l22 Vector
32 pages
Mastering C: Advanced Techniques and Tricks
From Everand
Mastering C: Advanced Techniques and Tricks
Ted Norice
No ratings yet
Ku Band VSAT Blockupconverters
No ratings yet
Ku Band VSAT Blockupconverters
5 pages
2.voice Operated Induction Motor Speed Control Through RF Communication
100% (1)
2.voice Operated Induction Motor Speed Control Through RF Communication
2 pages
PM4H A
No ratings yet
PM4H A
14 pages
WUBQ-159ACN Datasheet Ver.1.2 20190322
No ratings yet
WUBQ-159ACN Datasheet Ver.1.2 20190322
5 pages
Name: Joaquim Minelson Cuanga Mingas N.20220161 Lcc1M
No ratings yet
Name: Joaquim Minelson Cuanga Mingas N.20220161 Lcc1M
17 pages
Silergy Sy7711 2
No ratings yet
Silergy Sy7711 2
9 pages
TSX 303A Spellman X Ray
No ratings yet
TSX 303A Spellman X Ray
46 pages
Clap Switching PDF
No ratings yet
Clap Switching PDF
13 pages
Manual Servo Driver2
No ratings yet
Manual Servo Driver2
66 pages
Linkswitch II Family Datasheet 1512164
No ratings yet
Linkswitch II Family Datasheet 1512164
19 pages
d000523 Doseuse Remplisseuse
No ratings yet
d000523 Doseuse Remplisseuse
2 pages
EI2301 IE Notes Full
No ratings yet
EI2301 IE Notes Full
103 pages
Lipi PB2 PDF
100% (1)
Lipi PB2 PDF
4 pages
Variable Speed Drive For Converter Fed Synchronous Machine - PCS 8000 Variable-Speed Converter - (Converters For Pumped Storage Plants - ) - ABB
No ratings yet
Variable Speed Drive For Converter Fed Synchronous Machine - PCS 8000 Variable-Speed Converter - (Converters For Pumped Storage Plants - ) - ABB
3 pages
Term Paper
No ratings yet
Term Paper
9 pages
Ezbatteryreconditioning PDF
No ratings yet
Ezbatteryreconditioning PDF
5 pages
Telemecanique Catalog
No ratings yet
Telemecanique Catalog
75 pages
Manual CO12e 4 - Manual - CO12e - 19.09
No ratings yet
Manual CO12e 4 - Manual - CO12e - 19.09
112 pages
晶片系統設計流程與工具 Soc Design Flow & Tools: 熊博安 (Pao-Ann Hsiung) 國立中正大學資訊工程研究所 (National Chung Cheng University, Csie)
No ratings yet
晶片系統設計流程與工具 Soc Design Flow & Tools: 熊博安 (Pao-Ann Hsiung) 國立中正大學資訊工程研究所 (National Chung Cheng University, Csie)
12 pages
T4 - Series - Handbook Light Curtains
No ratings yet
T4 - Series - Handbook Light Curtains
21 pages
Entirety Brazil SEI ANATEL 6600480 Act
No ratings yet
Entirety Brazil SEI ANATEL 6600480 Act
5 pages
Multigate Device: Industry Need
No ratings yet
Multigate Device: Industry Need
4 pages
Characteristics of NTC Thermistor.
No ratings yet
Characteristics of NTC Thermistor.
4 pages
Low Voltage Distribution Fuse Boards (Feeder Pillars)
0% (1)
Low Voltage Distribution Fuse Boards (Feeder Pillars)
22 pages
I T Syllabus Pondicherry University
No ratings yet
I T Syllabus Pondicherry University
3 pages
Service Manual: Multimedia Projector Model No. PLC-XU56
No ratings yet
Service Manual: Multimedia Projector Model No. PLC-XU56
116 pages
Sony DCR-SX33E Service Manual
No ratings yet
Sony DCR-SX33E Service Manual
31 pages
Electromagnetic Field Interaction With Transmission Lines
100% (2)
Electromagnetic Field Interaction With Transmission Lines
280 pages
Proxar-Iin Ac KK 06 en 01 2023
No ratings yet
Proxar-Iin Ac KK 06 en 01 2023
6 pages

Simple Vector Processor Modeled With VHDL

Uploaded by

Simple Vector Processor Modeled With VHDL

Uploaded by

Simple Vector Processor Modeled with VHDL

Keywords: Vector Processor, Hardware Description Language, VHDL.

2 Instruction Set Architecture

Memory operations are coded as follows:

Branch Instructions and compare correspond to :

... . Control Unit Vector Reg.

Fig. 1. Basic architecture of vector processor

Fig. 2. Interleaved memory system

Fig. 3. Vector register file

Fig. 4. Register file and functional units

Conclusions and future work

You might also like