0% found this document useful (0 votes)

17 views26 pages

CCE 131 Lecture1

Uploaded by

Ahmed Gamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views26 pages

CCE 131 Lecture1

Uploaded by

Ahmed Gamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Computer Organization

CCE131
Lecture 1

Introduction-1

Dr. Emad Badry

Lecturer at faculty of Engineering, Suez Canal University
[email protected]
Classes of Computing

Personal computers (PCs)

Personal computers emphasize delivery of good performance to single users at low cost
and usually execute third-party software.

Servers
Are the modern form of what were once much larger computers and are usually
accessed only via a network. Servers are oriented to carrying sizable workloads,
which may consist of either single complex applications—usually a scientific or
engineering application—or handling many small jobs, such as would occur in
building a large web server.
Supercomputers
consist of tens of thousands of processors and many terabytes of memory, and cost
tens to hundreds of millions of dollars. Supercomputers are usually used for high-
end scientific and engineering calculations, such as weather forecasting, oil
exploration, protein structure determination, and other large-scale problems.

Embedded computers
The largest class of computers and span the widest range of applications and
performance. Embedded computers include the microprocessors found in your car,
the computers in a television set, and the networks of processors that control a
modern airplane or cargo ship. Embedded computing systems are designed to run
one application or one set of related applications that are normally integrated with
the hardware and delivered as a single system

Embedded applications often have unique application requirements that combine a

minimum performance with stringent limitations on cost or power.
Hardware and Software as hierarchical layers

Application software
A typical application, such as a word processor or a large database system, may
consist of millions of lines of code and rely on sophisticated software libraries that
implement complex functions in support of the application.

systems software
Sitting between the hardware and the application software.
There are many types of systems software, but two types of systems software are central to every computer system today: an
operating system and a compiler.
An operating system interfaces between a user’s program and the hardware and provides a variety of services and supervisory
functions. Among the most important functions are:
❑ Handling basic input and output operations
❑ Allocating storage and memory
❑ Providing for protected sharing of the computer among multiple
applications using it simultaneously
Compilers
Perform another vital function: the translation of a program
written in a high-level language, such as C, C++, Java, or
Visual Basic into instructions that the hardware can execute.
Given the sophistication of modern programming languages
and the simplicity of the instructions executed by the
hardware, the translation from a high-level language
program to hardware instructions is complex.
The embedded system build process is usually done on the host PC using cross-compilation tools. Because target
hardware does not have enough resources to run tools that are used to generate a binary image for target embedded
hardware. The process of compiling code on one system ( host system) and generated source code runs on the other
system is known as cross-compilation.
The five classic components

The five classic components of a computer are input, output, memory, datapath, and control, with the last two sometimes
combined and called the processor. This organization is independent of hardware technology: you can place every piece of
every computer, past and present, into one of these five categories.
Instruction set architecture Also called architecture. An abstract interface between the hardware and the lowest-level
software that encompasses all the information necessary to write a machine language program that will run correctly,
including instructions, registers, memory access, I/O, and so on.

An instruction set architecture allows computer designers to talk about

functions independently from the hardware that performs them. For
example, we can talk about the functions of a digital clock (keeping
time, displaying the time, setting the alarm) separately from the clock
hardware (quartz crystal, LED displays, plastic buttons).

Source: https://geteducationskills.com/computer-architecture/
Technologies for Building Processors and Memory

A transistor is simply an on/off switch controlled by electricity.

The integrated circuit (IC) combined dozens to hundreds of transistors into a single chip.

Very large-scale integrated circuit.

Is used to describe the tremendous increase in the number of transistors from hundreds to millions.
kibibits (210 bits).

❑ This rate of increasing integration has been remarkably stable.

❑ The industry has consistently quadrupled capacity every 3 years, resulting in an increase in excess of 16,000 times
Performance

If you were running a program on two different desktop computers, you’d say that the faster one is the desktop computer
that gets the job done first.

As an individual computer user, you are interested in reducing response time (the time between the start and completion of a
task) also referred to as execution time

Datacenter managers often care about increasing throughput or bandwidth (the total amount of work done in a given
time).
In most cases, we will need different performance metrics as well as different sets of applications to benchmark personal
mobile devices, which are more focused on response time, versus servers, which are more focused on throughput.

Start Response time End

A task
To maximize performance, we want to minimize response time or execution time for some task. Thus, we can relate
performance and execution time for a computer X:

This means that for two computers X and Y, if the performance of X is greater than the performance of Y, we have

That is, the execution time on Y is longer than that on X, if X is faster than Y.

We will use the phrase “X is n times faster than Y” or equivalently “X is n times as fast as Y”
If X is n times as fast as Y, then the execution time on Y is n times as long as it is on X:

Example

If computer A runs a program in 10 seconds and computer B runs the same program in 15 seconds, how much faster is A
than B?

We know that A is n times as fast as B if

Thus, the performance ratio is and A is therefore 1.5 times as fast as B.

CPU execution time
Also called CPU time. The actual time the CPU spends computing for a specific task.

user CPU time The CPU time spent in a program itself.

system CPU time The CPU time spent in the operating system performing tasks on behalf of the program.

CPU execution time

user CPU time system CPU time
CPU Performance and Its Factors

A simple formula relates the most basic metrics (clock cycles and clock cycle time) to CPU time:

CPU execution time for a program = CPU clock cycles for a program x Clock cycle time

Alternatively, because clock rate and clock cycle time are inverses,

This formula makes it clear that the hardware designer can improve performance by reducing the number of clock cycles
required for a program or the length of the clock cycle.
Example

Our favorite program runs in 10 seconds on computer A, which has a 2 GHz clock. We are trying to help a computer designer
build a computer, B, which will run this program in 6 seconds. The designer has determined that a substantial increase in the
clock rate is possible, but this increase will affect the rest of the CPU design, causing computer B to require 1.2 times as
many clock cycles as computer A for this program. What clock rate should we tell the designer to target?
One way to think about execution time is that it equals the number of instructions executed multiplied by the average time
per instruction. Therefore, the number of clock cycles required for a program can be written as

The term clock cycles per instruction, which is the average number of clock cycles each instruction takes to
execute, is often abbreviated as CPI. Since different instructions may take different amounts of time depending on
what they do, CPI is an average of all the instructions executed in the program. CPI provides one way of comparing
two different implementations of the identical instruction set architecture, since the number of instructions executed
for a program will, of course, be the same.
Example

Suppose we have two implementations of the same instruction set architecture. Computer A has a clock cycle time of 250
ps and a CPI of 2.0 for some program, and computer B has a clock cycle time of 500 ps and a CPI of 1.2 for the same
program. Which computer is faster for this program and by how much?

We know that each computer executes the same number of instructions for the program; let’s call this number I. First, find
the number of processor clock cycles for each computer:
The Classic CPU Performance Equation
We can now write this basic performance equation in terms of instruction count (the number of instructions executed by the
program), CPI, and clock cycle time:
Which code sequence executes the most instructions? Which will be faster? What is the CPI for each sequence?

Sequence 1 executes 2 + 1 + 2 = 5 instructions. Sequence 2 executes 4 + 1 + 1 = 6 instructions. Therefore, sequence 1

executes fewer instructions. We can use the equation for CPU clock cycles based on instruction count and CPI to find the
total number of clock cycles for each sequence:
So, code sequence 2 is faster, even though it executes one extra instruction. Since code sequence 2 takes fewer
overall clock cycles but has more instructions, it must have a lower CPI. The CPI values can be computed by
The performance of a program depends on the algorithm, the language, the compiler, the architecture, and the actual
hardware.
The power wall

Both clock rate and power increased rapidly for decades and then flattened off recently. The reason they grew together is
that they are correlated, and the reason for their recent slowing is that we have run into the practical power limit for
cooling commodity microprocessors.
For CMOS, the primary source of energy consumption is so-called dynamic energy—that is, energy that is consumed
when transistors switch states from 0 to 1 and vice versa. The dynamic energy depends on the capacitive loading of each
transistor and the voltage applied:

Frequency switched is a function of the clock rate. The capacitive load per transistor is a function of both the
number of transistors connected to an output (called the fanout) and the technology, which determines the
capacitance of both wires and transistors.
Multiprocessors

To reduce confusion between the words processor and microprocessor, companies refer to processors as “cores,” and such
microprocessors are generically called multicore microprocessors. Hence, a “quadcore” microprocessor is a chip that
contains four processors or four cores.

Today, for programmers to get significant improvement in response time, they need to rewrite their programs to take
advantage of multiple processors. Moreover, to get the historic benefit of running faster on new microprocessors,
programmers will have to continue to improve the performance of their code as the number of cores increases.
Amdahl’s Law

A rule stating that the performance enhancement possible with a given improvement is limited by the amount that the
improved feature is used.

A simple design problem illustrates it well. Suppose a program runs in 100 seconds on a computer, with multiply
operations responsible for 80 seconds of this time. How much do I have to improve the speed of multiplication if I want
my program to run five times faster?

Since we want the performance to be five times faster, the new

execution time should be 20 seconds, giving

Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Highway Capacity Manual: Volume 4: Applications Guide
No ratings yet
Highway Capacity Manual: Volume 4: Applications Guide
34 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
Chapter - 01 - Computer Abstractions
No ratings yet
Chapter - 01 - Computer Abstractions
37 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
25 pages
Lec 1
No ratings yet
Lec 1
32 pages
ARM Computer Organization-Chapter01
No ratings yet
ARM Computer Organization-Chapter01
55 pages
Chapter 01 Modified
No ratings yet
Chapter 01 Modified
55 pages
Chapter 01 RISC V
No ratings yet
Chapter 01 RISC V
30 pages
Lecture - 4 - Performance
No ratings yet
Lecture - 4 - Performance
31 pages
CH02-HP Computer Abstractions and Technology
No ratings yet
CH02-HP Computer Abstractions and Technology
36 pages
Abstraction & Technology - 1
No ratings yet
Abstraction & Technology - 1
74 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Lect 1
No ratings yet
Lect 1
25 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
Unit 1
No ratings yet
Unit 1
68 pages
Computer Architecture Introduction
No ratings yet
Computer Architecture Introduction
61 pages
Intro
No ratings yet
Intro
14 pages
Chapter 1
No ratings yet
Chapter 1
18 pages
Slide 1
No ratings yet
Slide 1
33 pages
Lect 1
No ratings yet
Lect 1
56 pages
Lect 1
No ratings yet
Lect 1
54 pages
ch1 PDF
No ratings yet
ch1 PDF
33 pages
Alllpdf PDF
No ratings yet
Alllpdf PDF
253 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
Module 2 (26-10-2024)
No ratings yet
Module 2 (26-10-2024)
50 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Ilovepdf - Merged (4) 36 274
No ratings yet
Ilovepdf - Merged (4) 36 274
120 pages
Computer Architecture (Ceng 201)
No ratings yet
Computer Architecture (Ceng 201)
32 pages
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
No ratings yet
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
33 pages
01 - Chapter 1
No ratings yet
01 - Chapter 1
41 pages
Chapter 1
No ratings yet
Chapter 1
53 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
CCS 1202 Lecture 2 - Computer Evolution and Performance
No ratings yet
CCS 1202 Lecture 2 - Computer Evolution and Performance
32 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Unit I-Basic Structure of A Computer: System
No ratings yet
Unit I-Basic Structure of A Computer: System
64 pages
Da Ci
No ratings yet
Da Ci
13 pages
Chapter 01
No ratings yet
Chapter 01
49 pages
02 Performance
No ratings yet
02 Performance
23 pages
Lecture 2: Performance/Power, MIPS Instructions
No ratings yet
Lecture 2: Performance/Power, MIPS Instructions
28 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
No ratings yet
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
53 pages
CA0216D Chapter1B
No ratings yet
CA0216D Chapter1B
32 pages
Unit 1
No ratings yet
Unit 1
6 pages
Chapter 1 Summary
No ratings yet
Chapter 1 Summary
12 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
39 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
Cs23402 - Computer Architecture - Unit - 1
No ratings yet
Cs23402 - Computer Architecture - Unit - 1
161 pages
COAL Lecture 02
No ratings yet
COAL Lecture 02
36 pages
Performance
No ratings yet
Performance
51 pages
Ico22 - 1 - Computer Abstraction and Technology
No ratings yet
Ico22 - 1 - Computer Abstraction and Technology
42 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
Lecture 16 Technology, Performance, Powerwall
No ratings yet
Lecture 16 Technology, Performance, Powerwall
9 pages
Lecture 1 8405 Computer Architecture
No ratings yet
Lecture 1 8405 Computer Architecture
15 pages
DHXD - Chuong 8. Performance
No ratings yet
DHXD - Chuong 8. Performance
27 pages
Computer Architecture and Performance
No ratings yet
Computer Architecture and Performance
33 pages
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
CCE 131-Lecture 5
No ratings yet
CCE 131-Lecture 5
18 pages
CCE 131-Lecture 8
No ratings yet
CCE 131-Lecture 8
17 pages
Power No.1
No ratings yet
Power No.1
16 pages
Lec 3 الكترونية
No ratings yet
Lec 3 الكترونية
29 pages
Sheet 8
No ratings yet
Sheet 8
4 pages
CamScanner ٠٨-١٦-٢٠٢٢ ١٣.٠٤
No ratings yet
CamScanner ٠٨-١٦-٢٠٢٢ ١٣.٠٤
11 pages
Sheet 1 هندسة الكترونيه
No ratings yet
Sheet 1 هندسة الكترونيه
4 pages
Midterm Sol. OS
No ratings yet
Midterm Sol. OS
3 pages
هندسه الكترونيه 1
No ratings yet
هندسه الكترونيه 1
38 pages
Chapter 1 Answers
No ratings yet
Chapter 1 Answers
2 pages
Os Mid22
No ratings yet
Os Mid22
2 pages
4 - Fourier Analysis For DT Signals - Part1
No ratings yet
4 - Fourier Analysis For DT Signals - Part1
16 pages
Building An Efficient and Reliable System For Pizza Ordering System
No ratings yet
Building An Efficient and Reliable System For Pizza Ordering System
18 pages
Nazar CV
No ratings yet
Nazar CV
1 page
IS 4420 Database Fundamentals Leon Chen
No ratings yet
IS 4420 Database Fundamentals Leon Chen
19 pages
Best Practices For Migrating To Application-Based Policy
No ratings yet
Best Practices For Migrating To Application-Based Policy
32 pages
PHP - Form Introduction: Dynamic Websites
No ratings yet
PHP - Form Introduction: Dynamic Websites
3 pages
Changes Effecting S4 HANA
100% (1)
Changes Effecting S4 HANA
35 pages
12th E Commerce 2023 24
No ratings yet
12th E Commerce 2023 24
27 pages
Samsung Corby PRO
No ratings yet
Samsung Corby PRO
2 pages
Appian StepbyStep 3 (Expressions)
No ratings yet
Appian StepbyStep 3 (Expressions)
8 pages
JOLT UserGuide
No ratings yet
JOLT UserGuide
321 pages
Practical Lessons in 21st Century Living
No ratings yet
Practical Lessons in 21st Century Living
2 pages
Papyrus 7 E
No ratings yet
Papyrus 7 E
28 pages
Alrtrm
No ratings yet
Alrtrm
136 pages
Lab 3 - Enabling Team Based Data Science With Azure Databricks
No ratings yet
Lab 3 - Enabling Team Based Data Science With Azure Databricks
18 pages
CS101-MidTerm MCQs With Reference Solved by Arslan
100% (4)
CS101-MidTerm MCQs With Reference Solved by Arslan
39 pages
ICDL Computer & Online Essentials 1.0 - ICDL Africa
No ratings yet
ICDL Computer & Online Essentials 1.0 - ICDL Africa
219 pages
Oracle History
No ratings yet
Oracle History
14 pages
Gaurav Kumar Yadav: Career Objective
No ratings yet
Gaurav Kumar Yadav: Career Objective
1 page
Practical Questions
No ratings yet
Practical Questions
2 pages
Monthly Ads Report Links - 2019-20 PDF
No ratings yet
Monthly Ads Report Links - 2019-20 PDF
32 pages
Applications of Cloud Computing
No ratings yet
Applications of Cloud Computing
12 pages
Write A Program To Display 10 Student Basic Information in A Table Form Using Table Layout
No ratings yet
Write A Program To Display 10 Student Basic Information in A Table Form Using Table Layout
8 pages
L4 7 Scripting API
No ratings yet
L4 7 Scripting API
300 pages
Remove Node From Ebs 12.1.1
100% (1)
Remove Node From Ebs 12.1.1
5 pages
Legend Release Notes 10-9-9 fp7
No ratings yet
Legend Release Notes 10-9-9 fp7
11 pages
12 Tips For Creating Better Powerpoint Presentations: Grab Viewers' Attention
No ratings yet
12 Tips For Creating Better Powerpoint Presentations: Grab Viewers' Attention
10 pages
Rackspace Cloud Files
No ratings yet
Rackspace Cloud Files
133 pages
M01 Business Technology and Equpment
No ratings yet
M01 Business Technology and Equpment
73 pages
Myriad Pro ReadMe
No ratings yet
Myriad Pro ReadMe
2 pages