0% found this document useful (0 votes)
2 views50 pages

CH08 COA11e

Chapter 8 of 'Computer Organization and Architecture' discusses Input/Output (I/O) systems, detailing the types of external devices, I/O functions, and various I/O techniques such as programmed I/O, interrupt-driven I/O, and direct memory access (DMA). It also covers I/O commands, mapping techniques, and design issues related to I/O operations, including device identification and the drawbacks of programmed and interrupt-driven I/O. The chapter highlights the importance of efficient data transfer methods in modern computing environments, particularly in relation to network traffic and performance issues.

Uploaded by

Jaden Sooklal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views50 pages

CH08 COA11e

Chapter 8 of 'Computer Organization and Architecture' discusses Input/Output (I/O) systems, detailing the types of external devices, I/O functions, and various I/O techniques such as programmed I/O, interrupt-driven I/O, and direct memory access (DMA). It also covers I/O commands, mapping techniques, and design issues related to I/O operations, including device identification and the drawbacks of programmed and interrupt-driven I/O. The chapter highlights the importance of efficient data transfer methods in modern computing environments, particularly in relation to network traffic and performance issues.

Uploaded by

Jaden Sooklal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 50

Computer Organization and Architecture

Designing for Performance


11th Edition

Chapter 8
Input/Output

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.1
Generic Model of an I/O Module

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
External Devices
Three
• Provide a means of
exchanging data between
the external environment
categories:
and the computer • Human readable
• – Suitable for communicating with the
Attach to the computer by a
computer user
link to an I/O module – Video display terminals (VDTs), printers
– The link is used to exchange
control, status, and data • Machine readable
between the I/O module and
– Suitable for communicating with
the external device equipment
– Magnetic disk and tape systems,
• Peripheral device sensors and actuators
– An external device connected
to an I/O module • Communication
– Suitable for communicating with remote
devices such as a terminal, a machine
readable device, or another computer

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.2
Block Diagram of an External Device

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Most common means of
computer/user interaction

Keyboard/Monitor User provides input through the


keyboard
The monitor displays data provided
International Reference Alphabet by the computer
(IRA)
• Basic unit of exchange is the character • Keyboard Codes
– Associated with each character is a code
– Each character in this code is • When the user depresses a key it generates
represented by a unique 7-bit binary an electronic signal that is interpreted by the
code transducer in the keyboard and translated into
 the bit pattern of the corresponding IRA code
128 different characters can be represented
• This bit pattern is transmitted to the I/O module
• Characters are of two types:
in the computer
– Printable
 Alphabetic, numeric, and special characters
• On output, IRA code characters are transmitted
that can be printed on paper or displayed on to an external device from the I/O module
a screen
• The transducer interprets the code and sends
– Control the required electronic signals to the output
 Have to do with controlling the printing or device either to display the indicated character
displaying of characters or perform the requested control function
 Example is carriage return
 Other control characters are concerned with
communications procedures
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
I/O Functions
The major functions for an I/O
module fall into the following
categories:
Control and timing
• Coordinates the flow of traffic between internal resources and external
devices
Processor communication
• Involves command decoding, data, status reporting, address recognition

Device communication
• Involves commands, status information, and data

Data buffering
• Performs the needed buffering operation to balance device and memory
speeds
Error detection
• Detects and reports transmission errors

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.3
Block Diagram of an I/O Module

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Programmed I/O

Three techniques are possible for I/O operations:


• Programmed I/O
– Data are exchanged between the processor and the I/O module
– Processor executes a program that gives it direct control of the I/O operation
– When the processor issues a command it must wait until the I/O operation is
complete
– If the processor is faster than the I/O module this is wasteful of processor time
• Interrupt-driven I/O
– Processor issues an I/O command, continues to execute other instructions, and
is interrupted by the I/O module when the latter has completed its work
• Direct memory access (DMA)
– The I/O module and main memory exchange data directly without processor
involvement

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Table 8.1
I/O Techniques

No Interrupts Use of Interrupts

I/O-to-memory transfer through


Programmed I/O Interrupt-driven I/O
processor

Direct I/O-to-memory transfer Direct memory access (DMA)

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
I/O Commands

• There are four types of I/O commands that an I/O module may
receive when it is addressed by a processor:
1) Control
– used to activate a peripheral and tell it what to do

2) Test
– used to test various status conditions associated with an I/O module and its
peripherals

3) Read
– causes the I/O module to obtain an item of data from the peripheral and place it in an
internal buffer

4) Write
– causes the I/O module to take an item of data from the data bus and subsequently
transmit that data item to the peripheral

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.4
Three Techniques for Input of a Block of Data

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
I/O Instructions
With programmed I/O there is a close correspondence between the I/O-
related instructions that the processor fetches from memory and the I/O
commands that the processor issues to an I/O module to execute the
instructions
• The form of the instruction depends on the way in which external devices are addressed

Each I/O device connected through I/O modules is given a unique identifier
or address
• When the processor issues an I/O command, the command contains the address of the desired
device
• Thus, each I/O module must interpret the address lines to determine if the command is for itself

Memory-mapped I/O

• There is a single address space for memory locations and I/O devices
• A single read line and a single write line are needed on the bus

Isolated I/O

• Alternatively, the bus may be equipped with memory read and write plus input and output
command lines.
• the command line specifies whether the address refers to a memory location or an I/O device.
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
I/O Mapping Summary
• Memory mapped I/O
– Devices and memory share an address space
– I/O looks just like memory read/write
– No special commands for I/O
▪ Large selection of memory access commands available

• Isolated I/O
– Separate address spaces
– Need I/O or memory select lines
– Special commands for I/O
▪ Limited set

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.5
Memory-Mapped and Isolated I/O

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Interrupt-Driven I/O
The problem with programmed I/O is that the
processor has to wait a long time for the I/O
module to be ready for either reception or
transmission of data

An alternative is for the processor to issue an


I/O command to a module and then go on to do
some other useful work

The I/O module will then interrupt the processor


to request service when it is ready to exchange
data with the processor

The processor executes the data transfer and


resumes its former processing

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.6
Simple Interrupt Processing

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.7
Changes in Memory and Registers for an
Interrupt

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Design Issues
• Because there
will be multiple
I/O modules how
does the
Two design processor
determine which
issues arise device issued
in the interrupt?
implementin • If multiple
g interrupt interrupts have
I/O: occurred how
does the
processor decide
which one to
process?
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Device Identification
Four general categories of techniques are in common use:
• Multiple interrupt lines
– Between the processor and the I/O modules
– Most straightforward approach to the problem
– Consequently even if multiple lines are used, it is likely that each line will have multiple
I/O modules attached to it
• Software poll
– When the processor detects an interrupt it branches to an interrupt-service routine
whose job is to poll each I/O module to determine which module caused the interrupt
– Time consuming
• Daisy chain (hardware poll, vectored)
– The interrupt acknowledge line is daisy chained through the modules
– Vector – address of the I/O module or some other unique identifier
– Vectored interrupt – processor uses the vector as a pointer to the appropriate device-
service routine, avoiding the need to execute a general interrupt-service routine first
• Bus arbitration (vectored)
– An I/O module must first gain control of the bus before it can raise the interrupt request
line
– When the processor detects the interrupt it responds on the interrupt acknowledge line
– Then the requesting Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
module places its vector on the data lines
Figure 8.8
Use of the 82C59A Interrupt Controller

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.9
The Intel 8255A Programmable Peripheral
Interface

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.10
The Intel 8255A Control Word

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.11
Keyboard/Display Interface to 8255A

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Drawbacks of Programmed and Interrupt-
Driven I/O

• Both forms of I/O suffer from two inherent drawbacks:


1) The I/O transfer rate is limited by the speed with which the
processor can test and service a device

2) The processor is tied up in managing an I/O transfer; a number


of instructions must be executed for each I/O transfer

• When large volumes of data are to be moved a more


efficient technique is direct memory access (DMA)

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.12
Typical DMA Block Diagram

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.13
DMA and Interrupt Breakpoints during an
Instruction Cycle

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.14
Alternative DMA Configurations

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.15
8237 DMA Usage of System Bus

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Fly-By DMA Controller

Data does not pass 8237 contains four


through and is not DMA channels
stored in DMA chip • Can be
• DMA can only programmed
transfer data Can do memory to independently
between an I/O memory via register • Any one of the
port and a channels may be
memory address active at any
• Not between two moment
I/O ports or two • These channels
memory locations are numbered 0,
1, 2, and 3

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Table 8.2
Intel 8237A Registers
Bit Command Status Mode Single Mask All Mask

D0 Memory- to- Channel 0 has Clear/set chan-


memory E/D reached TC nel 0 mask bit
Select channel
Channel select
mask bit
D1 Channel 0 Channel 1 has Clear/set chan-
address hold E/D reached TC nel 1 mask bit
D2 Controller E/D Channel 2 has Verify/write/read Clear/set Clear/set chan-
reached TC transfer mask bit nel 2 mask bit

D3 Normal/com- Channel 3 has Clear/set chan-


pressed timing reached TC nel 3 mask bit

D4 Fixed/rotating Channel 0 request Auto- initialization


priority E/D

D5 Late/extended Channel 0 request Address increment/


write selection decrement select

D6 DREQ sense Channel 0 request


active high/low

D7 DACK sense Channel 0 request Demand/single/


active high/low block/cascade mode
select

E/D = enable/disable
TC = terminal count
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Direct Cache Access (DCA)

• DMA is not able to scale to meet the increased demand due


to dramatic increases in data rates for network I/O
• Demand is coming primarily from the widespread deployment
of 10-Gbps and 100-Gbps Ethernet switches to handle
massive amounts of data transfer to and from database
servers and other high-performance systems
• Another source of traffic comes from Wi-Fi in the gigabit range
• Network Wi-Fi devices that handle 3.2 Gbps and 6.76 Gbps
are becoming widely available and producing demand on
enterprise systems

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.16
Xeon E5-2600/4600 Chip Architecture

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache-Related Performance Issues (1 of 2)
Network traffic is transmitted in the form of a sequence of protocol blocks called packets or
protocol data units

The lowest, or link, level protocol is typically Ethernet, so that each arriving and departing
block of data consists of an Ethernet packet containing as payload the higher-level protocol
packet

The higher-level protocols are usually the Internet Protocol (IP), operating on top of
Ethernet and the Transmission Control Protocol (TCP), operating on top of IP

The Ethernet payload consists of a block of data with a TCP header and an IP header

For outgoing data, Ethernet packets are formed in a peripheral component, such as in I/O
controller or network interface controller (NIC)

For incoming traffic, the I/O controller strips off the Ethernet information and delivers the TCP/IP
packet to the host CPU

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache-Related Performance Issues (2 of 2)

In a DMA scheme, when an


application wishes to
transmit data, it places that
data in an application-
assigned buffer in main
memory
For both • The core transfers this to a system
buffer in main memory and creates
outgoing and the necessary TCP and IP headers,
which are also buffered in system
incoming traffic memory
• The packet is then picked up via
the core, main DMA for transfer via the NIC
• This activity engages not only main
memory, and memory but also the cache
• Similar transfers between system
cache are all and application buffers are
required for incoming traffic
involved
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Packet Traffic Steps:

Incoming • Outgoing
• Packet arrives
• Packet transfer requested
• DMA
• Packet created
• NIC interrupts host
• Output operation invoked
• Retrieve descriptors and
headers • DMA transfer
• Cache miss occurs • NIC signals completion
• Header is processed • Driver frees buffer
• Payload transferred

© 2018 Pearson Education, Inc., Hoboken, NJ. All rights reserved.


Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Direct Cache Access Strategies

Simplest strategy was implemented as a prototype on a number of


Intel Xeon processors between 2006 and 2010
The DCA function in the memory
This enables the core to prefetch
This form of DCA applies only to controller sends a prefetch hint to
the data packet from the system
incoming network traffic the core as soon as the data is
buffer
available in system memory

Much more substantial gains can be realized by avoiding the system


buffer in main memory altogether
For incoming
The packet and
packets, the core Implemented in
packet descriptor It has no need to
reads the data from Intel’s Xeon
information are access that data in
the buffer and Cache injection processor line,
accessed only once the system buffer
transfers the packet referred to as Direct
in the system buffer again
payload to an Data I/O
by the core
application buffer

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.17
Comparison of DMA and DDIO

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Evolution of the I/O Function
1. The CPU directly controls a 4. The I/O module is given direct
peripheral device. access to memory via DMA. It can
now move a block of data to or
2. A controller or I/O module from memory without involving the
is added. The CPU uses CPU, except at the beginning and
programmed I/O without end of the transfer.
interrupts.
5. The I/O module is enhanced to
3. Same configuration as in become a processor in its own
step 2 is used, but now right, with a specialized instruction
interrupts are employed. set tailored for I/O
The CPU need not spend
time waiting for an I/O 6. The I/O module has a local
operation to be performed, memory of its own and is, in fact, a
thus increasing efficiency. computer in its own right. With this
architecture a large set of I/O
devices can be controlled with
minimal CPU involvement.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 8.18
I/O Channel Architecture

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
External
Interconnection
Standards

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Universal Serial Bus (USB)
• Widely used for peripheral connections
• Is the default interface for slower speed devices
• Commonly used high-speed I/O
• Has gone through multiple generations
– USB 1.0
▪ Defined a Low Speed data rate of 1.5 Mbps and a Full Speed rate of 12 Mbps
– USB 2.0
▪ Provides a data rate of 480 Mbps
– USB 3.0
▪ Higher speed bus called SuperSpeed in parallel with the USB 2.0 bus
▪ Signaling speed of SuperSpeed is 5 Gbps, but due to signaling overhead the usable data rate is up to
4 Gbps
– USB 3.1
▪ Includes a faster transfer mode called SuperSpeed+
▪ This transfer mode achieves a signaling rate of 10 Gbps and a theoretical usable data rate of 9.7 Gbps

• Is controlled by a root host controller which attaches to devices to create a local


network with a hierarchical tree topology
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
FireWire Serial Bus
• Was developed as an alternative to small computer system interface (SCSI) to
be used on smaller systems, such as personal computers, workstations, and
servers
• Objective was to meet the increasing demands for high I/O rates while avoiding
the bulky and expensive I/O channel technologies developed for mainframe and
supercomputer systems
• IEEE standard 1394, for a High Performance Serial Bus
• Uses a daisy chain configuration, with up to 63 devices connected off a single
port
• 1022 FireWire buses can be interconnected using bridges
• Provides for hot plugging which makes it possible to connect and disconnect
peripherals without having to power the computer system down or reconfigure
the system
• Provides for automatic configuration
• No terminations and the system automatically performs a configuration function
to assign addresses
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
SCSI

• Small Computer System Interface


• A once common standard for connecting peripheral devices
to small and medium-sized computers
• Has lost popularity to USB and FireWire in smaller systems
• High-speed versions remain popular for mass memory
support on enterprise systems
• Physical organization is a shared bus, which can support
up to 16 or 32 devices, depending on the generation of the
standard
– The bus provides for parallel transmission rather than serial, with a bus
width of 16 bits on earlier generations and 32 bits on later generations
– Speeds range from 5 Mbps on the original SCSI-1 specification to 160 Mbps
on SCSI-3 U3 Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Thunderbolt
• Most recent and fastest
peripheral connection
technology to become available
for general-purpose use
• Developed by Intel with
collaboration from Apple • Provides up to 10 Gbps
• The technology combines data, throughput in each direction
video, audio, and power into a and up to 10 Watts of power to
connected peripherals
single high-speed connection
for peripherals such as hard
drives, RAID arrays, video-
capture boxes, and network
interfaces

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
InfiniBand

• I/O specification aimed at the high-end server market


• First version was released in early 2001
• Heavily relied on by IBM zEnterprise series of mainframes
• Standard describes an architecture and specifications for data flow
among processors and intelligent I/O devices
• Has become a popular interface for storage area networking and
other large storage configurations
• Enables servers, remote storage, and other network devices to be
attached in a central fabric of switches and links
• The switch-based architecture can connect up to 64,000 servers,
storage systems, and networking devices

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
PCI Express and SATA
PCI Express • SATA
• High-speed bus system for • Serial Advanced Technology
connecting peripherals of a Attachment
wide variety of types and • An interface for disk storage
speeds systems
• Provides data rates of up to 6
Gbps, with a maximum per
device of 300 Mbps
• Widely used in desktop
computers and in industrial
and embedded applications

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Ethernet
• Predominant wired
networking technology
• Has moved from bus-based to
• Has evolved to support data switch-based
rates up to 100 Gbps and – Data rate has periodically
distances from a few meters increased by an order of
to tens of km magnitude
• Has become essential for – There is a central switch
supporting personal with all of the devices
computers, workstations, connected directly to the
servers, and massive data switch
storage devices in
organizations large and small • Ethernet systems are currently
available at speeds up to 100
• Began as an experimental Gbps
bus-based 3-Mbps system

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Wi-Fi
• Is the predominant wireless • As the technology of
Internet access technology antennas, wireless
transmission techniques, and
• Now connects computers, wireless protocol design has
tablets, smart phones, and evolved, the IEEE 802.11
other electronic devices such committee has been able to
as video cameras TVs and introduce standards for new
thermostats versions of Wi-Fi at higher
• speeds
In the enterprise has become
an essential means of • Current version is 802.11ac
enhancing worker productivity (2014) with a maximum data
and network effectiveness rate of 3.2 Gbps
• Public hotspots have expanded
dramatically to provide free
Internet access in most public
places
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Summary
•Input/
Chapter 8 Output
• Direct Cache Access
• External devices
• I/O channels and
• I/O modules processors
• Programmed I/O • External
interconnection
• Interrupt-driven I/O standards
• Direct memory
access

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Copyright

This work is protected by United States copyright laws and is provided solely
for the use of instructions in teaching their courses and assessing student
learning. dissemination or sale of any part of this work (including on the
World Wide Web) will destroy the integrity of the work and is not permit-
ted. The work and materials from it should never be made available to
students except by instructors using the accompanying text in their
classes. All recipients of this work are expected to abide by these
restrictions and to honor the intended pedagogical purposes and the needs of
other instructors who rely on these materials.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved

You might also like