Lecture 2.3 Information Coding Scheme

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 10

2.

3 Information Coding
Schemes
Learning Outcomes
1.
2.

Identify the coding system used


in data processing
Differentiate the coding system

Introduction
The combinations of 0s and 1s that represent

characters are defined by patterns called a


coding scheme.
These characters include numbers, uppercase

and lowercase letters of the alphabet, punctuation


marks, and others, such as the letters of the
Greek alphabet.
Coding schemes make it possible for humans to
interact with a digital computer that processes
only bits.
2

3 popular coding systems to represent


data :

ASCII
(American Standard Code for Information Interchange)

EBCDIC
(Extended Binary Coded Decimal Interchange Code)

Unicode
(Uniform code capable of representing all worlds
languages)

ASCII
The American Standard Code for Information

Interchange (ASCII) is widely used in computers


of all types.
ASCII codes are of two types ASCII-7 and
ASCII-8.
ASCII-7 is a 7-bit standard ASCII code. In ASCII7, the first 3 bits are the zone bits and the next 4
bits are for the digits.
ASCII-8 is an extended version of ASCII-7.
ASCII-8 is an 8-bit code having 4 bits for zone
and 4 bits for the digit.
4

ASCII
The ASCII-8 code represents 256 symbols.
Codes 0 to 31 represent control characters (non-

printable), because they are used for actions like,


Carriage return (CR), Bell (BEL) etc.
Codes 48 to 57 stand for numeric 0-9.
Codes 65 to 90 stand for uppercase letters A-Z.
Codes 97 to 122 stand for lowercase letters a-z.
Codes 128-255 are the extended ASCII codes.

EBCDIC
The Extended Binary Coded Decimal Interchange

Code (EBCDIC) uses 8 bits (4 bits for zone, 4 bits


for digit) to represent a symbol in the data.
EBCDIC allows 2 8 = 256 combinations of bits.
256 unique symbols are represented using
EBCDIC code. It represents decimal numbers (09), lower case letters (a-z), uppercase letters (AZ), Special characters, and Control characters
(printable and non-printable e.g. for cursor
movement, printer vertical spacing etc.).
EBCDIC codes are used, mainly, in the
mainframe computers.

Unicode
Unicode is a universal character encoding

standard for the representation of text which


includes letters, numbers and symbols in multilingual environments.
Unicode uses 32 bits to represent a symbol in the
data.
Unicode allows 232 = 4164895296 (~ 4 billion)
combinations.
Unicode codes can uniquely represent any
character or symbol present in any language like
Chinese, Japanese etc
7

ASCII

EBCDIC

UNICODE

Use 7 bits to
represent a
character

Use 8 bits to

used 16 bits to
represent a
character

Can represent
128 characters
(27 = 128)

Can represent

capable to
represent more
than 65000
characters

primarily used on
PC and server

primarily used on

represent a
character

256 characters
(28 = 256)
mainframe and
high end servers

Implemented in
several OS
(Windows, Mac OS,
Linux)

Examples of coding systems :


DATA

CODING SYSTEM
ASCII

EBCDIC

UNICODE

0000001

00000001

0000000000000001

0000100

00000100

0000000000000100

0001001

00001001

0000000000001001

13

0001101

00001101

0000000000001101

Table
1

10

You might also like