Digital Logic Design Week 2 Lecture 2
Digital Logic Design Week 2 Lecture 2
1
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
011
100
101
110
2
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
89 Dec2Bin 011
100
0101 1001
101
110
3
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
Hi BCD 011
100
101
110
4
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$
5
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
01001000
100
101
110
6
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$
7
American Standard Code for Information Interchange
There are three kinds of control characters:
1.Format effectors
HT- If the computer finds this character in the memory (in binary form)
It will provide a horizontal tab. Addresses 8 bit locations
000
001
010
0100 1000(H)
H i 0000 1001(HT)
011
0110 1001(i)
100
101
110
Fig: RAM
2/6/2023
8
8
American Standard Code for Information Interchange
9
American Standard Code for Information Interchange
ST
X
10
Problem with ASCII
Problem 1
• ASCII is well suited for English language
• It can represent a total of 256 characters
• But what about this case?
Problem 2
• You can develop your own coding scheme (mapping) for your own language
ASCII
BSCII (in your computer)
A= 0100 0001
ক= 0100 0001
11
Problem with ASCII (Cont.)
12
Unicode
Grapheme
A single unit of human writing system.
“J” , “ক”, “大”
Codepoint
Values assigned to the Graphemes (just like they did for ASCII)
“J” = 74 “ক”= 2453 “大”= 12233
13
UTF 8
“J” = 74 (100 1010)
“ক”= 2453 (1001 1001 1001)
“大”= 12233 (101111 11001001)
14
UTF 32
UTF 32
• “J” = 74 (100 1010)
• “ক”= 2453 (1001 1001 1001)
• “大”= 12233 (101111 11001001)
UTF 32 does not use variable number of bits.
Each character(grapheme) is converted into codepoints as usual (see above)
Then 4 bytes are assigned for each character:
J= 00000000 00000000 00000000 01001010
15
More Information:
https://en.wikipedia.org/wiki/Unicode
16
THANK YOU!
Reference:
Digital Logic and Computer Design. M. Morris Mano (1979).
17