0% found this document useful (0 votes)
1 views

Digital Logic Design Week 2 Lecture 2

The document discusses the necessity of alphanumeric codes, specifically focusing on the American Standard Code for Information Interchange (ASCII) and its limitations, particularly for non-English languages. It introduces Unicode as a solution to represent a wider range of characters and explains UTF-8 and UTF-32 encoding methods. The document concludes with a reference to further information on Unicode and thanks the reader.

Uploaded by

sajib25yhoo.com
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1 views

Digital Logic Design Week 2 Lecture 2

The document discusses the necessity of alphanumeric codes, specifically focusing on the American Standard Code for Information Interchange (ASCII) and its limitations, particularly for non-English languages. It introduces Unicode as a solution to represent a wider range of characters and explains UTF-8 and UTF-32 encoding methods. The document concludes with a reference to further information on Unicode and thanks the reader.

Uploaded by

sajib25yhoo.com
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

CSE 1325: Digital Logic Design

Sidratul Tanzila Tasmi


Department of Computer Science and Engineering
United International University

1
Why do we need alphanumeric codes?
Addresses 8 bit locations

000

001

010

011

100
101
110

2
Why do we need alphanumeric codes?
Addresses 8 bit locations

000

001

010

89 Dec2Bin 011

100
0101 1001

101
110

3
Why do we need alphanumeric codes?
Addresses 8 bit locations

000

001

010

Hi BCD 011

100
101
110

4
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$

5
Why do we need alphanumeric codes?
Addresses 8 bit locations

000

001

010
01001000

Hi BCD 011 0110 1001

100
101
110

6
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$

7
American Standard Code for Information Interchange
There are three kinds of control characters:
1.Format effectors
HT- If the computer finds this character in the memory (in binary form)
It will provide a horizontal tab. Addresses 8 bit locations

000
001
010
0100 1000(H)

H i 0000 1001(HT)

011
0110 1001(i)

100
101
110
Fig: RAM
2/6/2023
8

8
American Standard Code for Information Interchange

There are three kinds of control characters:


2.Information effectors
FS – file separator. Helps in separating multiple files
1000 0100
0000 1100
1010 1101
FS (0001 1100)
1000 0100
0000 1100
1010 1101

9
American Standard Code for Information Interchange

There are three kinds of control characters:


3.Communication Control Characters
STX – Start of text Note: The corresponding
binary
ETX – End of text numbers are being sent
Wants to say
“Hi!”

ST
X

10
Problem with ASCII
Problem 1
• ASCII is well suited for English language
• It can represent a total of 256 characters
• But what about this case?

Problem 2
• You can develop your own coding scheme (mapping) for your own language

ASCII
BSCII (in your computer)
A= 0100 0001
ক= 0100 0001

11
Problem with ASCII (Cont.)

You type an amazing But your friend’s computer


poem in Bangla . Your uses ASCII.
computer encodes it in
He will get something
the RAM using BSCII.
gibberish.
For example ক will be
You send it to your friend misrepresented as A.

12
Unicode

Grapheme
A single unit of human writing system.
“J” , “ক”, “大”

Codepoint
Values assigned to the Graphemes (just like they did for ASCII)
“J” = 74 “ক”= 2453 “大”= 12233

13
UTF 8
“J” = 74 (100 1010)
“ক”= 2453 (1001 1001 1001)
“大”= 12233 (101111 11001001)

No. of bits Representation


7 0xxx xxxx (J=0100 1010)
11 110x xxxx 10xx xxxx (ক= ‘cant be rep’)
16 1110 xxxx 10xx xxxx 10xx xxxx ক= 1110 0000 10 100110 10 011001
21 11110xxx 10xx xxxx 10xx xxxx 10xx xxxx

14
UTF 32
UTF 32
• “J” = 74 (100 1010)
• “ক”= 2453 (1001 1001 1001)
• “大”= 12233 (101111 11001001)
UTF 32 does not use variable number of bits.
Each character(grapheme) is converted into codepoints as usual (see above)
Then 4 bytes are assigned for each character:
J= 00000000 00000000 00000000 01001010

15
More Information:

https://en.wikipedia.org/wiki/Unicode

16
THANK YOU!
Reference:
Digital Logic and Computer Design. M. Morris Mano (1979).

17

You might also like