Presentation - 12 Character Sets
Presentation - 12 Character Sets
Science
Character sets
teachcomputerscience.co
m
2
Lesson Objectives
teachcomputerscience.co
m
1.
Content
teachcomputerscience.co
m
4
What is a character?
▪ A character or symbol that is present on the keyboard.
▪ It has a specific character code that consists of numbers.
teachcomputerscience.co
m
5
teachcomputerscience.co
m
6
Character set
▪ A complete set of all the characters is called a character
set.
▪ Different languages are represented using different
character sets.
▪ These character sets are unique to meet the global
standards.
teachcomputerscience.co
m
7
ASCII
▪ The ASCII (American Standard Code for Information
Interchange) character set is a 7-bit set of codes that can
represent 128 different characters.
▪ This consists of upper-case letters, lower-case letters,
digits, punctuation marks, special characters and control
characters.
▪ ASCII code is used for English only.
teachcomputerscience.co
m
8
ASCII
* 042 00101010 2A
4 052 00110100 34
teachcomputerscience.co
m
Hex Char He Char Hex Char Hex Char Hex Char Hex Char
10
Complete x
20 <space 31 1 42 B 53 S 64 d 75 u
list of >
ASCII 21
22
!
“
32
33
2
3
43
44
C
D
54
55
T
U
65
66
e
f
76
77
v
w
codes 23 # 34 4 45 E 56 V 67 g 78 x
24 $ 35 5 46 F 57 W 68 h 79 y
25 % 36 6 47 G 58 X 69 i 7A z
26 & 37 7 48 H 59 Y 6A j 7B {
27 ‘ 38 8 49 I 5A Z 6B k 7C |
28 ( 39 9 4A J 5B [ 6C l 7D }
29 ) 3A : 4B K 5C \ 6D m 7E ~
2A * 3B ; 4C L 5D ] 6E n 7F <del
ete>
2B + 3C < 4D M 5E ^ 6F o
2C , 3D = 4E N 5F _ 70 p
2D - 3E > 4F O 60 ` 71 q
2E . 3F ? 50 P 61 a 72 r
2F / 40 @ 51 Q 62 b 73 s
teachcomputerscience.co
30 0 41 A 52 R 63 c 74 t m
11
Extended ASCII
▪ Extended ASCII code consists of an 8-bit character set,
and hence 256 different characters can be encoded.
▪ Characters used in European languages can also be
represented in this coding.
teachcomputerscience.co
m
12
Unicode
▪ Unicode is the industrial standard for encoding characters
in most of the world’s writing system.
▪ Initially, this was a 16-bit system that permitted over 65
000 characters.
▪ The number of bits has now been extended up to 32
permitting coding of several billions of characters.
teachcomputerscience.co
m
13
Unicode
▪ This system uses 8 to 32 bits per character.
▪ Because of a higher number of bits per character in
Unicode, the files occupy a higher memory space too.
▪ Facebook and Google also use the Unicode system as
users communicate in different languages.
▪ The ASCII codes for the characters and symbols remained
unchanged in Unicode.
teachcomputerscience.co
m
14
Unicode
▪ The codes for characters from other languages were
added to the list.
▪ Unicode allocates character codes for languages all over
the world.
▪ Several code pages are used to represent Unicode.
teachcomputerscience.co
m
15
Unicode
▪ Microsoft word provides an
option for users to select
letters from other languages
such as Thai, Greek and
Latin.
▪ A user can also type the
specific character in a
document.
▪ For example: to enter the
character “฿”, its unicode
(OE3F) is typed and then, teachcomputerscience.co
m
Let’s review some 16
concepts
teachcomputerscience.co
m
2.
Activity
teachcomputerscience.co
m
18
Activity-1
Duration: 10 minutes
Similarly, use the ASCII code table given in this article to find out
the hexadecimal representation of ASCII codes for the url:
www.facebook.com
teachcomputerscience.co
m
19
Activity-1
Duration: 10 minutes
Extended
ASCII Unicode
ASCII
Number of
bits
teachcomputerscience.co
m
3.
End of topic questions
teachcomputerscience.co
m
21