0% found this document useful (0 votes)

48 views10 pages

Power Point

Uploaded by

yonatantesfayehaile753

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views10 pages

Power Point

Uploaded by

yonatantesfayehaile753

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Unicode international character-encoding system

designed to support the electronic interchange,

processing, and display of the written texts of the diverse
languages of the modern and classical world. The Unicode
Standard includes letters, digits, diacritics, punctuation
marks, and technical symbols for all the world's principal
written languages, as well as emoji and other symbols,
using a uniform encoding scheme. The standard is
maintained by the Unicode Consortium. The first version
of Unicode was introduced in 1991; the most recent
version contains more than 1 00,000 characters.
Numerous encoding systems (including ASCII) predate
Unicode. With Unicode (unlike earlier systems), the
unique number provided for each character remains the
same on any system that supports Unicode. The
introduction of ASCII characters was not enough to cover
all the languages. Therefore, to overcome this situation, it
was introduced. The Unicode Consortium introduced this
encoding scheme.

#INTERNAL STORAGE ENCODING

OF CHARACTER
We know that a computer understands only
binary language (0 and 1). Moreover, it is not
able to directly understand or store any
alphabets, other numbers, pictures, symbols,
etc. Therefore, we use certain coding schemes
so that it can understand each of them
correctly. Besides, we call these codes
alphanumeric codes. This standard includes
roughly 100000 characters to represent
characters of different languages. While ASCII
uses only 1 byte the Unicode uses 4 bytes to
represent characters. Hence, it provides a very
wide variety of encoding. It has three types
namely UTF-8, UTF-1 6, UTF-32. Among them,
UTF-8 is used mostly it is also the default
encoding for many programming language
#UCS(UNIVERSAL CHARACTER
SET)

It is a very common acronym in the Unicode

scheme. It stands for Universal Character Set.
Furthermore, it is the encoding scheme for
storing the Unicode text.

UCS-2: It uses two bytes to store the characters.

UCS-4: It uses two bytes to store the characters.

#UCS(UNIVERSAL CHARACTER
SET)

The UT F is the most important part of this

encoding scheme. It stands for Unicode
Transformation Format. Moreover, this defines
how the code represents Unicode. It has 3 types
as follows:
UTF-7
This scheme is designed to represent the ASCII
standard. Since the ASCII uses 7 bits encoding. It
represents the ASCII characters in emails and
messages which use this standard.

UTF-8
It is the most commonly used form of encoding.
Furthermore, it has the capacity to use up to 4
bytes for representing the characters. It uses:

1byte to represent English letters and symbols.

2bytes to represent additional Latin and Middle

Eastern letters and symbols.

3bytes to represent Asian letters and symbols.

4bytes for other additional characters.

Moreover, it is compatible with the ASCII standard.

Its uses are as follows:

Many protocols use this scheme.

It is the default standard for XML files

Some file systems Unix and Linux use it in some
files.

INTERNAL PROCESSING OF SOME

APPLICATIONS.
It is widely used in web development today.

It can also represent emojis which is today a very

important feature of most apps.

UTF-1 6

It is an extension of UCS-2 encoding. Moreover, it

uses to represent the 65536 characters. Moreover,
it also supports 4 bytes for additional characters.
Furthermore, it is used for internal processing like
in java, Microsoft windows, etc.

UTF-32
It is a multibyte encoding scheme. Besides, it
uses 4 bytes to represent the characters.

↪️IMPORTANCE OF UNICODE

As it is a universal standard therefore, it

allows writing a single application for various
platforms. This means that we can develop an
application once and run it on various
platforms in different languages. Hence we
don't have to write the code for the same
application again and again. And therefore the
development cost reduces.

Moreover, data corruption is not possible in It is a

common encoding standard for many different
languages and characters.We can use it to
convert from one coding scheme to another.
Since Unicode is the superset for all encoding
schemes. Hence, we can convert a code into
Unicode and then convert it into another coding
standard.It is preferred by many coding
languages. For example,XML tools and
applications use this standard only.

***ADVANTAGES OF UNICODE
-It is a global standard for encoding.

-It has support for the mixed-script computer

environment.

-The encoding has space efficiency and hence,

saves memory.

-A common scheme for web development.

-Increases the data interoperability of code

on cross platforms.

-Saves time and development cost of applications.

Unicode is a 1 6-bit system which can support
many more characters than ASCII.

***Disadvantage
Because it has more characters, Unicode
uses a lot more space.It takes 2 bytes to
store each character. Unicode uses more
bytes to enumerate its vastly larger
range of alphabetic symbols
THE FIRST 128 CHARACTERS ARE THE
SAME AS THE ASCII SYSTEM MAKING IT
COMPATIBLE.

There are 6400 characters set aside for the user

or software.

There are still characters which have not been

defined yet, future-proofing the system.

It can store characters from more than one

language.
It can store characters from languages with more
than 250 characters.

Encoding Schemes
100% (1)
Encoding Schemes
23 pages
Alphanumeric Code Lecture-11
No ratings yet
Alphanumeric Code Lecture-11
17 pages
Ol Chiki Script: Ciki, Ol, and Sometimes As The Santali Alphabet, Is The Official
No ratings yet
Ol Chiki Script: Ciki, Ol, and Sometimes As The Santali Alphabet, Is The Official
6 pages
Short Notes On ASCII
100% (1)
Short Notes On ASCII
16 pages
Malayalam Numerals
100% (1)
Malayalam Numerals
4 pages
CH Four
100% (4)
CH Four
10 pages
Encoding Schemes
No ratings yet
Encoding Schemes
9 pages
Unicode
No ratings yet
Unicode
9 pages
Developed By: PC - SCHEMATIC A/S Bygaden 7 4040 Jyllinge Denmark
No ratings yet
Developed By: PC - SCHEMATIC A/S Bygaden 7 4040 Jyllinge Denmark
990 pages
Laudate Dominum: Mozart
No ratings yet
Laudate Dominum: Mozart
5 pages
Encoding Schemes
100% (1)
Encoding Schemes
4 pages
Encoding Schemes
No ratings yet
Encoding Schemes
3 pages
Unicode Vs UTF-8
No ratings yet
Unicode Vs UTF-8
2 pages
OpenText Content Server CE 22.1 - Installation Guide English (LLESCOR220100-IGD-EN-03)
No ratings yet
OpenText Content Server CE 22.1 - Installation Guide English (LLESCOR220100-IGD-EN-03)
162 pages
Week 4 - A Comparative Study of UTF-8 UTF-16 and UTF-32
No ratings yet
Week 4 - A Comparative Study of UTF-8 UTF-16 and UTF-32
12 pages
Immediate Access To Unicode Demystified A Practical Programmer S Guide To The Encoding Standard 1st Edition Richard Gillam Ebook Full Chapters
No ratings yet
Immediate Access To Unicode Demystified A Practical Programmer S Guide To The Encoding Standard 1st Edition Richard Gillam Ebook Full Chapters
87 pages
Cocoa Text Architecture Guide
0% (1)
Cocoa Text Architecture Guide
75 pages
Unicode in C++ - McNellis - CppCon 2014
No ratings yet
Unicode in C++ - McNellis - CppCon 2014
125 pages
Text, Sound & Images
No ratings yet
Text, Sound & Images
48 pages
Computer Science Notes
No ratings yet
Computer Science Notes
81 pages
How To Check If The System Is A Single Code Page System
No ratings yet
How To Check If The System Is A Single Code Page System
22 pages
Assembly Language:Simple, Short, And Straightforward Way Of Learning Assembly Programming
From Everand
Assembly Language:Simple, Short, And Straightforward Way Of Learning Assembly Programming
Sherwyn Allibang
2/5 (1)
Presentation - 12 Character Sets
No ratings yet
Presentation - 12 Character Sets
21 pages
7-Text Preprocessing - ASCII and UNICODE-10!01!2024
No ratings yet
7-Text Preprocessing - ASCII and UNICODE-10!01!2024
34 pages
Ascii Unicode
No ratings yet
Ascii Unicode
6 pages
1521 Lec 9 - Unicode
No ratings yet
1521 Lec 9 - Unicode
46 pages
Set 11 - 62546805 - 2025 - 06 - 10 - 15 - 23
No ratings yet
Set 11 - 62546805 - 2025 - 06 - 10 - 15 - 23
7 pages
CFF Explorer Scripting Language
No ratings yet
CFF Explorer Scripting Language
43 pages
Character Sets and Encoding
No ratings yet
Character Sets and Encoding
7 pages
AX aFleX Ref v2 4 3-20100621
No ratings yet
AX aFleX Ref v2 4 3-20100621
166 pages
Revision Notes - 12 Character Sets
No ratings yet
Revision Notes - 12 Character Sets
9 pages
Streams: What Is A Stream Types of Strams
No ratings yet
Streams: What Is A Stream Types of Strams
28 pages
Character Sets
No ratings yet
Character Sets
1 page
Encoding Scheme
No ratings yet
Encoding Scheme
2 pages
15 Representation of Nonnumeric Data Character Codes 31 01 2024 PDF
No ratings yet
15 Representation of Nonnumeric Data Character Codes 31 01 2024 PDF
13 pages
Character Encoding For Sanskrit and Other Languages
No ratings yet
Character Encoding For Sanskrit and Other Languages
8 pages
Encoding Schemes and Number System (PDF 4)
No ratings yet
Encoding Schemes and Number System (PDF 4)
20 pages
Pdfminersix Readthedocs Io en Latest
No ratings yet
Pdfminersix Readthedocs Io en Latest
29 pages
Unicode CPP PDF
No ratings yet
Unicode CPP PDF
139 pages
Unicode Better Explained
No ratings yet
Unicode Better Explained
5 pages
Emerg Finished Indidvidual Assignment
No ratings yet
Emerg Finished Indidvidual Assignment
5 pages
1 Data Representation - L9 - Data Storage
No ratings yet
1 Data Representation - L9 - Data Storage
12 pages
ASCII
0% (1)
ASCII
2 pages
Representation of Text
No ratings yet
Representation of Text
5 pages
Universal Character Set Characters
No ratings yet
Universal Character Set Characters
34 pages
Database Assignment
No ratings yet
Database Assignment
11 pages
SS3 Note 2nd Term
No ratings yet
SS3 Note 2nd Term
10 pages
Logic Gate - Unicode
No ratings yet
Logic Gate - Unicode
12 pages
Miguel Ablóniz - Recuerdo Pampeano (Milonga) Met tabsPDF
No ratings yet
Miguel Ablóniz - Recuerdo Pampeano (Milonga) Met tabsPDF
4 pages
Python GTK 3 Tutorial
No ratings yet
Python GTK 3 Tutorial
131 pages
Unicode Vs Non
No ratings yet
Unicode Vs Non
3 pages
Google Java Style Guide PDF
No ratings yet
Google Java Style Guide PDF
22 pages
Building A Multi-Lingual OCR Engine
No ratings yet
Building A Multi-Lingual OCR Engine
21 pages
Text Encoding
No ratings yet
Text Encoding
8 pages
Unicode Fundamentals
No ratings yet
Unicode Fundamentals
51 pages
Lecture - ASCII and Unicode
No ratings yet
Lecture - ASCII and Unicode
38 pages
ICT Assignment ASCII Table and UNI Code
No ratings yet
ICT Assignment ASCII Table and UNI Code
4 pages
Coding Encoding
No ratings yet
Coding Encoding
14 pages
Molecular Bio Certificate
No ratings yet
Molecular Bio Certificate
2 pages
Howto Unicode
No ratings yet
Howto Unicode
12 pages
Type Light Help
No ratings yet
Type Light Help
24 pages
Lecture 1: Encoding Language: LING 1330/2330: Introduction To Computational Linguistics Na-Rae Han
No ratings yet
Lecture 1: Encoding Language: LING 1330/2330: Introduction To Computational Linguistics Na-Rae Han
18 pages
Logic Individual Assignment
No ratings yet
Logic Individual Assignment
1 page
An Introduction To Unicode - The Trainer's Friend
No ratings yet
An Introduction To Unicode - The Trainer's Friend
52 pages
ISO/IEC 10646 & The Unicode Standard: Mike Ksar
No ratings yet
ISO/IEC 10646 & The Unicode Standard: Mike Ksar
45 pages
Install Aware Reviewers Guide
No ratings yet
Install Aware Reviewers Guide
22 pages
El Cisne para Guitarra
No ratings yet
El Cisne para Guitarra
3 pages
OpenText StreamServe 5.6.2 Code Pages and Unicode Support User Guide
No ratings yet
OpenText StreamServe 5.6.2 Code Pages and Unicode Support User Guide
30 pages
Dictionary of Computing
From Everand
Dictionary of Computing
Handz Valentin, Sr
No ratings yet
Unicode®: Character Encodings
No ratings yet
Unicode®: Character Encodings
11 pages
Optical Character Recognition Techniques in Urdu-A Survey: Vippon Preet Kour Naveen Kumar Gondhi
No ratings yet
Optical Character Recognition Techniques in Urdu-A Survey: Vippon Preet Kour Naveen Kumar Gondhi
5 pages
CHARACTER ENCODING: How Do Computers Deal With Multiple Language?
No ratings yet
CHARACTER ENCODING: How Do Computers Deal With Multiple Language?
26 pages
Howto Unicode PDF
No ratings yet
Howto Unicode PDF
11 pages
Jaime Maria Linnemann
No ratings yet
Jaime Maria Linnemann
3 pages
Unicode HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Unicode HOWTO: Guido Van Rossum and The Python Development Team
12 pages
Introduction To Unicode: History of Character Codes
No ratings yet
Introduction To Unicode: History of Character Codes
4 pages
Uni Code
No ratings yet
Uni Code
9 pages
Computer Codes
No ratings yet
Computer Codes
22 pages
Unicode - Wikipedia, The Free Encyclopedia
No ratings yet
Unicode - Wikipedia, The Free Encyclopedia
18 pages
Bagan Keyboard Unicode Free
No ratings yet
Bagan Keyboard Unicode Free
2 pages
Linux Unicode Programming
No ratings yet
Linux Unicode Programming
10 pages
Ascii and Unicode
No ratings yet
Ascii and Unicode
6 pages
How To Write Chinese Characters
No ratings yet
How To Write Chinese Characters
12 pages
Alchemical Symbols (Glyphs)
No ratings yet
Alchemical Symbols (Glyphs)
4 pages
Unicode - Language of Universe
No ratings yet
Unicode - Language of Universe
15 pages
Unicode Enabling of ABAP
No ratings yet
Unicode Enabling of ABAP
82 pages
Howto Unicode
No ratings yet
Howto Unicode
9 pages
Unicode in C and C
No ratings yet
Unicode in C and C
8 pages
Programacion Web Parte-4
No ratings yet
Programacion Web Parte-4
4 pages
10.2005.5 Unicode
No ratings yet
10.2005.5 Unicode
4 pages
Unicode and Character Sets
No ratings yet
Unicode and Character Sets
2 pages
Problem Addressed by The Topic
No ratings yet
Problem Addressed by The Topic
2 pages

Power Point

Uploaded by

Power Point

Uploaded by

Unicode international character-encoding system

designed to support the electronic interchange,

#INTERNAL STORAGE ENCODING

It is a very common acronym in the Unicode

UCS-2: It uses two bytes to store the characters.

UCS-4: It uses two bytes to store the characters.

The UT F is the most important part of this

1byte to represent English letters and symbols.

2bytes to represent additional Latin and Middle

3bytes to represent Asian letters and symbols.

4bytes for other additional characters.

Moreover, it is compatible with the ASCII standard.

Many protocols use this scheme.

It is the default standard for XML files

INTERNAL PROCESSING OF SOME

It can also represent emojis which is today a very

It is an extension of UCS-2 encoding. Moreover, it

As it is a universal standard therefore, it

Moreover, data corruption is not possible in It is a

-It has support for the mixed-script computer

-The encoding has space efficiency and hence,

-A common scheme for web development.

-Increases the data interoperability of code

-Saves time and development cost of applications.

There are 6400 characters set aside for the user

There are still characters which have not been

It can store characters from more than one

You might also like