Encoding Schemes
Encoding Schemes
SCHEMES
WHAT IS ENCODING?
• Converting data (numbers,
alphabets, symbols, spaces,
graphics etc) into binary
codes.
A Cipher Is A Method Of
• The mechanism of converting Converting Regular Text Into A
data into an equivalent cipher Coded Form To Hide Its Meaning
And Protect It From Unauthorized
using specific code is called Access. This Process Is Called
as ENCODING. Encryption.
WHY ENCODING?
• Platform independency
• Internationalization
• Security
• Effective communication
POPULAR ENCODING SCHEMES
• ASCII
• ISCII
• UNICODE
ASCII
Before the existence of
ASCII, every computer
manufacturer used their own
way of character encoding,
making it impossible to
communicate when you use
two machines from different
manufacturers.
ASCII (AMERICAN STANDARD CODE FOR
INFORMATION INTERCHANGE
• Began in 1960.
• Standard common way to encode and represent keys of
keyboard understood by every computer.
• Uses 7-bit code to represent any character.
• It represents total of 27 = 128 characters.
• It can represent (encode) character set of English language
only.
ASCII
So for the understanding purpose, you can divide the
ASCII table into three main parts:
•0 to 31 and 127: Control characters (used for keys like
Indian languages.
ISCII
• All the characters resemble
very closely to each other in
terms of phonetics.
Therefore, only a certain
number of common character
set was sufficient for
covering several Indian
languages.
AS
The Development of ISCII
ISCII was developed by the National Informatics Centre (NIC) in India.
1 1986
Initial development of ISCII began.
2 1989
The first version of ISCII was released.
3 1990s
Further revisions and enhancements to ISCII were made.
Features of ISCII
ISCII was designed to be comprehensive and flexible.
ISCII defines a character ISCII uses a code page to ISCII supports a wide
set that includes all the map each character to a range of Indian
letters, digits, unique numerical value. languages, including
punctuation marks, and Hindi, Bengali, Marathi,
symbols used in Indian and Tamil.
languages.
Evolution and Successor
Standards
As technology advanced, newer standards emerged to address the limitations of ISCII.
The Unicode standard became the preferred character encoding system for
representing a wide range of languages.
ISCII
Limited character set, primarily for Indian scripts.
Unicode
Comprehensive character set, supporting thousands of languages.
The Birth of Unicode
In the early days of
computing, different
languages were
represented using
different encoding
schemes. These schemes
AS
were incompatible,
meaning text created
using one scheme could
The Challenge of Encoding
Every language uses its own unique set of characters.
Web Browsers
Displaying web pages with text in different languages.
Operating Systems
Handling text input and output for users.
Mobile Devices
Sending and receiving messages, using apps, and browsing the web.
Software Applications
Creating and editing documents, spreadsheets, presentations.
Unicode in Action
You use Unicode every day, even if you don't realize it.
Whenever you type, send a message, or browse the web, Unicode is working behind the
Expansion Customization
Adding new characters for Developing new features to meet
emerging languages and scripts. the needs of specific industries
and applications.
Security
Addressing security concerns related to character encoding and text processing.