0% found this document useful (0 votes)

9 views19 pages

Data Representation

Uploaded by

Recep Can Kazanç

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views19 pages

Data Representation

Uploaded by

Recep Can Kazanç

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Data representations in computer

1
UniCode
• Bit means “binary digit” and is the smallest unit of computerized data.
• A bit is a 2-base number, i.e. it has either the value of 0 or 1.
• A byte is an amount of memory, a certain collection of bits, originally variable in size
but now almost always eight bits.

Some example bytes could be 00000001 or 11111111 or 01010011.

The calculation of the decimal equivalent of the binary value 00000001:

2
UniCode

• The calculation of the decimal equivalent of the binary value 11111111:

• The calculation of the decimal equivalent of the binary value 01010011:

3
ASCII

• ASCII stands for American Standard Code for Information Interchange.

• It is a standard for assigning numerical values to the set of letters in the Roman
alphabet and typographic characters.
• The ASCII character set can be represented by 7 bits. This makes 27 or 128 different
values resp. characters
• As ASCII uses only 7 of the 8 bits available of an byte the first bit is always 0:
0xxxxxxx;

• The first 32 characters are for control

characters.

4
UniCode
• ASCII was an American-developed standard, so it only defined unaccented characters.
There was an ‘e’, but no ‘é’ or ‘Í’.
• This meant that languages which required accented characters couldn’t be
represented in ASCII.
• Unicode started out using 16-bit characters instead of 8-bit characters. 16 bits means
you have 2^16 = 65,536 distinct values available.
• This made it possible to represent many different characters from many different
alphabets

5
UniCode
• UTF-8 will encode a character with a single byte.
• UTF-16 will encode a character with a two bytes
•

• Same binary data different interpretation by different encodings

6
UniCode
• A string of ASCII text is also valid UTF-8 text.
• UTF-8 uses the following rules:
• If the code point is < 128, it’s represented by the corresponding byte value.
• If the code point is >= 128, it’s turned into a sequence of two, three, or four bytes,
where each byte of the sequence is between 128 and 255.
• Latin-1, also known as ISO-8859-1, is a similar encoding.
• Unicode code points 0–255 are identical to the Latin-1 values, so converting to this
encoding simply requires converting code points to byte values; if a code point
larger than 255 is encountered, the string can’t be encoded into Latin-1.

• One-character Unicode strings can also be created

with the chr() built-in function, which takes integers
and returns a Unicode string of length 1 that contains
the corresponding code point.
• The reverse operation is the built-in ord() function
that takes a one-character Unicode string and returns
the code point value

7
Detecting Encodings
• Chardet library:
• Character encoding auto-detection in Python.

• Latin-1 is also known as ISO-8859-1

8
Integer representation
• Unsigned Integers:
• Unsigned integers can represent zero and positive integers, but not negative integers.

9
Integer representation
• Signed Integers:
• Signed integers can represent zero, positive integers, as well as negative integers.
• Three representation types:
Ø Sign-Magnitude representation
Ø 1's Complement representation
Ø 2's Complement representation : Modern method
• the most significant bit (msb) is the sign bit, with value of 0 representing positive
integers and 1 representing negative integers.
The remaining n-1 bits represents the magnitude of the integer, as follows:
• for positive integers, the absolute value of the integer is equal to "the magnitude of
the (n-1)-bit binary pattern".
• for negative integers, the absolute value of the integer is equal to "the magnitude of
the complement of the (n-1)-bit binary pattern plus one" (hence called 2's
complement).

10
Integer representation
• 2's Complement representation
• Example 1: Suppose that n=8 and the binary representation 0 100 0001B
• Sign bit is 0 ⇒ positive
Absolute value is 100 0001B = 65D
Hence, the integer is +65

• Example 2: Suppose that n=8 and the binary representation 1 000 0001B.
Sign bit is 1 ⇒ negative
Absolute value is the complement of 000 0001B plus 1, i.e., 111 1110B + 1B = 127D
Hence, the integer is -127D

• Example 3: Suppose that n=8 and the binary representation 0 000 0000B.
Sign bit is 0 ⇒ positive
Absolute value is 000 0000B = 0D
Hence, the integer is +0D

• Example 4: Suppose that n=8 and the binary representation 1 111 1111B.
Sign bit is 1 ⇒ negative
Absolute value is the complement of 111 1111B plus 1, i.e., 000 0000B + 1B = 1D
Hence, the integer is -1D
11
Floating Point representation
Normalized form

1 1000 0001 011 0000 0000 0000 0000 0000

•S = 1 (negative or positive)
•E = 1000 0001
•F = 011 0000 0000 0000 0000 0000

• N = (-1)^S × 1.F × 2^(E-127) we need to represent both positive

and negative exponent.
Fraction part: 1. 011 0000 0000 0000 0000 0000B With an 8-bit E, ranging from 0 to
Here at ”1” at the beginning is implicit 255, the excess-127 scheme could
Fraction: 1 + 1×2^-2 + 1×2^-3 = 1.375D. provide actual exponent of -127 to
128
Exponent part: 1000 0001B=129
So the number is -1.375×2^2=-5.5D

12
Floating Point representation
De-Normalized form: In normalized form implicit leading 1 for the fraction, it
cannot represent the number zero!

•For E=0, the numbers are in the de-normalized

form.
•An implicit leading 0 (instead of 1) is used for the
fraction; and the actual exponent is always -126.
Hence, the number zero can be represented with
E=0 and F=0 (because 0.0×2^-126=0).

We can also represent very small positive and negative numbers in de-normalized form with E=0

For example, if S=1, E=0, and F=011 0000 0000 0000 0000 0000.
The actual fraction is 0.011=1×2^-2+1×2^-3=0.375D.
Since S=1, it is a negative number.
With E=0, the actual exponent is -126.
Hence the number is -0.375×2^-126 = -4.4×10^-39, which is an extremely small negative
number (close to zero).

13
Floating Point representation

In summary:
For 1 ≤ E ≤ 254, N = (-1)^S × 1.F × 2^(E-127).

• These numbers are in the so-called normalized form.

• The sign-bit represents the sign of the number.
• Fractional part (1.F) are normalized with an implicit leading 1.
• The exponent is bias (or in excess) of 127, so as to represent both positive and
negative exponent.
• The range of exponent is -126 to +127

•For E = 0, N = (-1)^S × 0.F × 2^(-126).

These numbers are in the so-called denormalized form.
The exponent of 2^-126 evaluates to a very small number.
Denormalized form is needed to represent zero (with F=0 and E=0).
It can also represents very small positive and negative number close to zero.

For E = 255, it represents special values, such as ±INF (positive and negative infinity) and
NaN (not a number).
14
Floating Point representation

Example:
0 10000000 110 0000 0000 0000 0000 0000

• Sign bit S = 0 ⇒ positive number

• E = 1000 0000B = 128D (in normalized form)
• Fraction is 1.11B (with an implicit leading 1) = 1 + 1×2^-1 + 1×2^-2 = 1.75D
• The number is +1.75 × 2^(128-127) = +3.5D

15
Floating Point representation

Example:
1 01111110 100 0000 0000 0000 0000 0000

• Sign bit S = 1 ⇒ Negative number

• E = 01111110B = 126D (in normalized form)
• Fraction is 1.1B (with an implicit leading 1) = 1 + 2^-1 = 1.5D
• The number is -1.5 × 2^(126-127) = -0.75D

16
Floating Point representation

Example:
1 00000000 000 0000 0000 0000 0000 0001

• E = 0 (in de-normalized form)

• Fraction is 0.000 0000 0000 0000 0000 0001B (with an implicit leading 0) = 1×2^-23
• The number is -2^-23 × 2^(-126) = -2×(-149) ≈ -1.4×10^-45

17
64-bit Double-Precision Floating-Point Numbers

• The most significant bit is the sign bit (S), with 0 for positive numbers and 1
for negative numbers.
• The following 11 bits represent exponent (E).
• The remaining 52 bits represents fraction (F).

• Normalized form: For 1 ≤ E ≤ 2046, N = (-1)^S × 1.F × 2^(E-1023).

• Denormalized form: For E = 0, N = (-1)^S × 0.F × 2^(-1022).
• These are in the denormalized form.
• For E = 2047, N represents special values, such as ±INF (infinity), NaN (not
a number)

18
Floating-Point Numbers Representations

DLCO Unit-1
No ratings yet
DLCO Unit-1
38 pages
Unit 2
No ratings yet
Unit 2
85 pages
Script Bls 2024
No ratings yet
Script Bls 2024
242 pages
COA - Unit 2 Data Representation 1
No ratings yet
COA - Unit 2 Data Representation 1
59 pages
Error Control Coding
No ratings yet
Error Control Coding
76 pages
07 Datarepresentation 150216185458 Conversion Gate02
No ratings yet
07 Datarepresentation 150216185458 Conversion Gate02
43 pages
Chapter 5 Data Representation
No ratings yet
Chapter 5 Data Representation
80 pages
02 HLDD - CombinationalDesign
No ratings yet
02 HLDD - CombinationalDesign
187 pages
Machine Level Representation of Data Part 3
100% (1)
Machine Level Representation of Data Part 3
32 pages
Topic #2 - Data Representation
No ratings yet
Topic #2 - Data Representation
61 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Week 2 - Data Representation - Stud
No ratings yet
Week 2 - Data Representation - Stud
25 pages
L2-Variables and Floating Point Number System
No ratings yet
L2-Variables and Floating Point Number System
38 pages
2 Data Representation
No ratings yet
2 Data Representation
67 pages
3GPP TS 38.212
No ratings yet
3GPP TS 38.212
153 pages
Madhusanka Liyanage: Lecture 3: Data Representation in Computer Systems
No ratings yet
Madhusanka Liyanage: Lecture 3: Data Representation in Computer Systems
62 pages
De Lecture - 02
No ratings yet
De Lecture - 02
49 pages
直到世界尽头
No ratings yet
直到世界尽头
3 pages
CH03 Data II
No ratings yet
CH03 Data II
31 pages
CSE100 Exam Preparation Notes (Comprehensive Overview)
No ratings yet
CSE100 Exam Preparation Notes (Comprehensive Overview)
16 pages
COA UNIT-III PPTs Dr.G.Bhaskar ECE
No ratings yet
COA UNIT-III PPTs Dr.G.Bhaskar ECE
64 pages
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
No ratings yet
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
6 pages
7,8-Convolutional Encoder, Tree Diagram, Trellis Diagram, Viterbi Decoding
No ratings yet
7,8-Convolutional Encoder, Tree Diagram, Trellis Diagram, Viterbi Decoding
33 pages
Class03 cs230s22
No ratings yet
Class03 cs230s22
33 pages
Lect4 Floats
No ratings yet
Lect4 Floats
64 pages
Python String Methods - Cheatsheet
No ratings yet
Python String Methods - Cheatsheet
7 pages
Week8 Slides
No ratings yet
Week8 Slides
43 pages
LEC03 Data II
No ratings yet
LEC03 Data II
45 pages
11 Number Systems
No ratings yet
11 Number Systems
53 pages
UNIT 2 Computer Organization
No ratings yet
UNIT 2 Computer Organization
48 pages
CNG111 Lecture 2
No ratings yet
CNG111 Lecture 2
25 pages
Lecture02-Data Representation 2
No ratings yet
Lecture02-Data Representation 2
38 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
2nd Unit-Part 2-DLCA
No ratings yet
2nd Unit-Part 2-DLCA
15 pages
Wa0018.
No ratings yet
Wa0018.
55 pages
Basso - I Will Follow Him
No ratings yet
Basso - I Will Follow Him
2 pages
Coa Unit 2
No ratings yet
Coa Unit 2
35 pages
1 Numberrepresentation
No ratings yet
1 Numberrepresentation
36 pages
8.3 Floating Point Numbers
No ratings yet
8.3 Floating Point Numbers
19 pages
Part 1
No ratings yet
Part 1
33 pages
3 Fixed and Floating Point DSP
No ratings yet
3 Fixed and Floating Point DSP
23 pages
Unit III CAO
No ratings yet
Unit III CAO
39 pages
UNIT-5 Part-2 Coding Theory PDF
No ratings yet
UNIT-5 Part-2 Coding Theory PDF
61 pages
Chapter3 3
No ratings yet
Chapter3 3
13 pages
Number Systems - Data Representation (Numbers)
No ratings yet
Number Systems - Data Representation (Numbers)
27 pages
Ic23 Unit03 Script
No ratings yet
Ic23 Unit03 Script
26 pages
Introduction To Numerical Computing: Statistics 580 Number Systems
No ratings yet
Introduction To Numerical Computing: Statistics 580 Number Systems
35 pages
Integer Representation
No ratings yet
Integer Representation
34 pages
GSM Basics Chapter7 Detail
No ratings yet
GSM Basics Chapter7 Detail
71 pages
ARCh Presentation1
No ratings yet
ARCh Presentation1
12 pages
w4 One PDF
No ratings yet
w4 One PDF
40 pages
Data Representation
No ratings yet
Data Representation
28 pages
Week-2 Data Representation
No ratings yet
Week-2 Data Representation
15 pages
L4
No ratings yet
L4
29 pages
Coa Module-Iii
No ratings yet
Coa Module-Iii
13 pages
Digitized Pictures: by K. Karpoora Sundari ECE Department, K. Ramakrishnan College of Technology, Samayapuram
No ratings yet
Digitized Pictures: by K. Karpoora Sundari ECE Department, K. Ramakrishnan College of Technology, Samayapuram
31 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
COMPX203 Computer Systems: Number Representation
No ratings yet
COMPX203 Computer Systems: Number Representation
33 pages
HPC Lecture2
No ratings yet
HPC Lecture2
13 pages
MCQ DC PPT 15
No ratings yet
MCQ DC PPT 15
43 pages
Module 1 Data Rep
No ratings yet
Module 1 Data Rep
14 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
03-Data Representation
No ratings yet
03-Data Representation
6 pages
Nptel Week6 Module5 Greedy Huffman Code
No ratings yet
Nptel Week6 Module5 Greedy Huffman Code
36 pages
COA - Unit2 Floating Point Arithmetic 3
No ratings yet
COA - Unit2 Floating Point Arithmetic 3
19 pages
Huffman Encoder and Decoder Using Verilog
No ratings yet
Huffman Encoder and Decoder Using Verilog
3 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Ece3101l Lab6 Signal Quantization
No ratings yet
Ece3101l Lab6 Signal Quantization
14 pages
Coding Theory
No ratings yet
Coding Theory
5 pages
Notes and Exercises - 1.3 Compression
No ratings yet
Notes and Exercises - 1.3 Compression
10 pages
Huffman Coding Technique For Image Compression: ISSN:2320-0790
No ratings yet
Huffman Coding Technique For Image Compression: ISSN:2320-0790
3 pages
Extended Ascii Code Table PDF
No ratings yet
Extended Ascii Code Table PDF
2 pages
Number System
No ratings yet
Number System
28 pages
(MS-SHLLINK) - Shortcut To A File
No ratings yet
(MS-SHLLINK) - Shortcut To A File
4 pages
Image Processing (RCS082) Unit V Huffman Coding
No ratings yet
Image Processing (RCS082) Unit V Huffman Coding
12 pages
Channel Coding
No ratings yet
Channel Coding
22 pages
High Performance Computing: Matthew Jacob Indian Institute of Science
No ratings yet
High Performance Computing: Matthew Jacob Indian Institute of Science
14 pages
Cyclic Redundancy Check Program in C - CRC
No ratings yet
Cyclic Redundancy Check Program in C - CRC
3 pages
Low Density Parity Check Codes1
No ratings yet
Low Density Parity Check Codes1
41 pages
Floating Point
No ratings yet
Floating Point
2 pages
2 CS1FC16 Information Representation
No ratings yet
2 CS1FC16 Information Representation
4 pages
Unit 1
No ratings yet
Unit 1
6 pages
Asd
No ratings yet
Asd
4 pages
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
No ratings yet
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
8 pages
ADC - MCQ-Unit-4-ASSIGNMENT - Answers
No ratings yet
ADC - MCQ-Unit-4-ASSIGNMENT - Answers
3 pages
CS20A - Assignment 5
No ratings yet
CS20A - Assignment 5
4 pages
Python Unicode Objects
No ratings yet
Python Unicode Objects
2 pages
Gif 2 TXT
No ratings yet
Gif 2 TXT
2 pages
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet

Data Representation

Uploaded by

Data Representation

Uploaded by

Data representations in computer

Some example bytes could be 00000001 or 11111111 or 01010011.

The calculation of the decimal equivalent of the binary value 00000001:

• The calculation of the decimal equivalent of the binary value 11111111:

• The calculation of the decimal equivalent of the binary value 01010011:

• ASCII stands for American Standard Code for Information Interchange.

• The first 32 characters are for control

• Same binary data different interpretation by different encodings

• One-character Unicode strings can also be created

• Latin-1 is also known as ISO-8859-1

1 1000 0001 011 0000 0000 0000 0000 0000

• N = (-1)^S × 1.F × 2^(E-127) we need to represent both positive

•For E=0, the numbers are in the de-normalized

• These numbers are in the so-called normalized form.

•For E = 0, N = (-1)^S × 0.F × 2^(-126).

• Sign bit S = 0 ⇒ positive number

• Sign bit S = 1 ⇒ Negative number

• E = 0 (in de-normalized form)

• Normalized form: For 1 ≤ E ≤ 2046, N = (-1)^S × 1.F × 2^(E-1023).

You might also like