0% found this document useful (0 votes)

39 views7 pages

Chapter One Data Representation

The document discusses data representation in computers, including binary and hexadecimal numbering systems, binary data organization (bits, nibbles, bytes, words, double words), signed and unsigned numbering systems, and arithmetic operations on binary values. It provides an overview of number systems used in computers, explaining how binary and decimal numbers work and how to convert between them. Key topics covered include the basic units of data (bits and bytes), common data formats, and numeric representation.

Uploaded by

Birhanu Atnafu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

Chapter One Data Representation

Uploaded by

Birhanu Atnafu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Chapter One Data Representation

Probably the biggest stumbling block most beginners encounter when attempting to learn assembly language is the common use of the binary and hexadecimal numbering systems. Many programmers think that hexadecimal (or hex) numbers represent absolute proof that God never intended anyone to work in assembly language. While it is true that hexadecimal numbers are a little different from what you may be used to, their advantages outweigh their disadvantages by a large margin. Nevertheless, understanding these numbering systems is important because their use simplifies other complex topics including boolean algebra and logic design, signed numeric representation, character codes, and packed data.

1.0 Chapter Overview

This chapter discusses several important concepts including the binary and hexadecimal numbering systems, binary data organization (bits, nibbles, bytes, words, and double words), signed and unsigned numbering systems, arithmetic, logical, shift, and rotate operations on binary values, bit fields and packed data, and the ASCII character set. This is basic material and the remainder of this text depends upon your understanding of these concepts. If you are already familiar with these terms from other courses or study, you should at least skim this material before proceeding to the next chapter. If you are unfamiliar with this material, or only vaguely familiar with it, you should study it carefully before proceeding. All of the material in this chapter is important! Do not skip over any material.

1.1 Numbering Systems

Most modern computer systems do not represent numeric values using the decimal system. Instead, they typically use a binary or two's complement numbering system. To understand the limitations of computer arithmetic, you must understand how computers represent numbers.

1.1.1 A Review of the Decimal System

You've been using the decimal (base 10) numbering system for so long that you probably take it for granted. When you see a number like "123", you don't think about the value 123; rather, you generate a mental image of how many items this value represents. In reality, however, the number 123 represents ("**" represents exponentiation):
1*10**2 + 2 * 10**1 + 3*10**0

or
100+20+3

Each digit appearing to the left of the decimal point represents a value between zero and nine times an increasing power of ten. Digits appearing to the right of the decimal point represent a

value between zero and nine times an increasing negative power of ten. For example, the value 123.456 means:
1*10**2 + 2*10**1 + 3*10**0 + 4*10**-1 + 5*10**-2 + 6*10**-3

or
100 + 20 + 3 + 0.4 + 0.05 + 0.006

1.1.2 The Binary Numbering System

Most modern computer systems (including the IBM PC) operate using binary logic. The computer represents values using two voltage levels (usually 0v and +5v). With two such levels we can represent exactly two different values. These could be any two different values, but by convention we use the values zero and one. These two values, coincidentally, correspond to the two digits used by the binary numbering system. Since there is a correspondence between the logic levels used by the 80x86 and the two digits used in the binary numbering system, it should come as no surprise that the IBM PC employs the binary numbering system. The binary numbering system works just like the decimal numbering system, with two exceptions: binary only allows the digits 0 and 1 (rather than 0-9), and binary uses powers of two rather than powers of ten. Therefore, it is very easy to convert a binary number to decimal. For each "1" in the binary string, add in 2**n where "n" is the zero-based position of the binary digit. For example, the binary value 11001010 represents:
1*2**7 + 1*2**6 + 0*2**5 + 0*2**4 + 1*2**3 + 0*2**2 + 1*2**1 + 0*2**0 = 128 + 64 + 8 + 2 =202 (base 10)

To convert decimal to binary is slightly more difficult. You must find those powers of two which, when added together, produce the decimal result. The easiest method is to work from the a large power of two down to 2**0. Consider the decimal value 1359:

2**10=1024, 2**11=2048. So 1024 is the largest power of two less than 1359. Subtract 1024 from 1359 and begin the binary value on the left with a "1" digit. Binary = "1", Decimal result is 1359 - 1024 = 335. The next lower power of two (2**9= 512) is greater than the result from above, so add a "0" to the end of the binary string. Binary = "10", Decimal result is still 335. The next lower power of two is 256 (2**8). Subtract this from 335 and add a "1" digit to the end of the binary number. Binary = "101", Decimal result is 79. 128 (2**7) is greater than 79, so tack a "0" to the end of the binary string. Binary = "1010", Decimal result remains 79. The next lower power of two (2**6 = 64) is less than79, so subtract 64 and append a "1" to the end of the binary string. Binary = "10101", Decimal result is 15.

15 is less than the next power of two (2**5 = 32) so simply add a "0" to the end of the binary string. Binary = "101010", Decimal result is still 15. 16 (2**4) is greater than the remainder so far, so append a "0" to the end of the binary string. Binary = "1010100", Decimal result is 15. 2**3(eight) is less than 15, so stick another "1" digit on the end of the binary string. Binary = "10101001", Decimal result is 7. 2**2 is less than seven, so subtract four from seven and append another one to the binary string. Binary = "101010011", decimal result is 3. 2**1 is less than three, so append a one to the end of the binary string and subtract two from the decimal value. Binary = "1010100111", Decimal result is now 1. Finally, the decimal result is one, which is2**0, so add a final "1" to the end of the binary string. The final binary result is "10101001111"

Binary numbers, although they have little importance in high level languages, appear everywhere in assembly language programs.

1.1.3 Binary Formats

In the purest sense, every binary number contains an infinite number of digits (or bits which is short for binary digits). For example, we can represent the number five by: 101 00000101 0000000000101 ... 000000000000101 Any number of leading zero bits may precede the binary number without changing its value. We will adopt the convention ignoring any leading zeros. For example, 101 (binary) represents the number five. Since the 80x86 works with groups of eight bits, we'll find it much easier to zero extend all binary numbers to some multiple of four or eight bits. Therefore, following this convention, we'd represent the number five as 0101 (binary) or 00000101 (binary). In the United States, most people separate every three digits with a comma to make larger numbers easier to read. For example, 1,023,435,208 is much easier to read and comprehend than 1023435208. We'll adopt a similar convention in this text for binary numbers. We will separate each group of four binary bits with a space. For example, the binary value 1010111110110010 will be written 1010 1111 1011 0010. We often pack several values together into the same binary number. One form of the 80x86 MOV instruction (see appendix D) uses the binary encoding 1011 0rrr dddd dddd to pack three items into 16 bits: a five-bit operation code (10110), a three-bit register field (rrr), and an eightbit immediate value (dddd dddd). For convenience, we'll assign a numeric value to each bit position. We'll number each bit as follows: 1) The rightmost bit in a binary number is bit position zero.

2) Each bit to the left is given the next successive bit number. An eight-bit binary value uses bits zero through seven:
X7 X6 X5 X4 X3 X2 X1 X0

A 16-bit binary value uses bit positions zero through fifteen:

X15 X14 X13 X12 X11 X10 X9 X8 X7 X6 X5 X4 X3 X2 X1 X0

Bit zero is usually referred to as the low order ( L.O.) bit. The left-most bit is typically called the high order ( H.O.) bit. We'll refer to the intermediate bits by their respective bit numbers.

1.2 Data Organization

In pure mathematics a value may take an arbitrary number of bits. Computers, on the other hand, generally work with some specific number of bits. Common collections are single bits, groups of four bits (called nibbles), groups of eight bits (called bytes), groups of 16 bits (called words), and more. The sizes are not arbitrary. There is a good reason for these particular values. This section will describe the bit groups commonly used on the Intel 80x86 chips.

1.2.1 Bits
The smallest "unit" of data on a binary computer is a single bit. Since a single bit is capable of representing only two different values (typically zero or one) you may get the impression that there are a very small number of items you can represent with a single bit. Not true! There are an infinite number of items you can represent with a single bit. With a single bit, you can represent any two distinct items. Examples include zero or one, true or false, on or off, male or female, and right or wrong. However, you are not limited to representing binary data types (that is, those objects which have only two distinct values). You could use a single bit to represent the numbers 723 and 1,245. Or perhaps 6,254 and 5. You could also use a single bit to represent the colors red and blue. You could even represent two unrelated objects with a single bit,. For example, you could represent the color red and the number 3,256 with a single bit. You can represent any two different values with a single bit. However, you can represent only two different values with a single bit. To confuse things even more, different bits can represent different things. For example, one bit might be used to represent the values zero and one, while an adjacent bit might be used to represent the values true and false. How can you tell by looking at the bits? The answer, of course, is that you can't. But this illustrates the whole idea behind computer data structures: data is what you define it to be. If you use a bit to represent a boolean (true/false) value then that bit (by your definition) represents true or false. For the bit to have any true meaning, you must be consistent. That is, if you're using a bit to represent true or false at one point in your program, you shouldn't use the true/false value stored in that bit to represent red or blue later.

Since most items you'll be trying to model require more than two different values, single bit values aren't the most popular data type you'll use. However, since everything else consists of groups of bits, bits will play an important role in your programs. Of course, there are several data types that require two distinct values, so it would seem that bits are important by themselves. However, you will soon see that individual bits are difficult to manipulate, so we'll often use other data types to represent boolean values.

1.2.2 Nibbles
A nibble is a collection of four bits. It wouldn't be a particularly interesting data structure except for two items: BCD (binary coded decimal) numbers and hexadecimal numbers. It takes four bits to represent a single BCD or hexadecimal digit. With a nibble, we can represent up to 16 distinct values. In the case of hexadecimal numbers, the values 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, and F are represented with four bits (see "The Hexadecimal Numbering System" on page 17). BCD uses ten different digits (0, 1, 2, 3, 4, 5, 6, 7, 8, 9) and requires four bits. In fact, any sixteen distinct values can be represented with a nibble, but hexadecimal and BCD digits are the primary items we can represent with a single nibble.

1.2.3 Bytes
Without question, the most important data structure used by the 80x86 microprocessor is the byte. A byte consists of eight bits and is the smallest addressable datum (data item) on the 80x86 microprocessor. Main memory and I/O addresses on the 80x86 are all byte addresses. This means that the smallest item that can be individually accessed by an 80x86 program is an eightbit value. To access anything smaller requires that you read the byte containing the data and mask out the unwanted bits. The bits in a byte are normally numbered from zero to seven using the convention shown below:

Bit 0 is the low order bit or least significant bit, bit 7 is the high order bit or most significant bit of the byte. We'll refer to all other bits by their number. Note that a byte also contains exactly two nibbles:

Bits 0..3 comprise the low order nibble, bits 4..7 form the high order nibble. Since a byte contains exactly two nibbles, byte values require two hexadecimal digits. Since a byte contains eight bits, it can represent 2**8, or 256, different values. Generally, we'll use a byte to represent numeric values in the range 0..255, signed numbers in the range 128..+127 (see "Signed and Unsigned Numbers" on page 23), ASCII/IBM character codes, and

other special data types requiring no more than 256 different values. Many data types have fewer than 256 items so eight bits is usually sufficient. Since the 80x86 is a byte addressable machine, it turns out to be more efficient to manipulate a whole byte than an individual bit or nibble. For this reason, most programmers use a whole byte to represent data types that require no more than 256 items, even if fewer than eight bits would suffice. For example, we'll often represent the boolean values true and false by 00000001 and 00000000 (respectively). Probably the most important use for a byte is holding a character code. Characters typed at the keyboard, displayed on the screen, and printed on the printer all have numeric values. To allow it to communicate with the rest of the world, the IBM PC uses a variant of the ASCII character set (see "The ASCII Character Set" on page 28). There are 128 defined codes in the ASCII character set. IBM uses the remaining 128 possible values for extended character codes including European characters, graphic symbols, Greek letters, and math symbols. See Appendix A for the character/code assignments.

1.2.4 Words
A word is a group of 16 bits. We'll number the bits in a word starting from zero on up to fifteen. The bit numbering appears below:

Like the byte, bit 0 is the low order bit and bit 15 is the high order bit. When referencing the other bits in a word use their bit position number. Notice that a word contains exactly two bytes. Bits 0 through 7 form the low order byte, bits 8 through 15 form the high order byte:

Naturally, a word may be further broken down into four nibbles as shown below:

Nibble zero is the low order nibble in the word and nibble three is the high order nibble of the word. The other two nibbles are "nibble one" or "nibble two". With 16 bits, you can represent 2**16 (65,536) different values. These could be the values in the range 0..65,535 (or, as is usually the case, -32,768..+32,767) or any other data type with no more

than 65,536 values. The three major uses for words are integer values, offsets, and segment values. Words can represent integer values in the range 0..65,535 or -32,768..32,767. Unsigned numeric values are represented by the binary value corresponding to the bits in the word. Signed numeric values use the two's complement form for numeric values (see "Signed and Unsigned Numbers" on page 23). Segment values, which are always 16 bits long, constitute the paragraph address of a code, data, extra, or stack segment in memory.

1.2.5 Double Words

A double word is exactly what its name implies, a pair of words. Therefore, a double word quantity is 32 bits long as shown below:

Naturally, this double word can be divided into a high order word and a low order word, or four different bytes, or eight different nibbles:

Double words can represent all kinds of different things. First and foremost on the list is a segmented address. Another common item represented with a double word is a 32-bit integer value (which allows unsigned numbers in the range 0..4,294,967,295 or signed numbers in the range -2,147,483,648..2,147,483,647). 32-bit floating point values also fit into a double word. Most of the time, we'll use double words to hold segmented addresses.

Data Migration Roadmap Bir 14112017
100% (1)
Data Migration Roadmap Bir 14112017
14 pages
Session 6 Data Representation
No ratings yet
Session 6 Data Representation
8 pages
Data Rep
No ratings yet
Data Rep
10 pages
Lecture 6 ICT Number System Decimal Binary 01042022 121104pm 20042022 091746am
No ratings yet
Lecture 6 ICT Number System Decimal Binary 01042022 121104pm 20042022 091746am
29 pages
Lecture-5
No ratings yet
Lecture-5
32 pages
Computer Fundamentals - Lecture_5- Numbring System
No ratings yet
Computer Fundamentals - Lecture_5- Numbring System
48 pages
CSC 204 Session 2
No ratings yet
CSC 204 Session 2
17 pages
DCN 157 - Introduction To IT (Lecture 3)
No ratings yet
DCN 157 - Introduction To IT (Lecture 3)
31 pages
0 Notes1 Integers
No ratings yet
0 Notes1 Integers
26 pages
CSC218 Sequential Programming Note
No ratings yet
CSC218 Sequential Programming Note
112 pages
Introduction To Microprocessor Systems: Basic of Digital Electronics
0% (1)
Introduction To Microprocessor Systems: Basic of Digital Electronics
27 pages
Assembly For Begginers
100% (1)
Assembly For Begginers
68 pages
Chapter 2: Data Representation
No ratings yet
Chapter 2: Data Representation
32 pages
1007ICT Introduction To Computer Systems & Networks: Data Representation
No ratings yet
1007ICT Introduction To Computer Systems & Networks: Data Representation
24 pages
Data Representation
No ratings yet
Data Representation
49 pages
CH 02
No ratings yet
CH 02
105 pages
Lecture02-Data Representation 2
No ratings yet
Lecture02-Data Representation 2
39 pages
Computer Architecture
No ratings yet
Computer Architecture
31 pages
Number_System
No ratings yet
Number_System
71 pages
Bahasa Rakitan (Assembler Language) : BY: Fakultas Ilmu Komputer Universitas Sriwijaya
No ratings yet
Bahasa Rakitan (Assembler Language) : BY: Fakultas Ilmu Komputer Universitas Sriwijaya
21 pages
Module2 Fc
No ratings yet
Module2 Fc
100 pages
EE2007C Chap1 201516
No ratings yet
EE2007C Chap1 201516
57 pages
Computer Organization Notes of Lesson-converted
No ratings yet
Computer Organization Notes of Lesson-converted
179 pages
Chapter 2
No ratings yet
Chapter 2
87 pages
A Quick Start Guide To CS/COE 0447: Digital Computer
No ratings yet
A Quick Start Guide To CS/COE 0447: Digital Computer
17 pages
Lecture 2
No ratings yet
Lecture 2
46 pages
Bee Unit 3 Cse Gates, Boolean, Codes
No ratings yet
Bee Unit 3 Cse Gates, Boolean, Codes
187 pages
CS Revision Booklet 1
No ratings yet
CS Revision Booklet 1
19 pages
Chapter 3 Data Representation and Computer Arithmetic
No ratings yet
Chapter 3 Data Representation and Computer Arithmetic
13 pages
Number System & Program Design
No ratings yet
Number System & Program Design
33 pages
Bilgisayar
No ratings yet
Bilgisayar
33 pages
Binary Number
No ratings yet
Binary Number
10 pages
Number System
No ratings yet
Number System
16 pages
Computer Science Coursebook-9-24
No ratings yet
Computer Science Coursebook-9-24
16 pages
Number Systems
No ratings yet
Number Systems
36 pages
Data - RepresentationPart 2 File Organization L1&2
No ratings yet
Data - RepresentationPart 2 File Organization L1&2
29 pages
Lecture_1_Number_System
No ratings yet
Lecture_1_Number_System
28 pages
The Number System CS
No ratings yet
The Number System CS
19 pages
Lect 1
No ratings yet
Lect 1
38 pages
CHAPTERS-WPS Office
No ratings yet
CHAPTERS-WPS Office
107 pages
1. Data Representation (1)
No ratings yet
1. Data Representation (1)
14 pages
Assembler
No ratings yet
Assembler
9 pages
Cambridge Igcse Computer Science Study and Revision Guide Sample Pages 9781398318489
No ratings yet
Cambridge Igcse Computer Science Study and Revision Guide Sample Pages 9781398318489
23 pages
4CS015Lecture1HCK_0d5aab72-6ebe-4738-b8e7-443d4aa72526_90180_
No ratings yet
4CS015Lecture1HCK_0d5aab72-6ebe-4738-b8e7-443d4aa72526_90180_
57 pages
Computer Sceince GCSE Notes
No ratings yet
Computer Sceince GCSE Notes
249 pages
Lect 2
No ratings yet
Lect 2
39 pages
caie-igcse-computer-science-2210-theory-v5
No ratings yet
caie-igcse-computer-science-2210-theory-v5
20 pages
CTPart 1
No ratings yet
CTPart 1
4 pages
BINARY NUMBER SYSTEM Unit1
No ratings yet
BINARY NUMBER SYSTEM Unit1
3 pages
01-IntroDigitalCircuits
No ratings yet
01-IntroDigitalCircuits
46 pages
Lecture02-Data Representation 2
No ratings yet
Lecture02-Data Representation 2
38 pages
02 Number Systems (1)
No ratings yet
02 Number Systems (1)
57 pages
L5 FCS Number Systesm
No ratings yet
L5 FCS Number Systesm
42 pages
Number Systems
No ratings yet
Number Systems
45 pages
DLD 2022 2
No ratings yet
DLD 2022 2
17 pages
Chapter 0 - Introduction To Computing
No ratings yet
Chapter 0 - Introduction To Computing
43 pages
Numerical Analysis Chapter 2
No ratings yet
Numerical Analysis Chapter 2
7 pages
1 Numbering Systems
No ratings yet
1 Numbering Systems
22 pages
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
Basic Math Notes
From Everand
Basic Math Notes
Ernest Bywater
5/5 (2)
Essential Computer Hardware: The Illustrated Guide to Understanding Computer Systems
From Everand
Essential Computer Hardware: The Illustrated Guide to Understanding Computer Systems
Kevin Wilson
No ratings yet
Unlocking the Future of Finance with Singular Wallet
No ratings yet
Unlocking the Future of Finance with Singular Wallet
2 pages
Delivering Ultimate Mobile Wallet
No ratings yet
Delivering Ultimate Mobile Wallet
2 pages
Time Schedule2345456
No ratings yet
Time Schedule2345456
3 pages
M-IS MANUAL FOR INTL OPERATIONS
No ratings yet
M-IS MANUAL FOR INTL OPERATIONS
39 pages
Lean Canavas Model
No ratings yet
Lean Canavas Model
4 pages
India Mobile Wallet Market (2018-2023)
No ratings yet
India Mobile Wallet Market (2018-2023)
3 pages
CPD Toolkit 20th October
100% (1)
CPD Toolkit 20th October
78 pages
Eoi FG
No ratings yet
Eoi FG
17 pages
Definitions and Usage of Were: Past Form Verb
No ratings yet
Definitions and Usage of Were: Past Form Verb
1 page
Freedom of Information: Standard Consent Form For Disclosure of Personal Information
No ratings yet
Freedom of Information: Standard Consent Form For Disclosure of Personal Information
2 pages
Scoping Study On Chinese Relations With Sudan : Nour Eldin A. Maglad
No ratings yet
Scoping Study On Chinese Relations With Sudan : Nour Eldin A. Maglad
29 pages
Risk Managemet Guidelinesv01
No ratings yet
Risk Managemet Guidelinesv01
16 pages
IT Technical Support Officer
0% (1)
IT Technical Support Officer
2 pages
System Retirement Parameters-30102018
No ratings yet
System Retirement Parameters-30102018
2 pages
Project Transition Document
No ratings yet
Project Transition Document
7 pages
Deployment Plan
No ratings yet
Deployment Plan
8 pages
Here Is A Basic Format For Developing and Using An Action Item List. You Can Modify The Format Accordingly
No ratings yet
Here Is A Basic Format For Developing and Using An Action Item List. You Can Modify The Format Accordingly
1 page
Training Plan
0% (1)
Training Plan
6 pages
TeamDecisions PDF
No ratings yet
TeamDecisions PDF
3 pages