0% found this document useful (0 votes)
5 views

Lecture2 Basic Fall13

Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Lecture2 Basic Fall13

Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 29

Digital Data

Patrice Koehl
Computer Science
UC Davis
Digital Data

• Binary and Hexadecimal numbers

• ASCII code and UNICODE

• Sampling and Quantitizing


– Example: sound
Digital Data

• Binary and Hexadecimal numbers

• ASCII code and UNICODE

• Sampling and Quantitizing


– Example: sound
Number representation
We are used to counting in base 10:
1000 100 10 1

103 102 101 100


….. thousands hundreds tens units

Example:

1 7 3 2 digits
1000 100 10 1

1x1000+7x100+3x10+2x1 = 1732
Number representation
Computers use a different system: base 2:
1024 512 256 128 64 32 16 8 4 2 1

210 29 28 27 26 25 24 23 22 21 20

Example:

1 1 0 1 1 0 0 0 1 0 0 bits
1024 512 256 128 64 32 16 8 4 2 1
1x1024+1x512+0x256+1x128+1x64+0x32+ 0x16+ 0x8 + 1x4 + 0x2 + 0x1 = 1732
Number representation
Base 10 Base 2
0 0
1 1
2 10
3 11
4 100
5 101
6 110
… …
253 11111101
254 11111110
255 11111111
… …
Conversion
From base 2 to base 10:
1 1 1 0 1 0 1 0 1 0 0
1024 512 256 128 64 32 16 8 4 2 1
1x1024+1x512+1x256+0x128+1x64+0x32+ 1x16+ 0x8 + 1x4 + 0x2 + 0x1 = 1876

From base 10 to base 2:


1877 %2 = 938 Remainder 1
938 %2 = 469 Remainder 0
469 %2 = 234 Remainder 1
234 %2 = 117 Remainder 0
117 %2 = 58 Remainder 1
58 %2 = 29 Remainder 0
29 %2 = 14 Remainder 1
14 %2 = 7 Remainder 0
7 %2 = 3 Remainder 1
3 %2 = 1 Remainder 1
1 %2 = 0 Remainder 1
1877 (base10) = 11101010101 (base 2)
Facts about Binary Numbers
-Each “digit” of a binary number (each 0 or 1) is called a bit

-1 byte = 8 bits
-1 KB = 1 kilobyte = 210 bytes = 1024 bytes (≈1 thousand bytes)
-1 MB = 1 Megabyte = 220 bytes = 1,048,580 bytes (≈ 1 million bytes)
-1 GB = 1 Gigabyte = 230 bytes = 1,073,741,824 bytes (≈1 billion bytes)
-1 TB = 1 Tetabyte = 240 bytes = 1,099,511,627,776 bytes (≈ 1 trillion bytes)

-A byte can represent numbers up to 255: 11111111 (base 2) = 255 (base 10)
-The largest number represented by a binary number of size N is 2N - 1
Big Data: Volume

Byte Kilobyte Megabyte Gigabyte Terabyte Petabyte Exabyte Zettabyte Yottabyte


KB MB GB TB PB EB ZB YB
1000 bytes 1000 KB 1000 MB 1000 GB 1000 TB 1000 PB 1000 ZB 1000YB
Big Data: Volume
One page One song One movie 6 million 55 storeys Data Data NSA
of text books of DVD up to in 2011 data center
2003
30KB 5 MB 5 GB 1 TB 1 PB 1.8 ZB 1 YB
5 EB

Byte Kilobyte Megabyte Gigabyte Terabyte Petabyte Exabyte Zettabyte Yottabyte


KB MB GB TB PB EB ZB YB
1000 bytes 1000 KB 1000 MB 1000 GB 1000 TB 1000 PB 1000 ZB 1000YB
Big Data: Volume
One page One song One movie 6 million 55 storeys Data Data NSA
of text books of DVD up to in 2011 data center
2003
30KB 5 MB 5 GB 1 TB 1 PB 1.8 ZB 1 YB
5 EB

Byte Kilobyte Megabyte Gigabyte Terabyte Petabyte Exabyte Zettabyte Yottabyte


KB MB GB TB PB EB ZB YB
1000 bytes 1000 KB 1000 MB 1000 GB 1000 TB 1000 PB 1000 ZB 1000YB
1s 20 mins 11 days 30 years 300 30 million 30 billion ….
centuries years years
Hexadecimal numbers
While base 10 and base 2 are the most common bases used
to represent numbers, others are also possible:
base 16 is another popular one, corresponding to
hexadecimal numbers

256 16 1

162 161 160

The “digits” are: 0 1 2 3 4 5 6 7 8 9 A B C D E F


Example:
2 A F
256 16 1

2x256 + 10*16 + 15x1 = 687


Hexadecimal numbers
Everything we have learned in base 10 should be studied
again in other bases !!
Example: multiplication table in base 16:
Base 10 Base 2 Base 16
0 0000 0
1 0001 1
2 0010 2
3 0011 3
4 0100 4
5 0101 5
6 0110 6
7 0111 7
8 1000 8
9 1001 9
10 1010 A
11 1011 B
12 1100 C
13 1101 D
14 1110 E
15 1111 F
Conversion: From base 2 to base 16, and back
This is in fact easy!!

-From base 2 to base 16:


Example: 11011000100

Step 1: break into groups of 4 (starting from the right):

110 1100 0100

Step 2: pad with 0, if needed:

0110 1100 0100

Step 3: convert each group of 4, using table:

6 C 4

Step 4: regroup:

6C4

11011000100 (base 2) = 6C4 (base 16)


Conversion: From base 2 to base 16, and back

From base 16 to base 2:

Example: 4FD

Step 1: split:

4 F D

Step 2: convert each “digit”, using table:

0100 1111 1101

Step 3: Remove leading 0, if needed

100 1111 1101

Step 4: regroup:

10011111101

4FD (base 16) = 10011111101 (base 2)


Digital Data

• Binary and Hexadecimal numbers

• ASCII code and UNICODE

• Sampling and Quantitizing


– Example: sound
ASCII
American Standard Code for Information Interchange

So far, we have seen how computers can handle numbers.


What about letters / characters?
The ASCII code was designed for that: it assigns a number to
each character:

A-Z: 65-90
a-z: 97-122
0-9: 48-57
UNICODE

ASCII only contains 127 characters (though an extended


version exists with 257 characters).
This is by far not enough as it is too restrictive to the
English language.

UNICODE was developed to alleviate this problem:


the latest version, UNICODE 5.1.0 contains more than
100,000 characters, covering most existing languages.

For more information, see:


http://www.unicode.org/versions/Unicode5.1.0/
Digital Data

• Binary and Hexadecimal numbers

• ASCII code and UNICODE

• Sampling and Quantitizing


– Example: sound
Digital Sound
Sound is produced by the vibration of a media like air or water. Audio
refers to the sound within the range of human hearing.
Naturally, a sound signal is analog, i.e. continuous in both time and
amplitude.

To store and process sound information in a computer or to transmit it


through a computer network, we must first convert the analog signal to
digital form using an analog-to-digital converter ( ADC ); the conversion
involves two steps: (1) sampling, and (2) quantization.
Sampling
Sampling is the process of examining the value of a continuous function
at regular intervals.

Sampling usually occurs at uniform intervals, which are referred to as sampling


intervals. The reciprocal of sampling interval is referred to as the sampling
frequency or sampling rate.
If the sampling is done in time domain, the unit of sampling interval is second and
the unit of sampling rate is Hz, which means cycles per second.
Sampling
Note that choosing the sampling rate is not innocent:

A higher sampling rate usually allows for a better representation of the original sound
wave. However, when the sampling rate is set to twice the highest frequency in the
signal, the original sound wave can be reconstructed without loss from the samples.
This is known as the Nyquist theorem.
Quantization
Quantization is the process of limiting the value of a sample of a continuous
function to one of a predetermined number of allowed values,
which can then be represented by a finite number of bits.
Quantization
The number of bits used to store each intensity defines the accuracy of
the digital sound:

Adding one bit makes the sample twice as accurate


Audio Sound
Sampling:

The human ear can hear sound up to 20,000 Hz: a sampling rate of
40,000 Hz is therefore sufficient. The standard for digital audio is
44,100 Hz.
Quantization:
The current standard for the digital representation of audio sound is to use
16 bits (i.e 65536 levels, half positive and half negative)

How much space do we need to store one minute of music?


- 60 seconds
- 44,100 samples
-16 bits (2 bytes) per sample
- 2 channels (stereo)

S = 60x44100x2x2 = 10,534,000 bytes ≈ 10 MB !!


1 hour of music would be more than 600 MB !
Analog Recording

www.atpm.com/6.02/digitalaudio.shtml
DIGITAL RECORDING

DAC

Advantages of digital recording:


-Faithful
- can make multiple identical copies

-Can be processed
- compression (MP3) www.atpm.com/6.02/digitalaudio.shtml

You might also like