Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 9
Character Representation
• Any text-based data is stored by the computer
in the form of bits(a series of 1s and 0s) • The combinations of 0s and 1s used to represent data are defined by patterns called coding schemes – BCD – ASCII – Extended ASCII – EBCDIC – Unicode BCD (Binary Coded Decimal) • BCD uses 6 bits and can represent 26 =64 characters Example • 709310 = ? (in BCD) 7 0 9 3
0111 0000 1001 0011
ASCII (American Standard Code for Information Exchange) • Uses 7 bits and can represent 27 =128 characters • Starts from (ANSI) AMERICAN NATIONAL STANDARD INSTITUTE • Assigns standard numeric values to letters, numerals, punctuation marks, and other characters used in computers • Every character is a unique ASCII code. • The ASCII code for an uppercase A is 1000001. Extended ASCII
• Uses 8 bits and can represent 28 =256 characters
• Extended version of ASCII • Uses 8 bits for each character • Introduced by IBM in 1981 for use in its first PC • Extended ASCII represents the uppercase letter A as 01000001. • Does not include enough code combinations to support all written languages. EBCDIC (Extended Binary Coded Decimal Interchange Code ) • Extended BCD Interchange Code (pronounced ebb’-se-dick) • 8-bit code • Developed by IBM • Rarely used today • IBM mainframes only Unicode • Unicode is a Universal Encoding System (UES) • Uses sixteen bits and provides codes or 65,000 characters. • Can support all the written languages • Most common character-encoding system on the World Wide Web • Unicode assigns code to every character • The code is an integer value. Example • You can refer character map to see all the code for characters. • For example the code point of a (Latin small letter) is 0061 or U+0061. 9