Modern Cryptography
Modern Cryptography
Modern Cryptography
Human being from ages had two inherent needs − (a) to communicate and share information
and (b) to communicate selectively. These two needs gave rise to the art of coding the
messages in such a way that only the intended people could have access to the information.
Unauthorized people could not extract any information, even if the scrambled messages fell in
their hand.
The art and science of concealing the messages to introduce secrecy in information security is
recognized as cryptography.
The word ‘cryptography’ was coined by combining two Greek words, ‘Krypto’ meaning hidden
and ‘graphene’ meaning writing.
History of Cryptography
The art of cryptography is considered to be born along with the art of writing. As civilizations
evolved, human beings got organized in tribes, groups, and kingdoms. This led to the
emergence of ideas such as power, battles, supremacy, and politics. These ideas further fueled
the natural need of people to communicate secretly with selective recipient which in turn
ensured the continuous evolution of cryptography as well.
The first known evidence of cryptography can be traced to the use of ‘hieroglyph’. Some 4000
years ago, the Egyptians used to communicate by messages written in hieroglyph. This code
was the secret known only to the scribes who used to transmit messages on behalf of the kings.
One such hieroglyph is shown below.
1
Later, the scholars moved on to using simple mono-alphabetic substitution ciphers during 500 to
600 BC. This involved replacing alphabets of message with other alphabets with some secret
rule. This rule became a key to retrieve the message back from the garbled message.
The earlier Roman method of cryptography, popularly known as the Caesar Shift Cipher, relies
on shifting the letters of a message by an agreed number (three was a common choice), the
recipient of this message would then shift the letters back by the same number and obtain the
original message.
Steganography
Steganography is similar but adds another dimension to Cryptography. In this method, people
not only want to protect the secrecy of an information by concealing it, but they also want to
make sure any unauthorized person gets no evidence that the information even exists. For
example, invisible watermarking.
2
1.2 Evolution of Cryptography
It is during and after the European Renaissance, various Italian and Papal states led the rapid
proliferation of cryptographic techniques. Various analysis and attack techniques were
researched in this era to break the secret codes.
Improved coding techniques such as Vigenere Coding came into existence in the
15th century, which offered moving letters in the message with a number of variable
places instead of moving them the same number of places.
Only after the 19th century, cryptography evolved from the ad hoc approaches to
encryption to the more sophisticated art and science of information security.
In the early 20th century, the invention of mechanical and electromechanical machines,
such as the Enigma rotor machine, provided more advanced and efficient means of
coding the information.
During the period of World War II, both cryptography and cryptanalysis became
excessively mathematical.
With the advances taking place in this field, government organizations, military units, and some
corporate houses started adopting the applications of cryptography. They used cryptography to
guard their secrets from others. Now, the arrival of computers and the Internet has brought
effective cryptography within the reach of common people.
3
1.3 Cryptography Defined/Brief History
If you want to keep information secret, you have two possible strategies: hide the existence of
the information, or make the information unintelligible. Cryptography is the art and science of
keeping information secure from unintended audiences, of encrypting it. Conversely,
cryptanalysis is the art and science of breaking encoded data. The branch of mathematics
encompassing both cryptography and cryptanalysis is cryptology.
Modern cryptography uses sophisticated mathematical equations (algorithms) and secret keys
to encrypt and decrypt data.
Today, cryptography is used to provide secrecy and integrity to our data, and both
authentication and anonymity to our communications.
The increase in demand for cryptography was driven by industry interest (e.g., financial services
required secure electronic transactions and businesses needed to secure trade secrets stored
on computers), and individual interest (e.g., secure wireless communications). Digital
communications were obvious candidates for encryption.
4
1.4 Principles of Modern Cryptography
Modern cryptographers emphasize that security should not depend on the secrecy of the
encryption method (or algorithm), only the secrecy of the keys. The secret keys must not be
revealed when plaintext and ciphertext are compared, and no person should have knowledge of
the key. Modern algorithms are based on mathematically difficult problems - for example, prime
number factorization, discrete logarithms, etc. There is no mathematical proof that these
problems are in fact are hard, just empirical evidence.
There are three major characteristics that separate modern cryptography from the classical
approach.
5
Classic Cryptography Modern Cryptography
Context of Cryptography
Cryptography
Cryptanalysis
6
What is Cryptography?
Cryptography is the art and science of making a cryptosystem that is capable of providing
information security.
Cryptography deals with the actual securing of digital data. It refers to the design of
mechanisms based on mathematical algorithms that provide fundamental information security
services. You can think of cryptography as the establishment of a large toolkit containing
different techniques in security applications.
What is Cryptanalysis?
The art and science of breaking the cipher text is known as cryptanalysis.
Cryptanalysis is the sister branch of cryptography and they both co-exist. The cryptographic
process results in the cipher text for transmission or storage. It involves the study of
cryptographic mechanism with the intention to break them. Cryptanalysis is also used during the
design of the new cryptographic techniques to test their security strengths.
Note − Cryptography concerns with the design of cryptosystems, while cryptanalysis studies the
breaking of cryptosystems.
A model for much of what we will be discussing is captured, in very general terms, in
Figure 1.1. A message is to be transferred from one party to another across some sort of
Internet service.
Some secret information shared by the two principals and, it is hoped, unknown to the
opponent. An example is an encryption key used in conjunction with the transformation to
scramble the message before transmission and unscramble it on reception.
7
Figure 1.1 Model for Network Security
8
Figure 1.2 Network Access Security Model
Another type of unwanted access is the placement in a computer system of logic that
exploits vulnerabilities in the system and that can affect application programs as well as utility
programs, such as editors and compilers. Programs can present two kinds of threats:
• Information access threats: Intercept or modify data on behalf of users who should not have
access to that data.
• Service threats: Exploit service flaws in computers to inhibit use by legitimate users.
Viruses and worms are two examples of software attacks. Such attacks can be
introduced into a system by means of a disk that contains the unwanted logic concealed
in otherwise useful software.
The security mechanisms needed to cope with unwanted access fall into two
broad categories (see Figure 1.2).The first category might be termed a gatekeeper
function. It includes password-based login procedures that are designed to deny access
to all but authorized users and screening logic that is designed to detect and reject
worms, viruses, and other similar attacks. Once either an unwanted user or unwanted
software gains access,
The second line of defense consists of a variety of internal controls that monitor
activity and analyze stored information in an attempt to detect the presence of unwanted
intruders.
9
1.6. CLASSICAL ENCRYPTION TECHNIQUES
Symmetric encryption is a form of cryptosystem in which encryption and decryption are
performed using the same key. It is also known as conventional encryption.
• Symmetric encryption transforms plaintext into cipher text using a secret key and
an encryption algorithm. Using the same key and a decryption algorithm, the
plaintext is recovered from the cipher text.
• The two types of attack on an encryption algorithm are cryptanalysis, based on
properties of the encryption algorithm, and brute-force, which involves trying all
possible keys.
• Traditional (pre computer) symmetric ciphers use substitution and/or
transposition techniques. Substitution techniques map plaintext elements
(characters, bits) into cipher text elements. Transposition techniques
systematically transpose the positions of plaintext elements.
• Rotor machines are sophisticated pre computer hardware devices that use
substitution techniques.
• Steganography is a technique for hiding a secret message within a larger one in
such a way that others cannot discern the presence or contents of the hidden
message.
An original message is known as the plaintext, while the coded message is called
the cipher text. The process of converting from plaintext to cipher text is known as
enciphering or encryption; restoring the plaintext from the cipher text is deciphering or
decryption. The many schemes used for encryption constitute the area of study known
as cryptography.
Such a scheme is known as a cryptographic system or a cipher. Techniques
used for deciphering a message without any knowledge of the enciphering details fall
into the area of cryptanalysis. Cryptanalysis is what the layperson calls “breaking the
code. ”The areas of cryptography and cryptanalysis together are called cryptology.
10
1.7 SYMMETRIC CIPHER MODEL
A symmetric encryption scheme has five ingredients (Figure 1.3):
•Plaintext: This is the original intelligible message or data that is fed into the algorithm as
input.
•Encryption algorithm: The encryption algorithm performs various substitutions and
transformations on the plaintext.
•Secret key: The secret key is also input to the encryption algorithm. The key is a
value independent of the plaintext and of the algorithm. The algorithm will produce
a different output depending on the specific key being used at the time. The exact
substitutions and transformations performed by the algorithm
depend on the key.
• Cipher text: This is the scrambled message produced as output. It depends on the
plaintext and the secret key. For a given message, two different keys will produce two
different cipher texts. The cipher text is an apparently random stream of data and, as it
stands, is unintelligible.
• Decryption algorithm: This is essentially the encryption algorithm run in reverse. It
takes the cipher text and the secret key and produces the original plaintext.
11
PANIMALAR ENGINEERING COLLEGE
2. Sender and receiver must have obtained copies of the secret key in a secure fashion
and must keep the key secure. If someone can discover the key and knows the
algorithm, all communication using this key is readable.
With the message X and the encryption key K as input, the encryption algorithm
forms the cipher text Y=[Y1,Y2,…….YN] .We can write this as Y=E(K,X)This notation
indicates that is produced by using encryption algorithm E as a function of the plaintext
X , with the specific function determined by the value of the key K .
The intended receiver, in possession of the key, is able to invert the transformation:
X=D(K,Y)
12
An opponent, observing Y but not having access K to X or , may attempt to recover X or
K or both X and K . It is assumed that the opponent knows the encryption (E) and
decryption (D) algorithms. If the opponent is interested in only this particular message,
then the focus of the effort is to recover X by generating a plaintext estimate X. Often,
however, the opponent is interested in being able to read future messages as well, in
which case an attempt is made to recover K by generating an estimate K .
Cryptography
Cryptographic systems are characterized along three independent dimensions:
1. The type of operations used for transforming plaintext to cipher text. All
encryption algorithms are based on two general principles: substitution, in which each
element in the plaintext (bit, letter, group of bits or letters) is mapped into another
element, and transposition, in which elements in the plaintext are rearranged. The
fundamental requirement is that no information be lost (that is, that all operations are
reversible). Most systems, referred to as product systems, involve multiple stages of
substitutions and transpositions.
2. The number of keys used. If both sender and receiver use the same key, the system
is referred to as symmetric, single-key, secret-key, or conventional encryption. If the
sender and receiver use different keys, the system is referred to as asymmetric, two-
key, or public-key encryption.
3. The way in which the plaintext is processed. A block cipher processes the input one
block of elements at a time, producing an output block for each input block. A stream
cipher processes the input elements continuously, producing output one element at a
time, as it goes along. Cryptanalysis and Brute-Force Attack
Typically, the objective of attacking an encryption system is to recover the key in use
rather than simply to recover the plaintext of a single cipher text. There are two general
approaches to attacking a conventional encryption scheme:
• Cryptanalysis: Cryptanalytic attacks rely on the nature of the algorithm plus perhaps
some knowledge of the general characteristics of the plaintext or even some sample
plaintext– cipher text pairs. This type of attack exploits the characteristics of the
algorithm to attempt to deduce a specific plaintext or to deduce the key being used.
13
• Brute-force attack: The attacker tries every possible key on a piece of cipher text until
an intelligible translation into plaintext is obtained. On average, half of all possible keys
must be tried to achieve success.
Table 1.1 summarizes the various types of cryptanalytic attacks based on the amount of
information known to the cryptanalyst. The most difficult problem is presented when all
that is available is the cipher text only.
A brute-force attack involves trying every possible key until an intelligible translation of
the cipher text into plaintext is obtained.
14
1.8 SERVICES
X.800 divides these services into five categories and fourteen specific services (Table 1.2)
15
1.9 MECHANISMS
Table 1.3 lists the security mechanisms defined in X.800. The mechanisms are divided into
those that are implemented in a specific protocol layer, such as TCP or an application-layer
protocol, and those that are not specific to any particular protocol layer or security service
16
1.10 ATTACKS
The security attacks can be classified into two types passive attacks and active attacks.
A passive attack attempts to learn or make use of information from the system but does
not affect system resources. An active attack attempts to alter system resources or
affect their operation.
Passive Attacks
Two types of passive attacks are the release of message contents and traffic analysis.
The release of message contents is easily understood (Figure 1.5a).A telephone
conversation, an electronic mail message, and a transferred file may contain sensitive
or confidential information. We would like to prevent an opponent from learning the
contents of these transmissions.
A second type of passive attack, traffic analysis, is subtler (Figure 1.5b). Suppose that
we had a way of masking the contents of messages or other information traffic so that
opponents, even if they captured the message, could not extract the information from
the message. The common technique for masking contents is encryption. If we had
encryption protection in place, an opponent might still be able to observe the pattern of
these messages.
17
Passive attacks are very difficult to detect, because they do not involve any alteration
of the data. Typically, the message traffic is sent and received in an apparently normal
fashion, and neither the sender nor receiver is aware that a third party has read the
messages or observed the traffic pattern.
Active Attacks
Active attacks involve some modification of the data stream or the creation of a false
stream and can be subdivided into four categories: masquerade, replay, modification of
messages, and denial of service.
18
A masquerade takes place when one entity pretends to be a different entity (Figure 1.6
a). A masquerade attack usually includes one of the other forms of active attack. For
example, authentication sequences can be captured and replayed after a valid
authentication sequence has taken place, thus enabling an authorized entity with few
privileges to obtain extra privileges by impersonating an entity that has those privileges.
Replay involves the passive capture of a data unit and its subsequent retransmission to
produce an unauthorized effect (Figure 1.6 b).
Active attacks present the opposite characteristics of passive attacks. Whereas passive
attacks are difficult to detect, measures are available to prevent their success.
19
1.11 THE OSI SECURITY ARCHITECTURE
• Security attack: Any action that compromises the security of information owned by an
organization.
Table 1.4 provides definitions taken from RFC 2828, Internet Security Glossary.
Threat
A potential for violation of security, which exists when there is a circumstance,
capability, action, or event that could breach security and cause harm. That is, a threat
is a possible danger that might exploit vulnerability.
Attack
An assault on system security that derives from an intelligent threat; that is, an
intelligent act that is a deliberate attempt (especially in the sense of a method or
technique) to evade security services and violate the security policy of a system.
Table 1.4 Threats and Attacks (RFC 2828)
20
Part –A
1. Compare active attacks and passive attacks.
2. Define Steganography.
3. Define cryptanalysis.
4. Why Random numbers are used in Network Security?
5. List the four categories of security threats.
6. Define cryptography
7. Explain why Modular arithmetic has been used in cryptography.
8. Classify the basic functions used in encryption algorithms.
9. Describe security mechanism
10. Generalize why network need security.
11. Specify the four categories of security threats.
12. Define integrity and non repudiation.
13. Differentiate symmetric and asymmetric encryption?
14. Define cryptanalysis?
15. Define security mechanism
16. Define steganography
17. Why network need security?
18. Define confidentiality and authentication
19. Specify the basic task for defining a security service.
20. Define network security
21. Define computer security
22. List-out the types of attack in ceaser cipher
Part – B & C
1. Generalize the security services classifications and security mechanisms in detail.
2. List the different types of attacks and explain in detail.
3. Describe in detail about the types of cryptanalytic attack.
4. What is Steganography? Briefly examine any three techniques used.
5. With a neat block diagram, explain the network security model and the important parameters
associated with it.
6. Differentiate active and passive security attacks. Categorize these attacks and explain one
examples of each.
7. Discuss the following
a) Message Integrity
b) Denial of Service
c) Availability
d) Authentication
8. Experiment how the ITU-T3 Recommendation X.800, Security Architecture for OSI, defines such
a systematic approach.(15)
21
UNIT II: SYMMETRIC TECHNIQUES
Definition – Substitution ciphers – Transposition ciphers - Stream and block ciphers - A5, RC4 -
Characteristics of good ciphers - Data Encryption Standard (DES) – International Data
Encryption Algorithm – Advanced Encryption Standard – Block cipher modes of operation –
Confidentiality using symmetric encryption.
Note that the alphabet is wrapped around, so that the letter following Z is A. We can
define the transformation by listing all possibilities, as follows:
plain: abc defg h i j k l m n o p q r s t u v w x y z
cipher: D E F G H I J K L M N O P Q R S T U V W X Y Z A B C
Let us assign a numerical equivalent to each letter:
1When letters are involved; the following conventions are used in this book. Plaintext is
always in lowercase; cipher text is in uppercase; key values are in italicized lowercase.
Let us assign a numerical equivalent to each letter:
22
Then the algorithm can be expressed as follows. For each plaintext letter, substitute
the cipher text letter:
C = E(3, p) = (p + 3) mod 26
A shift may be of any amount, so that the general Caesar algorithm is
C = E(k, p) = (p + k) mod 26
where takes on a value in the range 1 to 25.The decryption algorithm is simply
p = D(k, C) = (C - k) mod 26
If it is known that a given cipher text is a Caesar cipher, then a brute-force cryptanalysis
is easily performed: simply try all the 25 possible keys. Three important characteristics
of this problem enabled us to use a brute force cryptanalysis:
1. The encryption and decryption algorithms are known.
2. There are only 25 keys to try.
3. The language of the plaintext is known and easily recognizable.
23
Mono alphabetic Ciphers
With only 25 possible keys, the Caesar cipher is far from secure. A dramatic
increase in the key space can be achieved by allowing an arbitrary substitution. A
permutation of a finite set of elements is an ordered sequence of all the elements of ,
with each element appearing exactly once. For example, if S = {a,b,c} , there are six
permutations of :
abc, acb, bac, bca, cab, cba
In general, there are n! permutations of a set of elements, because the first
element can be chosen in one of n ways, the second in n-1 ways, the third in n-2 ways,
and so on.
Recall the assignment for the Caesar cipher:
plain: a b c d e f g h I j kl m n o p q r s t u v w x y z
cipher: D E F G H I J K L M N O P Q R S T U V W X Y Z A B C
If, instead, the “cipher” line can be any permutation of the 26 alphabetic
characters, then there are 26! or greater than 4*10 26 possible keys. This is 10 orders of
magnitude greater than the key space for DES and would seem to eliminate brute-force
techniques for cryptanalysis. Such an approach is referred to as a mono alphabetic
substitution cipher, because a single cipher alphabet (mapping from plain alphabet to
cipher alphabet) is used per message.
The cipher text to be solved is
UZQSOVUOHXMOPVGPOZPEVSGZWSZOPFPESXUDBMET
SXAIZ
VUEPHZHMDZSHZOWSFPAPPDTSVPQUZWYMXUZUHSX
EPYEPOPDZSZUFPOMBZWPFUPZHMDJUDTMOHMQ
As a first step, the relative frequency of the letters can be determined and compared to
a standard frequency distribution for English, such as is shown in Figure 1.12. If the
message were long enough, this technique alone might be sufficient, but because this is
a relatively short message,
24
We cannot expect an exact match. In any case, the relative frequencies of the letters in the
cipher text (in percentages) are as follows:
That cipher letters P and Z are the equivalents of plain letters e and t, but it is not
certain which is which. The letters S,U,O, M, and H are all of relatively high frequency
and probably correspond to plain letters from the set {a, h, i, n, o, r, s}. The letters with
the lowest frequencies (namely ,A, B,G,Y, I, J) are likely included in the set {b, j, k, q, v,
x, z}.
25
A powerful tool is to look at the frequency of two-letter combinations, known as
diagrams. The most common such diagram is the. In our cipher text, the most common
diagram is ZW, which appears three times. So we make the correspondence of Z with t
and W with h. Then, by our earlier hypothesis, we can equate P with e. Now notice that
the sequence ZWP appears in the cipher text, and we can translate that sequence as
“the.”This is the most frequent trigram (three- letter combination). Next, notice the
sequence ZWSZ in the first line. We do not know that these four letters form a complete
word, but if they do, it is of the form that. If so, Sequates with a.
So far, then, we have
Only four letters have been identified, but already we have quite a bit of the message.
Continued analysis of frequencies plus trial and error should easily yield a solution from
this point. The complete plaintext, with spaces added between words, follows:
it was disclosed yesterday that several informal
but direct contacts have been made with political
representatives of the Viet Cong in Moscow
Mono alphabetic ciphers are easy to break because they reflect the frequency data of
the original alphabet. A countermeasure is to provide multiple substitutes, known as
homophones, for a single letter.
Play fair Cipher
The best-known multiple-letter encryption cipher is the Play fair, which treats
diagrams in the plaintext as single units and translates these units into cipher text
diagrams. The Play fair algorithm is based on the use of a 5 × 5 matrix of letters
constructed using a keyword. Here is an example, solved by Lord Peter Wimsey in
26
Dorothy Sayers‟s Have His Carcase
In this case, the keyword is monarchy. The matrix is constructed by filling in the letters
of the keyword (minus duplicates) from left to right and from top to bottom, and then
filling in the remainder of the matrix with the remaining letters in alphabetic order. The
letters I and J count as one letter. Plaintext is encrypted two letters at a time, according
to the following rules:
1. Repeating plaintext letters that are in the same pair are separated with a filler letter,
such as
x, so that balloon would be treated as ba lx lo on.
2. Two plaintext letters that fall in the same row of the matrix are each replaced by the
letter to the right, with the first element of the row circularly following the last. For
example, ar is encrypted as RM.
3. Two plaintext letters that fall in the same column are each replaced by the letter
beneath, with the top element of the column circularly following the last. For example,
mu is encrypted as CM.
4. Otherwise, each plaintext letter in a pair is replaced by the letter that lies in its own
row and the column occupied by the other plaintext letter. Thus, hs becomes BP and ea
becomes IM (or JM, as the en cipher er wishes).
The Playfair cipher is a great advance over simple mono alphabetic ciphers. For one
thing, whereas there are only 26 letters, there are 26 × 26 = 676 diagrams, so that
identification of individual diagrams is more difficult. Furthermore, the relative
frequencies of individual letters exhibit a much greater range than that of diagrams,
making frequency analysis much more difficult. For these reasons, the Play fair cipher
was for a long time considered unbreakable. It was used as the standard field system
27
by the British Army in World War I and still enjoyed considerable use by the U.S. Army
and other Allied forces during World War II.
Hill Cipher
Another interesting multi letter cipher is the Hill cipher, developed by the
mathematician Lester Hill in 1929. Define the inverse M-1of a square matrix M by the
equation M(M-1)= M-1M=I, where I is the identity matrix. I is a square matrix that is all
diagonal from upper left to lower right. The inverse of a matrix does not always exist,
but when it does, it satisfies the preceding equation. For example,
To explain how the inverse of a matrix is computed, we begin by with the concept
of determinant. For any square matrix (m × m), the determinant equals the sum of all
the products that can be formed by taking exactly one element from each row and
exactly one element from each column, with certain of the product terms preceded by a
minus sign. For a 2 × 2 matrix,
The determinant is k11k22 -k12k21. For a 3×3 matrix, the value of the determinant is
.k11k22k33 + k21k32k13 + k31k12k23 - k31k22k13 - k21k12k33 - k11k32k23. If a square matrix A has a
nonzero determinant, then the inverse of the matrix is computed as [A -1]ij=(det A)-1 (-1)i+j
(Dij)where (Dij ) is the sub determinant formed by deleting the jth row and the ith column
of A, det(A) is the determinant of A, and (det A)-1 is the multiplicative inverse of (det A)
mod 26.
28
Continuing our example,
or
C = PK mod 26
Where C and P are row vectors of length 3 representing the plaintext and cipher text, and K is
a3
×3 matrix representing the encryption key. Operations are performed mod 26.For
example, consider the plaintext “pay more money” and use the encryption Key
29
As with Play fair, the strength of the Hill cipher is that it completely hides single-
letter frequencies. Indeed, with Hill, the use of a larger matrix hides more frequency
information. Thus, a 3 ×3 Hill cipher hides not only single-letter but also two-letter
frequency information.
Consider this example. Suppose that the plaintext “hill cipher” is encrypted using
a Hill cipher to yield the cipher text HCRZSSXNSP.Thus, we know that (78) Kmod26=
(72)11 11) K mod26=(17 25); and so on. Using the first two plaintext–cipher text pairs,
we have
30
Poly alphabetic Ciphers
Another way to improve on the simple mono alphabetic technique is to use
different mono alphabetic substitutions as one proceeds through the plaintext message.
The general name for this approach is poly alphabetic substitution cipher. All these
techniques have the following features in common:
VIGEN`ERE CIPHER The best known, and one of the simplest, poly alphabetic ciphers
is the Vigenère cipher. In this scheme, the set of related mono alphabetic substitution
rules consists of the 26 Caesar ciphers with shifts of 0 through 25. Each cipher is
denoted by a key letter, which is the cipher text letter that substitutes for the plaintext
letter a. Thus, a Caesar cipher with a shift of 3 is denoted by the key value .
Express the Vigenère cipher in the following manner. Assume a sequence of
plaintext letters and a key consisting of the sequence of letters , where typically < .The
sequence of cipher text letters is calculated as follows
Thus, the first letter of the key is added to the first letter of the plaintext, mod 26, the
second letters are added, and so on through the first letters of the plaintext. For the next
letters of the plaintext, the key letters are repeated. This process continues until all of
the plaintext sequence is encrypted. A general equation of the encryption process is
31
To encrypt a message, a key is needed that is as long as the message. Usually,
the key is a repeating keyword. For example, if the keyword is deceptive, the message
“we are discovered save you” is encrypted as
key: deceptivedeceptivedeceptive
plaintext:
wearediscoveredsaveyourself
ciphertext: ZICVTWQNGRZGVTWAVZHCQYGLMGJ
All the techniques examined so far involve the substitution of a cipher text symbol
for a plaintext symbol. A very different kind of mapping is achieved by performing some
sort of permutation on the plaintext letters. This technique is referred to as a
transposition cipher. The simplest such cipher is the rail fence technique, in which the
plaintext is written down as a sequence of diagonals and then read off as a sequence of
rows. For example, to encipher the message “meet me after the toga party” with a rail
fence of depth 2, we write the following:
mematrhtgpryetefet
eoaat
is
This sort of thing would be trivial to cryptanalyze. A more complex scheme is to
write the message in a rectangle, row by row, and read the message off, column by
column, but permute the order of the columns. The order of the columns then becomes
the key to the algorithm. For example,
32
Thus, in this example, the key is 4312567.To encrypt, start with the column that
is labeled 1, in this case column 3.Write down all the letters in that column. Proceed to
column 4, which is labeled 2, then column 2, then column 1, then columns 5, 6, and 7.A
pure transposition cipher is easily recognized because it has the same letter
frequencies as the original plaintext.
To visualize the result of this double transposition, designate the letters in the
original plaintext message by the numbers designating their position. Thus, with 28
letters in the message, the original sequence of letters is
01 02 03 04 05 06 07 08 09 10 11 12 13 14
15 16 17 18 19 20 21 22 23 24 25 26 27 28
After the first transposition, we have
03 10 17 24 04 11 18 25 02 09 16 23 01 08
15 22 05 12 19 26 06 13 20 27 07 14 21 28
MEMATRHTGPRYETEFETEOAAT
message is used to convey the hidden message.
33
34
2. 3 Block Ciphers
• A block cipher is an encryption/decryption scheme in which a block of plaintext is treated as a
wholeand used to produce a cipher text block of equal length.
• A stream cipher is one that encrypts a digital data stream one bit or one byte at a time. Examples
of classical stream ciphers are the auto keyed Vigenère cipher and the Vernam cipher.
• A block cipher is one in which a block of plaintext is treated as a whole and used to produce a cipher
text block of equal length
• Many block ciphers have a Feistel structure. Such a structure consists of a number of identical
roundsof processing. In each round, a substitution is performed on one half of the data being
processed, followed by a permutation that interchanges the two halves. The original key is expanded so
that a different key is used for each round.
.
Block Cipher Principles
Substitution (S-box)
A binary word is replaced by some other binary word
Whole substitution function forms the key
If use n bit words, The key space is 2n!
Permutation (P-box)
A binary word has its bits reordered (permuted)
The re-ordering forms the key
If we use n bit words,
The key space is n! (Less secure than substitution)
Substitution-permutation Network:
Shannon combined these two primitives
He called these mixing transformations
34
A special form of product ciphers where
o S-boxes -Provide confusion of input bits
o P-boxes -Provide diffusion across s-box inputs
Confusion and Diffusion:
More practically Shannon suggested combining elements to obtain:
Diffusion
The statistical structure of plaintext is dissipated into long range statistics of cipher text. This is
achieved by having each plaintext digit affect the value of many ciphertext digits;
Generally this is equivalent to having each cipher text digit be affected by many plaintext digits
Confusion
Seeks to make the relationship between the statistics of the cipher text and the value of the
encryption key as complex as possible, again to thwart attempts to discover the key.
Thus, even if the attacker can get some handle on the statistics of the cipher text, the way in
which the key was used to produce that cipher text is so complex as to make it difficult to
deduce the key.
36
2.4 Stream Cipher
A stream cipher is a cipher that encrypts (and decrypts) with the flow — the data flow, that is.
Unlike block ciphers, which require the formation of blocks prior to encryption, stream ciphers
encrypt data in long, pseudorandom streams. Basically, this means you can process one bit of
data at a time instead of waiting for a data block to form.
In their book “Domain Specific High-Level Synthesis for Cryptographic Workloads,” Ayesha Khalid,
Goutam Paul and Anupam Chattopadhyay describe stream ciphers as finite state machines, or
FSMs, and stateful ciphers. The reason for the first description (FSM) is because these ciphers
take in plaintext bits of data one by one and spit out ciphertext symbols in the same way. The
reason they’re also called stateful ciphers is because they rely on their internal state for their
keystream functions.
But did you know there are actually two types of stream ciphers?
Synchronous stream ciphers (aka key auto-key, or KAK) — These types of ciphers generate
keystreams independently of any previous plaintext or ciphertexts.
So, how does a stream cipher actually work in a more technical sense? As mentioned earlier, a
stream cipher encrypts data one bit at a time instead of in blocks. But a key part of this process is
generating a stream of pseudorandom bits based on an encryption key and a seed, aka a nonce
(a unique randomly generated number — “nonce” = number-only-used-once). Together, they
create a keystream (that stream of pseudorandom bits we just mentioned) that gets XORed with
your plaintext input, which encrypts it and results in your ciphertext output.
37
This rinse-and-repeat process happens over and over again with each bit of plaintext data.
However, it’s important to not use the same exact key-nonce combination again because it can
result in a duplicate keystream.
Feeling a bit confused? Here’s a visual of the process to provide a little clarity:
Stream cipher example: This is a basic illustration to showcase the basic operations of a stream
cipher.
To better understand this, let’s walk through an example. Let’s consider an alternative phrase to
the example we used earlier: “For the Horde!” (See? I haven’t forgotten you guys.) The binary
version of this plaintext message would look like this:
010001100110111101110010001000000111010001101000011001010010000001001000011011
1101110010011001000110010100100001
38
So, if we use a stream cipher to encrypt the message, we’ll do so one bit at a time. So, the
plaintext data would be broken down into individual bits, which would look something like this:
Block cipher vs stream cipher graphic: This image illustrates a message being divided up into
individual bits for encryption.
Yeah, take a moment to let your eyes adjust after reading that, uh, colorful graphic.
So, the next step here would be to generate the sequence of pseudorandom bits (i.e., the
keystream). This is typically based on an internal state.
Once you have your keystream, you XOR the pseudorandom bits with the binary of your plaintext
message. Thinking back to the XOR table from earlier, this means that if the first bit of the plaintext
message was 1 and the first bit for the keystream was a 0, then the output would be a 1. If the
second bit of the plaintext message was another 1 and the first bit of the keystream was also a 1,
then the output would also be a 0. Get it? Yeah, that’s basically how a stream cipher works.
Stream algorithms are faster and more efficient than block ciphers because they’re encrypting only
one bit of data at a time into individual symbols rather than entire blocks. So, they’re better suited
for devices that have fewer resources. Also, as a result of this single-bit-of-data approach, it
means that if there’s an error in one symbol, it’ll be less likely to affect the next. However, some
stream ciphers are vulnerable to bit-flipping attacks and key reuse attacks.
A very basic example of a stream cipher is the Caesar cipher, which is a cipher that that
39
substitutes one character with another individually. But that’s a really archaic and outdated
example of a cipher. So, what sorts of stream ciphers can we find in use today?
A5/1 is a stream cipher used to provide over-the-air communication privacy in the GSM cellular telephone
standard. It is one of several implementations of the A5 security protocol. It was initially kept secret, but
became public knowledge through leaks and reverse engineering. A number of serious weaknesses in the
cipher have been identified.
A5/1 is used in Europe and the United States. A5/2 was a deliberate weakening of the algorithm for certain
export regions.[1] A5/1 was developed in 1987, when GSM was not yet considered for use outside Europe,
and A5/2 was developed in 1989. Though both were initially kept secret, the general design was leaked in
1994 and the algorithms were entirely reverse engineered in 1999 by Marc Briceno from a GSM telephone.
In 2000, around 130 million GSM customers relied on A5/1 to protect the confidentiality of their voice
communications.[citation needed]
Security researcher Ross Anderson reported in 1994 that "there was a terrific row between the NATO
signal intelligence agencies in the mid-1980s over whether GSM encryption should be strong or not. The
Germans said it should be, as they shared a long border with the Warsaw Pact; but the other countries
didn't feel this way, and the algorithm as now fielded is a French design.
A GSM transmission is organised as sequences of bursts. In a typical channel and in one direction, one
burst is sent every 4.615 milliseconds and contains 114 bits available for information. A5/1 is used to
produce for each burst a 114 bit sequence of keystream which is XORed with the 114 bits prior to
modulation. A5/1 is initialised using a 64-bit key together with a publicly known 22-bit frame number. Older
fielded GSM implementations using Comp128v1 for key generation, had 10 of the key bits fixed at zero,
resulting in an effective key length of 54 bits. This weakness was rectified with the introduction of
Comp128v3 which yields proper 64 bits keys. When operating in GPRS / EDGE mode, higher bandwidth
40
radio modulation allows for larger 348 bits frames, and A5/3 is then used in a stream cipher mode to
maintain confidentiality.
The A5/1 stream cipher uses three LFSRs. A register is clocked if its clocking bit (orange) agrees with the
clocking bit of one or both of the other two registers.
A5/1 is based around a combination of three linear-feedback shift registers (LFSRs) with irregular clocking.
The bits are indexed with the least significant bit (LSB) as 0.
The registers are clocked in a stop/go fashion using a majority rule. Each register has an associated
clocking bit. At each cycle, the clocking bit of all three registers is examined and the majority bit is
determined. A register is clocked if the clocking bit agrees with the majority bit. Hence at each step at least
two or three registers are clocked, and each register steps with probability 3/4.
RC4 is a stream cipher and variable-length key algorithm. This algorithm encrypts one byte at a time (or
larger units at a time). A key input is a pseudorandom bit generator that produces a stream 8-bit number
that is unpredictable without knowledge of input key, The output of the generator is called key-stream, is
combined one byte at a time with the plaintext stream cipher using X-OR operation.
Key-Generation Algorithm – A variable-length key from 1 to 256 bytes is used to initialize a 256-byte state
vector S, with elements S[0] to S[255]. For encryption and decryption, a byte k is generated from S by
selecting one of the 255 entries in a systematic fashion, then the entries in S are permuted again.
41
Key-Scheduling Algorithm: Initialization: The entries of S are set equal to the values from 0 to 255 in
ascending order, a temporary vector T, is created. If the length of the key k is 256 bytes, then k is assigned
to T. Otherwise, for a key with length(k-len) bytes, the first k-len elements of T as copied from K, and then K
is repeated as many times as necessary to fill T.
we use T to produce the initial permutation of S. Starting with S[0] to S[255], and for each S[i] algorithm
swap it with another byte in S according to a scheme dictated by T[i], but S will still contain values from 0 to
255 :
Pseudo random generation algorithm (Stream Generation): Once the vector S is initialized, the input key
will not be used. In this step, for each S[i] algorithm swap it with another byte in S according to a scheme
dictated by the current configuration of S. After reaching S[255] the process continues, starting from S[0]
again
Symmetric key algorithm: RC4 is a symmetric key encryption algorithm, which means that the same key is
used for encryption and decryption.
Stream cipher algorithm: RC4 is a stream cipher algorithm, which means that it encrypts and decrypts data
one byte at a time. It generates a key stream of pseudorandom bits that are XORed with the plaintext to
produce the ciphertext.
42
Variable key size: RC4 supports variable key sizes, from 40 bits to 2048 bits, making it flexible for different
security requirements.
Fast and efficient: RC4 is a fast and efficient encryption algorithm that is suitable for low-power devices and
applications that require high-speed data transmission.
Widely used: RC4 has been widely used in various applications, including wireless networks, secure
sockets layer (SSL), virtual private networks (VPN), and file encryption.
Vulnerabilities: RC4 has several vulnerabilities, including a bias in the first few bytes of the keystream,
which can be exploited to recover the key. As a result, RC4 is no longer recommended for use in new
applications
Advantages:
Fast and efficient: RC4 is a very fast and efficient encryption algorithm, which makes it suitable for use in
applications where speed and efficiency are critical.
Simple to implement: RC4 is a relatively simple algorithm to implement, which means that it can be easily
implemented in software or hardware.
Variable key size: RC4 supports variable key sizes, which makes it flexible and adaptable for different
security requirements.
Widely used: RC4 has been widely used in various applications, including wireless networks, secure
sockets layer (SSL), virtual private networks (VPN), and file encryption.
Disadvantages:
Vulnerabilities: RC4 has several known vulnerabilities that make it unsuitable for new applications. For
example, there is a bias in the first few bytes of the keystream, which can be exploited to recover the key.
Security weaknesses: RC4 has some inherent weaknesses in its design, which make it less secure than
other encryption algorithms, such as AES or ChaCha20.
Limited key length: The maximum key length for RC4 is 2048 bits, which may not be sufficient for some
applications that require stronger encryption.
Not recommended for new applications: Due to its vulnerabilities and weaknesses, RC4 is no longer
recommended for use in new applications. Other more secure stream cipher algorithms, such as AES-CTR
or ChaCha20, should be used instead.
43
2.6 Modes of Operation
Block ciphers encrypt fixed size blocks
Eg. DES encrypts 64-bit blocks, with 56-bit key
Need way to use in practice, given usually have arbitrary amount of information to
encrypt
Four were defined for DES in ANSI standard ANSI X3.106-1983 Modes of Use
Have block and stream modes
44
Cipher Block Chaining (CBC)
Message is broken into blocks
But these are linked together in the encryption operation
Each previous cipher blocks is chained with current plaintext block (hence name)
Use Initial Vector (IV) to startprocess
Ci= DESK1(Pi XOR Ci-1)
C-1 = IV
Uses: bulk data encryption, authentication
45
Message Padding:
At end of message, handle possible last short block which is not as large as block size
of cipher.
Pad either with known non-data value (eg nulls)
Or pad last block with count of pad size
Eg. [ b1 b2 b3 0 0 0 0 5]
Means have 3 data bytes, then 5 bytes pad + count
46
Output Feed Back (OFB)
Message is treated as a stream of bits
Output of cipher is added to message
Output is then feedback (hence name)
Feedback is independent of message
Can be computed in advance
Ci = PiXOR Oi
Oi = DESK1(Oi-1)
O-1 = IV
Uses: stream encryption on noisy channels
Advantages and Limitations of OFB
Used when error feedback a problem or where need to encryptions before message is
available
More vulnerable to message stream modification
But feedback is from the output of cipher and is independent of message
A variation of a vernam cipher
Hence must never reuse the same sequence (key + IV)
Sender and receiver must remain in sync, and some recovery method is needed to
ensure this occurs
Originally specified with m-bit feedback in the standards
Subsequent research has shown that only full block feedback (i.e., CFB-64 or CFB-
128) should ever be used
Subsequent research has shown that only full block feedback (i.e., CFB-64 or CFB-
128) should ever be used
47
Counter (CTR)
A “new” mode, though proposed early on
Similar to OFB but encrypts counter value rather than any feedback value
Must have a different key & counter value for every plaintext block (never reused)
Ci = Pi XOR Oi
Oi = DESK1(i)
Uses: high-speed network encryptions
48
2.7 SIMPLIFIED DES
Encryption takes an 8-bit block plaintext, a 10 –bit key and produces an
8-bit block of cipher text
Decryption takes the 8-bit block of cipher text, the same 10-bit key and
produces the original 8-bit block of plaintext
The encryption algorithm involves five functions
IP – an initial permutation
fk - a complex function, which involves both permutation and substitution
-1
IP – inverse initialpermutation
Cipher text = IP
-1
(fK2(SW(fK1(IP(Plaintext)))))
Plaintext = IP
-1
(fK1(SW(fK2(IP(Cipher text)))))
49
ES Key Generation:
50
The key is first subjected to a permutation (P10).
Divide the output of P10 in to two halves and perform circular left shift one bit position (LS-1) on the two
halves and then passes through a permutation function (P8) that produces an 8-bit output for the first
subkey (K1).
The output of the shift operation is given as input into another shift
(LS-2)(left shift by 2 bits on the two halves of the data) and another instance of (P8) to produce the
second subkey (K2)
K1 = P8(Shift(P10(key)))
K2 = P8(Shift(Shift(P10(key)
The first and fourth bits are treated as a 2 bit number that specify a row of the s-box and the second and
third bits specify a column of the S-boxes.
Example
Plain text –
10111101Key –
1010000010
Key Generation
Applying circular left shift by 1 bit on two 5 bits part, we get: 00001 11000
51
Steps for S-DES encryption
Inverse operation for this cipher text will give plain text.
52
2.8 Data Encryption Standard (DES)
The most widely used encryption scheme is based on the Data Encryption Standard (DES)adopted in 1977 by the
National Bureau of Standards, now the National Institute of Standards and Technology (NIST), as Federal
Information Processing Standard 46 (FIPS PUB 46).
The algorithm itself is referred to as the Data Encryption Algorithm (DEA).
For DES, data are encrypted in 64-bit blocks using a 56-bit key.
The same steps, with the same key, are used to reverse the encryption.
DES Encryption:
The basic process consists of:
An initial permutation (IP)
16 rounds of a complex key dependent calculation f
A final permutation, being the inverse of IP
DES key schedule: (Operation on key)
The bits are numbered from 1 to 64, every 8 bit is ignored.
64 bit key is used as an input to the algorithm.
Forms sub keys used in eachround
Consists of:
Initial permutation of the key (PC1) which selects 56-bits in two 28-bit halves C0 and D0
16 stages consisting of:
At each round, the two halves are separately subjected to circular shift or
rotation of 1 or 2 bits.
These shifted values serve as input to the next round.
They also serve as input to permuted choice 2 (PC2), which produces a 48 bit
output that serves as input to the function F (Ri-1 , Ki )
General depiction of DES encryption algorithm
53
Initial Permutation IP:
This is the first step of the data computation
IP reorders the input data bits& it changes the even bits to LH half, odd bits to RH half
The initial permutation and its inverse are defined by tables. The tables are to be interpreted as follows.
The input to a table consists of 64 bits numbered left to right from 1 to 64. The 64 entries in the
permutation table contain a permutation of the numbers from 1 to 64. Each entry in the permutation table
indicates the position of a numbered input bit in the output, which also consists of 64 bits.
Note that the bit numbering for DES reflects IBM mainframe practice, and is the opposite of what we now
mostly use - so be careful! Numbers from Bit 1 (leftmost, most significant) to bit 32/48/64 etc (rightmost,
least significant).
For example, a 64-bit plaintext value of “675a6967 5e5a6b5a” (written in left & right halves) after
permuting with IP becomes “ffb2194d 004df6fb”. Note that example values are specified using
hexadecimal.
Li = Ri–1
Substitution Boxes S:
Substitution has eight S-boxes, each of which accepts 6 bits as input and produces 4 bitsas
output.
Outer bits 1 & 6 (row bits) select one row of 4. Thefirst and last bits of the input box Si form a
2 bit binary number to select one of four substitutions defined by the four rows inthe table Si.
Inner bits 2-5 (column bits) are substituted. Themiddle four bits select one of the 16
columns.
The substitution consists of a set of eight S-boxes, each of which accepts 6 bits as input and
produces 4 bits as output. The first and last bits of the input to box Si form a 2-bit binary
number to select one of four substitutions defined by the four rows in the table forSi. The
middle four bits select one of the sixteen columns. The decimal value in the cell selected by
the row and column is then converted to its 4-bit representation to produce
the output. For example, in S1, for input 011001, the row is 01 (row 1) and the column
is1100 (column 12). The value in row 1, column 12 is 9, so the output is 1001.
The example lists 8 6-bit values (ie 18 in hex is 011000 in binary, 09 hex is 001001 binary,
12 hex is 010010 binary, 3d hex is 111101 binary etc), each of which is replacedfollowing
the process detailed above using the appropriate S-box. ie
S1(011000) lookup row 00 col 1100 in S1 to get 5
S2(001001) lookup row 01 col 0100 in S2 to get 15 = f in hex
S3(010010) lookup row 00 col 1001 in S3 to get 13 = d in hex
S4(111101) lookup row 11 col 1110 in S4 to get 2 etc
55
Calculation of F(R,K):
The R input is first expanded to 48 bits by using expansion table E that defines a permutation plus an
expansion that involves duplication of 16 of the R bits . The resulting 48 bits are XORed with key Ki . This
48-bit result passes through a substitution function comprising 8 S-boxes which each map 6 input bits to 4
output bits, producing a 32-bit output, which is then permuted by permutation P .
56
1. Key transformation: 56-bit key is divided into two halves. Each of 28 bits are circularly shifted one or
two positions based on the round. After appropriate shifts 48 bits of 56 bits are selected as key for each
round.
2. Expansion permutation: In this step Right Plain text is expanded from 32bits to 48 bits. 48 bit key is
XOR with 48 bit Right Plain text and resulting output is given to next step.
3. Substitution Boxes S: S-box substitution is a process that accepts 48-bit input from XOR operation and
produces 32 bit output.
4. Permutation: The output of s-box consists of 32 bits. These 32 bits are permuted using p-box.
5. XOR and swap: All the above operations are performed only on 32 bits Right Plain text. Now Left Plain
text is XOR with p-box output. The result of XOR operation becomes the new right half. The old right half
becomes the new left half.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
Right Half i-1
31 2 3 4 5 4 5 6 7 8 9 8 9 1111111111111122222222222222223331
2 012323456767890101234545678989012
Round Key i
O
+ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
input symbol input symbol input symbol input symbol input symbol input symbol input symbol input symbol
control
control
control
control
control
control
control
control
S1 S2 S3 S4 S5 S6 S7 S8
output symbol output symbol output symbol output symbol output symbol output symbol output symbol output symbol
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
12 3 45 6 78 9 11111111112222222222333
01234567890123456789012
16 7 20 21 29 12 28 17 1 15 23 26 5 18 31 10 2 8 24 14 32 27 3 9 19 13 30 6 22 11 4 25
57
Left Half i-1
O
+ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
Right Half i
12 3 45 6 78 9 11111111112222222222333
01234567890123456789012
DES Decryption:
With Feistel design, decryption uses the same algorithm as encryption, except that the
application of sub keys is reversed (SK16 … SK1)
Avalanche Effect:
In particular, a change in one bit of the plaintext or one bit of the key should
produce a change in many bits of the ciphertext.
If the change were small, this might produce a way to reduce the size of the plaintext or
keyspace to be searched.
Byte Substitution:
A simple substitution of each byte
AES defines 16x16 matrix of byte values containing a permutation of all 256 8-bit values
Each individual byte of state is mapped into a new byte in the following way:
row (left most 4-bits) & column (right most 4-bits)
Eg. Byte {95} is replaced by row 9 col 5 byte
which has the value {2A}
Shift Rows:
A circular byte shift ineach
1st row is unchanged
2nd row does 1 byte circular shift to left
3rd row does 2 byte circular shift to left
4th row does 3 byte circular shift to left
Decrypt does shifts to right
Mix Columns:
Each column is processed separately
Each byte is mapped into a new value that is a function of all 4 bytes in the column.
Effectively a matrix multiplication in GF(28) using prime poly m(x) =x8+x4+x3+x+1
Can express each col as 4 equations
To derive each new byte in col
In GF(28) addition is bitwise XOR operation and that multiplication can
be performed according to the rule.
Decryption requires use of inverse matrix
With larger coefficients, hence a little harder
61
Add Round Key:
Lastly is the Add Round Key stage, in which the 128 bits of state are bitwise XOR ed with the
128 bits of the round key.
The first matrix is state and the 2nd matrix is the round key.
Inverse for decryption is identical since XOR is own inverse, just with correct round key.
AES Round
62
AES Key Expansion
Takes 4 word (16-byte) key and expands into array of 44 words (156 bytes)
63
AES Decryption
64
Before examining some of these more recent topics, we concentrate in this chapter on the
use of symmetric encryption to provide confidentiality.
We begin with a discussion of the location of encryption logic; the main choice
here is between what are known as link encryption and end-to-end encryption. Next, we look
at the use of encryption to counter traffic analysis attacks. Then we discuss the difficult problem
of key distribution. Finally, we discuss the principles underlying an important toolin providing a
confidentiality facility: random number generation.
There are a large number of locations at which an attack can occur. Furthermore, for wide
area communications, many of these locations are not under the physical control of the end user.
64
Even in the case of local area networks, in which physical security measures are possible,
there is always the threat of the disgruntled employee.
The most powerful and most common approach to securing the points of vulnerability
highlighted in the preceding section is encryption. If encryption is to be used to counter these
attacks, then we need to decide what to encrypt and where the encryption gear should be
located. There are two fundamental alternatives: link encryption and end-to-end encryption.
Basic Approaches
Link to Link Encryption:
With link encryption, each vulnerable communications link is equipped on both ends with
an encryption device. Thus, all traffic over all communications links is secured. One of its
disadvantages is that the message must be decrypted each time it enters a switch because the
switch must read the address (logical connection number) in the packet header in order to route
the frame. Thus, the message is vulnerable at each switch. If working with a public network, the
user has no control over the security of the nodes.
Several implications of link encryption should be noted. For this strategy to be effective, all
the potential links in a path from source to destination must use link encryption. Each pair of
nodes that share a link should share a unique key, with a different key used on each link.
65
End-To-End Encryption
With end-to-end encryption, the encryption process is carried out at the two end systems.
The source host or terminal encrypts the data. The data in encrypted form are then transmitted
unaltered across the network to the destination terminal or host. The destination shares a key
with the source and so is able to decrypt the data. This plan seems to secure the transmission
against attacks on the network links or switches. Thus, end-to-end encryption relieves the end
user of concerns about the degree of security of networks and links that support the
communication. There is, however, still a weak spot.
Consider the following situation. A host connects to a frame relay or ATM network, sets
up a logical connection to another host, and is prepared to transfer data to that other host by
using end-to-end encryption. Data are transmitted over such a network in the form of packets that
consist of a header and some user data. What part of each packet will the host encrypt? Suppose
that the host encrypts the entire packet, including the header. This will not work because,
remember, only the other host can perform the decryption. The frame relay or ATM switch will
receive an encrypted packet and be unable to read the header. Therefore, it will not be able to
route the packet. It follows that the host may encrypt only the user data portion of the packet and
must leave the header in the clear.
Thus, with end-to-end encryption, the user data are secure. However, the traffic pattern is
not, because packet headers are transmitted in the clear. On the other hand, end-to-end encryption
does provide a degree of authentication. If two end systems share an encryption key, then a recipient
is assured that any message that it receives comes from the alleged sender, because only that
sender shares the relevant key. Such authentication is not inherent in a link encryption scheme.
To achieve greater security, both link and end-to-end encryptions a r e needed, as is shown in
Figure. When both forms of encryption are employed, the host encrypts the user data portion of a
packet using an end-to-end encryption key. The entire packet is then encrypted using a link
encryption key. As the packet traverses the network, each switch decrypts the packet,
using a link encryption key to read the header, and then encrypts the entire packet again for
sending it out on the next link. Now the entire packet is secure except for the time that the packet
is actually in the memory of a packet switch, at which time the packet header is in the clear.
66
Link Encryption End-to-End Encryption
Link encryption encrypts all the data along a end-to-end encryption, the headers,
specific communication path. Not only is the addresses, routing, and trailer information are
user information encrypted, but the header, not encrypted, enabling attackers to learn
trailers, addresses, and routing data that more about a captured packet and where it
are is
part of the packets are also encrypted. headed.
All data are encrypted, including headers, Headers, addresses, and routing
information
addresses, and routing information.
are not encrypted, and therefore not
protected.
It works at a lower layer in the OSI model. It works at Network layer.
All of the information is encrypted, and the The packets do not need to be decrypted and
packets must be decrypted at each hop so then encrypted again at each hop, because
the router, or other intermediate device, the headers and trailers are not encrypted.
knows where to send the packet next.
67
Logical Placement of End-to-End Encryption Function
With link encryption, the encryption function is performed at a low level of the
communications hierarchy i.e. physical or link layers.
For end-to-end encryption, several choices are possible for the logical placement of the
encryption function. At the lowest practical level, the encryption function could be performed at
the network layer.
With network-layer encryption, Each end system can engage in an encrypted exchange
with another end system if the two share a secret key. All the user processes and applications
within each end system would employ the same encryption scheme with the same key to reach a
particular target end system.
Figure illustrates the issues involved. In this example, an electronic mail gateway is used to
interconnect an internetwork that uses a TCP/IP-based architecture. In such a configuration, there
is no end-to-end protocol below the application layer. The transport and network connections from
each end system terminate at the mail gateway, which sets up new transport and network
connections to link to the other end system. Even if both end systems use TCP/IP or OSI, there are
plenty of instances in actual configurations in which mail gateways sit between otherwise isolated
internetworks. Thus, for applications like electronic mail that have a store-and-forward capability,
the only place to achieve end-to-end encryption is at the application layer
68
With application-level encryption (Figure a), only the user data portion of a TCP segment is
encrypted. The TCP, IP, network-level, and link-level headers and link-level trailer are in the clear.
By contrast, if encryption is performed at the TCP level (Figure b), then, on a single end-to-end
connection, the user data and the TCP header are encrypted. The IP header remains in the clear
because it is needed by routers to route the IP datagram from source to destination. Note,
however, that if a message passes through a gateway, the TCP connection is terminated and a
new transport connection is opened for the next hop. Furthermore, the gateway is treated as
a destination by the underlying IP. Thus, the encrypted portions of the data unit are decrypted at
the gateway. If the next hop is over a TCP/IP network, then the user data and TCP header are
encrypted again before transmission. However, in the gateway itself the data unit is buffered
entirely in the clear. Finally, for link-level encryption (Figure c), the entire data unit except for the
link header and trailer is encrypted on each link, but the entire data unit is in the clear at each
router and gateway.
69
Traffic Confidentiality
The following types of information that can be derived from a traffic analysis attack:
Identities of partners
How frequently the partners are communicating
Message pattern, message length, or quantity of messages that suggest important
information is being exchanged
The events that correlate with special conversations between particular partners
Another concern related to traffic is the use of traffic patterns to create a covert
channel. Typically, the channel is used to transfer information in a way that violates a security
policy. For example, an employee may wish to communicate information to an outsider in a way
that is not detected by management and that requires simple eavesdropping on the part of the
outsider.
With the use of link encryption, network-layer headers (e.g., frame or cell header) are
encrypted, reducing the opportunity for traffic analysis. However, it is still possible in those
circumstances for an attacker to assess the amount of traffic on a network and to observe the
amount of traffic entering and leaving each end system. An effective countermeasure to this
attack is traffic padding, illustrated in Figure.
Traffic padding produces ciphertext output continuously, even in the absence of plaintext. A
continuous random data stream is generated. When plaintext is available, it is encrypted and
transmitted. When input plaintext is not present, random data are encrypted and transmitted. This
makes it impossible for an attacker to distinguish between true data flow and padding and
therefore impossible to deduce the amount of traffic.
70
One technique that might prove useful is to pad out data units to a uniform length at either
the transport or application level. In addition, null messages can be inserted randomly into the
stream. These tactics deny an opponent knowledge about the amount of data exchanged
between end users and obscure the underlying traffic pattern.
Key Distribution
For symmetric encryption to work, the two parties to an exchange must share the same
key, and that key must be protected from access by others. Furthermore, frequent key changes
are usually desirable to limit the amount of data compromised if an attacker learns the key.
Therefore, the term that refers to the means of delivering a key to two parties who wish to
exchange data, without allowing others to see the key. For two parties A and B, key distribution
can be achieved in a number of ways, as follows:
1. A can select a key and physically deliver it to B.
2. A third party can select the key and physically deliver it to A and B.
3. If A and B have previously and recently used a key, one party can transmit the new keyto
the other, encrypted using the old key.
4. If A and B each has an encrypted connection to a third party C, C can deliver a key onthe
encrypted links to A and B.
Physical delivery (1 & 2) is simplest - but only applicable when there is personal
contact between recipient and key issuer. This is fine for link encryption where devices & keys
occur in pairs, but does not scale as number of parties who wish to communicate grows. 3 is
mostly based on 1 or 2 occurring first.
A third party, whom all parties trust, can be used as a trusted intermediary to mediate the
establishment of secure communications between them (4). Must trust intermediary not to abuse
the knowledge of all session keys. As number of parties grow, some variant of 4 is only practical
solution to the huge growth in number of keys potentially needed.
The use of a key distribution center is based on the use of a hierarchy of keys. At a
minimum, two levels of keys are used.
Communication between end systems is encrypted using a temporary key, often referred toas
a session key.
Typically, the session key is used for the duration of a logical connection and thendiscarded
71
master key is shared by the key distribution center and an end system or user and used to
encrypt the session key.
Let us assume that user A wishes to establish a logical connection with B and requires a
one-time session key to protect the data transmitted over the connection. A has a master key, Ka,
known only to itself and the KDC; similarly, B shares the master key K b with the KDC. The
following steps occur:
1. A issues a request to the KDC for a session key to protect a logical connection to B. The
message includes the identity of A and B and a unique identifier, N 1, for this transaction,
which we refer to as a nonce. The nonce may be a timestamp, a counter, or a random
number; the minimum requirement is that it differs with each request. Also, to prevent
masquerade, it should be difficult for an opponent to guess the nonce. Thus, a random
number is a good choice for a nonce.
72
2. The KDC responds with a message encrypted using Ka Thus, A is the only one who can
successfully read the message, and A knows that it originated at the KDC. The message
includes two items intended for A:
The one-time session key, Ks, to be used for the session
The original request message, including the nonce, to enable A to match thisresponse
with the appropriate request
Thus, A can verify that its original request was not altered before reception by the KDCand,
because of the nonce, that this is not a replay of some previous request.
These last two items are encrypted with Kb (the master key that the KDC shares with B).They
are to be sent to B to establish the connection and prove A's identity.
3. A stores the session key for use in the upcoming session and forwards to B the information
that originated at the KDC for B, namely, E(Kb, [Ks || IDA]). Because this information is
encrypted with Kb, it is protected from eavesdropping. B now knows the session key (Ks),
knows that the other party is A (from IDA), and knows that the information originated at the
KDC (because it is encrypted using Kb).
At this point, a session key has been securely delivered to A and B, and they may begintheir
protected exchange. However, two additional steps are desirable:
4. Using the newly minted session key for encryption, B sends a nonce, N2, to A.
5. Also using Ks, A responds with f(N2), where f is a function that performs some transformation
on N2 (e.g., adding one).
These steps assure B that the original message it received (step 3) was not a replay.
Note that the actual key distribution involves only steps 1 through 3 but that steps 4 and 5, aswell
as 3, perform an authentication function.
73
Major Issues with KDC:
For very large networks, a hierarchy of KDCs can be established. For communication
among entities within the same local domain, the local KDC is responsible for key distribution.
Itwo entities in different domains desire a shared key, then the corresponding local KDCs can
communicate through a (hierarchy of) global KDC(s)
To balance security & effort, a new session key should be used for each new connection-
oriented session. For a connectionless protocol, a new session key is used for a certain fixed
period only or for a certain number of transactions.
An automated key distribution approach provides the flexibility and dynamic characteristics
needed to allow a number of terminal users to access a number of hosts and for the hosts to
exchange data with each other, provided they trust the system to act on theirbehalf.
The use of a key distribution center imposes the requirement that the KDC be trusted and
be protected from subversion. This requirement can be avoided if key distribution is fully
decentralized.
In addition to separating master keys from session keys, may wish to define different types
of session keys on the basis of use.
74
Part - A
1. Assess the following cipher text using brute force attack:
CMTMROOEOORW (Hint: Algorithm-Rail fence).
2. Compare Substitution and Transposition techniques.
3. Give examples for substitution cipher.
4. Compare Block cipher and Stream cipher.
5. Give examples for transposition cipher.
6. Show how to convert the given text “VALLIAMMAI” into cipher text using Rail fence Technique.
7. Plan how many keys are required by two people to communicate via a cipher.
8. Compare Substitution and Transposition techniques.
9. Define Diffusion & Confusion.
10. Specify the design criteria of block cipher.
11. Define Reversible mapping.
12. List the five modes of operation of block cipher.
13. List the function of state array.
14. Differentiate symmetric and asymmetric encryption
15. Show how to convert the given text “CRYPTOGRAPHY” into cipher text using Rail fence Technique
Part – B & C
75
UNIT III: ASYMMETRIC TECHNIQUES
Principles of Public Key Cryptosystems – The RSA Algorithm – Key Management – Diffie
Hellman Key Exchange – Elliptic Curve Cryptography – Prime fields and binary fields,
Applications, Practical considerations. Cryptography in Embedded Hardware.
“Public-Key Cryptography”, shows that a public-key encryption scheme has six ingredients:
• Plaintext: the readable message /data fed into the algorithm as input.
• Encryption algorithm: performs various transformations on the plaintext.
• Public and private keys: a pair of keys selected so that if one is used for encryption, the
other is used for decryption. The exact transformations performed by the algorithm
depend on the public or private key that is provided as input.
• Ciphertext: the scrambled message produced as output. It depends on the plaintext and
the key. For a given message, two different keys will produce two different ciphertexts.
• Decryption algorithm: accepts the ciphertext and matching key and produces the
original plaintext.
Consider the following analogy using padlocked boxes: traditional schemes involve the
sender putting a message in a box and locking it, sending that to the receiver, and
somehow securely also sending them the key to unlock the box. The radical advance in
public key schemes was to turn this around, the receiver sends an unlocked box (their
public key) to the sender, who puts the message in the box and locks it (easy - and
having locked it cannot get at the message), and sends the locked box to the receiver
who can unlock it (also easy), having the (private) key. An attacker would have to pick
the lock on the box (hard).
77
Symmetric vs Public-Key
78
shown here). There is some source A that produces a message in plaintext X The M
elements of X are letters in some finite alphabet. The message is intended for
destination B. B generates a related pair of keys: a public key, PUb, and a private key,
PRb. PRb is known only to B, whereas PUb is publicly available and therefore accessible
by A. With the message X and the encryption key PUb as input, A forms the ciphertext Y
= E(PUb, X) The intended receiver, in possession of the matching private key, is able
to invert the transformation: X = D(PRb, Y) An adversary, observing Y and having access
to PUb, but not having access to PRb or X, must attempt to recover X and/or PRb. This
provides confidentiality. Can also use a public-key encryption to provide authentication: Y
= E(PRa, X); X = D(PUa, Y) To provide both the authentication function and confidentiality
have a double use of the public-key scheme (as shown here): Z = E(PUb, E(PRa, X))
X = D(PUa, D(PRb, Z)) In this case, separate key pairs are used for each of these
purposes. The receiver owns and creates secrecy keys, sender owns and creates
authentication keys.
79
Security of Public Key Schemes
Public key schemes are no more or less secure than private key schemes - in both cases
the size of the key determines the security. As with symmetric encryption, a public-key
encryption scheme is vulnerable to a brute-force attack. The countermeasure is the same:
Use large keys. However, there is a tradeoff to be considered. Public-key systems
depend on the use of some sort of invertible mathematical function. The complexity of
calculating these functions may not scale linearly with the number of bits in the key but
grow more rapidly than that. Thus, the key size must be large enough to make brute-force
attack impractical but small enough for practical encryption and decryption. In practice,
the key sizes that have been proposed do make brute-force attack impractical but
result in encryption/decryption speeds that are too slow for general-purpose use. Instead,
as was mentioned earlier, public-key encryption is currently confined to key management
and signature applications. Another form of attack is to find some way to compute the
private key given the public key. To date, it has not been mathematically proven that
this form of attack is infeasible for a particular public-key algorithm.
Note also that you can't compare key sizes - a 64-bit private key scheme has very roughly
similar security to a 512-bit RSA - both could be broken given sufficient resources. But
with public key schemes at least there is usually a firmer theoretical basis for
determining the security since its based on well-known and well studied number theory
problems.
80
3.2 RSA Algorithm:
RSA is the best known, and by far the most widely used general public key
encryption algorithm, and was first published by Rivest, Shamir & Adleman of MIT in
1978 [RIVE78]. The Rivest-Shamir-Adleman (RSA) scheme has since that time
reigned supreme as the most widely accepted and implemented general-purpose
approach to public-key encryption. It is based on exponentiation in a finite (Galois)
field over integers modulo a prime, using large integers (eg. 1024 bits). Its security is
due to the cost of factoring large numbers.
Introduced by Rivest, Shamir & Adleman of MIT in 1977
RSA algorithm:
The ingredients of RSA algorithm are as follows:
1. p, q, two prime numbers private, chosen
2. n= pq public, calculated
3. e, with gcd (e , ø(n))=1 , Where 1<e<ø(n) public, chosen
81
RSA Encryption and decryption:
Encryption:
Plaintext:M<n Ciphertext:
C=Me mod N
Decryption: Ciphertext: C
RSA example:
1. Select primes: p=17 & q=11 2.
Compute n = pq =17×11=187
3. Compute ø(n)=(p–1)(q-1)=16×10=160
4. Select e : gcd (e,160)=1; choose e=7
5. Determine d:
82
Encryption and decryption:
Both encryption and decryption in RSA involves raising an integer to an integer
power mod n (exponentiation)
Another consideration is the efficiency of exponentiation
RSA makes use of large exponents and fast modular exponentiation algorithm is
used to compute ab mod n.
Variable c is not needed. It is included for explanatory purposes. The value of c is
the value of the exponent.
Key generation:
Users of RSA must:
o Determine two primesat random - p, q
o Select either e or d and compute the other
Primes p , q must not be easily derived from modulus n=p . q
o Means must be sufficiently large
Exponents e, d are inverses, so use inverse algorithm to compute the other
(Extended Euclid‟s algorithm)
RSA security:
Three approaches to attacking RSA:
o Brute force key search (trying all possible private keys)
o Mathematical attacks (factoring the product of two primes)
o Timing attacks (depends on running of decryption algorithm)
Factoring problem:
o Factor n into its two prime factors p and q. Calculate of ø(n) and find d
o Determine ø(n) directly and compute d
o Determine d directly, without first determining ø(n)
Have seen slow improvements over the years
o As of aug-99 best is 512 bit with GNFS
Biggest improvement comes from improved algorithm
o “quadratic sieve” to “generalized number field sieve”
Ensure p, q of similar size and matching other constraints
The threat to larger key sizes is twofold: the continuous increase in
computing power and the continuing refinement of factoring problems.
83
Timing attack:
Attacker can determine a private key by keeping track of how long a computer
takes to decipher the message.
o Attack may be from completely unexpected direction
o And it may be a ciphertext only attack
Exploit timing variations in operations using fast modular exponentiation algorithm
o Eg. Multiplying by small vs large number
If the observed time to execute the decryption algorithm is always slow
when the particular iteration is slow with a bit 1, then it is assumed to be 1.
If the observed time for the entire algorithm is fast, then this bit is assumed to be 0.
Countermeasures:
o Constant exponentiation time ensure that all exponentiations take the
same amount of time before returning a result.
o Randomdelay better performance can be achieved byadding a random
delay to the exponentiation algorithm to confuse the timing attack.
o Blinding multiply the ciphertext by random number before
performing exponentiation. This process prevents the attacker from
knowing what ciphertext bits are being processes inside the computer
and therefore prevents the bit by bit analysis essential to the timing
attack.
84
In the Diffie-Hellman key exchange algorithm, there are two publicly known numbers: a
prime number q and an integer a that is a primitive root of q. The prime q and primitive root a
can be common to all using some instance of the D-H scheme. Note that the primitive root a is a
number whose powers successively generate all the elements mod q. Users Alice and Bob
choose random secrets x's, and then "protect" them using exponentiation to create their public
y's. For an attacker monitoring the exchange of the y's to recover either of the x's, they'd need
to solve the discrete logarithm problem, which is hard.
The actual key exchange for either party consists of raising the others "public key' to
power of their private key. The resulting number (or as much of as is necessary) is used as the
key for a block cipher or other private key scheme. For an attacker to obtain the same value
they need at least one of the secret numbers, which means solving a discrete log, which is
computationally infeasible given large enough numbers. Note that if Alice and Bob subsequently
communicate, they will have the same key as before, unless they choose new public-keys.
The simplest, and original, implementation of the protocol uses the multiplicative group
of integers modulo p, where p is prime and g is primitive root mod p. Here is an
example of the protocol, with non-secret values and secret values:
85
1. Alice and Bob agree to use a prime number p=23 and base g=5.
2. Alice chooses a secret integer a=6, then sends Bob A = ga mod p
o A = 56 mod 23
o A = 15,625 mod 23
o A=8
6. Alice and Bob now share a secret: s = 2. This is because 6*15 is the same as
15*6. So somebody who had known both these private integers might also have
calculated s as follows:
o s = 56*15 mod
23 o s = 515*6
mod 23 o s = 590
mod 23
o s=
807,793,566,946,316,088,741,610,050,849,573,099,185,363,389,551,639,55
6,884,765,625 mod 23
o s =2
Both Alice and Bob have arrived at the same value, because (ga)b and (gb)a are equal
mod p. Note that only a, b and gab = gba mod p are kept secret. All the other values – p,
g, ga mod p, and gb mod p – are sent in the clear. Once Alice and Bob compute the
shared secret they can use it as an encryption key, known only to them, for sending
messages across the same open communications channel. Of course, much larger
values of a, b, and p would be needed to make this example secure, since it is easy to try
all the possible values of gab mod
23. There are only 23 possible integers as the result of mod 23. If p were a prime of at
least 300 digits, and a and b were at least 100 digits long, then even the best
86
algorithms known today could not find a given only g, p, gb mod p and ga mod p, even
using all of mankind's computing power. The problem is known as the discrete
logarithm problem. Note that g need not be large at all, and in practice is usually either
2 or 5.
Both Alice and Bob are now in possession of the group element gab, which can serve
as the shared secret key. The values of (gb)a and (ga)b are the same because groups
are power associative.
87
point at infinity or the zero point. Now, consider the set of points E(a, b) consisting of all of
the points (x, y) that satisfy this equation together with the element O. Using a different
value of the pair (a, b) results in a different set E(a, b). See text for detailed rules of
addition and relation to zero point O. Can derive an algebraic interpretation of addition,
based on computing gradient of tangent and then solving for intersection with curve. There
is also an algebraic description of additions over elliptic curves
88
With the preceding list of rules, it can be shown that the set E(a, b) is an abelian group.
ECC Diffie-Hellman
89
ECC Encryption/Decryption
ECC Security
The security of ECC depends on how difficult it is to determine k given kP and P. This is
referred to as the elliptic curve logarithm problem. The fastest known technique for taking the
elliptic curve logarithm is known as the Pollard rho method. Compared to factoring integers or
polynomials, can use much smaller numbers for equivalent levels of security.
relies on elliptic curve logarithm problem
fastest method is “Pollard rho method”
compared to factoring, can use much smaller key sizes than with RSA etc
for equivalent key lengths computations are roughly equivalent
hence for similar security ECC offers significant computational advantages
90
PART A
1. Give the applications of the public key crypto systems.
2. Explain any one attacking technique in RSA.
3. Discover the Difference between public key and conventional encryption.
4. Illustrate the purpose of Diffie Hellman key exchange.
5. Define the principle elements of a public key crypto system
6. List four general characteristics of schema for the distribution of the public key.
7. Show what requirements must a public key crypto system to fulfil security.
8. Evaluate the formula for encryption and decryption using RSA algorithm
9. Generalize elliptic curve cryptography.
10. Define avalanche effect.
11. Differentiate unconditionally secured and computationally secured
12. What are the design parameters of Feistel cipher network?
13. Give the five modes of operation of Block cipher.
14. State advantages of counter mode.
15. Specify the basic task for defining a security service
16. What is the difference between link and end to end encryption?
17. List the evaluation criteria defined by NIST for AES?
18. List the schemes for the distribution of public keys
19. List out the attacks to RSA.
Part B & C
1. Evaluate using Diffie-Hellman key exchange technique. Users A and B use a common prime q=11
and a primitive root alpha=7.
(i) If user A has private key XA=3.What is A’s public key YA?
(ii)If user B has private key XB=6. What is B’s public key YB?
(iii) What is the shared secret key? Also write the algorithm.
2. Describe RSA Algorithm.
3. Estimate the encryption and decryption values for the RSA algorithm parameters. P=7, Q=11, E=17,
M=8.
4. Summarize the role of discrete log in the Diffie-Hellman key exchange in exchanging the secret key
among two users.
5. What are elliptic curves? Describe how the elliptic curves are useful for Cryptography?
6. Describe the key management of public key encryption in detail.
7. Apply the mathematical foundations of RSA algorithm. Perform encryption decryption for the following
data. P=17, q=7, e=5, n=119, message=”6”. Use Extended Euclid’s algorithm to find the private key.
8. Explain briefly about Diffie-Hellman key exchange algorithm with its pros and cons.
9. Explain public key cryptography and when is it preferred.
10. (i) User A and B use Diffie-Hellman key exchange a common prime q=71 and a primitive root
a=7.Calculate the following. If user A has private key XA=5, what is A’s public key YA. If user A has
private key XB=12, what is B’s public key YB and what is shared secret key?
11. Consider the elliptic curve E11 (1, 6); that is the curve is defined by y2=x3+x+6 with a modules of
P=11. Calculate all the points in E11 (1, 6). Start by calculation the right hand side of the
equation of all the values of n?
91
UNIT IV : MESSAGE AUTHENTICATION
Authentication requirements – Authentication functions – Message
Authentication Codes (MAC) – Hash functions – Security of hash functions and
MACs.
With public-key techniques, can use a digital signature which can only have been
created by key owner to validate the integrity of the message contents. To provide both
confidentiality and authentication, A can encrypt M first using its private key, which
provides the digital signature, and then using B's public key, which provides
confidentiality. The disadvantage of this approach is that the public-key algorithm, which
is complex, must be exercised four times rather than two in each communication.
Message authentication is a procedure to verify that received messages come from the
alleged source and have not been altered. - Also sequencing and timeliness may be
authenticated. Authentication mechanism:
93
Password
Smart cards
Biometrics
Certification
Authentication Requirements
* In the context of communication across a network the following attacks can be identified:
3. Masquerade:
• insertion of messages into the network from a fraudulent source
• fraudulent acknowledgement of a message receipt
4. Content modification
• insertion, deletion, transposition, modification
5. Sequence modification
• modification to a sequence of messages between parties, i-e- reordering
6. Timing modification
• delay or replay of messages
7. Repudiation
• Measures against attacks 1 and 2 involve achieving confidentiality.
• Measures against attacks 3, 4,5 and 6 are generally regarded as
messageauthentication.
• Mechanisms dealing specifically with attack 7 use digital signatures.
Authentication Functions
1. Message encryption: the cipher text of the whole message is the authenticator
2. Message authentication code (MAC): a public function and a secret key produces a fixed
length value that is the authenticator
94
3. Hash function: a public function that maps the message into a fixed-length hash-value
serving as authenticator
95
Authentication functions – MACs
Using a MAC involves the use of a secret key to generate a small fixed size block of data,
a cryptographic checksum.
Communicating parties A and B share a secret key K. The MAC is calculated as MAC
= C,(M). The algorithm C is public.
Note that a MAC does not provide a digital signature because both the sender and
receiver have the same secret key.
Hash function accepts variable size message M as input and produces a fixed-size
hash codeH(M) as output. H(M) is often called a massage digest.
A hash code is a function of all the bits of the message, thus providing an error detection
capability.
* The hash-function itself is not secret; some means is required to protect the hash value.
96
A hash value is generated by a one-way function H of the form
97
3. H(x) is easy to compute for any given x, making hardware and software
implementations possible
4. One-way property: for any given code A, it is computationally infeasible to find x such that
h
=H(x).
5. Weak collision resistance: for any given block x, it is computationally infeasible to
find block y ≠x s.t. H(y) = H(x).
6. Strong collision resistance: it is computationally infeasible to find a pair (x,y) .t. H{x) = H(y).
• The first three properties are needed for practical use of the hash-function
• The fourth property is needed if the authentication technique involves the use of a
secret value S, The opponent could learn $ from S| |M = HC), if he obtains C = H(S|
|M).
• The weak collision property guarantees that an alternative message yielding the
same code cannot be found. This prevents forgery when an encrypted hash code is
used . The strong collision property refers to how resistant the hash function is to a
class of attacks known as the birthday attack.
Just as with symmetric and public-key encryption, we can group attacks on hash
functions and MACs into two categories: brute-force attacks and cryptanalysis.
In recent years, there has been increased interest in developing a MAC derived from a
cryptographic hash function, because they generally execute faster in software than
symmetric block ciphers, and because code for cryptographic hash functions is widely
98
available. A hash function such as SHA was not designed for use as a MAC and cannot
be used directly for that purpose because it does not rely on a secret key. There have
been a number of proposals for the incorporation of a secret key into an existing hash
algorithm, originally by just pre-pending a key to the message. Problems were found
with these earlier, simpler proposals, but they resulted in the development of HMAC.
99
Part - A
Part – B & C
1. Here hash functions are used? What characteristics are needed in secure hash
function?
2. Write about the security of hash functions and MACs.
3. Discuss the classification of authentication function in detail.
4. How Hash function algorithm is designed? Explain their features and properties.
5. Explain in detail message authentication code and its requirements.
6. Illustrate the security of hash functions and MACs.
7. Explain the different approaches to message authentication.
8. Describe about the class of message authentication function.
9. Briefly explain about the Authentication applications with suitable example.
10. Enumerate the properties of Hash Function.
11. Evaluate the authentication protocol and list its limitations.
100
UNIT V: HASH AND DIGITAL SIGNATURES 9
MD5 Message Digest Algorithm – Secure Hash Algorithm (SHA) –RIPMED160 - HMAC
– Digital Signatures – Authentication Protocols – Digital Signature Standard (DSS).
6. For example, if the message is 448 bit long, it is padded by 512 bits to a length of 960
bits.
7. Thus the number of padding bits is in the range 1 to 512.
8. The padding consists of a single 1 bit followed by the necessary number of 0 bits.
2. This field contains the length of the original message mod 264
101
The outcome of the first 2 steps yields a message that is an integer multiple of 512 bits.
1. A 128 bits buffer is used to hold intermediate value and final results of
the hash function.
2. The buffer can be represented as four 32 bit registers (A, B,
C, D) A= 67452301
B = FCDAB89
C= 8BADCFE
D= 0324576
3. The values are stored in little-endian format, which is the least significant byte of a
word in the low address byte position.
Word A= 01 23 45 67
102
Word B= 89 AB CD EF
Word C= FE DC BA 98
Word D= 76 45 32 10
5. T[i] has the value equal to the integer part of 232 * abs(sin(i)), where I is in radians.
6. Output of the fourth round is added to the input to the first round
(CVq) to produce CVq+1
Step 5: Output:
1. After all L512 blocks have been processed, the output from the Lth
stage is the128 bit message digest.
2. We can summarize the behavior of
MD5 as follows: CV0= IV
CVq+1= SUM 32[CVq, RFI, (Yq, RFH (Yq, RFG(Yq, RFF( Yq,
CVq))))] MD= CVL-1
103
Where
SUM32 = addition mod 232 performed separately on each word of the pair of inputs
104
MD5 Compression Function:
Where
X[k] M [q *16 + k] = kth 32 bit word in the qth 512 bit block of the
message. T[i] ith 32 bit word in matrix T
+ Addition modulo 232
105
1. One of the four primitive logical functions is used for each of the four rounds of the
algorithm.
2. Each primitive function takes three 32 bit words as input and produces a 32 bit word
output.
3. Each performs a set of bitwise logical operations.
The logical operators (AND, OR, NOT, XOR) are represented by the symbols.
1. The array of 32 bit words X[0…15] holds the value of the current 512 bit input
block beingprocessed.
2. Within a round, each of the 16 words of X[i] is used exactly once.
3. In the first round, the words are used in their original order. The following
permutations are defined for rounds 2 through 4:
106
STRENGTH OFMD5
107
Secure Hash Algorithm:
The Secure Hash Algorithm was developed by the National Security Agency
(NSA) and given to the National Institute of Standards and Technology (NIST). The
original version, often referred to as SHA or SHA-0, was published in 1993 as a
Federal Information Processing Standard (FTPS 180).SHA contained a weakness that
was later uncovered by the NSA, which led to the a revised standards document
(FIPS 180-t) that was released in 1995. This revised document describes the
improved version, SHA-1, Which is now the hash algorithm recommended by NIST.
SHA-1 Logic:
This algorithm takes as input a maximum length of less than 264 bits.
1. The message is padded so that its length is congruent to 448 mod 512 (length mod 512
= 448)
2. Padding is always added, even if the message is already of the desired length.
3. Thus the number of padding bits is in the range 1 to 512.
4. The padding consists of a single 1 bit followed by the necessary number of 0 bits.
108
Step 3: Initialize MD buffer:
3. Each round takes as input the current 512 bit block being processed (Yq)
and
the 160 bit buffer value ABCDE and updates the contents of the buffer.
STEP NO HEXADECIMAL
0≤ t ≤19 Kt = 5A817999
20 ≤ t ≤ 39 Kt = 6ED9EBA1
40 ≤ t ≤ 59 Kt =8F1BBCDC
60 ≤ t ≤ 79 Kt =CA62C1D6
109
6. Output of the fourth round is added to the input to the first round (CVq) to
produce CVq+1
Step 5: Output:
1. After all L 512 blocks have been processed, the output from the Lth stage is the
160 bit message digest.
2. We can summarize the behavior of SHA-1as
follows: CV0= IV
CVq+1= SUM32[CVq, ABCDEq]
MD= CVL
Where
SUM32 = addition mod 232 performed separately on each word of the pair of inputs
110
SHA-1 Compression Function:
Each round has 20 steps of the form:
A, B, C, D, E = E + f(t, B, C, D) + S5(A) + W t + Kt), A, S30(B), C, D
Where
A, B, C, D, E refers to the 5 words of the
buffer t
step number 0 to 79
111
1. Each primitive function takes three 32 bit words as input and produces a 32 bit word
output.
2. Each function performs a set of bitwise logical operations
1. The 32 bit values of the W t is derived from the 512 bit message
112
2. The first 16 values of W t are taken directly from the 16 words of the current block.
3. The remaining values are defined as follows:
For the remaining 64 steps, the value of W t consists of the circular left shift by one
bit of theXOR of four of the preceding values of W t.
113
HMAC
114
The above Figure illustrates the overall operation of HMAC:
M is the message input to HMAC (including the padding specified in the embedded
hash function)
Note that the XOR with ipad results in flipping one-half of the bits of K. Similarly, the
XOR with opad results in flipping one-half of the bits of K, but a different set of bits. In
115
effect, pseudo randomly generated two keys from K. HMAC should execute in
approximately the same time as the embedded hash function for long messages. HMAC
adds three executions of the hash compression function (for Si, So, and the block
produced from the inner hash). A more efficient implementation is possible by pre
computing the internal hash function on (K+ XOR opad) and (K+ XOR ipad) and
inserting the results into the hash processing at start & end. With this implementation,
only one additional instance of the compression function is added to the
processing normally produced by the hash function. This is especially worthwhile if most
of the messages for which a MAC is computed are short.
HMAC Security
The appeal of HMAC is that its designers have been able to prove an exact relationship
between the strength of the embedded hash function and the strength of HMAC. The
security of a MAC function is generally expressed in terms of the probability of
successful forgery with a given amount of time spent by the forger and a given number
of message-MAC pairs created with the same key. Have two classes of attacks: brute
force attack on key used which has work of order 2^n; or a birthday attack which
requires work of order 2^(n/2) - but which requires the attacker to observe 2^n blocks of
messages using the same key - very unlikely. For a hash code length of 128 bits, this
requires 264 observed blocks (272 bits) generated using the same key. On a 1-Gbps link,
one would need to observe a continuous stream of messages with no change in key for
about 150,000 years in order to succeed. So even MD5 is still secure for use in HMAC
given these constraints.
116
DIGITAL SIGNATURE
“Digital Signature “is the best solution for authenticity in various fields.
The most important development from the work on public-key cryptography is the digital
signature. Message authentication protects two parties who exchange messages from
any third party. However, it does not protect the two parties against each other either
fraudulently creating, or denying creation, of a message. A digital signature is analogous
to the handwritten signature, and provides a set of security capabilities that would be
difficult to implement in any other way. It must have the following properties:
• It must verify the author and the date and time of the signature
117
The above figure is a generic model of the process of making and using digital
signatures. Bob can sign a message using a digital signature generation algorithm. The
inputs to the algorithm are the message and Bob's private key. Any other user, say
Alice, can verify the signature using a verification algorithm, whose inputs are the
message, the signature, and Bob's public key.
Here A denotes the user whose signature is being attacked and C denotes the attacker.
118
• Directed chosen message attack: Similar to the generic attack, except that the list of
messages is chosen after C knows A's public key but before signatures are seen.
• Total break: C determines A's private key. • Universal forgery: C finds an efficient
signing algorithm that provides an equivalent way of constructing signatures on arbitrary
messages.
• Existential forgery: C forges a signature for at least one message. C has no control
over the message. Consequently this forgery may only be a minor nuisance to A.
A variety of approaches has been proposed for the digital signature function. A secure
hash function, embedded in a scheme such as that shown in figure, provides a basis for
satisfying these requirements. However care must be taken in the design of the details
of the scheme. These approaches fall into two categories: direct and arbitrated.
The term direct digital signature refers to a digital signature scheme that involves only
the communicating parties (source, destination). It is assumed that the destination
knows the public key of the source. Direct Digital Signatures involve the direct
application of public-key algorithms involving only the communicating parties. A digital
signature may be formed by encrypting the entire message with the sender’s private
key, or by encrypting a hash code of the message with the sender’s private key.
Confidentiality can be provided by further encrypting the entire message plus signature
using either public or private key schemes. It is important to perform the signature
function first and then an outer confidentiality function, since in case of dispute, some
third party must view the message and its signature. But these approaches are
dependent on the security of the sender’s private-key. Will have problems if it is
lost/stolen and signatures forged. The universally accepted technique for dealing with
these threats is the use of a digital certificate and certificate authorities.
119
DSA is the US Govt approved signature scheme, which is designed to provide strong
signatures without allowing easy use for encryption. The National Institute of Standards
and Technology (NIST) published Federal Information Processing Standard FIPS 186,
known as the Digital Signature Standard (DSS). The DSS makes use of the Secure
Hash Algorithm (SHA) and presents a new digital signature technique, the Digital
Signature Algorithm (DSA). The DSS was originally proposed in 1991 and revised in
1993 in response to public feedback concerning the security of the scheme. There was
a further minor revision in 1996. In 2000, an expanded version of the standard was
issued as FIPS 186-2. This latest version also incorporates digital signature algorithms
based on RSA and on elliptic curve cryptography. In this section, we discuss the original
DSS algorithm. The DSS uses an algorithm that is designed to provide only the digital
signature function. Unlike RSA, it cannot be used for encryption or key exchange.
Nevertheless, it is a public-key technique.
The above figure contrasts the DSS approach for generating digital signatures to that
used with RSA. In the RSA approach, the message to be signed is input to a hash
function that produces a secure hash code of fixed length. This hash code is then
120
encrypted using the sender's private key to form the signature. Both the message and
the signature are then transmitted. The recipient takes the message and produces a
hash code. The recipient also decrypts the signature using the sender's public key. If
the calculated hash code matches the decrypted signature, the signature is accepted as
valid. Because only the sender knows the private key, only the sender could have
produced a valid signature. The DSS approach also makes use of a hash function. The
hash code is provided as input to a signature function along with a random number k
generated for this particular signature. The signature function also depends on the
sender's private key (PR a) and a set of parameters known to a group of communicating
principals. We can consider this set to constitute a global public key (PUG). The result is
a signature consisting of two components, labeled s and r. At the receiving end, the
hash code of the incoming message is generated. This plus the signature is input to a
verification function. The verification function also depends on the global public key as
well as the sender's public key (PUa), which is paired with the sender's private key. The
output of the verification function is a value that is equal to the signature component r if
the signature is valid. The signature function is such that only the sender, with
knowledge of the private key, could have produced the valid signature.
The DSA is based on the difficulty of computing discrete logarithms and is based on
schemes originally presented by ElGamal and Schnorr. The DSA signature scheme has
advantages, being both smaller (320 vs 1024bit) and faster (much of the computation
is done modulo a 160 bit number), over RSA. Unlike RSA, it cannot be used for
encryption or key exchange. Nevertheless, it is a public-key technique
DSA typically uses a common set of global parameters (p,q,g) for a community of
clients, as shown. A 160-bit prime number q is chosen. Next, a prime number p is
selected with a length between 512 and 1024 bits such that q divides (p – 1). Finally, g
is chosen to be of the form h(p– 1)/q
mod p where h is an integer between 1 and (p – 1)
with the restriction that g must be greater than 1. Thus, the global public key
components of DSA have the same for as in the Schnorr signature scheme.
121
Then each DSA uses chooses a random private key x, and computes their public key as
shown. The calculation of the public key y given x is relatively straightforward.
However, given the public key y, it is computationally infeasible to determine x, which
is the discrete logarithm of y to base g, mod p.
To create a signature, a user calculates two quantities, r and s, that are functions of the
public key components (p,q,g), the user’s private key (x), the hash code of the message
H(M), and an additional integer k that should be generated randomly or pseudo-
randomly and be unique for each signing. This is similar to ElGamal signatures, with the
use of a per message temporary signature key k, but doing calculations first mod p, then
mod q to reduce the size of the result. The signature (r,s) is then sent with the message
to the recipient. Note that computing r only involves calculation mod p and does not
depend on message, hence can be done in advance. Similarly with randomly choosing
k’s and computing their inverses.
At the receiving end, verification is performed using the formulas shown. The receiver
generates a quantity v that is a function of the public key components, the sender’s
public key, and the hash of the incoming message. If this quantity matches the r
component of the signature, then the signature is validated. Note that the difficulty of
computing discrete logs is why it is infeasible for an opponent to recover k from r, or x
from s. Note also that nearly all the calculations are mod q, and hence are much faster
save for the last step.
The structure of this function is such that the receiver can recover r using the incoming
message and signature, the public key of the user, and the global public key. It is
certainly not obvious that such a scheme would work. A proof is provided in Stallings
appendix K.
122
The above figure depicts the functions of signing and verifying. The structure of the
algorithm, as revealed here is quite interesting. Note that the test at the end is on the
value r, which does not depend on the message at all. Instead, r is a function of k and
the three global public-key components. The multiplicative inverse of k (mod q) is
passed to a function that also has as inputs the message hash code and the user's
private key. The structure of this function is such that the receiver can recover r using
the incoming message and signature, the public key of the user, and the global public
key.
To sign a message M, user A first computes the hash m = H(M), such that m is an
integer in the range 0 <= m <= q – 1. A then forms a digital signature as shown.
The basic idea with El Gamal signatures is to again choose a temporary random signing
key, protect it, then use it solve the specified equation on the hash of the message to
create the signature (in 2 pieces). Verification consists of confirming the validation
equation that relates the signature to the (hash of the) message (see text for proof).
Again note that El Gamal encryption involves 1 modulo exponentiation and
multiplications (vs 1 exponentiation for RSA).
Here is an example of creating and verifying an ElGamal signature from the text using
the prime field GF(19); that is, q = 19. It has primitive roots {2, 3, 10, 13, 14, 15}, We
choose a = 10. Alice generates a key pair as shown, which is = {19, 10, 4}. Alice can
sign a message with hash m =
14 as shown to compute the signature pair (3,4). Any user B can verify the signature by
computing confirming the validation equation as shown.
124
Schnorr signature scheme
The first part of this scheme is the generation of a private/public key pair, which consists
of the following steps:
3. Choose a random integer s with 0 < s < q. This is the user's private key.
A user with public key s and private key v generates a signature as follows:
1. Choose a random integer r with 0 < r < q and compute x = ar mod p. This is
independent of any message M, hence can be pre-computed.
3. Compute y = (r + se) mod q. The signature consists of the pair (e, y).
125
Any other user can verify the signature as follows:
126
PART-A
PART-B & C
1. Explain Hash Functions.
2. How Hash function algorithm ids designed? Explain their features and properties.
3. Explain Digital Signature Standard?
4. Explain the steps in DSA
5. What are digital signatures? Explain DSA algorithm to generate the same.
6. Explain the concepts of Digital signature algorithm with key generation and verification in detail.
7. Explain the steps in MD5 message digest algorithm.
8. How MD5 method provide security to the system? Explain with suitable diagram.
9. With a neat diagram, explain the MD5 processing of a single 512 bit block.
10. Explain the secure hash algorithm to generate message digest in detail.
11. Illustrate about the SHA algorithm and explain
12. Compare the performance of RIPEMD-160 algorithm and SHA-1 algorithm.
13. Write notes on birthday attack .
14. Explain ElGamal Public key Cryptosystems with an example.
15. Write down the steps involved in
i) Elgamal Digital Signature Scheme
ii) Schnorr Digital Signature Schemeusedfor authenticating a person.
127