0% found this document useful (0 votes)

33 views44 pages

25 Years of Cryptographic Hardware Design: City University of Istanbul & University of California Santa Barbara

The document summarizes 25 years of advances in cryptographic hardware design for implementing public-key algorithms like RSA and Diffie-Hellman. It describes the initial naive algorithms from 1978-1985, followed by Peter Montgomery's breakthrough Montgomery multiplication algorithm in 1985 which significantly improved efficiency by replacing costly divisions with additions. It then discusses subsequent optimizations like advanced Karatsuba algorithms, various Montgomery multiplication methods, and arithmetic techniques for finite fields that further improved hardware performance.

Uploaded by

ALEX SAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PS, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views44 pages

25 Years of Cryptographic Hardware Design: City University of Istanbul & University of California Santa Barbara

Uploaded by

ALEX SAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PS, PDF, TXT or read online on Scribd

You are on page 1/ 44

25 Years of Cryptographic Hardware Design

C
etin Kaya Koc
City University of Istanbul &
University of California Santa Barbara
[email protected]
http://cryptocode.net
[email protected]

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

25 Years of Cryptographic Hardware Design

1975-1977: Invention of Public-Key Cryptography
Diffie-Hellman & RSA Algorithms
Publication Dates: Nov 1976 & Feb 1978
First hardware implementation:
R. L. Rivest. A Description of a Single-Chip Implementation of the RSA
Cipher. Lambda, vol. 1, pages 14-18, 1980.
In 1984, I was a graduate student at UCSBs ECE Department
My interest started with Rivests hardware paper

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Essential Milestones
This talk gives a brief summary of advanced algorithms for creating better
hardware realizations of public-key cryptographic algorithms: DiffieHellman, RSA, elliptic curve cryptography
Essential milestones:

Naive algorithms, 1978-1985

Montgomery algorithm, 1985
Advanced Karatsuba algorithms, 1994
Advanced Montgomery algorithms, 1996
Montgomery algorithm in GF (2k ), 1998
Unified arithmetic, 2002
Spectral arithmetic, 2006

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

RSA Computation
The RSA algorithm uses modular exponentiation for encryption
C := M e

(mod n)

and decryption
M : Cd

(mod n)

The computation of M e mod n is performed using exponentiation

heuristics
Modular exponentiation requires implementation of three basic modular
arithmetic operations: addition, subtraction, and multiplication

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Diffie-Hellman Computation
Similarly, the Diffie-Hellman key exchange algorithm executes the steps
RA := g a

(mod p)

:= g b

(mod p)

b
:= RA
= g ab

(mod p)

a
RA
:= RB
= g ba

(mod p)

between two parties, Alice & Bob

These computations are also modular exponentiations, requiring modular
addition, subtraction, and multiplication operations

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

NIST Digital Signature Algorithm

The signature computation on M and k is the pair (r, s)
r := (g k mod p) mod q
s := (M + xr)k 1 mod q
The signature verification
w := s1 mod q
u1 := M w mod q
u2 := rw mod q
v := (g u1 y u2 mod p) mod q
Check if r

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

v
5

Ellliptic Curve Cryptography

Elliptic curves defined over GF (p) or GF (2k ) are used in cryptography
The arithmetic of GF (p) is the usual mod p arithmetic
The arithmetic of GF (2k ) is similar to that of GF (p), however, there
are some differences
Elliptic curves over GF (2k ) are more popular due to the space and
time-efficient algorithms for doing arithmetic in GF (2k )
Elliptic curve cryptosystems based on discrete logarithms seem to provide
similar amount of security to that of RSA, but with relatively shorter key
sizes

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Computations of Cryptographic Functions

It is interesting to note that all public-key cryptographic algorithms are
based on number-theoretic and algebraic finite structures, such as groups,
rings, and fields
In fact, most of them need modular arithmetic, i.e., the arithmetic of
integers in finite rings or fields
The challenge is however that the sizes of operands are large, starting
from about 160 bits up to 16,000 bits
Therefore, the algorithmic development of cryptographic hardware design
is essentially based on (exact) computer arithmetic with very large
integers
Since exponentiations & multiplications are most time/energy/space
consuming computations, we will only study those in our talk

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Computing Exponentiations
Given the integer e, the computation of M e or eP is an exponentiation
operation
The objective is to use as few multiplications (or elliptic curve additions)
as possible for a given integer e
This problem is related to addition chains
An addition chain yields an algorithm for computing M e or eP given the
integer e
M 1 M 2 M 3 M 5 M 10 M 11 M 22 M 44 M 55
P 2P 3P 5P 10P 11P 22P 44P 55P

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Computing Exponentiations
Finding the shortest addition chain is an NP-complete problem
Lower bound: log2 e + log2 H(e) 2.13 (Sch
onhage)
Upper bound: log2 e + H(e) 1, where H(e) is the Hamming weight
of e (the binary method, the SX method, Knuth)
It turns out the oldest known algorithm for computing exponentiation is
not too far in efficiency to the best algorithm
Heuristics, m-ary, adaptive m-ary, sliding windows, power tree methods
offer only slight improvements

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Computing Modular Multiplication - Naive Algorithms

Given a, b < n, compute P = a b mod n
Multiply and reduce:
Multiply: P = a b (2k-bit number)
Reduce: P = P mod n (k-bit number)
Reductions are essentially integer divisions
However, multiply and reduce steps can be interleaved, but offering only
slight improvements

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Interleaved Multiply & Reduce - Naive Algorithms

P = a b = a

k1
X

bi2i =

i=0

k1
X

(a bi)2i

i=0

= 2( 2(2(0 + a bk1) + a bk2) + ) + a b0

1.
2.
2a.
2b.
3.

P := 0
for i = k 1 downto 0
P := 2P + a bi
P := P mod n
return P

Unfortunately, Step 2b is highly time consuming (a full division for every

bit of the operands)

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Montgomery Multiplication - 1985

Attempts to create good hardware to compute the RSA functions (sign,
verify, encrypt, decrypt) in acceptable time have essentially failed because
of the excessive requirements of the naive algorithms
This includes Rivests hardware proposal and all other implementations
until the Montgomery multiplication algorithm came about
Peter Montgomery discovered a method to replace Step 2b with a step
similar to Step 2a: an addition instead of a division
It is brilliant and efficient
Montgomerys algorithm changed cryptographic design in a way very
much like the FFT algorithm changed the digital signal processing

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Montgomery Multiplication
Montgomerys method maps the integers {0, 1, 2, . . . , n 1} to the same
set with the map x
= x r (mod n) using the integer r = 2k
It then works in this set (numbers with the bar sign) and performs the
multiplication
MonPro(
a, b) = a
b 2k

(mod n)

The above operation turns out to be significantly simpler than the

standard modular multiplication a b (mod n) because the division by
n in Step 2b (reduction) is avoided
Transformation to and back from the bar domain is also quite easily
done, i.e., x
= MonPro(x, r 2) and x = MonPro(
x, 1)

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Montgomery Multiplication
In order to compute u = MonPro(a, b) = a b 2k
the steps below

(mod n), we use

1. u := 0
2. for i = 0 to k 1
2a.
u := u + ai b
2b.
if u0 is 1 then u := u + n
3.
u := u/2
Now, Step 2b is only an addition!
And, it is is done about half of the time!
We remain in the Montgomery (bar) domain of integers until the final
step of the exponentiation, and then use the conversion routine to go
back to the no bar domain

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Karatsuba-Ofman Multiplication
Algorithms Textbooks offer a few asymptotically faster multiplication
algorithms: Karatsuba-Ofman, Toom-Cook, Winograd, and DFT-based
algorithms
These algorithms are all good: they help you to multiply faster
But, they are no help in modular multiplication, i.e., they do not
multiply-and-reduce (Montgomerys method is special)
They also have large overhead, and start being faster only after a few
thousand bits
However, there has been significant algorithmic developments to bring
down their break-even point to a few hundred bits

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Advanced Montgomery Multiplication

On the other hand, Montgomery algorithms also improved
They can be made fit into specific archiectures, by changing the way
they scan the bits of the multiplicand, the multiplier, and the product
Separated Operand Scanning (SOS): First computes t = a b and then
interleaves the computations of m = t n mod r and u = (t + m n)/r.
Squaring can be optimized.
SOS requires 2s + 2 words of space
Finely Integrated Product Scanning (FIPS): Interleaves computation of
a b and m n by scanning the words of m
It uses the same space to keep m and u, reducing the temporary space
to s + 3 words

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Advanced Montgomery Multiplication

Finely Integrated Operand Scanning (FIOS): The computation of a b
and m n is performed in a single loop
FIOS also requires s + 3 words of space
Coarsely Integrated Hybrid Scanning (CIHS): The computation of a b is
split into 2 loops, and the second loop is interleaved with the computation
of m n
CIHS also requires s + 3 words of space
Coarsely Integrated Operand Scanning(CIOS): Improves the SOS method
by integrating the multiplication and reduction steps. It alternates
between iterations of the outer loops for multiplication and reduction
CIOS also requires s + 3 words of space

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Advanced Montgomery Multiplication

All methods require 2s2 + s multiplications
Add, Read/Write and Space requirements are below
SOS
FIPS
FIOS
CIHS
CIOS

Add

Read/Write

Space

4s2 +4s+2

8s2 +13s+5

2s+2

6s2 +2s+2

14s2 +16s+3

s+3

5s2 +3s+2

10s2 +9s+3

s+3

4s2 +4s+2

9.5s2+11.5s+3

s+3

4s2 +4s+2

8s2 +12s+3

s+3

Depending on the availability of functional units (multipliers, adders,

registers), one method can outperform another and thus should be
selected

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Montgomery Multiplication in GF (2k )

It turns out that the Montgomery multiplication can also be performed
in the finite field GF (2k ) if the polynomial basis representations of the
field elements are employed
It imitates the the Montgomery multiplication in GF (p) by taking
the modulus the irreducible polynomial p(x) generating the field of 2k
elements
It is not as fast as the normal basis, but it has some advantages

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Montgomery Multiplication in GF (2k )

In order to compute
u(x) = MonPro(a(x), b(x)) = a(x) b(x) xk mod p(x) ,
we use the steps below
1. u(x) := 0
2. for i = 0 to k 1
2a.
u(x) := u(x) + ai b(x) mod 2
2b.
if u0 is 1 then u(x) := u(x) + p(x) mod 2
3.
u := u/2
Now Steps 2a and 2b use mod 2 additions (XOR gates)

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Unified Arithmetic
One advantage of the Montgomery multiplication in GF (2k ) is that a
single arithmetic unit can be used to handle both kinds of fields: GF (p)
and GF (2k ): This is called unified arithmetic (or, dual-field arithmetic)
Advantages of the unified arithmetic are low manufacturing cost,
compatibility, parallelism, and scalability
Furthermore, unified arithmetic is impartial: it does not favor one prime
against another or one irreducible polynomial against another
The building block of the unified architecture is the unified full adder: a
1-bit adder that handles both GF (p) and GF (2k )

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Unified Full Adder

a
b
c
FSEL

UnivAdder

S
Cout

a
b
FSEL

Cout

(a) Universal Adder

(b) Synthesized circuit by Mentor

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Scalability
Scalability is an important concept: it allows to make small changes
in the hardware to handle larger operands without a complete redesign
(such as switching from 1024-bit RSA keys to 1536-bit RSA keys)

PE 1

PE 2

PE 3

PE k

Buffer

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Dependency Graph of Montgomery Multiplication

(0)

, p

(0)

(1)

, p

(0)
c
b

(2)

(0)

, p

(2)

, p

(0)
(1)

b
b

(3)

(1)

, p

(3)

, p

a
(0)

(2)

c
b

(4)

(0)

(4)

, p

(2)

(0)

, p

(2)

, p

(0)

b
p

(e+1)

,
(e+1)

(e)

(e-1)

c
b
p

(e+1)

(e)

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Pipelined Montgomery Multiplication

An example of pipeline computation for 7 bit operands

where w=1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Pipelined Architecture with Fewer Units

Pipeline stalls when fewer

processing
i units
i are available
il bl
m=7, w=1, k=3

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

General Pipelined Architecture

Reg-a
k

k
a

k
i+1

i+t-1

(j)

t
c

Reg-c

Reg-p

Reg-b

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

PUt

Spectral Arithmetic
We use FFT-based arithmetic to implement modular multiplication
However, we are interested in performing the reduction inside the spectral
(frequency) domain
We utilize finite ring and field arithmetic (avoid real or complex arithmetic
because of the roundoff errors in using floating-point or fixed-point
arithmetic)
We also want to bring down the break-even point of efficiency for
FFT-based multiplication
Furthermore, we utilize the properties of the DFT and Montgomery
algorithm to perform modular multiplication

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Spectral Arithmetic

Convolution

DFT

Modular
Multiplication

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Modular
Reduction

DFT Inverse

DFT over a Finite Ring: Definition

Let be a primitive d-th root of unity in Zq and, let x(t) and X(t) be
polynomials of degree d 1 having entries in Zq . The DFT map over Zq is
an invertib le set map sending x(t) to X(t) given by
Xi = DF Td (x(t)) :=

d1
X

xj ij mod q,

j=0

with the inverse

xi = IDF Td (X(t)) := d1

d1
X

Xj ij mod q,

j=0

for i, j = 0, 1, . . . , d 1.

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

DFT over a Finite Ring: Existence

We write
x(t)

DFT

X(t)

and say x(t) and X(t) are transform pairs; x(t) is called a time polynomial
and sometimes X(t) is named as the spectrum of x(t).
(Convention) In the literature, DFT over a finite ring spectrum is also
called as Number Theoretical Transform (NTT)
(Existence) In order to have a DFT map over Zq :
The multiplicative inverse of DFT length d must exist in Zq which
requires that gcd(d, q) = 1.
d has to divide p 1 for every prime p divisor of q

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

DFT over a Finite Ring: Efficiency

In order to have simple arithmetic
q should be chosen as
a Mersenne number q = 2v 1, or
a Fermat number q = 2v + 1
The principal root of unity should be selected as a power of 2 to
simplify the multiplications with roots of unity

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Properties of DFT
Under certain conditions, the Fourier transform preserves some properties
of the time sequences, e.g., linearity and convolution.
The existence conditions of these properties differ when working in finite
ring spectrums
Let and be operations on time and spectral domains respectively.
We write
DFT

and say and are transform pairs on x(t) and sometimes declare that
the map DF Td respects the operation on point x(t) if following
equation is satisfied
(x(t)) = IDF Td DF Td (x(t))

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Time-Frequency Dictionary
Time and frequency shifts correspond to circular shifts Let
x(t) = x0 + x1t + . . . + xd1td1
and
X(t) = X0 + X1t + . . . + Xd1td1
be a transform pair.
The one-term right circular shift is defined as x(t) 1
x1 + x2t + . . . + xd2td1 + x0td1
l DFT
X(t) (t)
where stands for component-wise multiplication and
(t) = 1 + 1t + . . . + (d1) td1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Time-Frequency Dictionary
Sum of sequence and first value: The sum of the coefficients of a time
polynomial equals to the zeroth coefficient of its spectral polynomial.
Conversely the sum of the spectrum coefficients equals to d1 times the
zeroth coefficient of the time polynomial

x0 = d1

d1
X

Xi i

and X0 =

d1
X

xi i

i=0

sum equals to X0
(x0, x1, , xd-1)

DFT

(X0, X1, , Xd-1)

sum multiplied by d-1

equals to x0

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Time-Frequency Dictionary
Left and right logical shifts: By using the previous properties, it is
possible to perform logical left and right digit shifts x(t) 1 as follows:
(x(t) x0)/t = x1 + . . . + xd1td2
l DFT
(X(t) x0 (t)) (t)
where
x0(t) = x0 + x0t + x0 t2 + . . . + x0td1
The right shifts are similar, where one then uses the
(t) = 1 + 1t + . . . + (d1)td1
polynomial instead of (t)

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for Spectral Modular Multiplication

We would like to compute 8592 49 (mod 1337).
Signal x(t) representing 859 = x(4) in base 4.
35
30
25
20
15
10
5

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for SMP

Convolving x(t) with itself, we find x2(t) = 8592 = 737881.
25

20
15
10

12
10

9
7

5
0
1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for SMP

The modulus m = 1337 is represented as m = 1 + 2t + 3t2 + t4 + t5.We
add 3m to the sum to anhilate the least significant b bits of the least digit.
30
26
25
20
15

19
17

12
10

9
7

5
0
0
1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for SMP

Carry goes to the next digit.
30
26
25
21
19

15
10

9
7

3
0
1

Carry added from the

eliminated coefficient

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for SMP

We then shift the digits.
30
26
25
21
19

15
10

9
7

5
0
1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

A Time Simulation for SMP

After 9 iterations, we find the result: 914 8592 49

(mod 1337).

35
30
25
20
15
10

0
1

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Unending Quest for Efficiency

Conclusions?
Challenges remain: Make faster but low-area and low-energy hardware
for cryptography
Platforms are diverse: Huge SSL and IPSec boxes versus tiny Bluetooth
earphones, cellphones and PDAs
New challenges: We need to build countermeasures in order to circumvent
attacks by adversaries to obtain hardware-hidden secrets
Questions?
Email: [email protected]

EN EL CINVESTAV
25 ANOS
DE LA COMPUTACION

Design of A High-Performance Iterative Barrett Modular Multiplier For Crypto Systems
No ratings yet
Design of A High-Performance Iterative Barrett Modular Multiplier For Crypto Systems
14 pages
06 Rsa
No ratings yet
06 Rsa
104 pages
Lecture 12
No ratings yet
Lecture 12
55 pages
Class 11 Maths CH - Sets Notes
50% (2)
Class 11 Maths CH - Sets Notes
17 pages
Comp3355 L4a (PKC)
No ratings yet
Comp3355 L4a (PKC)
53 pages
Course2 - Week4 Adder Subtractor
No ratings yet
Course2 - Week4 Adder Subtractor
76 pages
RNS in Cryptography
No ratings yet
RNS in Cryptography
85 pages
Math For Programmers PDF
100% (1)
Math For Programmers PDF
127 pages
Cryptography Techniques Unit 1
No ratings yet
Cryptography Techniques Unit 1
44 pages
TO Cryptography: Stanoyevitch
No ratings yet
TO Cryptography: Stanoyevitch
7 pages
Exploring The Design Space For FPGA Base
No ratings yet
Exploring The Design Space For FPGA Base
9 pages
614722802 (2)
No ratings yet
614722802 (2)
7 pages
06 Quantum Algorithmic Foundations Slides
No ratings yet
06 Quantum Algorithmic Foundations Slides
30 pages
Masters Project: Efficient Encryption On Limited Devices
No ratings yet
Masters Project: Efficient Encryption On Limited Devices
37 pages
Mca-Cup-0 5 9 PDF
No ratings yet
Mca-Cup-0 5 9 PDF
239 pages
Final Project Report
No ratings yet
Final Project Report
44 pages
Public-Key Cryptography RSA Attacks Against RSA: Système Et Sécurité
No ratings yet
Public-Key Cryptography RSA Attacks Against RSA: Système Et Sécurité
37 pages
CO1508 - Week 05
No ratings yet
CO1508 - Week 05
28 pages
6515 Transcripts DC1
No ratings yet
6515 Transcripts DC1
23 pages
Design A Scalable RSA and ECC Crypto-Processor
No ratings yet
Design A Scalable RSA and ECC Crypto-Processor
4 pages
By Dr. Rana Aljanabi
No ratings yet
By Dr. Rana Aljanabi
31 pages
Parsonson S.L. Pure Mathematics (Volumes 1 & 2)
92% (13)
Parsonson S.L. Pure Mathematics (Volumes 1 & 2)
719 pages
0862 Lower Secondary Mathematics Stage 7 Scheme of Work - tcm143-595643
100% (5)
0862 Lower Secondary Mathematics Stage 7 Scheme of Work - tcm143-595643
101 pages
Embedment of Montgomery Algorithm On Elliptic Curve Cryptography Over RSA Public Key Cryptography
No ratings yet
Embedment of Montgomery Algorithm On Elliptic Curve Cryptography Over RSA Public Key Cryptography
7 pages
DICD Fall 2024 Lecture 09 Arithmetic Circuits
No ratings yet
DICD Fall 2024 Lecture 09 Arithmetic Circuits
52 pages
Hardware RSA Accelerator: Group 3: Ariel Anders, Timur Balbekov, Neil Forrester May 15, 2013
No ratings yet
Hardware RSA Accelerator: Group 3: Ariel Anders, Timur Balbekov, Neil Forrester May 15, 2013
15 pages
Nwe3-Main Ref 7 PDF
No ratings yet
Nwe3-Main Ref 7 PDF
12 pages
IEEE Pappers Cryto
No ratings yet
IEEE Pappers Cryto
3 pages
9th MathsEM Final (3 To 303)
No ratings yet
9th MathsEM Final (3 To 303)
301 pages
Crypto3 4
No ratings yet
Crypto3 4
43 pages
Ecs726p Week03 P
No ratings yet
Ecs726p Week03 P
63 pages
IT3030E CA Chap4 Arithmetics
No ratings yet
IT3030E CA Chap4 Arithmetics
64 pages
Cracking Des and Rsa Enc
No ratings yet
Cracking Des and Rsa Enc
39 pages
Fundamentals of Mathematics
No ratings yet
Fundamentals of Mathematics
5 pages
Paper 3
No ratings yet
Paper 3
10 pages
IKV 2 Main
No ratings yet
IKV 2 Main
97 pages
Chapter 4 PDF
No ratings yet
Chapter 4 PDF
49 pages
Asynchronous vs. Synchronous Design of RSA: Abstract
No ratings yet
Asynchronous vs. Synchronous Design of RSA: Abstract
6 pages
CBSE XI Text Books
No ratings yet
CBSE XI Text Books
471 pages
Lec 6
No ratings yet
Lec 6
38 pages
Electronic Codebook Book (ECB) : C Des (P)
No ratings yet
Electronic Codebook Book (ECB) : C Des (P)
43 pages
Cryptography and Network Security
No ratings yet
Cryptography and Network Security
27 pages
RSA
No ratings yet
RSA
8 pages
Computer Organisation and Architecture:Multiplier Design
No ratings yet
Computer Organisation and Architecture:Multiplier Design
6 pages
Basic Algorithms in Number Theory
No ratings yet
Basic Algorithms in Number Theory
44 pages
Design and Implementation of VLSI Systems (EN1600) : Lecture 30: Array Subsystems (DRAM/ROM)
No ratings yet
Design and Implementation of VLSI Systems (EN1600) : Lecture 30: Array Subsystems (DRAM/ROM)
16 pages
File Encryption and Decryption System Based On RSA Algorithm
No ratings yet
File Encryption and Decryption System Based On RSA Algorithm
4 pages
CENG413 - Lec05
No ratings yet
CENG413 - Lec05
20 pages
Floating Point Multipliers: Simulation & Synthesis Using VHDL
No ratings yet
Floating Point Multipliers: Simulation & Synthesis Using VHDL
40 pages
Number Theory
No ratings yet
Number Theory
29 pages
15-853:algorithms in The Real World: Cryptography 3 and 4
No ratings yet
15-853:algorithms in The Real World: Cryptography 3 and 4
43 pages
Hardware Complexity of Modular Multiplication and Exponentiation
No ratings yet
Hardware Complexity of Modular Multiplication and Exponentiation
12 pages
Math 322 Notes 2 Introduction To Functions
No ratings yet
Math 322 Notes 2 Introduction To Functions
7 pages
Lect 13
No ratings yet
Lect 13
41 pages
Math7 - Q1 - Week 9
No ratings yet
Math7 - Q1 - Week 9
11 pages
9 Computer Arithmetics
No ratings yet
9 Computer Arithmetics
47 pages
MATHS Home Work Class VII
No ratings yet
MATHS Home Work Class VII
4 pages
Resonance Kota Class 7 Mathematics Book
100% (2)
Resonance Kota Class 7 Mathematics Book
368 pages
Modern Computer Arithmetic: Richard P. Brent and Paul Zimmermann Version 0.5.9 of 7 October 2010
No ratings yet
Modern Computer Arithmetic: Richard P. Brent and Paul Zimmermann Version 0.5.9 of 7 October 2010
25 pages
Fast Architectures For FPGA-Based Implementation Encryption Algorithm
No ratings yet
Fast Architectures For FPGA-Based Implementation Encryption Algorithm
8 pages
China Western Mathematical Olympiad 2010 63
No ratings yet
China Western Mathematical Olympiad 2010 63
2 pages
Montgomery Modular Multiplier Architecture
No ratings yet
Montgomery Modular Multiplier Architecture
28 pages
The Chinese Remainder Theorem and Its Application in A High-Speed RSA Crypto Chip
No ratings yet
The Chinese Remainder Theorem and Its Application in A High-Speed RSA Crypto Chip
10 pages
Maths Practice Question Bank (MetaBrain Quiz)
No ratings yet
Maths Practice Question Bank (MetaBrain Quiz)
14 pages
Math 7 Unit Test Quarter 2
No ratings yet
Math 7 Unit Test Quarter 2
2 pages
An Embedded Processor For Encryption and Decryption: of of
No ratings yet
An Embedded Processor For Encryption and Decryption: of of
4 pages
FPGA Implementation of RSA Encryption System: Sushanta Kumar Sahu Manoranjan Pradhan
No ratings yet
FPGA Implementation of RSA Encryption System: Sushanta Kumar Sahu Manoranjan Pradhan
3 pages
CryptoAnalysin Security of Differential Attacks & Propagation
No ratings yet
CryptoAnalysin Security of Differential Attacks & Propagation
8 pages
Byju Maths Tips
100% (2)
Byju Maths Tips
12 pages
Mathematics 7 Q2 W8
No ratings yet
Mathematics 7 Q2 W8
15 pages
Whole Numbers Booklet 1
No ratings yet
Whole Numbers Booklet 1
31 pages
Digital Logic Design: P.V.P.Siddhartha Institute of Technology (Autonomous), I B.Tech. Syllabus Under PVP14 Regulations
No ratings yet
Digital Logic Design: P.V.P.Siddhartha Institute of Technology (Autonomous), I B.Tech. Syllabus Under PVP14 Regulations
3 pages
Scaling of MOS Circuits
No ratings yet
Scaling of MOS Circuits
59 pages
Edc PPT1
No ratings yet
Edc PPT1
109 pages
STD Ix Project Work
No ratings yet
STD Ix Project Work
5 pages
Number System
No ratings yet
Number System
4 pages
Mic 07
No ratings yet
Mic 07
14 pages
GRP 21
No ratings yet
GRP 21
18 pages
First Quarter Examination MATH 7
No ratings yet
First Quarter Examination MATH 7
3 pages
NCERT Solutions For Class 7 Maths Chapter 9
No ratings yet
NCERT Solutions For Class 7 Maths Chapter 9
15 pages
A Childrens Guide To Python Programming
100% (1)
A Childrens Guide To Python Programming
10 pages
Harvard Algorithm Course Notes
No ratings yet
Harvard Algorithm Course Notes
6 pages
L12 - Machine Minimization
No ratings yet
L12 - Machine Minimization
41 pages
Design of Circular Apertures For Narrow Beamwidth and Low Sidelobes-fvQ
No ratings yet
Design of Circular Apertures For Narrow Beamwidth and Low Sidelobes-fvQ
6 pages
EE 434 ASIC & Digital Systems: Dae Hyun Kim Eecs Washington State University Spring 2018
No ratings yet
EE 434 ASIC & Digital Systems: Dae Hyun Kim Eecs Washington State University Spring 2018
12 pages
Tartar 9
No ratings yet
Tartar 9
3 pages
Pre-Algebra Final Review
No ratings yet
Pre-Algebra Final Review
15 pages
2022 Fermat Contest: The Centre For Education in Mathematics and Computing Cemc - Uwaterloo.ca
No ratings yet
2022 Fermat Contest: The Centre For Education in Mathematics and Computing Cemc - Uwaterloo.ca
306 pages
Complex Number Paper 3
No ratings yet
Complex Number Paper 3
4 pages
Chapter One (Multiplier)
No ratings yet
Chapter One (Multiplier)
8 pages
Dual-Field Multiplier Architecture For Cryptographic Applications
No ratings yet
Dual-Field Multiplier Architecture For Cryptographic Applications
5 pages
Design and Implementation of VLSI Systems (EN0160) : Lecture 28: Datapath Subsystems 4/4
No ratings yet
Design and Implementation of VLSI Systems (EN0160) : Lecture 28: Datapath Subsystems 4/4
9 pages
Datapath Subsystems
No ratings yet
Datapath Subsystems
9 pages
Numeracy Matrix g10
No ratings yet
Numeracy Matrix g10
1 page
3121 SMK Fomra Institute of Technology
No ratings yet
3121 SMK Fomra Institute of Technology
2 pages
Subtitle
No ratings yet
Subtitle
3 pages
Subtitle
No ratings yet
Subtitle
4 pages
Sequential Circuit Analysis
No ratings yet
Sequential Circuit Analysis
15 pages
Section1 3
No ratings yet
Section1 3
2 pages
Validation Rule
No ratings yet
Validation Rule
1 page
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet

25 Years of Cryptographic Hardware Design: City University of Istanbul & University of California Santa Barbara

Uploaded by

25 Years of Cryptographic Hardware Design: City University of Istanbul & University of California Santa Barbara

Uploaded by

25 Years of Cryptographic Hardware Design

25 Years of Cryptographic Hardware Design

Naive algorithms, 1978-1985

The computation of M e mod n is performed using exponentiation

between two parties, Alice & Bob

NIST Digital Signature Algorithm

Ellliptic Curve Cryptography

Computations of Cryptographic Functions

Computing Modular Multiplication - Naive Algorithms

Interleaved Multiply & Reduce - Naive Algorithms

= 2( 2(2(0 + a bk1) + a bk2) + ) + a b0

Unfortunately, Step 2b is highly time consuming (a full division for every

Montgomery Multiplication - 1985

The above operation turns out to be significantly simpler than the

(mod n), we use

Advanced Montgomery Multiplication

Advanced Montgomery Multiplication

Advanced Montgomery Multiplication

Depending on the availability of functional units (multipliers, adders,

Montgomery Multiplication in GF (2k )

Montgomery Multiplication in GF (2k )

Unified Full Adder

(a) Universal Adder

(b) Synthesized circuit by Mentor

Dependency Graph of Montgomery Multiplication

Pipelined Montgomery Multiplication

An example of pipeline computation for 7 bit operands

Pipelined Architecture with Fewer Units

Pipeline stalls when fewer

General Pipelined Architecture

DFT over a Finite Ring: Definition

with the inverse

DFT over a Finite Ring: Existence

DFT over a Finite Ring: Efficiency

(X0, X1, , Xd-1)

sum multiplied by d-1

A Time Simulation for Spectral Modular Multiplication

A Time Simulation for SMP

A Time Simulation for SMP

A Time Simulation for SMP

Carry added from the

A Time Simulation for SMP

A Time Simulation for SMP

Unending Quest for Efficiency

You might also like