0% found this document useful (0 votes)

26 views17 pages

Sci Comp

This document provides an introduction to scientific computing requirements and definitions. It discusses computer architectures like RISC vs CISC, pipelining, parallel RISC, and memory and caches. Key terms are defined, such as bandwidth, bus, cache, CPU, disk, floating point, integer, kernel, memory, OS, register, swap space, virtual memory, and word. Benchmarking metrics like user time, system time, and wall time are also introduced.

Uploaded by

tirthendu sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views17 pages

Sci Comp

Uploaded by

tirthendu sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Introduction to Scientific Computing, Part I

C. David Sherrill
School of Chemistry and Biochemistry
Georgia Institute of Technology
Outline

• Requirements of scientific computing

• Some definitions

• Computer architectures

• Benchmarking

• Benchmarks for Quantum Chemistry

21st Century Computing:

• Very complex programs (100’s or 1000’s of developers)

• Graphical — ease of use critical

• Not much math

Examples: Graphical operating systems (Windows); word

processors; spreadsheets; large databases; graphics programs
(Photoshop); Web browsers; games
Scientific Computing:

• Complex programs (106 lines, perhaps 1-30 developers)

• No time to develop graphical interface

• Much math — floating point very important! “Computa-

tionally expensive.”

Needs of scientific computing can be vastly different than a

user-friendly graphical program. Java makes great applets but
is horribly slow for computations.
Some definitions

Bandwidth: The rate at which data can

flow.

Bus: A physical data pathway connecting,

e.g., the CPU to a graphics card or
other device.

Cache: Small storage area for frequently

accessed data which provides faster data
access.

CPU: Central Processing Unit, or “proces-

sor,” this is the brain of the computer
and does most computational work.

Disk: A magnetic storage device, typically

a hard disk drive, used to store data
which won’t fit in memory. Much slower
access than memory.
Floating point: Data representing a real
number, or operations (such as multi-
plication) on such data. Longer/costlier
than integers.

Instruction: An elementary, low-level

command that the CPU understands.
Each CPU has an “instruction set” that
it can interpret.

Integer: Data representing an integer, or

operations (such as multiplication) on
such data.

Kernel: The core of the operating system.

Linux is actually an OS kernel; the
support software comes from the GNU
project (MIT).

Memory: Typically refers to random ac-

cess memory (RAM) (or “physical
memory”). “Virtual memory” simulates
additional RAM by using disk space.

OS: Operating system. The kernel plus

essential support software (e.g., file
utility programs, system management
tools, graphical user interface).

Register: One of the handful of memory

locations on the CPU itself.

Swap space: (Also paging space) Disk

space used to store data which won’t
fit in physical memory, i.e., virtual
memory.

Virtual memory: Maps logical memory

addresses to physical memory addresses.
Program memory is divided into pages,
some of which may actually reside on
disk.
Word: A fixed number of bytes appropriate
for a given data type. Often 4 bytes (32
bits) for an integer and 8 bytes (64 bits)
for a floating point number.
RISC vs CISC
CISC (Complex Instruction Set Computers): Older machines,
many current and most older PC’s. Advantages: program
requires fewer instructions (fits in less memory, requires fewer
memory fetches). Disadvantages: compilers can’t figure out
how to take advantage of complex instructions (including
Pentium MMX and SSE instructions!).
RISC (Reduced Instruction Set Computers): Modern work-
stations (e.g., IBM, Compaq) beginning in mid 1980’s. PC’s
moving this way since 1990’s (IBM/Motorola/Apple PowerPC).
More memory now, faster access due to caches and pipelines
— advantages of CISC not as great. Easier to pipeline due to
smaller set of instructions.
Pipelines
Instructions typically take more than one clock cycle to execute.
In pipelining, after launching one instruction, you immediately
launch another on the next clock tick, without waiting for
the first one to complete. Possible for CISC, easier for RISC
(uniform instruction length, simple addressing modes).
Parallel RISC

Superscalar: Multiple pipelines (e.g., floating point and

integer pipelines). Depend on compiler to give good
mix of instructions and branch processor to oversee their
scheduling. Examples: IBM RS/6000, DEC Alpha, even
Intel Pentium.

Superpipeline: Break up stages of instruction into smaller,

simpler, faster stages, making pipeline go faster.

Long Instruction Word: Like superscalar but depend en-

tirely on compiler to make the instruction stream parallel;
puts RISC floating point and integer operations together
into big instruction.
Memory and Caches
Advances in memory technology have not kept up with advances
in processor speed. Hence, accessing memory substantially
delays computations. Caches store frequently accessed data in
a small, expensive, high-speed memory area. When we fetch a
new memory element, that element and the memory around it
are transferred to a cache line of the cache.
Direct Mapped Cache: If the computer has a 32K cache,
then in the direct mapped scheme, memory location 0 is
mapped to cache location 0, as are memory locations 64K,
96K, 128K, etc. Problem: if we need data elements more than
32K apart, we’ll never find the next item in the cache! (Cache
miss; cache thrashing).
Fully Associative Cache: Each cache line can map to any
memory location. Performs well, very expensive.
Set Associative Cache: Two or four (or more) direct mapped
caches side by side; less likely to miss if we hop back and forth
between two areas in memory.
REAL*4 A(1024), B(1024)
DO 10 I=1,1024
A(I) = A(I) * B(I)
10 CONTINUE
END
Memory Pages
In the virtual memory scheme, memory is divided into pages.
Each program addresses its memory as a block from 0 to N ,
even though this may be distributed nonsequentially in physical
memory. A page table translates virtual memory locations to
physical memory locations. Simple for the program, bad for
performance.
A translation lookaside buffer (TLB) is a special cache for page
tables, speeding up virtual to physical translation.
A page fault results when a requested memory location is not
in cache (cache miss) or in the TLB (TLB miss) or in the list
of valid pages (page invalid or on disk). TLB is refreshed and
new page is created or loaded from disk (swapped).
Benchmarking

User time: The time spent by the CPU on the user’s compu-
tation.

System time: The time spent by the system in tasks required

by the computation; typically I/O time.

Wall time: The actual time elapsed by a “clock on the

wall.” This is the most relevant time and the most useful
benchmark assuming the machine is not busy with other
things.

Standard Benchmarks: e.g., SPEC benchmarks

User Benchmarks: Most relevant
Suggested Reading
“High Performance Computing,” Kevin Dowd (O’Reilly,
Sebastopol, CA, 1993).
“C++ and C efficiency,” David Spuler (Prentice Hall, New
York, 1992).

Computer Architecture and Organization Reviewer
No ratings yet
Computer Architecture and Organization Reviewer
14 pages
Revision Class Test One Systems
No ratings yet
Revision Class Test One Systems
5 pages
FIT9134 Week11
No ratings yet
FIT9134 Week11
21 pages
Computer Architecture
100% (1)
Computer Architecture
318 pages
Osca All in One Notes
No ratings yet
Osca All in One Notes
18 pages
2 Architecture
No ratings yet
2 Architecture
20 pages
L09 AddressTranslation
No ratings yet
L09 AddressTranslation
39 pages
Computer Architecture Study Guide Summary
No ratings yet
Computer Architecture Study Guide Summary
4 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
49 pages
ALevel Computer Science KO
No ratings yet
ALevel Computer Science KO
24 pages
Chapter 1 Edit
No ratings yet
Chapter 1 Edit
463 pages
Chapter 1 Edit PDF
No ratings yet
Chapter 1 Edit PDF
40 pages
Unit-1 ACA
No ratings yet
Unit-1 ACA
86 pages
Operating System
No ratings yet
Operating System
3 pages
Cs501 Glossary
No ratings yet
Cs501 Glossary
14 pages
Introduction To Computing (CS-141) : Engr. Ehtasham Naseer EE Department QCET
No ratings yet
Introduction To Computing (CS-141) : Engr. Ehtasham Naseer EE Department QCET
23 pages
Computer Architecture
No ratings yet
Computer Architecture
20 pages
Computer Architecture
No ratings yet
Computer Architecture
24 pages
Unit - I
No ratings yet
Unit - I
74 pages
Issues in Hardware-Software Design and Co-Design
No ratings yet
Issues in Hardware-Software Design and Co-Design
7 pages
Module 6 - Memory
No ratings yet
Module 6 - Memory
32 pages
Computer Architecture Important Thing
No ratings yet
Computer Architecture Important Thing
7 pages
Aalim Muhammed Salegh College of Engineering: Iaf-Avadi, Chennai - 55 Department of Computer Science and Engineering
No ratings yet
Aalim Muhammed Salegh College of Engineering: Iaf-Avadi, Chennai - 55 Department of Computer Science and Engineering
224 pages
CA Final PDF
No ratings yet
CA Final PDF
13 pages
Chapter - 5 Parallel Processing
No ratings yet
Chapter - 5 Parallel Processing
117 pages
Lecture 6 - Intro To Prog
No ratings yet
Lecture 6 - Intro To Prog
52 pages
Foundations of Computing - From Hardware Essentials To Web Design
No ratings yet
Foundations of Computing - From Hardware Essentials To Web Design
7 pages
Understanding CPU Caching
No ratings yet
Understanding CPU Caching
7 pages
1 Chapter One
No ratings yet
1 Chapter One
28 pages
Cso101 Lecture 1
No ratings yet
Cso101 Lecture 1
23 pages
DS1822 - Parallel Computing - Unit 1
No ratings yet
DS1822 - Parallel Computing - Unit 1
23 pages
CA0216D Chapter1B
No ratings yet
CA0216D Chapter1B
32 pages
Digital Electronics & Computer Organisation
No ratings yet
Digital Electronics & Computer Organisation
17 pages
Purposes of An Operating System
No ratings yet
Purposes of An Operating System
7 pages
Unit 2
No ratings yet
Unit 2
48 pages
CS 355 Computer Architecture: Text: Computer Organization & Design, D A Patterson, J L Hennessy
No ratings yet
CS 355 Computer Architecture: Text: Computer Organization & Design, D A Patterson, J L Hennessy
12 pages
CS 303 Chapter1, Lecture 3
No ratings yet
CS 303 Chapter1, Lecture 3
18 pages
Unit 2 Assessment Answers
No ratings yet
Unit 2 Assessment Answers
5 pages
Introduction To Computer & Operating Systems
No ratings yet
Introduction To Computer & Operating Systems
55 pages
William Stallings Computer Organization and Architecture 7 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 7 Edition Computer Evolution and Performance
44 pages
L7-1 - Tagged
No ratings yet
L7-1 - Tagged
48 pages
What Is Ubiquitous System, Explain How Ubiquitous System Can Be Created?
No ratings yet
What Is Ubiquitous System, Explain How Ubiquitous System Can Be Created?
8 pages
Chap 1
No ratings yet
Chap 1
48 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
33 pages
CS Exam Answers by Topic
No ratings yet
CS Exam Answers by Topic
15 pages
Chapter 3
No ratings yet
Chapter 3
8 pages
Computer System Overview: 1 Spring 2015
No ratings yet
Computer System Overview: 1 Spring 2015
48 pages
Operating Systems and Computer Architecture: Eng. Hector M Lugo-Cordero, MS CIS 4361 Secure OS Admin
No ratings yet
Operating Systems and Computer Architecture: Eng. Hector M Lugo-Cordero, MS CIS 4361 Secure OS Admin
35 pages
Unit IV CAL 817 Operating System
No ratings yet
Unit IV CAL 817 Operating System
17 pages
1
No ratings yet
1
52 pages
U Proccessors
100% (1)
U Proccessors
487 pages
Computer SW and HW
No ratings yet
Computer SW and HW
20 pages
Computerarchitecture and Organization Summary
No ratings yet
Computerarchitecture and Organization Summary
6 pages
Chapter 2 - Computer Organization
No ratings yet
Chapter 2 - Computer Organization
30 pages
Cs Intro Os
No ratings yet
Cs Intro Os
58 pages
An Overview of Computer Hardware: Next
No ratings yet
An Overview of Computer Hardware: Next
10 pages
L 3 GPU
No ratings yet
L 3 GPU
33 pages
ICT - 03 Computer Architecture I-C
No ratings yet
ICT - 03 Computer Architecture I-C
26 pages
Definitions Importantes
No ratings yet
Definitions Importantes
4 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
3 - Offline Participant Information and Consent Form
No ratings yet
3 - Offline Participant Information and Consent Form
3 pages
Delta Ia-Cnc Solution en 20190123
No ratings yet
Delta Ia-Cnc Solution en 20190123
44 pages
Keer 2010
No ratings yet
Keer 2010
288 pages
Aiwa Av-D58-U SM PDF
No ratings yet
Aiwa Av-D58-U SM PDF
34 pages
BMW Innovations & RND
No ratings yet
BMW Innovations & RND
7 pages
Procedure For Paut
No ratings yet
Procedure For Paut
21 pages
Function A&R
No ratings yet
Function A&R
3 pages
AC51526140 Nimh Battery Pack
No ratings yet
AC51526140 Nimh Battery Pack
1 page
Capstone Case Study
No ratings yet
Capstone Case Study
4 pages
Scripting Use Cases
No ratings yet
Scripting Use Cases
4 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
66 Easy
No ratings yet
66 Easy
10 pages
Module-1: Web Programming
100% (1)
Module-1: Web Programming
50 pages
Lorraine - de Souza - GCSE - String Manipulation With Helpsheets
No ratings yet
Lorraine - de Souza - GCSE - String Manipulation With Helpsheets
37 pages
80211az 2022
No ratings yet
80211az 2022
248 pages
Sanyo Cm21sf1 Cm21sf1 Chassis Fc8-A SM
No ratings yet
Sanyo Cm21sf1 Cm21sf1 Chassis Fc8-A SM
37 pages
Lesson 2.2 Understanding Files: Slideshow Created by Sarel Myburgh Updated by Savon (25-Feb-23)
No ratings yet
Lesson 2.2 Understanding Files: Slideshow Created by Sarel Myburgh Updated by Savon (25-Feb-23)
8 pages
Julia: Fresh Approach To Numerical Computing
No ratings yet
Julia: Fresh Approach To Numerical Computing
34 pages
PROFORMA of BCA PROJECT PROPOSAL Bcsp064
50% (2)
PROFORMA of BCA PROJECT PROPOSAL Bcsp064
1 page
Litera 03z Week 2
No ratings yet
Litera 03z Week 2
65 pages
Output SmartPLS 27 September 2024 Brostrapping
No ratings yet
Output SmartPLS 27 September 2024 Brostrapping
153 pages
BSI MD Consultants Day Usability and Human Factors Presentation UK EN
No ratings yet
BSI MD Consultants Day Usability and Human Factors Presentation UK EN
38 pages
M04 - Dax Part1
No ratings yet
M04 - Dax Part1
16 pages
OHS-PR-02-07 Document Control
100% (2)
OHS-PR-02-07 Document Control
14 pages
Sapthagiri College of Engineering Department of Computer Science and Engineering Internal Assessment Test - III
No ratings yet
Sapthagiri College of Engineering Department of Computer Science and Engineering Internal Assessment Test - III
2 pages
Lab 5 Password Cracking 2018 v5.10 Temple
No ratings yet
Lab 5 Password Cracking 2018 v5.10 Temple
14 pages
Nse 3.1
No ratings yet
Nse 3.1
4 pages
Dataflair FTPO Free Certification Courses
No ratings yet
Dataflair FTPO Free Certification Courses
14 pages
Fixlog
No ratings yet
Fixlog
108 pages
Web Based Customer Management System For Electric Power Nekemte City
No ratings yet
Web Based Customer Management System For Electric Power Nekemte City
80 pages

Sci Comp

Uploaded by

Sci Comp

Uploaded by

Introduction to Scientific Computing, Part I

• Requirements of scientific computing

• Benchmarks for Quantum Chemistry

• Very complex programs (100’s or 1000’s of developers)

• Graphical — ease of use critical

• Not much math

Examples: Graphical operating systems (Windows); word

• Complex programs (106 lines, perhaps 1-30 developers)

• No time to develop graphical interface

• Much math — floating point very important! “Computa-

Needs of scientific computing can be vastly different than a

Bandwidth: The rate at which data can

Bus: A physical data pathway connecting,

Cache: Small storage area for frequently

CPU: Central Processing Unit, or “proces-

Disk: A magnetic storage device, typically

Instruction: An elementary, low-level

Integer: Data representing an integer, or

Kernel: The core of the operating system.

Memory: Typically refers to random ac-

OS: Operating system. The kernel plus

Register: One of the handful of memory

Swap space: (Also paging space) Disk

Virtual memory: Maps logical memory

Superscalar: Multiple pipelines (e.g., floating point and

Superpipeline: Break up stages of instruction into smaller,

Long Instruction Word: Like superscalar but depend en-

System time: The time spent by the system in tasks required

Wall time: The actual time elapsed by a “clock on the

Standard Benchmarks: e.g., SPEC benchmarks

You might also like