Chapter 1-Introduction to Programming I
Chapter 1-Introduction to Programming I
CHAPTER ONE
INTRODUCTION TO PROGRAMMING
Programming Language
A language is a systematic set of rules for communicating ideas. With a natural language, like English, the
communication is between people and the language is used in both spoken and written. A programming
language is a language for communication between a person and computer. The content of communication
is known as a program. Programs are set of instructions, which enable a computer to perform a required
operation.
Several hundred programming languages are in use today, each varying in their specific syntax, but
showing similarities in purpose. Input/output instructions, arithmetic instructions, logic instructions,
control instructions, data movement instructions and specification instructions are such common purpose
instructions. Together with the improvement of hardware and processing speed and memory size increase
computer languages changed to languages that were easier for humans to understand. So generations of
programming languages are developed to correspond with the generations of computer hardware.
Programming languages fit into five generations: First generation languages, second-generation languages,
third generation languages, fourth-generation languages and fifth-generation languages. These generations
of programming languages can also be categorized into two broad categories: -low and high-level
languages. Low-level languages are machine-dependent; that is, they are designed to be run on a particular
computer. In contrast, high-level languages (for example, COBOL and BASIC) are machine-independent
and can be run on a variety of computers.
The first two generations were low-level and the rest high-level of programming languages. The higher-
level languages do not provide us with greater programming capabilities, but they do provide a more
sophisticated program/computer interaction with simple instructions. In short, the higher the level of the
language, the easier it is to understand and use. For example, in a fourth-generation language you need
only instruct the computer system what to do, not necessarily how to do it.
When programming in one of the first three generations of languages, you have to tell the computer what to
do and how to do it. What comprises a new generation is less clear; therefore, languages after the fourth
generation are referred to as very high-level languages.
With each new level, fewer instructions are needed to tell the computer to perform a particular task. A
program written in a second-generation language that computes the total sales for each sales representative,
and then lists those over quota, may require 100 or more instructions; the same program in a fourth-
generation language may have fewer than 10 instructions.
The ease with which the later generations can be used is certainly appealing, but the earlier languages also
have their advantages. All generations of languages are in use today.
First generation (Machine languages, 1940’s): Difficult to program in; they are dependent on machine
languages of the type of computer being used. However, machine language allows the programmer to
interact directly with the hardware, and it can be executed by the computer without the need for a
translator.
Machine language is the ultimate low-level language. To communicate with the first generation computer
programmers had to write programs in machine language: - the 0’s and 1’s of binary code. Programming
in 0s and 1s needed reducing statements such as add, subtract and divide into a series of 0’s and 1’s.
Programs written in this way are machine-dependent. There is no universal machine language as different
hardware designs dictate different data and memory locations, and hence different bit strings. Programmer
must remember exact memory addresses used to store data and instructions. They execute very quickly and
use memory very efficiently, as no conversion from source code is necessary. Writing the programs is
difficult, tedious, time consuming and hard to debug. A high level of understanding of hardware specifics
is required. All programs written using other languages are translated into machine language before they
can be executed.
End users who wanted applications have to work with specialized programmers who could understand,
think and work directly in the machine language of a particular computer. So programming in machine is
slow and labor intensive process.
Second generation (Assembly languages, appeared in the early 1950’s): Use symbolic names for
operations and storage locations. Assembly language is easier to use than machine language but it is still
difficult to understand. A systems program called an assembler translates it into machine language.
Different computer architectures have their own machine and assembly languages, which means that
programs written in these languages are not portable to other, incompatible systems. Many programmers
still use assembly languages because they still give them close control over the hardware.
Instead of using 0’s and 1’s programmers substituted language like acronyms and
words such as add, sub(subtract) in programming statements.
Costly in terms of programmer time
Difficult to read and debug, and even difficult to learn
Highly used in system software development
Programs are easier to create and to debug than machine language, but are still prone to errors and
tedious and execute faster than high-level languages.
Third generation (High-level languages, emerged from 1950’s - 1970’s): Use English-like instructions
like print and mathematicians were able to define variables with statements such as Z=A+B. Many high-
level languages are portable. Each has syntax rules that must be followed. Such languages are much easier
to use than assembly language but they can take considerable time to learn. Two types of systems
programs exist for translating them into machine language: interpreters and compilers.
In High-level languages, instructions closely resemble human language and mathematical notation.
Programmers do not require detailed knowledge of hardware specifics as these languages extensively use
symbolic representations of memory addresses and library functions. The use of common words (reserved
words) within instructions makes them easier to learn. Assembly language code segments may be added to
most high-level programs, where faster execution speed is required by the programmer. There are two
types of language-translator programs used to translate into machine code, Interpreters and Compilers.
Fourth-generation (Since late 70’s): Have simple, English-like syntax rules; commonly used to
access databases. As we mentioned earlier, the third-generation programming languages discussed
in the previous section are all procedural languages because the programmer must list each step
and must use logical control structures to indicate the order in which instructions are to be executed.
Fourth-generation languages (4GLs), on the other hand, are nonprocedural languages. These
languages can be compared to the way in which you might instruct someone to cook a meal. The
nonprocedural method is simply to state the needed output: fix a meal of chicken, rice, and salad.
Using the procedural method, on the other hand, involves specifying each step - from preparing the
shopping list to washing the dishes.
Obviously, the nonprocedural method is easier to write, but you have less control over how each task is
actually performed. For example, the dishes might be washed by hand or in a dishwasher. When using
nonprocedural languages, the methods used and the order in which each task is carried out are left to the
language itself; the user does not have any control over it. In addition, 4GLs sacrifice computer efficiency
in order to make programs easier to write; hence they require more computer power and processing time.
As the power and speed of hardware have increased and its cost has decreased, the use of 4GLs has spread.
Because fourth-generation languages have a minimum number of syntax rules, people who have not been
trained as programmers can use such languages to write application programs, as they need them. This
saves time and fees professional programmers for more complex tasks. There are several categories of
4GLs languages. The most common once are: query languages, report generators, and application
generators and graphic languages.
Query languages allow the user to retrieve information from databases by following simple syntax rules.
For example, you might ask the database to locate all customer accounts that are more than 90 days
overdue. Example of query languages is SQL, which has become a de facto standard.
Report generators produce customized reports using data stored in a database. The user specifies the data
to be in the report, how the report should be formatted, and whether any subtotals and totals are needed.
For example, you might ask the system to create a list, arranged by account number, of all the company
customers located in Addis Ababa.
The user can access the database when using query languages and report generators, but the database
cannot be altered with these tools. With application generators, however, the user writes programs to allow
data to be entered into the database. The program prompts the user to enter the needed data. It also checks
the data for validity.
Application generators contain preprogrammed modules that can generate entire applications, greatly
seeding development. A user can specify what needs to be done, and the application generator will create
the appropriate code for input, validation, update, processing, and reporting.
Graphic languages that retrieve data from files or databases and display them in graphic format. Users
can ask for data and specify how they are to be charted. Some graphic software can perform arithmetic or
logic operations on data as well. SAS, Harvard Graphics, and Lotus Freelance Graphics are popular
graphics tools.
Fifth-generation (1990’s): Used in artificial intelligence and expert systems; also used for accessing
databases. Fifth-generation languages (5GLs) are also nonprocedural languages and are also commonly
used to query databases. Because these languages are still in their infancy, only a few are currently
commercially available. They are closely tied to artificial intelligence and expert systems.
Fifth-generation languages are the “natural” languages whose instructions closely resemble human speech
"Get me Debela’s sales figures for the 1997 financial year" is a typical instruction. Very powerful
hardware and software are required to execute such programs because of the complexity involved in
interpreting commands entered in human language. This means computers can in the future have the ability
to think for themselves and draw their own inferences using programmed information in large databases.
Complex processes like understanding speech would appear to be trivial using these fast inferences and
would make the software seem highly intelligent.
True problem situations, either real or anticipated, that require corrective action
Unexploited Opportunities to improve a situation despite the absence of complaints
Directives to change a situation regardless of whether anyone has complained about the
current situation
Problem solving is the process of transforming the description of a problem into the solution of that
problem by using our knowledge of the problem domain and by relying on our ability to select and use
appropriate problem-solving strategies, techniques, and tools.
Problem Solving: is basic intellectual process that has been refined and systemized for the various
challenges people face.
Top down design: is a systematic approach based on the concept that the structure of the problem should
determine the structure of the solution and what should be done in lower level. This approach will try to
disintegrate a larger problem into more smaller and manageable problems to narrow the problem domain.
Bottom up design: is the reverse process where the lowest level component are built first and the system
builds up from the bottom until the whole process is finally completed.
Software Engineering
"Software Engineering" has been used to describe "building of software systems which are so large or so
complex that they are built by a team or teams of engineers".
Detail system specifications for the functions identified in the analysis: managerial,
organizational and technological components of the system solution (input, output, user
interface, database design, processing, manual procedures, controls: input, processes, output
and procedural controls; security, documentation, training, organizational change)
Implementation
During implementation, we translate (code) each step of the algorithm in the design specifications
into a statement in the particular language selected, and end up with a computer program.
Maintenance
Changes in hardware, software, documentation, or procedures to correct errors, meet new requirements, or
improve processing efficiency.
– Fix bugs found by the customer
– Make modifications
– Add new features
Documentation: Written descriptions of, specifications, development, and actual code of program.
Documentation is valuable because:
1. It is probable that you may come back to this program sometime in the future in order to use it again,
or you may want to use part of it in developing the solution for another similar problem.
2. If you have developed this program for a client's use, he or she will need some information so that in
your absence the program can be used.
3. If you are working for a company and have developed this program as an addition to the company's
software library, some other programmer may be assigned to maintain it or to make additions to it.
4. It is possible that sometime after you have developed the program, you may discover some errors in it
and you may be required to correct them.
The significance of proper program documentation in the software life cycle cannot be overemphasized.
Program documentation consists of
Algorithm development
An algorithm is procedure for solving a problem in terms of
the action to execute (what to do) and
the order in which these actions are executed(done)
An algorithm needs to be
Precise and unambiguous (no ambiguity in any instruction and in the order of execution)
Simple
Correct
Finite (has to have an end)
Produce expected output
Efficient: in time, memory and other resources
An algorithm can be expressed in many ways. Some of these methods are narrative, flowchart and
pseudo-code:
Narrative: often used to narrate the algorithm, can be understood by any user who may not have any
knowledge of computer programming. Too wordy, too unambiguous and can be interpreted in different
ways.
Example: Accept salary of the employee. Calculate bonus as 10% of salary and add it to salary. Accept
service year of employee. If the service year is greater than 10, give additional 100 birr as bonus. Display
the bonus of the employee.
Flowcharts: a diagram consisting of labeled symbols, together with arrows connecting one symbols to
another.
On-page Connector: used to connect two points without drawing a flow line
Example: A certain company plan to give a 10% bonus to each of its employees at the end of every year.
If an employee has been working 10/more years at the company, she/he is to get an additional birr 100.
Draw a flowchart of an algorithm to calculate and print the bonus for a given employee.
Solution:
Analysis: the problem is to compute bonus for employee. To do so, the salary and number of service years
of the employee must be known. Let the salary be denoted by SAL, service be denoted by YEAR and
bonus by BONUS. To compute the bonus, we multiply the salary by 10% and assign the result to BONUS.
Thus, BONUS = SAL * 0.10. If the employee has served at least 10 years, birr 100 is added to the
computed bonus of the employee and the result is displayed. Otherwise the originally computed bonus is
displayed.
START
READ
SAL,
YEAR
Bonus= SAL*.1
YEAR>=10 Bonus=
Bonus+100
Yes
No dsfadfdaffgsgfsgfg
WRITE gfffffffbbtjryjBonu
BONUS
STOP
Start
Read NAME,
BALANCE, RATE
Interest=BALANCE *RATE
Display NAME,
INTEREST
Stop
3) A program that identifies a larger and smaller number from two numbers
Start
Read A, B
Yes Big=B
A<B?
Small=A
No
Big=A
Small=B
Stop
Yes
MARK>80 ? GRADE-A
No
Yes GRADE-B
MARK>60 ?
No
Yes GRADE-C
MARK>50 ?
No
Yes GRADE-D
MARK>40 ?
No
GRADE-F
Increment Count
No
Count>10 ?
yes
Display NAME, GRADE Stop
Pseudo-code:
It is much similar to real code. We use verbs to write pseudo-code. Capitalize important words that
show actions.
Eg 1) A pseudo-code to calculate interest rate
ACCEPT Name, Principal, Rate
Interest=Principal X Rate
DISPLAY Name, Interest
2) A pseudo-code to calculate bonus
ACCEPT Name, Salary
Bonus=SalaryX0.1
ACCEPT Serviceyear
IF Serviceyear>10 Then
Bonus+100
ENDIF
DISPLAY Bonus
3) A pseudo-code that calculates grade
ACCEPT Mark, Name
IF Mark>80 Then
GradeA
ELSE IF Mark>=70 Then
GradeB
ELSE IF Mark>=60 Then
GradeC
ELSE IF Mark>=50 Then
GradeD
ELSE
GradeF
ENDIF
DISPLAY Grade, Name
Most algorithms incorporate three programming conventions
Sequence
Decision and
Repetition
Sequence
STOCK=STOCK+ QUANTITY
Decision
IF HOURS>40
THEN
PERFORM OVERTIME
ELSE
PERFORM REGULAR
ENDIF
Repetition
Worksheet 1