Chapter 1 Updated
Chapter 1 Updated
CHAPTER ONE
INTRODUCTION OF STATISTICS
Definition of Statistics
Statistics is the science of data. It enables the collection, organization, presentation, analysis, and
interpretation of numerical data.
Dr. A.L. Bowely defined “Statistics are numerical statement of facts in any department of enquiry placed
in relation to each other”.
R.A. Fisher defined, “The science of statistics is essentially a branch of applied mathematics and may be
regarded as mathematics applied to observational data”.
According to Croxton and Cowden, “Statistics may be defined as the science of collection, presentation,
analysis and interpretation of numerical data”.
1. Descriptive Statistics
2. Inferential Statistics.
Descriptive Statistics
Inferential Statistics.
Inferential statistics involves methods of using information from a sample to draw conclusions
about the population.
1
Manju, Associate professor, CSE, IIUC
Functions of statistics:
Limitations of Statistics:
1. Statistics deals with aggregate of items and not with individual item or measurement.
2. Statistics deals only with quantitative characteristics.
3. Statistical laws hold good only for the averages.
4. It plays only an auxiliary rule in summarizing a fact.
5. Statistics can be misused.
Characteristics of statistics:
1. Statistics should deal with aggregate of individual rather than with individual alone.
2
Manju, Associate professor, CSE, IIUC
Importance of statistics:
1. Statistics of wealth and manpower are important for development and planning.
3. Statistics helps the planner to estimate the revenue income and expenditure of the country.
Statistics is defined as the science of collecting, organizing, and interpreting numerical facts, which we call
data. It is very important for a student of computer sciences. As Computer sciences also deals with
organization and interpretation of numerical facts. In fact most of the principles of computer sciences are
based on concepts of statistics.
The computer can process large amount of data quickly and accurately. For proc essing the large
amount of data some of the important statistical packages that have been used are – SPSS, SAS,
STATA, S-plus and MINITAB .
Population with example:
The totality of all elements under the study or discussion is called population. The “population” in
statistics includes all members of a defined group that we are studying or collecting information.
Example: If we measure the heights and weights of a group of person then it is called population.
Finite population: A Population is called finite population if its elements are countable. Example:
Number of students in a university.
Infinite population: A population is called infinite population if its elements are not countable.
3
Manju, Associate professor, CSE, IIUC
A representative part of population is called sample Or a part of the population is called sample.
Example: If we measure the height and weight of IIUC student, then that of the CSE/EEE/ETE department
student are sample.
# The totality of all elements under the study or A part of the population is called sample.
discussion is called population.
# A population includes each element from the set A sample consists only of observations drawn
of observations that can be made. from the population.
Any phenomenon which varies from individual to individual is called variable. Variables are represented
by symbols (e.g., x, y, or z).
For example Age, weight, height, sex, income and expenses, country of birth, capital expenditure, class
grades, eye colour and vehicle type are examples of variables.
4
Manju, Associate professor, CSE, IIUC
Qualitative variable:
A variable which cannot be expressed as numerically is called qualitative (categorical) variable.
Examples: Hair color, gender, field of study, college attended, political affiliation, status of disease
infection, sex, business type, eye colour, religion and brand.
Quantitative variable:
Examples: Height, age, crop yield, GPA, salary, temperature, area, air pollution index (measured in parts
per million), etc.
Discrete variable:
A variable is called discrete variable if it can take only isolated (Whole) values. Examples of discrete
variables include the number of registered cars, number of business locations, and number of children in a
family, all of which measured as whole units (i.e. ,1, 2, 3 cars).
Continuous variable:
A variable is called continuous variable if it can take any values between certain limits. i.e. Observations
can take any value between a certain set of real numbers. Examples of continuous variables include
height, time, age, and temperature.
N.B: Every continuous variable are discrete but every discrete variable are not continuous.
5
Manju, Associate professor, CSE, IIUC
Skin colour, hair colour, religion, gender, merit, GPA, age, weight, height, income, temperature
education, character are the example of etc are the example of quantitative variable.
qualitative variable.
Constant:
Variable Constant
A variable is always subject to change. A constant will not change, ever.
It is denoted by X,Y,Z,U,V It is denoted by a, b ,c or d
Variable are qualitative and quantitative Constant has no such classification
age, weight, height, salary etc are variable Total number of days in a week, Number of
finger in a hand or leg are constant
Statistical Data
1. Primary data;
2. Secondary data;
Primary data: Data collected by the investigator himself/ herself for a specific purpose.
Example: Data collected by different govt., public, private organizations, research bodies, research
scholars, NGO’s for their official records and research purpose from the field directly are primary data.
Secondary data: When an investigator uses the data which has already been collected by others, such data
are called secondary data.
This data can be obtained from journals, report, Internet, Books/ Magazines, Newspapers, Office statistics
the government statistics service, the office of national statistics, centre for applied social surveys.
6
Manju, Associate professor, CSE, IIUC
1. Through interview
2. Through questionnaire
3. Through schedule
4. Through local agent.
5. Through observations
6. Through experimentation.
Primary Data
2. Suitability of the primary data will be positive because it has been systematically collected.
Secondary Data
1. Secondary data lacks originality. The investigator makes use of the data collected by other agencies.
Questionnaire:
In any survey the information are collected according to some predetermined question. A set of question
for any survey constitutes the questionnaire.
Or, A set of printed or written questions with a choice of answers, devised for the purposes of a survey or
statistical study.
7
Manju, Associate professor, CSE, IIUC
SAMPLE QUESTIONNAIRE:
18. What is ( or was ) your mother’s occupation: 19. Your father and mother monthly
(a) Housewife (b) Business (c) Service (d) teacher income:………………..
and (e) Others.
20. Economic status of your family:(a) Poor, (b) 21.Behind Choosing IIUC: (a) Islamic (b) Tuition
lower middle class (c) middle class, (d) higher middle (c) Safety and distance (d) Faculty (e)Good
class, (e)rich private university(f) Scholarship (g) others.
8
Manju, Associate professor, CSE, IIUC
22. How you got information about IIUC: 23.While attending university, I live: (a) In a hall
(a) Advertisement, (b) Faculty (c) (b) At home/ Family (c) Mess (d) Relative/friends
friends/relatives, (d) internet (e) others
24. Approximately, what was your grade average in 25. Do you discuss your grades with your
your final year of IIUC:…………..out of 4.00. guardian? (a) Yes (b)No
26.Compared to my friends at university, I am, on an 27. How often did you miss classes: (a) Always
academic basis, performing:(a) Better (b) Same (b) Sometimes (C) Never (d) Once every week?
(c)Worse
28. Usually I study: (a) Library (b) Class (c) my 29. Generally, I study…………….hours daily.
room (d) If other please Specify:………….
30.Are you satisfied in IIUC academic system: 31. Do you seem that tuition fee is higher in
(a) yes (b) No. IIUC? (a) yes (b) No.
32. What's your dream Career? (a) Subject related
job(b) Teacher (c) Banker (d) service (e) others.
THANK-YOU
Types of Questionnaire:
1. Unstructured question.
2. Structured question.
Unstructured question: Unstructured question are open-ended, that respondents can answer in their own
words.
Structured question: A question in which the respondents is given specific limited alternative responses
and ask to choose one to his/her own view point.
1. Classification
2. Tabulation
3. Frequency distribution
Classification: Classification is the first statistical technique to condense the raw data. It is the process of
arranging data in different groups or classes according to their affinities.
Tabulation: Tabulation is a logical and systematic arrangement of statistical data in raws and columns.
Frequency:
The number of times that a given value occurs into each group / class is known as frequency.
For example, if four students have a score of 80 in mathematics, and then the score of 80 is said to have a
frequency of 4. The frequency of a data value is often represented by f.
9
Manju, Associate professor, CSE, IIUC
Frequency distribution
1. Class limits/Range: The class limits are the lowest and highest values that can be included in the class.
4. Class frequency/Tally: The number of observations corresponding to the particular class is known as
class frequency of that class.
6. Cumulative frequency: The number of observation for less then a certain upper limit of the class, if
cumulation is done from top of the table.
33, 41, 21, 25, 36, 38, 35, 36, 35, 37, 42, 30, 35, 37, 36, 38, 30, 54, 40, 48, 15, 28, 51, 42, 25, 41, 30, 27,
42, 36, 28, 26, 37, 54, 44, 31, 36, 40, 36, 22, 30, 31, 19, 48, 16, 42, 32, 21, 22, 40, 43, 42, 39, 38, 37, 33,
49, 47, 46, 48.
Assignment-2: Monthly income (in Lac Tk.) of 30 firms in a certain area is given below:
20,19,10,13, 8,15,9,16,18,8,16,17,12,11,10,19,18,17,14,15,12,13,14,12,12.5,18.5,20,15,18,16.
9,20,19,10,12,15,9,16,18,8,16,17,12,11,10,19,18,17,14,15,12,13,14,12,12,18,20,15,18,15.
Example 6
The number of calls from motorists per day for roadside service was recorded for the month of December
2003. The results were as follows:
10
Manju, Associate professor, CSE, IIUC
In addition to presentation of statistical data through tabular form, one can present the same through some
visual aids refer to graphs and diagrams.
1. Histogram;
2. Bar diagram;
3. Frequency polygon
4. Pie diagram;
5. Scatter diagram;
6. Line diagram;
7. Ogive;
8. Steam and leaf plot.
9. Box plot
Histogram:
*A histogram consists of tabular frequencies, shown as adjacent rectangles, erected over discrete
intervals (bins), with an area equal to the frequency of the observations in the interval.
Bar chart:
A bar chart is a chart with rectangular bars with lengths proportional to the values that they represent. The
bars can be plotted vertically or horizontally.
Frequency polygon
11
Manju, Associate professor, CSE, IIUC
Pie chart:
Scatter diagram:
12
Manju, Associate professor, CSE, IIUC
Line chart:
A line chart is a two-dimensional scatter plot of ordered observations where the observations are connected
following their order.
Box plot
13
Manju, Associate professor, CSE, IIUC
Diagram
Graph
14