0% found this document useful (0 votes)
13 views

Data an overview lecture 2

The document provides an overview of various types of data, including Time Series Data, Spatial-Series Data, Cross-Sectional Data, Spatio-Temporal Data, Longitudinal or Panel Data, and Frequency Type Data. It presents multiple datasets with examples from GDP growth, population statistics, rainfall, COVID cases, crop prices, and student marks. Each type of data retains individual identities and is characterized by specific observation patterns over time or across regions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views

Data an overview lecture 2

The document provides an overview of various types of data, including Time Series Data, Spatial-Series Data, Cross-Sectional Data, Spatio-Temporal Data, Longitudinal or Panel Data, and Frequency Type Data. It presents multiple datasets with examples from GDP growth, population statistics, rainfall, COVID cases, crop prices, and student marks. Each type of data retains individual identities and is characterized by specific observation patterns over time or across regions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 26

Data: An Overview

Lecture 2
Consider the following dataset:

Year 1961 1962 1963 1964 1965 1967 1968 1969


GDP
growth of 3.722742533 2.931127735 5.994353262 7.452950123 -2.63577011 -0.05532877 7.825963031 3.387929176
India (in
%)
Now, consider this dataset:

Year 1953-57 1958-62 1963-67 1968-72 1973-77 1978-82 1983-87 1988-92 1993-97
Population 398,577, 445,954, 500,114,3 623,524,2 696,828,3 780,242,0 870,452,1 964,279,1 1,059,633
of India 992 579 46 19 85 84 65 29 ,675
• Observations are provided against points of time or against time
periods for the same individual (or item or region).
• All the individual identities are intact.
• This type of data is known as Time Series Data.
Some more examples
of time series data.
Intraday Trading Chart of BRITANNIA INDUSTRIES LIMITED
for Friday January 10th, 2025
Here is another dataset for you:
State Assam Arunachal Nagaland Manipur Mizoram Tripura Meghalaya
Pradesh
Total
rainfall in
mm (from
1/12/2023 17.2 20 32.2 30.3 69.6 44.3 25
to
31/12/202
3)
Another Dataset:

Country United United Japan Italy Germany France Canada


States Kingdom
Total 110,533,193 24,863,166 33,803,572 26,692,251 38,793,736 40,138,560 4,891,249
covid
cases (As
of 15/1/23
• Observations of one (or more) feature(s) for different regions for a
single time point are provided.
• Individual identity is retained.
• This type of data is known as Spatial-Series Data.
Another Dataset:
Crops Gram Jute Paddy Potato Mustard Wheat
(Autumn)
State
average price
for West
Bengal 5784 5993 1798 1596 5407 1969
(2021-22)
(Rs/Quintal)
• Observations of one (or more) feature(s) for different items for a
single time point are provided.
• Individual identity is retained.
• This type of data is known as Cross-Sectional Data.
Another dataset:

State West Bengal Bihar Odisha Madhya Uttar Jharkhand


Pradesh Pradesh
1951 26,300,670 29,085,900 14,646,100 18,615,700 60,274,800 9,697,300
1961 34,926,000 34,841,490 17,549,500 23,218,950 70,144,160 11,606,504
1971 44,312,017 42,126,800 21,944,625 30,017,180 83,849,775 14,227,493
1981 54,580,650 52,303,000 26,370,270 38,169,500 105,113,300 17,612,000
• Observations are provided for different regions over multiple
points or periods of time.
• Individual identity is intact.
• This type of data is called Spatio-Temporal data.
• Observations are provided for different items over multiple points
or periods of time.
• Individual identity is intact.
• This type of data is called Longitudinal or Panel data.
• In all of the previous cases, all the individual identities are
retained.
• An umbrella term for such data is Non-Frequency Data.
Now consider a dataset you previously
encountered:
Marks of 60 students in class 10 of ABC school in Maths (out of 100)
66 55 45 51 58 42 58 54 60 59 62 39 55 61 45

60 63 57 45 53 59 53 48 51 68 63 50 47 47 58

47 48 53 55 52 45 53 55 50 58 64 48 56 51 52

41 58 57 57 69 53 61 48 48 44 55 57 59 76 44
Marks 36-40 41-45 46-50 51-55 56-60 61-65 66-70 71-75 76-80
Represe
ntative
of the 38 43 48 53 58 63 68 73 78
group
Number
of 5 8 10 16 15 6 3 0 1
students
Number of Savings Accounts opened in a rural
bank over 30 working days

7 5 10 8 10
8 13 13 9 10
9 7 11 6 8
14 3 10 7 9
11 5 8 13 5
12 6 9 4 11
No. of
Savings
account 3 4 5 6 7 8 9 10 11 12 13 14
s opened
No. of
working 1 1 3 2 3 4 4 4 3 1 3 1
days
•Frequency Type Data

You might also like