Chapter 2 Arranging and Collecting Data
Chapter 2 Arranging and Collecting Data
2) A food delivery app has asked for your feedback on the quality of the food. You have
written two paragraphs to describe the food. This is an example of :
a) Qualitative data
b Quantitative data
c) Both qualitative and quantitative data
d) None of the above
Ans. A
3) You need to predict what the temperature will be for next Friday. Which algorithm will
you use?
a) Clustering
b) Regression
c) Anomaly detection
d) Binary classification
Ans. B
4) You need to predict if you car tyre will last for next 1000 km. Which algorithm will you
use?
a) Clustering
b) Regression
c) Anomaly detection
d) Binary classification
Ans. D
5) Which of the following questions are the benefits of big data processing?
Ans. D
6) The analysis of large amount of data to see what patterns or other useful information
can be found is known as
a) Data Analysis
b) Information Analytics
c) Big data Analytics
d) Data Analytics
Ans. C
a) Collects data
b) Spreads data
c) Organizes data
d) Analyses data
Ans. B
a) Experiment
b) Survey
c) Both a and b
d) None of the above
Ans. C
9) The advantage of secondary data are low cost, speed, availability, and flexibility
a) True
b) False
Ans. A
a) Survey
b) Information
c) Observational
d) Experimental
Ans. C
Standard Questions
1) What is the difference between multivariate and univariate data? Give some
examples.
Ans :- The difference between multivariate and univariate data are as followed :-
For ex :- Consider an example of umbrellas. You may notice that umbrellas are in much
demand during the rainy season. Hence the sale of umbrellas gets increase during the
rain. So here two variables are noticed which are “rain and umbrella”.
As compared to the univariate type of data, such data type are more complex, compared
and can be in relation with multiple circumstances.
b) Univariate data :- In this data type there is involvement of only one/single variable. In
comparison to Multivariate data, this data type does not involve any comparison or
related at multiple circumstances. For ex:- The weight of the students in univariate data
Student 1 45
Student 2 56
Student 3 60
Ans :- There are two common sources which are involved in the collection of data which
are : –
Data Sources
Primary and Secondary
a) Primary :- This are the sources which are been for making the collection of data for
analysing purpose. For ex :- data collected by conducting different types of surveys,
interviews, questionnaire, and by creating feedback forms. In order to collect the primary
data, following methods are been used :-
i) Physical interview
b) Secondary :- The data which is been collected by these source is already been used
for other used, and the same data for analysing purpose. Secondary sources of data
includes Transactional data, Satellite data etc. There are some methods which are
mostly used for collecting the secondary data are :-
a) Volume :- Volume is mostly referred with the data size. On the basis of the data size,
it can be very easily determined that whatever the data is available whether it is big or
not. The data sets which are excess than the terabytes and petabytes can be termed as
Big Data.
b) Variety :- As we are aware that Big Data are usually collected from various sources or
ranges which may include transactional databases, sensor data etc. this includes
different types of images, pictures, audio, video etc. Therefore different variety data is an
essential characteristic of the Big Data.
c) Velocity :- Velocity is a term which is related with the rate at which the data is been
generated. The data is been generated in a very faster speed resulting in creating of
high volumes. For ex :- Social media is the platform where the huge amount of data can
be generated for every / per minute.
d) Value:- The amount of useful information in large volumes of data is called value of
the Big Data
e) Veracity:- Veracity of data is a measurement of how trustworthy the data is. We can
trust data if it is accurate , obtained from verified sources and up-to-date as per
requirements.