Unit - I: Topic - 1
Unit - I: Topic - 1
The purpose of learning this Session on Introduction of Data Science and Basic Data Analytics using R is to
1. Understand data science and its characteristics that are fueling big data analytics.
2. Educate the importance of R tool in data analysis and Familiarize students with Basic R syntax.
At the end of this class hour on Introduction of Data Science and Basic Data Analytics using R, my students will be able to:
Implement Operate
Design
Conceive (What Students (What Students
Time Topic (What Students would
(What Students Absorb) would would demonstrate /
design/solve)
Implement / test) prove)
Introduction of Data Science Role of data science in analytics
10 About Driving Data Deluge
Basic / Key Learning Material: (Descriptive/ Concept Map/ Mind-Map etc., to be prepared by faculty member)
Quiz Choose all Correct / Multiple Choice / True or False / Match the following Answer 1 Answer 2 Answer 3 Answer 4
Which of the following is one of the key data science skills? c) Data
b) Machine d) All of the
1. a) Statistics Visualizati
Learning Mentioned
on
If I execute the expression x <- 4 in R, what is the class of the object `x' as
2. determined by the `class ()' function? NUMERIC INTEGER REAL COMPLEX
All of the
7. Numbers in R are generally treated as _______ precision real numbers. Single Double real
mentioned
8. R objects can have attributes, which are like ________ for the object. metadata features Expression None
Topic_2: R Graphical User Interfaces – Data Import and Export – Attribute and Data Types
Session _2:
Recap / Introduction:
Introduction of Data Science and Big
05 Oral Questioning /
Data Analytics using R
Discussion
Sub-topic – 1 (Lecture/Demonstration)
R graphical Interfaces, Data Import and
10 Window panes of
Export
R Studio
Sub-topic – 2 (Lecture/Demonstration)
Attributes and Data Types
10 NOIR Attributes
Sub-topic – 2 (Participative / Verify)
Activity
10 Code for an
Example
Conclusion &Summary
Review Activity
05 Short questions
and answers
Basic / Key Learning Material: (Descriptive/ Concept Map/ Mind-Map etc., to be prepared by faculty member)
PPT
Answ
Quiz Choose all Correct / Multiple Choice / True or False / Match the following Answer 1 Answer 2 Answer 4
er 3
d) All of
c)
a) the
The __________ function returns a list of all the formal arguments of a function b) funct() formal
1. formals()
() mentioned
b) c) d)
You can check to see whether an R object is NULL with the _________ function. a) is.null()
2. is.nullobj() null() empty()
c)t.co
Which function in R language is used to find out whether the means of 2 groups are
4. a)t.tests () b)t.equals() mapre d)None
equal to each other or not?
()
Answer: 12. In f(3), y is 2, so y^2 is 4. When evaluating g(3), y is the globally scoped y (5) instead of the y that is locally scoped to f,
so g(3) evaluates to 3 + 5 or 8. The rest is just 4 + 8, or 12.
6. If I have a data.frame df <- data.frame(a = c(1, 2, 3), b = c(4, 5, 6), c(7, 8, 9))...
2. Write short notes on Arrays and Matrices in R programming with suitable example.
Recap / Introduction:
Introduction of Statistics
05 Oral Questioning /
Discussion
Sub-topic – 1 (Lecture/Demonstration)
Activity
Functions on view, summary, basic
20 Basic Function
statistics ,correlation
commands using
R
Sub-topic – 2 (Lecture/Demonstration)
Summary statistics on Example that
20 Mean Vs Median demonstrates
Standard Deviation summary
Quartiles , Min/Max statistics
Correlations between variables
Conclusion &Summary
Review Activity
05 Short questions
and answers
PPT,PDF
Choose all Correct /
Multiple Choice / True or
Quiz Answer 1 Answer 2 Answer 3 Answer 4
False / Match the
following
In R, the t.test () function produces a variety of t-tests. T-test is the most common test in statistics and used to determine whether
the means of two groups are equal to each other.
Session _4:
The purpose of learning this Session on Exploratory data analysis is to learn about graphical representation of data
At the end of this class hour on Exploratory data analysis: Visualizing before analysis, my students will be able to:
Sub-topic – 1
Able to know
(Lecture/Demonstartion)
15 about the functions
Ggplot2() package detail
in ggplot()
package.
Sub-topic – 2
(Lecture/Demonstration) Understand the
10 Comparison of Base graphics and differences and
ggplot2() specialities of
ggplot package
Session _5:
The purpose of this session would be to learn to visualize data corresponding to single variable and multiple variables
A company’s sales plot shows that there are small peaks at the
Seasonality
11. end of the year and large peaks at the middle of the year. This Peak Effect Quarterly effect Year effect
effect
effect is called ------------.
12. The R command used to plot a continuous histogram is ----------- glot(data) barplot(data) rug(data) glot(density[data])
Approximation
In dirty data cleaning, the wrong data can generally be replaced based on
13. Ones Zeros NULL
with -------------------- nearest
neighbour
Session _6:
Session Learning Rationale :
The purpose of learning this Session on Data Exploration Versus presentation with respect to Data analytics.
Session Learning Outcomes:
At the end of this class hour on Data Exploration Versus presentation students will be able to:
1. Know what the user need to know
2. Ability to present data using R studio
Operate
Design Implement
Conceive (What Students
(What Students (What Students
Time Topic (What Students would
would would Implement
Absorb) demonstrate /
design/solve) / test)
prove)
Recap / Introduction:
Refreshing the various plot functions
05 Oral Questioning /
Discussion
Sub-topic – 2 (Lecture/Demonstration)
Using an suitable example discuss on
20 Data Presentation and Exploration
Conclusion &Summary
Review Activity
05 Short questions
and answers
Basic / Key Learning Material: (Descriptive/ Concept Map/ Mind-Map etc., to be prepared by faculty member)
PPT,PDF
. _________ is a category of
d) All of the
applications and
mentioned
technologies for presenting c) EIS
2 a) Data warehouse b) MIS
and analyzing corporate
and external data
............................. is the
process of finding a model
that describes and
A) Data D) Data
4 distinguishes data classes C) Data discrimination
Characterization B) Data Classification selection
or concepts.
__________ is a
nonparametric hypothesis
c) Wilcoxon
5. test that checks whether a) Student’s test b) Welch’s test
rank sum test
two populations are
identically disturbed.