0% found this document useful (0 votes)
3 views4 pages

Data Analytics and Visualization - Module-Wise Question Organization

The document outlines the question structure for a Data Analytics and Visualization course across six modules, detailing specific questions from past exam papers from December 2023 to May 2024. Each module covers different aspects of data analytics, including the data analytics lifecycle, regression techniques, time series analysis, text analytics, and data visualization in R and Python. The total marks per paper is 80, with a compulsory question worth 20 marks and other questions allowing students to choose from a selection.

Uploaded by

vg.pranaybhoir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views4 pages

Data Analytics and Visualization - Module-Wise Question Organization

The document outlines the question structure for a Data Analytics and Visualization course across six modules, detailing specific questions from past exam papers from December 2023 to May 2024. Each module covers different aspects of data analytics, including the data analytics lifecycle, regression techniques, time series analysis, text analytics, and data visualization in R and Python. The total marks per paper is 80, with a compulsory question worth 20 marks and other questions allowing students to choose from a selection.

Uploaded by

vg.pranaybhoir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Analytics and Visualization - Module-wise Question

Organization
Module 1: Introduction to Data Analytics

Questions from December 2023 Paper:


Q1(a): Why is data analytics lifecycle essential? [4 marks]
Q2(a): Explain the data analytics lifecycle. [10 marks]

Questions from May 2023 Paper:


Q1(a): What is an analytic sandbox, and why is it important? [5 marks]

Q3(a): List and explain the main phases of the Data Analytics Lifecycle. [10 marks]

Questions from December 2024 Paper:


Q1(a): List and explain different phases in data analytics lifecycle. [5 marks]

Questions from May 2024 Paper:


Q1(A): List and explain different key roles for successful data analytics? [5 marks]
Q2(A): Explain different phases in data analytics lifecycle. [10 marks]

Module 2: Linear Regression and Logistic Regression

Questions from December 2023 Paper:


Q1(b): The regression lines of a sample are given. Find sample means and coefficient of correlation.
[4 marks]

Q1(c): Differentiate between linear regression and logistic regression. [4 marks]


Q2(b): Find two lines of regression from the given data and estimate ages. [10 marks]
Q4(b): Fit a regression equation to estimate damage for given weight, distance data. [10 marks]

Q5(a): From given results, obtain two regression equations and estimate yield/rainfall. [10 marks]
Q5(b): What is stepwise regression? State and explain different types. [10 marks]

Questions from May 2023 Paper:


Q1(d): What is regression? What is simple linear regression? [5 marks]
Q3(b): Describe how logistic regression can be used as a classifier. [10 marks]

Q6(a): Evaluating the Residuals in Linear regression. [5 marks]

Questions from December 2024 Paper:


Q2(a): Calculating the regression equation of y on x and x on y from given data. [10 marks]

Questions from May 2024 Paper:


Q1(B): What is Stepwise regression? Explain its types. [5 marks]

Q4(A): Fit lines of regression of y on x and x on y for bacterial cell data. [10 marks]
Q5(B): What is Logistic Regression? Similarities and differences with linear regression. [10 marks]

Q6(B): Generalized Linear Model [5 marks]

Module 3: Time Series Analysis

Questions from December 2023 Paper:


Q3(a): Explain Box-Jenkins intervention analysis. [10 marks]

Questions from May 2023 Paper:


Q1(b): Why use autocorrelation instead of autocovariance when examining stationary time series? [5
marks]
Q5(a): How does the ARMA model differ from the ARIMA model? [10 marks]

Q6(b): Box-Jenkins Methodology [5 marks]

Questions from December 2024 Paper:


Q1(e): What is time series analysis? Explain its components. [5 marks]
Q1(f): What is Pandas? Explain features of Pandas. [5 marks]

Q6(a): Time series analysis [5 marks]

Questions from May 2024 Paper:


Q1(E): Explain components of time series? [5 marks]

Q2(B): Explain ARIMA model in detail. Also state its Pros and Cons. [10 marks]

Q6(A): Box-Jenkins Methodology [5 marks]

Module 4: Text Analytics

Questions from December 2023 Paper:


Q1(e): Explain term frequency (TF), document frequency (DF), and inverse document frequency (IDF).
[4 marks]
Q3(b): What is text mining? Enlist and explain the seven practice areas of text analytics. [10 marks]

Questions from May 2023 Paper:


Q2(b): List and explain methods that can be used for sentiment analysis. [10 marks]
Q4(b): List and explain the steps in the Text Analysis. [10 marks]

Q5(b): Explain with suitable example how the Term Frequency and Inverse Document Frequency are
used in information retrieval. [10 marks]

Questions from December 2024 Paper:


Q1(d): Explain in brief steps of text analysis. [5 marks]
Q4(b): Explain seven practice areas of text analytics. [10 marks]

Questions from May 2024 Paper:


Q1(C): Explain Term Frequency-Inverse Document Frequency (TF-IDF) with a suitable example. [5
marks]

Q3(A): Explain in detail seven practice areas of text analytics. [10 marks]

Q5(A): Enlist and explain the steps of text analysis. [10 marks]

Module 5: Data Analytics and Visualization with R

Questions from December 2023 Paper:


Q4(a): Explain different types of data visualizations in R programming language. [10 marks]

Questions from May 2023 Paper:


Q6(d): Data import and Export in R [5 marks]

Questions from December 2024 Paper:


Q5(a): Explain which analysis model is used to predict forecast monthly average temperature
considering historical climate data. [10 marks]

Q5(b): Explain following data visualization libraries in Python: Box plot, Violin plot, scatter plot. [10
marks]

Questions from May 2024 Paper:


Q3(B): Explain different types of data visualisation in Python programming language. [10 marks]
Q4(B): How Exploratory Data Analysis (EDA) is performed in R? [10 marks]

Q6(D): Data import and export in R [5 marks]

Module 6: Data Analytics and Visualization with Python

Questions from December 2023 Paper:


Q1(d): What is Pandas? State and explain key features of Pandas. [4 marks]
Q6: Write short notes on (any 2): [20 marks]
a. Time series analysis

b. Exploratory data analysis


c. Regression plot

d. Generalized linear model (GLM)

Questions from May 2023 Paper:


Q1(c): Difference between Pandas and NumPy. [5 marks]

Q2(a): Explain in detail how dirty data can be detected in the data exploration phase with
visualizations. [10 marks]

Q4(a): Statistical method for promotional offer analysis. [10 marks]

Q6(c): Seaborn Library. [5 marks]

Questions from December 2024 Paper:


Q6: Write short notes on: [20 marks]
a. Box-Jenkins Methodology

b) Residue Regression

c) Seaborn Library

d) Generalized Linear Model

Questions from May 2024 Paper:


Q1(D): Difference between Matplotlib and Seaborn library. [5 marks]

Q6(C): Pandas library [5 marks]

Summary
Total marks per paper: 80

Compulsory question (Q1): 20 marks

Other questions: 20 marks each (attempt any 3 out of remaining 5)

Short notes questions: Usually 20 marks (4×5 or 2×10)

You might also like