0% found this document useful (0 votes)
11 views8 pages

Business Analytics QB

The document outlines a business analytics course covering Python programming and exploratory data analysis. It includes 6 modules that cover topics such as Python fundamentals, data visualization with R, statistical learning, linear regression, and decision trees. Each module is further divided into sections with short and long answer questions pertaining to the topics.

Uploaded by

Arjun Singh A
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views8 pages

Business Analytics QB

The document outlines a business analytics course covering Python programming and exploratory data analysis. It includes 6 modules that cover topics such as Python fundamentals, data visualization with R, statistical learning, linear regression, and decision trees. Each module is further divided into sections with short and long answer questions pertaining to the topics.

Uploaded by

Arjun Singh A
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Business Analytics QB

Introduction to Python, Data and Control Systems


Module-1: Introduction to Python Program (8 Hours)
Section A: 3 Marks Questions
● Define the term "debugging" in the context of programming.
● Explain the significance of computer hardware architecture in programming.
● Differentiate between a word and a sentence in programming.
Section B: 7 Marks Questions
● Describe the process of installing Python on a computer system.
● Discuss the role of comments in Python programming with an example.
● Explain the concept of flow of execution in Python programs.
Section C: 10 Marks Questions
● Elaborate on the motivation and creativity aspects in Python programming.
● Analyze the importance of terminology in programming languages.
● Discuss the learning journey of a programmer in Python.

Module-2: Variables, Expressions, and Statements (8 Hours)


Section A: 3 Marks Questions
● Define and give examples of each data type: int, float, Boolean, string, and list.
● Explain the concept of precedence of operators in Python.
● What is the purpose of modules in Python programming?
Section B: 7 Marks Questions
● Discuss the flow of execution in a Python program with examples.
● Describe the concept of a function and its use in Python.
● How are parameters and arguments different in Python functions?
Section C: 10 Marks Questions
● Create a Python program that uses variables, expressions, and statements to perform a
specific task.
● Explain the role of functions in modularising code with a real-world example.
● Discuss the significance of modules in large-scale Python programming projects.
Module-3: Control Flow, Loops, and Conditionals (8 Hours)
Section A: 3 Marks Questions
● Define Boolean values and operators in Python.
● Differentiate between conditional, alternative, and chained conditional statements.
● What is the purpose of the "break" and "continue" statements in Python loops?
Section B: 7 Marks Questions
● Explain the concept of iteration in Python with examples of "while" and "for" loops.
● Discuss the use of conditional statements (if-else) in Python programs.
● How can you use the "while" loop to create an infinite loop in Python?
Section C: 10 Marks Questions
● Develop a Python program that uses conditional statements and loops to solve
problems.
● Discuss the advantages and disadvantages of using "for" and "while" loops in Python.
● Explain the concept of a chained conditional in Python with a practical example.
Module-4: Functions, Arrays, and Fruitful Functions (8 Hours)
Section A: 3 Marks Questions
● Define and provide examples of fruitful functions in Python.
● Differentiate between local and global scope in Python functions.
● Explain the concept of recursion with a simple Python example.
Section B: 7 Marks Questions
● Discuss the importance of return values in Python functions.
● Explore the usage of string slices and immutability in Python.
● How can you access elements of an array in Python?
Section C: 10 Marks Questions
● Develop a Python program that uses recursion to solve a specific problem.
● Explain the concept of array mutability and its implications in Python.
● Discuss the role of fruitful functions in writing modular and reusable code.
Module-5: Lists, Tuples, Dictionaries (10 Hours)
Section A: 3 Marks Questions
● Provide examples of list operations in Python.
● Explain the concept of tuple assignment in Python.
● What are the key operations and methods associated with dictionaries in
Python?
Section B: 7 Marks Questions
● Discuss the mutability of lists in Python with examples.
● Explain the aliasing and cloning of lists in Python.
● How can you use list comprehension in Python to create a new list?
Section C: 10 Marks Questions
● Develop a Python program that extensively uses lists, tuples, and dictionaries.
● Explain the concept of tuple comprehension with a practical example.
● Discuss the advantages and use cases of dictionaries in Python programming.
Module-6: Files, Exception, Modules, Packages (8 Hours)
Section A: 3 Marks Questions
● Explain the process of reading and writing text files in Python.
● Discuss the significance of error handling and exceptions in Python
programming.
● What are modules and how are they used in Python?
Section B: 7 Marks Questions
● Develop a Python program that reads data from a file and handles exceptions.
● Discuss the role of various built-in modules like datetime, time, and math in
Python.
● Explore the concept of packages in Python with examples.
Section C: 10 Marks Questions
● Create a Python program that utilizes multiple modules to perform a complex
task.
● Discuss the importance of proper error handling in large-scale Python projects.
● Explore the usage of Python packages in real-world applications.
EXPLORATORY DATA ANALYSIS FOR BUSINESS
Module-1: Introduction to Data Mining and Exploratory Data Analysis
3 Marks Questions:
● Define Exploratory Data Analysis (EDA) and its significance in data mining.
● List and briefly explain three real-life applications of classification problems.
● Explain the concept of Euclidean Distance in the context of numerical summarization.
● What are the key tools used for displaying relationships between two variables in
EDA?
● Provide a brief overview of R scripts and mention a specific library used for
visualization.
7 Marks Questions:
● Discuss the nature of problems addressed by data mining in real-life scenarios,
using two examples.
● Explain the role of Mahalanobis Distance in exploratory data analysis, with an
illustration.
● Elaborate on the measures of similarity and dissimilarity in the context of data
analysis.
● Compare and contrast tools used for displaying single variables and tools for
displaying more than two variables.
● Create a sample R script using the ggplot2 library to visualize a dataset with
multiple variables.
10 Marks Questions:
● Illustrate the steps involved in the exploratory data analysis process using a real-world
example.
● Discuss the importance of numerical summarization in the context of data mining.
● Critically evaluate the role of visualization tools in exploring and understanding
complex datasets.
● Develop an R Markdown document that includes both code and textual explanations
for a data visualization task.
● Explain the significance of exploratory data analysis in making informed business
decisions.
Module-2: Statistical Learning, Model Selection, and Cross Validation
3 Marks Questions:
● Define prediction accuracy and explain its importance in statistical learning.
● Differentiate between training error and test error in the context of model complexity.
● Briefly explain the concept of overfitting and the bias-variance trade-off.
● What is cross-validation, and how does it address the limitations of a holdout sample?
● Provide an example of K-fold cross-validation and its advantages.
7 Marks Questions:
● Discuss the role of model complexity in prediction error, providing examples.
● Explain the steps involved in the three-way split of training, validation, and test data.
● Compare and contrast random subsampling and K-fold cross-validation.
● Provide a detailed explanation of leave-one-out cross-validation with a practical
example.
● Critically evaluate the effectiveness of cross-validation in model selection.
10 Marks Questions:
● Elaborate on the challenges associated with overfitting and how cross-validation
mitigates these challenges.
● Discuss the practical implications of using cross-validation in real-world business
scenarios.
● Develop an R script that implements K-fold cross-validation for a given dataset.
● Evaluate the strengths and weaknesses of different model selection techniques.
● Explain how cross-validation contributes to the generalization ability of a predictive
model.
Module-3: Linear Regression and Variable Selection
3 Marks Questions:
● Define linear regression and its applications in data analysis.
● Explain the concept of point estimation in the context of linear regression.
● Provide an example of a linear model and discuss its theoretical justification.
● What are the key components of the Frequentist approach to parameter estimation?
● Summarize the main steps involved in variable selection for the linear model.
7 Marks Questions:
● Discuss the expectations and variances associated with linear methods in regression.
● Provide a practical example of linear regression and interpret the results.
● Compare and contrast parameter estimation in linear regression with other regression
methods.
● Develop an R script for implementing linear regression and interpreting the output.
● Critically evaluate the assumptions of linear regression in the context of data analysis.
10 Marks Questions:
● Elaborate on the significance of variance in the context of linear regression.
● Discuss the limitations of linear regression and when it might be inappropriate to use.
● Create an R script that includes both linear regression and variable selection steps.
● Evaluate the impact of outliers on linear regression and propose strategies to address
them.
● Explain the concept of bias-variance trade-off in the context of linear regression.
Module-4: Regression Shrinkage Methods and Tree-based Methods
3 Marks Questions:
● Define Ridge Regression and its purpose in data analysis.
● Differentiate between Ridge Regression and Lasso Regression.
● Explain the concept of coefficient shrinkage in the context of regression shrinkage
methods.
● What is the impurity function, and how is it used in tree-based methods?
● Briefly discuss the advantages of the tree-structured approach in regression.
7 Marks Questions:
● Compare the squared loss for Ridge Regression and traditional regression methods.
● Discuss the process of constructing a tree in tree-based methods, emphasizing
impurity.
● Explain the concept of bagging and its role in improving the performance of
tree-based models.
● Provide examples of situations where pruning in tree-based methods is beneficial.
● Develop an R script that implements Ridge Regression and includes a bagging step.
10 Marks Questions:
● Elaborate on the types of problems that Ridge Regression aims to address.
● Critically evaluate the trade-offs involved in using Ridge Regression compared to
traditional regression.
● Discuss the steps involved in constructing a decision tree and the considerations for
pruning.
● Create an R script that demonstrates the process of constructing a decision tree with
pruning.
● Evaluate the strengths and weaknesses of bagging and random forests in the context
of regression.
Module-5: Principal Components Analysis and Classification
3 Marks Questions:
● Define Singular Value Decomposition (SVD) and its role in Principal Components
Analysis (PCA).
● Explain the concept of principal components and their interpretation in PCA.
● Define the classification error rate and discuss its significance in classification
problems.
● What is the Bayes Classification Rule, and how does it relate to classification error
rate?
● Briefly describe the assumptions of logistic regression in classification.
7 Marks Questions:
● Discuss the geometric interpretation of principal components in PCA.
● Elaborate on the process of acquiring data for Principal Components Analysis.
● Compare and contrast logistic regression with linear regression in the context of
classification.
● Provide an example of binary classification and discuss the logistic regression fitting
process.
● Critically evaluate the role of principal components in reducing dimensionality.
10 Marks Questions:
● Explain the steps involved in Principal Components Analysis (PCA) with a practical
example.
● Discuss the challenges associated with classification error rate and strategies to
address them.
● Develop an R script that implements logistic regression for binary classification.
● Evaluate the performance of logistic regression in a multiclass classification scenario
(K ≥ 3).
● Compare and contrast discriminant analysis with logistic regression for classification.
Module-6: Support Vector Machines
3 Marks Questions:
● Provide an overview of Support Vector Machines (SVM) and their applications.
● Define the Support Vector Classifier and its use when data is linearly separable.
● Explain the role of kernel functions in handling data that is not linearly separable.
7 Marks Questions:
● Discuss the considerations when data is not linearly separable and how SVM
addresses them.
● Elaborate on the process of constructing a multiclass SVM.
● Compare the performance of SVM with linear and non-linear kernels.
10 Marks Questions:
● Evaluate the advantages and limitations of Support Vector Machines in real-world
scenarios.
● Develop an R script that implements a Support Vector Classifier for a given dataset.
● Discuss the importance of selecting appropriate kernel functions in SVM.
● Critically evaluate the transition from bagging to random forests and its advantages to
the model.

You might also like