Question Bank-DA
Question Bank-DA
UNIT-I
Short Questions
Long Questions
1. Explain briefly about Design Data Architecture and manage the data for analysis in data
management with neat sketch.
2. Discuss briefly about Understand various sources of Data and also explain data like
Sensors/Signals/GPS etc.
3. Discuss briefly about Data Quality (noise, outliers, missing values, duplicate data) with
examples.
4. Market researchers have used four experimental designs for generating primary data.
Describe them in detail?
5. Discuss the techniques for handling missing values in datasets. Compare different imputation
methods and their impact on the results of data analysis.
6. How can outliers be detected and managed in a dataset? Discuss the methods for identifying
and handling outliers in different types of data.
7. Explore the implications of duplicate data in large datasets. How can data deduplication be
effectively implemented in data management processes?
8. Explain briefly about data preprocessing in data management.
9. Explain about data quality?
10. Analyze a case study where data management played a crucial role in the success of a business
analytics project. Discuss the data architecture, sources of data, data quality challenges, and
how they were managed.
UNIT-II
Short Questions
Long Questions
1. Define Data Analytics. Why data analytics is important in real world? And also explain tools and
environment in data analytics.
2. Describe the various tools and environments used in data analytics, highlighting their specific
applications in business.
3. Explain the different types of databases and how they manage different types of data. Provide
examples of each.
4. Illustrate Data modeling techniques with examples.
5. Demonstrate Missing Imputation methods in detail with examples.
6. What are the key steps in developing a business model, and how does data analytics play a role
in each step?
7. Examine the different types of variables used in data analysis. How do they influence the choice
of data modeling techniques?
8. Discuss the process and importance of feature selection in data modeling.
9. Explain how predictive analytics can be applied in a business context to forecast future trends.
Provide a case study example.
10. Analyze the need for business modeling and how it integrates with data analytics to solve
complex business problems.
UNIT-III
Short Questions
Long Questions
1. Explain the concept of regression analysis, discussing its importance and applications in business
decision-making.
2. Discuss the BLUE (Best Linear Unbiased Estimator) property in detail, explaining the assumptions
required for it to hold in linear regression models.
3. Describe the Least Squares Estimation method, including its mathematical foundation and
application in regression analysis.
4. Examine the process of variable rationalization in model building. How does it influence the
quality and interpretability of a regression model?
5. Demonstrate linear regression with suitable example.
6. Provide a detailed explanation of logistic regression, including its underlying theory and
differences from linear regression.
7. Illustrate Least Square Estimation method with example.
8. Discuss the various model fit statistics used in logistic regression, explaining how they help in
assessing the performance of the model.
9. Explore the applications of logistic regression in various business domains, providing examples
of how it is used to solve real-world problems.
10. Describe a case study where regression or logistic regression was used to build a predictive
model in a business context. Discuss the steps taken, challenges faced, and outcomes achieved.
UNIT-IV
Short Questions
Long Questions
1. Discuss the differences between regression and segmentation, highlighting their respective
applications in data analysis.
2. Explain the concepts of supervised and unsupervised learning. Provide examples of each and
discuss their importance in the context of data segmentation.
3. Describe the process of building a decision tree for regression and classification tasks. Include a
discussion on the key steps involved, such as splitting criteria, stopping rules, and model
evaluation.
4. Analyze the problem of over fitting in decision tree models. How can pruning and other
techniques be used to mitigate this issue?
5. Discuss the advantages and disadvantages of using multiple decision trees, such as in Random
Forests or Gradient Boosting Machines, compared to a single decision tree.
6. Explain the ARIMA model in detail, including its components (AR, I, MA) and the process of
fitting an ARIMA model to a time series dataset.
7. Discuss the various measures of forecast accuracy used in time series analysis, such as MAE,
RMSE, and MAPE. Provide examples of how they are calculated and interpreted.
8. Describe the STL (Seasonal-Trend Decomposition using LOESS) approach in time series
forecasting. How does it help in understanding and modeling complex time series data?
9. Explain how features such as 'height' and 'average energy' can be extracted from a time series
model and used for further analysis or prediction. Provide examples of applications where this
might be useful.
10. Analyze a case study where time series methods like ARIMA or STL were used for forecasting in
a business context. Discuss the challenges faced, the modeling approach taken, and the
outcomes achieved.
UNIT-V
Short Questions
Long Questions
1. Discuss the role and importance of data visualization in business analytics, providing examples
of how it can influence decision-making.
2. Explain pixel-oriented visualization techniques in detail, including their advantages and
limitations. Provide examples of when and how they should be used.
3. Describe geometric projection visualization techniques and their application in visualizing multi-
dimensional datasets. Include a discussion on common methods like PCA and MDS.
4. Explore icon-based visualization techniques, explaining how they work and their effectiveness in
representing complex data. Provide examples of commonly used icon-based visualizations.
5. Analyze hierarchical visualization techniques, discussing how they are used to represent data
with inherent hierarchical structures. Provide examples such as tree maps and dendrograms.
6. How can pixel-oriented visualization techniques be applied to large datasets? Discuss the
challenges and solutions related to scalability and interpretability.
7. Examine the strengths and weaknesses of geometric projection techniques compared to other
visualization methods when dealing with high-dimensional data.
8. Discuss the use of hierarchical visualization techniques in understanding and analyzing
organizational structures or taxonomies. Provide examples from different industries.
9. How can complex data and relationships be effectively visualized to ensure clarity and
actionable insights? Discuss the principles and best practices of visualizing complex data.
10. Describe a case study where advanced visualization techniques (such as pixel-oriented,
geometric projection, or hierarchical) were used to solve a business problem. Analyze the
visualization process, the challenges faced, and the impact on decision-making.