The document outlines important questions for a Business Analytics course, covering topics such as data definitions, data science, data analytics, R programming, data visualization, and regression analysis. It includes specific questions related to data types, analytics roles in business, R features, data importing, and predictive analytics. The questions are categorized into units with varying marks, emphasizing the need for thorough preparation in these areas.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
3 views8 pages
Ba Important Questions
The document outlines important questions for a Business Analytics course, covering topics such as data definitions, data science, data analytics, R programming, data visualization, and regression analysis. It includes specific questions related to data types, analytics roles in business, R features, data importing, and predictive analytics. The questions are categorized into units with varying marks, emphasizing the need for thorough preparation in these areas.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8
Latika Bajetha/SRCC/6th Sem/Business Analytics
These questions are important, and you should prepare
them well. In theory question paper, one short answer compulsory question with internal choice may be asked in the form of MCQ or true/ false question with reasons.
Questions: Unit 1 (15 marks)
1. Define data. What are the different forms of data? 2. Data Science (Scope , significance) 3. What is data analytics? Its role in modern business world(same as question 4) 4. Role of Data Analytics with respect to various business verticals. ( HR , Marketing, Finance, Sales, R&D , supply chain management). 5. Differentiate between data analytics and data analysis. 6. What are the four types of analytics? Explain with examples. Descriptive Diagnostic Predictive Prescriptive 7. What do you understand by Data Analytics? How is It (associated / different) with Data Science and Big Data Analytics 8. Role of Data Analytics with respect to various business verticals. ( HR , Marketing, Finance, Sales, R&D , supply chain management). 9. Explain the 5 V’s of Big Data and How is Big Data different from traditional data. Application of Big Data in different industries (retail/ healthcare/banking / social media etc)
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
10. Challenges in Data Analytics (Discuss issues related to
data quality, integration, ethics privacy and how to overcome those challenges)
Questions: Unit 3 (10 marks)
1. What is R? Describe some of the features. Advantages of R for data Analysis 2. What is data transformation? Explain any 3 ways in which data can be transformed in R 3. Describe the steps for installing R and RStudio on a computer. And Explain the interface of RStudio briefly. 4. What are packages in R? Why are they important? How do you install and load a package in R? Give examples. 5. Differentiate between a package and a library in R. 6. Name five commonly used R packages and their uses. 7. What are the basic syntax rules in R? 8. Discuss different functions used for importing spreadsheet data in R. 9. Explain the use of if, else, and ifelse in R with examples. 10. What are control flow statements? Why are they important? And Write an R program that uses nested if statements. 11. Describe the types of loops in R: for, while, and repeat.Compare for and while loops in R. Write a program using a loop to print even numbers between 1 and 20. 12. What is a function in R? How is it defined and called?Differentiate between user-defined and built-in functions in R.
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
13. What is the apply family in R? List different apply
functions.Compare apply(), lapply(), and sapply() with examples.When should one use tapply() or mapply()? 14. Discuss how missing values may be handled in R? Is it possible to replace the missing values? 15. What is a constant in R? Explain the various types of constants in R. 16. What is a data structure. Describe all the 6 data structures in R. (same as question 17) 17. Short notes on data structure 18. What are control structures in R? Discuss the broad categories of control structures in R 19. What is a variable in R? How does it differ from a constant?
Questions: Unit 4 (10 marks)
1. What is data importing? Explain the various ways in which data of different formats may be loaded into R (Excel or CSV etc . Syntax ). What common issues can arise when importing data, and how can they be resolved 2. What are the different formats from which data can be imported into R?Explain the process of importing a CSV file into R with an example. 3. What are the key functions used to read Excel and text files in R? 4. Histograms ( When is it used and what is its function),Bar Charts (How can you create horizontal and vertical bar charts in R?What are grouped and stacked bar charts) Box Plots When is it used and what is its function What are outliers in a box plot? How are they represented? Line Graphs (when it can be used How do you plot multiple lines on the same graph in R?Scatter Plots How does it
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
help in analyzing relationships? What is the significance of
adding a trend line (regression line) to a scatter plot? 5. What are the common measures of dispersion in statistics? 6. Why is standard deviation a better measure of spread than range? 7. What is covariance? What does a positive or negative covariance indicate?How do you calculate covariance in R? 8. How do you compute the following in R and what do they imply Mean Median mode Median absolute deviation Mean absolute deviation Skewness Kurtosis Range variance Standard deviation the interquartile range 9. How is correlation different from covariance and Write the R code to calculate correlation between two variables. And explain What is the range of correlation values and how do you interpret them? 10. What is the coefficient of determination? How is it related to correlation?How do you interpret R² in the context of regression analysis? Why is R² important in understanding model accuracy? 11. What is data visualization? Describe the various applications of data visualization in different sector and Describe, in detail, the various types of tools and packages
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
that can be used to create powerful visualizations of data in
R 12. *ggplot2 is the most versatile and powerful data visualization package in R, offering a wide range of customization options and statistical graphics." In the light of this statement, describe the various functions in the ggplot library that can be used to create plots and graphs in R. 13. What are base R graphics? Describe some of the base R functions that can be used to create graphical representations of data in R 14. Describe various aesthetics that can be used in ggplot2 to customise line graph, scatter plot , boxplot , bar charts, histogram created in R. 15. Mention any three functions in the ggplot R package that can be used to create powerful visualizations
Questions: Unit 5 (25 marks)
1. Write the steps to estimate a simple linear regression model in R . What are the assumptions of simple linear regression? State the equation of a simple linear regression model and explain each term and the various types of plots that can be constructed in Ras a diagnostic tool for simple linear regression models. 2. How is the line of best fit determined in simple linear regression? 3. What are confidence and prediction intervals in the context of regression? How are they different from each other Using a suitable example, explain how the confidence and prediction intervals of a basic linear regression model can be generated using R script code.
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
4. Give an example of how prediction intervals are used in
business forecasting. And What factors affect the width of a confidence or prediction interval? 5. What is multiple linear regression? How does it differ from simple linear regression? Explain, using a suitable example, how multiple linear regression can be performed using the scripting language R. What are the assumptions of multiple linear regression? 6. Explain how you would interpret the regression equation with three independent variables. 7. What does an R-squared value indicate in a multiple regression model? 8. How do you interpret the slope coefficient in a regression model and What does the intercept in a regression equation represent? 9. How do you interpret regression coefficients when there are multiple predictors?What is the meaning of a negative regression coefficient and If a coefficient is not statistically significant, how should it be interpreted? 10. What is heteroscedasticity? Why is it a problem in regression analysis? Name and explain two methods to test for heteroscedasticity and also How can we detect and correct heteroscedasticity using R script code and What are the consequences of ignoring heteroscedasticity? 11. Define multicollinearity. How does it affect regression results? What are the causes of multicollinearity? What are the ways to deal with multicollinearity in a regression model? Explain the various methods that can be employed to detect potential multicollinearity in a regression model. 12. How can multicollinearity be detected and resolved using R script code? Explain the Variance Inflation Factor (VIF). Why
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
is multicollinearity more problematic in multiple linear
regression than in simple linear regression? 13. What do you understand by textual data analysis? Discuss some of the major applications and its broad categories of methods and techniques 14. What is text mining? Discuss the steps involved in performing text mining in R .Discuss the various techniques involved 15. What is sentiment analysis? Discuss the benefits and challenges associated with it and write its steps 16. What do you understand by text categorization? Describe the main approaches to text categorization 17. What is a confusion matrix? Describe the structure and interpretation of its components. 18. What is TF-IDF? Discuss the real-world applications of TF- IDF analysis. Also explain process of generating a TF-IDF Matrix in R. 19. What is data clustering? Discuss the steps and techniques involved in data clustering 20. What do you understand by visualization and reporting of textual data? Discuss the benefit challenges associated with textual data visualization and reporting 21. Write short notes on stemming and lemmatization tokenization confusion matrix topic modelling data pre-processing Document-Term Matrix 22. What is predictive data analytics? Discuss its features using suitable examples and describe the scope of predictive
Latika Bajetha/SRCC/6th Sem/Business Analytics
Latika Bajetha/SRCC/6th Sem/Business Analytics
analytics in the business world .What are predictive tools and
techniques? Discuss some of the key tools and techniques. 23. Discuss the role of R in facilitating predictive analytics 24. What is a Residual vs Fitted plot?