Business Analytics ASSIGNMENT Questions
Business Analytics ASSIGNMENT Questions
ASSIGNMENT:
NAME: REVA
SECTION: B
Data Analytics goes beyond just looking at numbers - it’s about turning data into a
story that explains what’s happening in a business, process, or system, and what
actions can be taken to improve outcomes.
1. Data Collection – Gathering data from various sources like databases, logs,
sensors, web platforms, etc.
2. Data Cleaning – Removing errors, inconsistencies, and duplicates to
improve data quality.
3. Data Exploration and Transformation – Understanding the structure of
the data, transforming it into usable formats, and performing exploratory
data analysis (EDA).
4. Data Modeling and Analysis – Applying statistical or machine learning
techniques to identify patterns, correlations, or trends.
5. Data Visualization and Interpretation – Presenting results using charts,
graphs, dashboards, or reports to make them understandable and actionable.
In essence, data analytics is one of the tools or skill sets that data scientists
use regularly.
Big Data Analytics requires advanced tools and frameworks such as:
Conclusion:
Data Analytics is the foundation—it helps make sense of data and extract insights to support
decisions.
It is a core function within Data Science, which is a broader, more holistic field focused on not
just analysis, but also prediction, automation, and innovation.
Big Data Analytics applies data analytics techniques to extremely large and complex datasets
that cannot be handled with traditional methods.
All three are closely interconnected and often work hand-in-hand in modern data-driven
environments. Whether it’s a business analyzing customer behavior, a healthcare provider
predicting disease outbreaks, or a social media platform recommending content—data
analytics is at the heart of making data useful.
Ques 2. “R is used extensively for data analysis”. In the light of this statement,
discuss some of the advantages of using R for data analysis.
Ans 2. The statement “R is used extensively for data analysis” is very accurate
—R has long been one of the most popular tools in the field of data
analysis and statistics. Originally developed by statisticians for
statisticians, R has evolved into a powerful programming language and
environment specifically designed for data manipulation, statistical modeling,
and visualization.
Let’s explore some of the key advantages of using R for data analysis :
R was built from the ground up for statistics and data analysis. It provides a vast
number of functions for statistical techniques like:
This makes it ideal for anyone working with data in a research, academic,
or business setting.
You can even embed R in web applications (using shiny) or use it in reports
(via RMarkdown) that combine code, text, and visuals.
With tools like R Markdown, knitr, and bookdown, R makes it easy to create
documents that include code, analysis, and narrative text—all in one
place. This is essential for ensuring reproducibility, which is a core
principle in both academic and industry data science projects.
8. Versatility in Applications:
Machine learning,
Data mining,
Text analysis and natural language processing,
Bioinformatics,
Finance and economics,
Marketing analytics, and more.
Conclusion:
R remains a top choice for data analysis because of its specialized features, extensive libraries, and
visualization capabilities. Its design makes complex statistical tasks much easier, and its ecosystem
supports everything from small-scale data wrangling to large-scale predictive modeling.
If your focus is statistical modeling, visualization, or reproducible research, R is often the go-to
language for professionals and academics alike.
Ques 3. What is data visualization. Describe the various types of tools and
packages that can be used to create powerful visualization of data in R.
Ans 3. What is Data Visualization?
It helps in:
In short, data visualization transforms raw data into visuals that tell a
story.
Data Visualization in R:
Here are some of the most popular and powerful tools and packages in R
for data visualization:
1. Ggplot2:
Bar charts
Line graphs
Histograms
Scatter plots
Boxplots
Density plots
Faceted/multi-panel plots
In R:-
CopyEdit
library(ggplot2)
ggplot(data = mtcars, aes(x = mpg, y = hp)) +
geom_point() +
theme_minimal()
2. Lattice:
In R:-
CopyEdit
library(lattice)
xyplot(mpg ~ wt | cyl, data = mtcars)
3. Plotly:
In R:-
CopyEdit
library(plotly)
p <- ggplot(mtcars, aes(x = wt, y = mpg)) + geom_point()
ggplotly(p)
In R:-
CopyEdit
plot(mtcars$mpg, mtcars$hp, main="MPG vs HP")
5. Highcharter:
In R:-
CopyEdit
library(highcharter)
highchart() %>%
hc_add_series(data = c(3, 5, 7, 9), type = "line")
6. Leaflet:
In R:-
CopyEdit
library(leaflet)
leaflet() %>%
addTiles() %>%
addMarkers(lng = -0.09, lat = 51.505, popup = "London")
7. Esquisse:
In R:-
CopyEdit
# Install and load
# esquisse::esquisser() launches the GUI
8. RShiny + RMarkdown:
While not packages specifically for plotting, they allow you to present your
visualizations interactively:
1. Data-Driven Forecasting,
Example:
An e-commerce platform might use predictive analytics to forecast which
products a customer is likely to buy based on their previous browsing and
purchase history.
These models "learn" from the data and improve over time.
Example:
A bank might use a logistic regression model to predict the likelihood of a
customer defaulting on a loan.
Many predictive analytics systems can process incoming data in near real-
time, enabling businesses to respond instantly.
Example:
A fraud detection system in a bank analyzes transactions in real-time to
predict if a purchase might be fraudulent.
Predictive analytics is widely used for risk modeling, where it assigns scores
or probabilities to assess risk levels.
Example:
In insurance, predictive analytics can assess which applicants are high-risk
based on demographic and behavioral data.
5. Behavior Prediction,
Example:
Netflix uses predictive models to recommend shows and movies you’re
likely to watch based on your viewing history and ratings.
7. Automated Decision-Making,
Dynamic pricing,
Personalized marketing,
Inventory management.
Example:
Amazon adjusts product prices based on predicted demand and
competitor activity.
Industry Application
Retail: Predict customer churn, optimize pricing & inventory
Healthcare: Forecast disease outbreaks, hospital readmission
risks
Finance: Credit scoring, fraud detection
Marketing: Targeted ad campaigns, customer segmentation
Manufacturing: Predictive maintenance, supply chain optimization
Transportation: Route optimization, demand forecasting
Conclusion:
Predictive Data Analytics is not just about knowing what happened, but about knowing what’s coming.
With access to quality data and the right models, it empowers organizations to plan ahead, reduce
risks, improve customer satisfaction, and stay competitive.
Ques 5. What is Textual Data Analytics? Describe the major application of
textual data analysis.
Textual Data Analysis, also known as Text Analytics or Text Mining, is the process
of extracting useful insights, patterns, and meaningful information from
unstructured text data. This type of analysis helps computers understand human
language and turn messy text into structured, analyzable data.
Since a large portion of data generated today is textual (emails, social media posts,
customer reviews, chat logs, etc.), text analysis has become a critical area in data
science and artificial intelligence.
1. Text Preprocessing:
o Removing stop words (like "the", "is", "and")
o Tokenization (breaking text into words or phrases)
o Stemming/Lemmatization (reducing words to root form)
o Removing punctuation, numbers, and irrelevant symbols
2. Text Representation:
o Bag-of-Words (BoW): Represents text as a set of word frequencies.
o TF-IDF (Term Frequency-Inverse Document Frequency): Weights
words based on importance.
o Word Embeddings (Word2Vec, GloVe): Converts words into
vectors capturing semantic meaning.
3. Natural Language Processing (NLP):
o Part-of-Speech tagging
o Named Entity Recognition (NER)
o Syntax and grammar parsing
4. Text Classification & Clustering:
o Grouping similar documents or classifying texts into categories
(e.g., spam vs. not spam)
5. Sentiment Analysis:
o Detecting the emotional tone behind the text (positive,
negative, neutral)
1. Sentiment Analysis,
Example:
A company analyzes social media posts to gauge how people feel about a
new product launch.
2. Spam Detection,
Example:
Email services use text analysis to detect keywords and patterns in spam
emails.
Example:
An e-commerce platform uses text analysis to find common complaints in
product reviews.
Example:
AI chatbots in banking apps analyze user messages to offer help with
account-related questions.
5. Topic Modeling,
Example:
A news agency uses topic modeling to categorize thousands of news articles
automatically.
6. Healthcare and Medical Records Analysis,
Example:
Hospitals use text analysis to identify trends in symptoms and diagnoses
from patient records.
Example:
A law firm uses text mining to search thousands of documents for relevant
case precedents.
8. Fraud Detection,
Example:
Insurance companies use text analysis to flag suspicious language in claim
descriptions.
Conclusion:
Textual Data Analysis transforms unstructured text into actionable insights. In an age where text is
one of the most common forms of data, its applications span across industries—from marketing and
healthcare to finance and law. By leveraging text analysis, organizations can better understand human
communication, automate decision-making, and improve customer experience.