ADS SEM8 Com BH Sample Notes
ADS SEM8 Com BH Sample Notes
pp
weather, holidays, or economic cycles.
3. Residual Component: The third step is to estimate the
residual component of the time series, which represents the
A
random or unpredictable variation in the data that is not
explained by the trend or seasonal components. This can be
s
done by subtracting the estimated trend and seasonal
components from the original time series. The residual
er
component captures the unexplained variation in the time
series and is usually the least important component for
at
forecasting.
● Once the time series has been decomposed into its components,
he
●
long-term patterns and changes in the data, while the seasonal
component can be used to identify seasonal effects and patterns.
ra
pp
● The ARIMA model is a generalisation of the simpler ARMA
(Autoregressive Moving Average) model, which assumes that the
time series is stationary (i.e., the statistical properties of the series
A
do not change over time).
● However, many time series in real-world applications are
non-stationary, meaning that the statistical properties change over
s
time.
●
er
ARIMA models can handle non-stationary time series by
incorporating differencing, which removes the trend or seasonality
at
component from the series. The ARIMA model also includes
autoregression and moving average components, which capture
he
p, d, and q.
● The parameter p represents the autoregression order, which is the
ra
pp
● Here's a flowchart for building an ARIMA model:
1. Examine the time series data to determine if it is stationary. If
it is not stationary, apply differencing to make it stationary.
A
2. Determine the order of differencing (d) required to make the
series stationary. This can be done by examining the
s
autocorrelation function (ACF) and partial autocorrelation
function (PACF) plots of the differenced series.
3.
er
Determine the order of the autoregressive (p) and moving
average (q) terms required for the model. This can also be
at
done by examining the ACF and PACF plots of the
differenced series.
he
4. Fit the ARIMA model to the time series data using the chosen
values of p, d, and q. This can be done using a variety of
statistical software packages.
in
pp
Ans: Customer segmentation is a technique used by businesses to divide
their customers into groups based on common characteristics such as
demographics, behaviour, and purchasing habits. The goal of customer
A
segmentation is to better understand the needs and preferences of
different customer groups and tailor marketing strategies to each group
accordingly.
s
Here are the steps involved in customer segmentation:
1.
er
Data collection: The first step is to gather data on customers,
including information such as age, gender, income, and purchase
at
history.
2. Data preprocessing: Once the data has been collected, it needs to
he
pp
as silhouette score, Dunn index, or Calinski-Harabasz index.
7. Marketing strategy: Once the customers have been segmented into
groups, businesses can tailor their marketing strategies to each
A
group. For example, a company may create different advertising
campaigns for high-income customers versus low-income
s
customers.
Customer segmentation has many practical applications, including
er
improving customer retention, increasing customer lifetime value, and
optimising marketing spend. It is widely used in industries such as retail,
at
e-commerce, and healthcare to gain insights from large datasets and
make better decisions.
he
Q7. Explain Time series analysis using linear regression. (P4 - Appeared
in
pp
● The linear equation takes the form of:
Y = a + bX + e
where Y is the dependent variable, X is the independent variable, a
A
is the intercept, b is the slope, and e is the error term.
● The slope and intercept can be estimated using least-squares
s
regression, which minimizes the sum of the squared differences
between the predicted values and the actual values.
●
er
Once the slope and intercept are estimated, they can be used to
make predictions about future values of the dependent variable
at
based on the independent variables.
● This is useful in time series analysis because it allows us to identify
he
trends and patterns in the data and make predictions about future
behavior.
Linear regression can also be used to test hypotheses about the
in
●
relationship between the dependent variable and the independent
variables.
ra
pp
A field that encompasses A subset of data science that
a wide range of focuses on using statistical
Definition techniques, tools, and and computational methods
A
methodologies for working to explore, analyze, and
with data extract insights from data
s
Extraction of insights and
Uncovering patterns, trends,
er
knowledge from complex
and relationships in data and
Focus and large data sets using
using that information to
at
advanced statistical and
make data-driven decisions
computational methods
he
pp
ing Skills
for data science. analytics.
A
Data Science makes use
Use of
of machine learning Data Analytics doesn’t make
Machine
algorithms to get use of machine learning.
s
Learning
insights. er
at
Q9. more questions are available in Brainheaters app….
he
.
in
.
ra
B
A
29+ Q/A is available in
s
Brainheaters App er
at
he