0% found this document useful (0 votes)
111 views16 pages

ADS SEM8 Com BH Sample Notes

at The document discusses time series decomposition. It states that time series decomposition involves breaking down a time series into three main components: the trend component, which represents long-term patterns or movements; the seasonal component, which captures periodic patterns that repeat over a fixed interval; and the residual component, which represents random variations not explained by trend or seasonality. Once decomposed, each component can be analyzed separately to better understand the characteristics and behavior of the overall time series. The decomposition can also be used for forecasting by extrapolating the trend and seasonal components.

Uploaded by

Nabeel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
111 views16 pages

ADS SEM8 Com BH Sample Notes

at The document discusses time series decomposition. It states that time series decomposition involves breaking down a time series into three main components: the trend component, which represents long-term patterns or movements; the seasonal component, which captures periodic patterns that repeat over a fixed interval; and the residual component, which represents random variations not explained by trend or seasonality. Once decomposed, each component can be analyzed separately to better understand the characteristics and behavior of the overall time series. The decomposition can also be used for forecasting by extrapolating the trend and seasonal components.

Uploaded by

Nabeel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

2.

Seasonal Component: The second step is to estimate the


seasonal component of the time series, which represents the
periodic patterns or cycles in the data that repeat over fixed
time intervals. This can be done using various methods, such
as seasonal indices, Fourier analysis, or seasonal regression
models. The seasonal component captures the systematic
variation in the time series due to seasonal effects, such as

pp
weather, holidays, or economic cycles.
3. Residual Component: The third step is to estimate the
residual component of the time series, which represents the

A
random or unpredictable variation in the data that is not
explained by the trend or seasonal components. This can be

s
done by subtracting the estimated trend and seasonal
components from the original time series. The residual
er
component captures the unexplained variation in the time
series and is usually the least important component for
at
forecasting.
● Once the time series has been decomposed into its components,
he

each component can be analyzed separately to better understand


its characteristics and behavior.
For example, the trend component can be used to identify
in


long-term patterns and changes in the data, while the seasonal
component can be used to identify seasonal effects and patterns.
ra

● The residual component can be used to identify outliers and


random fluctuations in the data.
B

● The decomposition of a time series can also be used for forecasting


by extrapolating the trend and seasonal components into the future
and adding them together to obtain a forecast.
● The residual component can be used to estimate the uncertainty or
variability in the forecast

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
Q5. Write a short note on the ARIMA Model. (P4 - Appeared 1 Time)
(5-10 Marks)
Ans: ARIMA (Autoregressive Integrated Moving Average) model is a popular
statistical method used for time series analysis and forecasting.
● It is a combination of three methods: autoregression, differencing,
and moving average.

pp
● The ARIMA model is a generalisation of the simpler ARMA
(Autoregressive Moving Average) model, which assumes that the
time series is stationary (i.e., the statistical properties of the series

A
do not change over time).
● However, many time series in real-world applications are
non-stationary, meaning that the statistical properties change over

s
time.

er
ARIMA models can handle non-stationary time series by
incorporating differencing, which removes the trend or seasonality
at
component from the series. The ARIMA model also includes
autoregression and moving average components, which capture
he

the autocorrelation and noise components of the series,


respectively.
● ARIMA models are specified by three parameters:
in

p, d, and q.
● The parameter p represents the autoregression order, which is the
ra

number of lagged values of the dependent variable used to predict


the current value.
B

● The parameter q represents the moving average order, which is the


number of lagged errors used to predict the current value.
● The parameter d represents the differencing order, which is the
number of times the series is differenced to make it stationary.

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
● ARIMA models can be used for both time series analysis and
forecasting.
● For time series analysis, ARIMA models can be used to identify the
underlying patterns and trends in the data, and to test for the
presence of seasonality or other cyclical components.
● For forecasting, ARIMA models can be used to predict future values
of the time series based on its past behaviour.

pp
● Here's a flowchart for building an ARIMA model:
1. Examine the time series data to determine if it is stationary. If
it is not stationary, apply differencing to make it stationary.

A
2. Determine the order of differencing (d) required to make the
series stationary. This can be done by examining the

s
autocorrelation function (ACF) and partial autocorrelation
function (PACF) plots of the differenced series.
3.
er
Determine the order of the autoregressive (p) and moving
average (q) terms required for the model. This can also be
at
done by examining the ACF and PACF plots of the
differenced series.
he

4. Fit the ARIMA model to the time series data using the chosen
values of p, d, and q. This can be done using a variety of
statistical software packages.
in

5. Check the residuals of the ARIMA model for autocorrelation,


non-normality, and heteroscedasticity. If any of these issues
ra

are present, re-estimate the model or consider using a


different model.
B

6. Use the ARIMA model to make predictions for future time


periods.
7. Check the accuracy of the ARIMA model predictions using
metrics such as mean absolute error (MAE) or root mean
squared error (RMSE).

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
8. Refine the ARIMA model as necessary based on the
prediction accuracy and additional insights gained from the
analysis of the time series data.

Q6. Describe in detail Customer Segmentation. (P4 - Appeared 1 Time)


(5-10 Marks)

pp
Ans: Customer segmentation is a technique used by businesses to divide
their customers into groups based on common characteristics such as
demographics, behaviour, and purchasing habits. The goal of customer

A
segmentation is to better understand the needs and preferences of
different customer groups and tailor marketing strategies to each group
accordingly.

s
Here are the steps involved in customer segmentation:
1.
er
Data collection: The first step is to gather data on customers,
including information such as age, gender, income, and purchase
at
history.
2. Data preprocessing: Once the data has been collected, it needs to
he

be cleaned and processed. This involves removing duplicates, filling


in missing values, and converting categorical variables into
numerical values.
in

3. Feature selection: The next step is to select the relevant features


that will be used to group the customers. This can be done using
ra

various techniques such as correlation analysis, principal


component analysis, or domain expertise.
B

4. Similarity or distance measure: The similarity or distance measure is


a key component of customer segmentation, which is used to
determine how similar or dissimilar two customers are. Common
measures include Euclidean distance, cosine similarity, and
Manhattan distance.

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
5. Clustering algorithm: There are several clustering algorithms that
can be used for customer segmentation, including K-means,
hierarchical clustering, and density-based clustering. The choice of
algorithm depends on the specific problem and the characteristics
of the data.
6. Evaluation: After clustering, it is important to evaluate the quality of
the clustering results. This can be done using various metrics such

pp
as silhouette score, Dunn index, or Calinski-Harabasz index.
7. Marketing strategy: Once the customers have been segmented into
groups, businesses can tailor their marketing strategies to each

A
group. For example, a company may create different advertising
campaigns for high-income customers versus low-income

s
customers.
Customer segmentation has many practical applications, including
er
improving customer retention, increasing customer lifetime value, and
optimising marketing spend. It is widely used in industries such as retail,
at
e-commerce, and healthcare to gain insights from large datasets and
make better decisions.
he

Q7. Explain Time series analysis using linear regression. (P4 - Appeared
in

1 Time) (5-10 Marks)


Ans: Time series analysis is a statistical method used to analyze data that
ra

is collected over a period of time.


● In time series analysis, the data is often plotted over time to identify
B

patterns, trends, and other useful information.


● Linear regression is a statistical method that can be used to analyze
time series data. Linear regression is a technique used to model the
relationship between a dependent variable and one or more
independent variables.

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
● In time series analysis, the dependent variable is usually the
variable of interest that changes over time, and the independent
variables are time-related variables such as time itself or other
factors that may affect the dependent variable.
● The basic idea behind linear regression in time series analysis is to
use a linear equation to model the relationship between the
dependent variable and the independent variables.

pp
● The linear equation takes the form of:
Y = a + bX + e
where Y is the dependent variable, X is the independent variable, a

A
is the intercept, b is the slope, and e is the error term.
● The slope and intercept can be estimated using least-squares

s
regression, which minimizes the sum of the squared differences
between the predicted values and the actual values.

er
Once the slope and intercept are estimated, they can be used to
make predictions about future values of the dependent variable
at
based on the independent variables.
● This is useful in time series analysis because it allows us to identify
he

trends and patterns in the data and make predictions about future
behavior.
Linear regression can also be used to test hypotheses about the
in


relationship between the dependent variable and the independent
variables.
ra

● For example, we may want to test whether a particular independent


variable has a significant effect on the dependent variable, or
B

whether there is a trend in the data over time.

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
Q8. Difference between data science and data analytics. (P4 -
Appeared 1 Time) (5-10 Marks)
Ans:

Data Science Data Analytics


Parameter

pp
A field that encompasses A subset of data science that
a wide range of focuses on using statistical
Definition techniques, tools, and and computational methods

A
methodologies for working to explore, analyze, and
with data extract insights from data

s
Extraction of insights and
Uncovering patterns, trends,
er
knowledge from complex
and relationships in data and
Focus and large data sets using
using that information to
at
advanced statistical and
make data-driven decisions
computational methods
he

Data preparation, data


Exploratory data analysis,
Stages of modeling, data
statistical analysis, and
in

Analysis visualization, and


predictive modeling
communication of insights
ra

Mainly used in business


Used in various fields,
settings to optimize
Application including business,
B

processes, improve customer


s healthcare, finance, and
experience, and increase
more
profitability

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
Machine learning, deep Statistical analysis,
Techniques learning, natural language regression analysis,
Used processing, computer clustering, data visualization,
vision, and more and more

In- depth knowledge of Basic Programming skills


Programm
programming is required are necessary for data

pp
ing Skills
for data science. analytics.

A
Data Science makes use
Use of
of machine learning Data Analytics doesn’t make
Machine
algorithms to get use of machine learning.

s
Learning
insights. er
at
Q9. more questions are available in Brainheaters app….
he

.
in

.
ra
B

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
pp
Full module-wise notes with

A
29+ Q/A is available in

s
Brainheaters App er
at
he

Download the App Now!


in
ra
B

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4
pp
A
s
er
at
he
in
ra
B

Download Brainheaters App - https://bit.ly/Brainheaters_App


Whatsapp Community Link:-https://chat.whatsapp.com/Gp1v0PbkFoG5DkC8Cyqev4

You might also like