Bda Report
Bda Report
A Project Report
On
“REVIEW ANALYSIS BY USING PSPARK TOOL”
Submitted in partial fulfilment of the requirement for the award of
BACHELOR OF ENGINEERING
In
COMPUTER SCIENCE & ENGINEERING (DATA SCIENCE)
By
2024 – 2025
1. INTRODUCTION
3. OUTPUT SCREENSHOTS
4. CONCLUSION
INTRODUCTION
3
DESCRIPTION OF TOOLS USED
6
OUTPUT SCREENSHOTS
7
CONCLUSION
This project delivers a unified framework for both retrospective analysis and
short-term forecasting of U.S. stock prices. By combining domain
knowledge (moving averages, RSI, MACD) with a deep sequential
model (LSTM), we achieve the following:
1. Historical Insight:
o Clear visualizations reveal long-run trends (e.g., Apple’s multi-
decade appreciation), volume surges around major events, and
oscillatory behavior in RSI and MACD indicating overbought or
oversold regimes.
o A correlation heatmap highlights relationships among features,
guiding feature selection and pointing out potential
multicollinearity.
2. Predictive Performance:
o The LSTM, trained on 20-day sliding windows of normalized
features, learns temporal dependencies and yields a test‐set R2R^2
of around 0.85–0.90 for large-cap tickers (e.g., AAPL), with an
average prediction error of $1–$3.
o Although it cannot capture abrupt shocks perfectly, it reliably
tracks general uptrends and downtrends in daily closing prices.
3. User Flexibility:
o A single function, run_analysis(symbol), ties together data
ingestion, feature engineering, EDA plots, model training, and
forecast plotting. Users simply pass a ticker (e.g., “MSFT”), and
the notebook produces a comprehensive analysis within minutes.
o The optional interactive widget allows nonprogrammers to trigger
8
analysis by typing a symbol and clicking a button.
4. Modularity and Extensibility:
o The pipeline’s design (clearly separated blocks for loading data,
engineering features, preparing sequences, training the LSTM, and
visualizing results) makes it straightforward to replace or augment
any component.
o Future work could incorporate additional exogenous inputs (news
sentiment, macroeconomic indicators), experiment with alternative
architectures (GRU, Transformer), or extend to multi-day
forecasts.
In summary, this project demonstrates that a well-engineered combination
of technical indicators and an LSTM model can both illuminate historical
market behavior and produce actionable next-day price forecasts. By
packaging everything into a user-friendly function, it provides a practical
tool for analysts and students to explore time-series modeling in finance.