ISB_Assignment 2

This assignment focuses on data scraping, SEM replication, and machine learning analysis using Python. Students must select a research paper, scrape relevant data from a subreddit, preprocess it for SEM analysis, and implement machine learning models for comparison. Deliverables include a data file, source code, a PowerPoint presentation on the scraping workflow, detailed documentation of the modeling pipeline, and a LaTeX report summarizing the findings.

Uploaded by

f20212745

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

ISB_Assignment 2

Uploaded by

f20212745

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Assignment 2:

Data Scraping, SEM Replication, and Analysis

Objective:
This assignment assesses your practical skills in data acquisition, data preprocessing,
and the application of SEM using a programming language (preferably Python). It tests
your ability to replicate published research and critically analyze the results.

Task:
Part 0: Paper Selection:
1. Choose one of the five papers you reviewed in Assignment 1. Clearly state which
paper you have chosen and explain your reasoning. Factors to consider might
include:
a. The feasibility of replicating the analysis with the data you can scrape.
b. The clarity and completeness of the methodology described in the paper.
c. Your personal interest in the research topic.
d. The availability of code or detailed model specifications from the original
authors (this is a bonus, but not required).

Part 1: Data Scraping

1. Subreddit Selection: Choose a single, active subreddit to focus on based-on/relevant-
to your selected paper. The subreddit should have a reasonable volume of posts and
comments over the past year. Clearly state the subreddit you have chosen and
provide a brief justification for your selection (e.g., relevance to a specific research
area, high activity level, etc.). Avoid subreddits with overly sensitive or potentially
harmful content.
2. Data Scraping: Write a functional Python script to scrape data from the chosen
subreddit. You may use the Reddit API or a non-API approach (e.g., using requests
and BeautifulSoup). Your script should:
 Collect data for a period of one year. Specify the exact date range you are
collecting data for.
 Extract, at minimum, the following information for each post and comment:
o Post/Comment ID
o Post Title (for posts)
o Post/Comment Body (text)
o Author (username)
o Timestamp (date and time)
o Upvotes/Downvotes (or score)
o Number of comments (for posts)
o Parent ID (for comments, to reconstruct conversation threads)
 Handle potential issues such as:
o Rate limiting (from the Reddit API or website).
o Bot detection (if using a non-API approach).
o Missing or incomplete data.
o Changes to the website structure (if using a non-API approach).
3. Scraping Workflow Documentation (PPT): Create a single-slide PowerPoint
presentation that clearly outlines your scraping workflow. This should include:
 A diagram or flowchart illustrating the steps of your scraping process.
 A brief description of the libraries/tools you used.
 An explanation of how you handled rate limiting and/or bot detection.
 A description of any data cleaning or preprocessing steps performed during the
scraping process (e.g., handling HTML entities, removing deleted comments).
 Any limitations or challenges encountered during scraping.

Part 2: SEM Replication

1. Data Preprocessing: Prepare the scraped data for SEM analysis. This will likely
involve:
 Text preprocessing (e.g., tokenization, stemming/lemmatization, stop word
removal, handling of special characters and URLs).
 Feature engineering (e.g., creating variables based on text analysis, such
as sentiment scores, topic proportions, or measures of linguistic
complexity).
 Creating any necessary dummy variables or interaction terms.
 Handling missing data (e.g., through imputation or deletion).
 Scaling or transforming variables as needed.
2. SEM Implementation: Implement the SEM model from the chosen paper using
Python. The semopy package is recommended, but you may use other suitable
libraries (e.g., lavaan in R with rpy2, or even manual matrix calculations if you are
comfortable with that). Your code should:
 Clearly define the latent variables, observed variables, and their
relationships.
 Specify the estimation method (matching the original paper if possible).
 Calculate and report appropriate model fit indices.
 Estimate the path coefficients and their significance.
3. Result Replication: Attempt to replicate the key results of the original study as
closely as possible. This may not be perfectly achievable due to differences in
data, sample size, or specific implementation details, but you should strive for the
closest possible replication.

Part 3: Machine Learning Model Development and Comparison

1. Problem Framing for ML: Based on the research question and variables from
the paper you chose to replicate, define a specific prediction task that is suitable
for an ML model. This is crucial. SEM and ML are often used for different purposes,
however they could be used to predict the same outcome:
 SEM: Primarily focuses on testing hypothesized relationships between
latent and observed variables (explanatory modeling).
 ML: Primarily focuses on prediction (predictive modeling).
 Therefore, you need to translate the research question into a concrete
prediction problem. Examples:
 If the SEM explores factors influencing user engagement on
Reddit: Your ML task could be to predict the number of upvotes a post will
receive based on its text content, author features, and time of posting.
 If the SEM examines the relationship between sentiment and stock
market movements (using Twitter data): Your ML task could be to
predict the direction of stock price movement (up/down) based on
aggregated sentiment scores from tweets.
 If the SEM investigates the impact of online communities on
political polarization: Your ML task could be to classify users into
different ideological groups based on their posting behavior.
 Clearly state the prediction task you've chosen and justify why it's a
relevant and meaningful comparison point to the SEM analysis. Explain
what your target variable (the thing you're trying to predict) and your
predictor variables (the features you'll use to make the prediction) will be.
2. Model Selection and Justification: Choose at least two different ML models
that are appropriate for your prediction task. Consider models like:
 Regression models: Linear Regression, Ridge/Lasso Regression, Support
Vector Regression (for predicting continuous target variables).
 Classification models: Logistic Regression, Support Vector Machines,
Random Forests, Gradient Boosting Machines (e.g., XGBoost, LightGBM),
Naive Bayes (for predicting categorical target variables).
 Deep learning models: If appropriate for your data and task, you could
consider Transformers, Recurrent Neural Networks (RNNs, especially
LSTMs or GRUs) for text data, or feedforward neural networks for other
types of data.
 Justify your model choices. Explain why each model is suitable for the task
and the type of data you have.
3. Model Training and Evaluation:
 Data pre-processing: Explain your strategy.
 Train each of your chosen ML models on the training data.
 Use the validation set to tune hyperparameters (e.g., regularization
strength, number of trees in a random forest, learning rate) using
appropriate techniques like cross-validation.
 Evaluate the performance of each model on the test set using appropriate
metrics for your prediction task. Examples:
 Regression: Mean Squared Error (MSE), Root Mean Squared Error
(RMSE), R-squared, Mean Absolute Error (MAE).
 Classification: Accuracy, Precision, Recall, F1-score, AUC-ROC
(Area Under the Receiver Operating Characteristic Curve).
4. Comparison with SEM: This is the critical part. Compare the results of your ML
models with the SEM results. Do not expect the ML models to "replicate" the SEM
results. Instead, focus on:
 Predictive Power: How well do the ML models predict the target variable
compared to the implied predictions from the SEM? You might need to
derive a way to make predictions from the SEM.
 Feature Importance: For ML models that provide feature importance
scores, examine which features are most important for prediction. How do
these features relate to the variables and relationships in the SEM? Do
they provide any insights that the SEM might have missed?
 Complementary Insights: Discuss how the ML and SEM approaches
provide complementary perspectives on the research problem. SEM helps
understand the underlying mechanisms, while ML focuses on predictive
accuracy. Highlight the strengths and weaknesses of each approach.

Part 4: Reporting and Analysis (Modified to include ML)

1. Modeling Pipeline Documentation: (Additions in bold)
a. Data Preprocessing: (No changes)
b. Model Assumptions: A clear statement of the assumptions underlying the
SEM model and the ML models you implemented, and a discussion of
whether those assumptions are likely to be met by your data.
c. Model Parameter Fine-Tuning: Describe any parameter tuning or model
adjustments you made for both the SEM and the ML models. Justify
any changes you made.
d. Post-Model Analysis: Describe any analyses you performed after fitting the
SEM and training the ML models.
2. Results and Discussion Report (LaTeX/Overleaf): (Additions in bold)
a. Introduction: (No changes)
b. Methods: Summarize your data scraping, preprocessing, SEM
implementation, and ML model development and evaluation.
c. Results: Present your key findings, including model fit indices, path
coefficients, and ML model performance metrics. Use tables and
figures.
d. Discussion: Compare your results to those of the original study. Compare
the SEM and ML results, focusing on predictive power, feature
importance, and complementary insights. Discuss the limitations of
your replication and comparison, and any potential areas for future
research.
e. Conclusion: Summarize your main conclusions and their implications.

Deliverables:
 Datafile: The scraped and preprocessed data in CSV format.
 Source Code: Your Python script(s) for data scraping, SEM analysis and, ML model
training and evaluation, well-commented and organized.
 Scraping Workflow PPT: The single-slide PowerPoint presentation describing your
scraping workflow.
 Modeling Pipeline Documentation: A detailed, written description of your modeling
pipeline (as a separate document, e.g., a Markdown or text file).
 Results and Discussion Report: A 2-3 page report in LaTeX format (PDF).

Evaluation Criteria:
 Data Scraping: Completeness, accuracy, and efficiency of the data scraping process.
Effective handling of potential issues.
 Data Preprocessing: Appropriateness and thoroughness of preprocessing steps. Clear
justification for choices made.
 SEM Implementation: Correct implementation of the SEM model, including model
specification, estimation, and fit assessment.
 ML Model Development: Appropriate choice of ML models, proper training and
evaluation procedures, and clear justification of choices.
 SEM and ML Comparison: Thoughtful and insightful comparison of the two
approaches, focusing on relevant aspects like predictive power and feature
importance.
 Result Replication: Degree of success in replicating the key findings of the original
study.
 Analysis and Interpretation: Thoughtful and insightful comparison of results,
discussion of limitations, and identification of potential areas for future research.
 Documentation and Reporting: Clear, concise, and well-organized documentation of
all steps, including code comments, workflow descriptions, and the final report.
 Code Quality: Readability, efficiency, and adherence to good coding practices.
 Latex Report Quality: Proper use of Latex syntax, well-formatted, structed report with
professional look.

Academic Integrity and Use of AI Tools

 Original Writing Requirement: All written content in your PowerPoint presentation,
modeling pipeline documentation, and LaTeX report must be your own original work.
The use of Large Language Models (LLMs) or other Generative AI tools (e.g., ChatGPT,
Bard, etc.) to generate text for your explanations, justifications, discussions, or
conclusions is strictly prohibited. Any use of such tools for generating written content
will be considered a violation of academic integrity and will result in the assignment
being rejected.
 Code Assistance (Permitted with Disclosure): The use of AI coding assistants
(e.g., GitHub Copilot) for code-related tasks (such as syntax suggestions, debugging,
or generating boilerplate code) is permitted, provided that you clearly acknowledge
their use. If you use an AI coding assistant, include a brief statement in your code
comments indicating which parts of the code were assisted by the tool. The core logic
and structure of your code (including both the scraping and modeling components)
must still be your own. You must be able to fully explain any code you submit,
including the rationale behind your design choices and implementation details.
 Plagiarism: All sources (including code snippets from online resources) must be
properly cited. Any instance of plagiarism (presenting someone else's work as your
own) will result in the assignment being rejected.

APQP Checklist: Supplier: Supplier Code: Part Name: Part Number
86% (7)
APQP Checklist: Supplier: Supplier Code: Part Name: Part Number
2 pages
SENG 310 - Human-Computer Interaction - Midterm Instructor: Dr. Charles Perin
No ratings yet
SENG 310 - Human-Computer Interaction - Midterm Instructor: Dr. Charles Perin
9 pages
ES.0.06.0021-C - Compilation and Handover of Project Dossier
100% (5)
ES.0.06.0021-C - Compilation and Handover of Project Dossier
34 pages
Data Structures and Algorithm Analysis in C++, Third Edition
From Everand
Data Structures and Algorithm Analysis in C++, Third Edition
Clifford A. Shaffer
4.5/5 (5)
Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
From Everand
Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
Dr. M. Kanagasabapathy
5/5 (1)
IICS MCQs
100% (1)
IICS MCQs
7 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
From Everand
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Avishek Nag
No ratings yet
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
From Everand
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
César Pérez López
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
C++ Data Structures Explained: A Practical Guide with Examples
From Everand
C++ Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Machine Learning Fundamentals: Concepts, Models, and Applications
From Everand
Machine Learning Fundamentals: Concepts, Models, and Applications
Amar Sahay
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Modern C++ Templates: A Practical Guide for Developers
From Everand
Modern C++ Templates: A Practical Guide for Developers
Robert Johnson
No ratings yet
Statistics with Rust: 50+ Statistical Techniques Put into Action
From Everand
Statistics with Rust: 50+ Statistical Techniques Put into Action
Keiko Nakamura
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
C Data Structures and Algorithms: Implementing Efficient ADTs
From Everand
C Data Structures and Algorithms: Implementing Efficient ADTs
Larry Jones
No ratings yet
JMP for Mixed Models
From Everand
JMP for Mixed Models
Ruth Hummel
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
The C++ Template Handbook: Advanced Techniques for Modern C++ Developers
From Everand
The C++ Template Handbook: Advanced Techniques for Modern C++ Developers
Robert Johnson
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Creating your MySQL Database: Practical Design Tips and Techniques
From Everand
Creating your MySQL Database: Practical Design Tips and Techniques
Marc Delisle
3/5 (1)
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
Mastering Algorithms and Data Structures
From Everand
Mastering Algorithms and Data Structures
Manish Soni
No ratings yet
Python Regular Expressions Explained: A Practical Guide with Examples
From Everand
Python Regular Expressions Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Cloud Native AI and Machine Learning on AWS: Use SageMaker for building ML models, automate MLOps, and take advantage of numerous AWS AI services (English Edition)
From Everand
Cloud Native AI and Machine Learning on AWS: Use SageMaker for building ML models, automate MLOps, and take advantage of numerous AWS AI services (English Edition)
Premkumar Rangarajan
No ratings yet
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet
Software Engineering & Object Oriented Modeling
From Everand
Software Engineering & Object Oriented Modeling
Jitendra Patel
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
Ultimate Enterprise Data Analysis and Forecasting using Python: Leverage Cloud platforms with Azure Time Series Insights and AWS Forecast Components for Time Series Analysis and Forecasting with Deep learning Modeling using Python
From Everand
Ultimate Enterprise Data Analysis and Forecasting using Python: Leverage Cloud platforms with Azure Time Series Insights and AWS Forecast Components for Time Series Analysis and Forecasting with Deep learning Modeling using Python
Shanthababu Pandian
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Simulation for Data Science with R
From Everand
Simulation for Data Science with R
Matthias Templ
No ratings yet
Building Transformer Models with PyTorch 2.0: NLP, computer vision, and speech processing with PyTorch and Hugging Face (English Edition)
From Everand
Building Transformer Models with PyTorch 2.0: NLP, computer vision, and speech processing with PyTorch and Hugging Face (English Edition)
Prem Timsina
No ratings yet
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
Workshop Master Revealed
From Everand
Workshop Master Revealed
Anil Soni
No ratings yet
Programming in Star
From Everand
Programming in Star
Francis McCabe
No ratings yet
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
Developing Analytic Talent: Becoming a Data Scientist
From Everand
Developing Analytic Talent: Becoming a Data Scientist
Vincent Granville
3/5 (7)
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
Introduction to MATLAB for Scientists and Engineers: A Practical Guide to Computational Problem Solving
From Everand
Introduction to MATLAB for Scientists and Engineers: A Practical Guide to Computational Problem Solving
Eric Okoth Ogur
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
From Everand
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
Somanath Nanda
No ratings yet
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
From Everand
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
Mirza Rahim Baig
No ratings yet
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
From Everand
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
Mustafa Al-Dori
5/5 (1)
Python Task Descriptions
No ratings yet
Python Task Descriptions
10 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
179899_Naveen_PILLA_Implementation_of_Sentiment_Analysis_in_the_Stock_Market_Using_Machine_Learning_2414012_1532031119
No ratings yet
179899_Naveen_PILLA_Implementation_of_Sentiment_Analysis_in_the_Stock_Market_Using_Machine_Learning_2414012_1532031119
8 pages
Em Semester Project
No ratings yet
Em Semester Project
21 pages
7641 Assignment 1
No ratings yet
7641 Assignment 1
4 pages
Machine Learning Assignment-02
No ratings yet
Machine Learning Assignment-02
2 pages
McLeod CH02
No ratings yet
McLeod CH02
33 pages
SRA Job Aid
No ratings yet
SRA Job Aid
12 pages
New Technology in Garment Industry
No ratings yet
New Technology in Garment Industry
16 pages
Gmail - Profenaa Industrial Software Training - Reg PDF
No ratings yet
Gmail - Profenaa Industrial Software Training - Reg PDF
2 pages
Spring Boot Tutorial Part 1
No ratings yet
Spring Boot Tutorial Part 1
88 pages
No School Tomorrow
No ratings yet
No School Tomorrow
6 pages
Atomy Membership Application (Ms-01-03)
No ratings yet
Atomy Membership Application (Ms-01-03)
1 page
Sodapdf
No ratings yet
Sodapdf
20 pages
Effective Intrusion Detection in IoT Env
No ratings yet
Effective Intrusion Detection in IoT Env
8 pages
Chapter No 5 (Answer Key) Set1
No ratings yet
Chapter No 5 (Answer Key) Set1
4 pages
Ecdis Notes
100% (3)
Ecdis Notes
7 pages
IntroductionAMEISE Qa200 en v1.2
No ratings yet
IntroductionAMEISE Qa200 en v1.2
9 pages
Pricing by Item Category
No ratings yet
Pricing by Item Category
20 pages
Circle Drawing Algorithms
No ratings yet
Circle Drawing Algorithms
18 pages
Architect URE Design
No ratings yet
Architect URE Design
11 pages
TDPSQL Explained
No ratings yet
TDPSQL Explained
26 pages
Change Data Capture - Overview
No ratings yet
Change Data Capture - Overview
76 pages
2PAA102411 C en System 800xa Course T306 - Information Management
No ratings yet
2PAA102411 C en System 800xa Course T306 - Information Management
2 pages
Metatrader 4 Tutorial: Downloading and Installing MT4
No ratings yet
Metatrader 4 Tutorial: Downloading and Installing MT4
59 pages
ATM PROJECT CS-2-extracted-text
No ratings yet
ATM PROJECT CS-2-extracted-text
19 pages
GAI End of Course Notes
No ratings yet
GAI End of Course Notes
3 pages
Sistema de Inyeccion Motor 6059
No ratings yet
Sistema de Inyeccion Motor 6059
3 pages
S1, S2, 2. Data Warehousing Concepts and Stella Gatziu and AVavouras (1999)
No ratings yet
S1, S2, 2. Data Warehousing Concepts and Stella Gatziu and AVavouras (1999)
4 pages
Homotopy Type Theory
No ratings yet
Homotopy Type Theory
458 pages
Equipment Reliability Sondini
100% (1)
Equipment Reliability Sondini
9 pages
X - Ict Skills
No ratings yet
X - Ict Skills
5 pages

ISB_Assignment 2

Uploaded by

ISB_Assignment 2

Uploaded by

Assignment 2:

Data Scraping, SEM Replication, and Analysis

Part 1: Data Scraping

Part 2: SEM Replication

Part 3: Machine Learning Model Development and Comparison

Part 4: Reporting and Analysis (Modified to include ML)

Academic Integrity and Use of AI Tools

You might also like