Chapter 0. Course Presentation
Chapter 0. Course Presentation
ANALYSIS
CHAPTER 0
ABOUT
IEDA
PART I
A. COURSE PRESENTATION
• Given their significant contribution to business success, the demand for data
coming years.
expanding field.
B. COURSE PLAN
(1/4)
• The course is designed to address real managerial problems using tools from
“Supervised Machine Learning”, which is a subset of “Artificial Intelligence”.
◦ Phase II (Chapters 2 and 3): Preparation for data analysis, which includes a
review of key statistical tools needed for the course and an overview of the
prerequisites for Supervised Machine Learning (such as training/testing sets,
cross-validation, and understanding variation and bias errors).
B. COURSE PLAN
(3/4)
MYCOURSES ORGANIZATION
PARTICIPATION
• WEIGHT 15%
IEDA AI
1. Focus on a specific subset of AI: 1. All subsets of AI: “Machine Learning” (Supervised
and Unsupervised), and “Deep Learning”;
“Supervised Machine Learning”;
• The goal is to use classification models from the IEDA and AI courses to
predict the direction of price movement (up vs. down) of the explained
variable, rather than the actual price.
• Each group (5 students on average) must select only one topic from those
covered in the Financial Markets course.
• For the chosen topic, each group needs to identify one explained variable and
several explanatory variables (refer to the project guidelines for details).
• Your data source will be “Wharton Research Data Services (WRDS)”. Please
register at WRDS Registration to access the data.
A. PROJECT PRESENTATION
(3/3)
• Note that you have flexibility regarding the time frame and frequency of the
variables. However, be mindful of the frequencies. For example, do not use
explanatory variables with annual variation (e.g., GDP) to predict an explained
variable that varies daily (e.g., daily stock price) or monthly, and vice versa.
B. EXPECTATIONS
• Compare all the models using performance metrics and evaluate their theoretical
strengths and weaknesses;
• Determine which model is the most effective for your dataset and study context.
• Bring your results, including the four Python models, to the coaching session.
C. IMPORTANT INFORMATION
• You will be provided with the complete Python script for all models.
• For any questions regarding group composition (e.g., missing members, individuals
not assigned to any group, etc.), please contact the project coordinators:
◦ Aljona ZORINA: [email protected]
• For any questions related to data, please contact your finance professor or the
project coordinators, and for questions about Supervised Machine Learning, please
https://mycourses.ieseg.fr/course/view.php?id=6216
E. SUBMISSION
(1/4)
1. CODE
• Three “.ipynb” files (i.e., google colab notebooks) are expected:
2. TEMPLATE
◦ Select the best model for your study and justify your choice.
• In summary, your IEDA professors will evaluate both your code and the
◦ If the instructions are not followed (e.g., if you do not submit three .ipynb
files), you will get a zero for the coding part.
◦ Code that has not been adjusted to your dataset (i.e., using the provided
code without modifications) will also result in a zero for the coding part.
◦ Failure to submit the code in the correct format (e.g., submitting a
document instead of an .ipynb file or only providing a link to your code in
the template) will result in a zero for the coding part.
E. SUBMISSION
(4/4)
• Once these instructions are organized, they form an algorithm. This algorithm
represents the “behind-the-scenes” process of the software we use.
• Other programming languages include Java, JavaScript, C++, Ruby, and more.
A. INTRODUCTION TO PYTHON
(2/2)
• This year, however, we will be building our models from scratch, allowing you to
learn the underlying algorithms.
• Don’t worry, the course is not intended to teach coding. Instead, it focuses on
understanding the logic behind the algorithms and the instructions they require.
• To support this, the code will be provided. Your task will be to adapt the code to
your project dataset by making the necessary adjustments and modifications.
B. PYTHON CODE EDITORS
(1/3)
• Like any other written language, Python requires a medium for writing. While
human languages are written using text editors (e.g., MS Word) or physical
media (e.g., paper), programming languages are written using “code editors”.
• Therefore, you will need a code editor to write, edit, and run Python code.
B. PYTHON CODE EDITORS
(2/3)
• On the one hand, an IDE (like PyCharm, Jupyter, Spyder, etc.) is a “program
dedicated to software development”*. It includes a wide range of tools and
features that facilitate coding, but it may require more memory and time to
download and install.
B. PYTHON CODE EDITORS
(3/3)
• On the other hand, a code editor is a text “editor designed to handle codes
(with, for example, syntax highlighting and auto-completion).”*
• For the IEDA course, we will use Google Colab, an online code editor that
only requires a Google account, with no need for download or installation.
C. PYTHON LIBRARIES
(1/3)
• Using libraries in programming avoids the need to write code from scratch.
Instead, you can search for the appropriate library, import it, and use it.
• Just as in research where you visit a library to find the most relevant book for
your topic, in programming you use libraries to find the tools and functions
needed for your specific task.
C. PYTHON LIBRARIES
(3/3)
Among the thousands of libraries available, these are the ones we will
primarily use:
Data cleaning & analysis: Scientific computing: Machine Learning:
missing values, outliers… mathematical operations PCA, HCA, K-NN, DT, RF…