241010_C1.5. Algorithm, Machine Learning and Deep Learning

Chapter 1.5 discusses algorithms and machine learning, outlining various types of algorithms including sorting, searching, greedy, divide and conquer, and dynamic programming, along with their characteristics and real-world applications. It also covers machine learning types such as supervised, unsupervised, semi-supervised, and reinforcement learning, emphasizing their impact on finance through applications like credit scoring and fraud detection. The chapter highlights the challenges associated with data labeling and model performance in machine learning.

Uploaded by

Thùy Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

241010_C1.5. Algorithm, Machine Learning and Deep Learning

Uploaded by

Thùy Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Chapter 1.

5: Algorithm and Machine learning

Prepared for Technology Applications in Fiance – FTU 2024

1. Algorithms
2. Machine learning
3. Deep learning
Definition
The types of algorithms
Characteristics
Real-world applications
Challenges
Algorithms are fundamental building blocks of computer science, helping solve problems efficiently by following
a defined set of instructions. They process inputs and generate outputs, with their efficiency measured in terms of
time (how fast they run) and space (how much memory they use).
Types of Algorithms
Sorting Algorithms: Sorting arranges data in order.
Search Algorithms: Searching finds a specific item in a dataset.
Greedy Algorithms: Make the best choice at each step, hoping to find a global optimum.
Divide and Conquer: Breaks a problem into smaller sub-problems, solves them recursively, and combines the results.
Dynamic Programming: Solves complex problems by breaking them into smaller sub-problems and storing results to avoid
repetition.
Sorting algorithms are methods to arrange elements in a specific order, typically ascending or descending. Here
are the key sorting algorithms:
Bubble Sort: Compares adjacent elements and swaps them if in the wrong order. Example: List [5, 3, 8, 6, 2]—
compare 5 and 3, swap: [3, 5, 8, 6, 2], then compare 8 and 6, swap: [3, 5, 6, 8, 2]. Continue until sorted: [2, 3, 5, 6,
8].
Selection Sort: Finds the smallest element in the unsorted part and swaps it with the first unsorted element.
Example: List [29, 10, 14, 37, 14]—find smallest (10), swap with 29: [10, 29, 14, 37, 14]. Repeat to get [10, 14,
14, 29, 37].
Insertion Sort: Builds a sorted array one element at a time by inserting elements into their correct position.
Example: List [9, 7, 5, 3, 1]—insert 7 into position: [7, 9, 5, 3, 1]. Repeat to get [1, 3, 5, 7, 9].
Merge Sort: Divides the list into halves, sorts each half, and merges them back together. Example: List [38, 27,
43, 3, 9, 82, 10]—split into [38, 27, 43] and [3, 9, 82, 10], further split and merge: [27, 38, 43], [3, 9, 10, 82].
Final merge: [3, 9, 10, 27, 38, 43, 82].
Quick Sort: Picks a "pivot" element, partitions the list into elements smaller and larger than the pivot, and
recursively sorts the partitions. Example: List [3, 6, 8, 10, 1, 2, 1]—pivot is 6, partition: [3, 1, 2, 1] | [6] | [8, 10].
Recursively sort partitions to get [1, 1, 2, 3, 6, 8, 10].
Heap Sort: Builds a binary heap from the list and repeatedly extracts the largest element. Example: List [12, 11,
13, 5, 6, 7]—build max heap [13, 11, 12, 5, 6, 7], extract 13 and rebuild to get sorted list [5, 6, 7, 11, 12, 13].
Choosing the Right Algorithm:
Simple algorithms like Bubble, Selection, and Insertion Sort are easier to implement but slow for large datasets
(O(n²)). More efficient algorithms like Merge, Quick, and Heap Sort (O(n log n)) handle larger datasets better.
Search algorithms are methods for finding a specific item in a collection of items. Here’s a breakdown of several
key search algorithms with practical examples.
Linear Search A linear search checks each element in a list sequentially until it finds the target value or reaches
the end of the list.
Binary Search A binary search is an efficient algorithm for finding an item from a sorted list by repeatedly
dividing the search interval in half.
Depth-First Search (DFS) DFS is an algorithm for traversing or searching tree or graph data structures. It
explores as far as possible along each branch before backtracking.
Breadth-First Search (BFS) BFS is another tree/graph traversal algorithm that explores all neighbor nodes at the
present depth before moving on to nodes at the next depth level.
A linear search checks each element in a list sequentially until it finds the target value or reaches the end of the
list.
You have a list of students' names: ["John", "Emma", "Alice", "Michael", "Sara"]. If you want to find out if "Alice" is in the
list, you would start from the first name and check each one until you find "Alice" or reach the end of the list.
In the worst case, you may need to check every name.

A binary search is an efficient algorithm for finding an item from a sorted list by repeatedly dividing the search
interval in half.
Consider a sorted list of numbers: [1, 3, 5, 7, 9, 12, 15, 20, 23, 30, 35].
If you want to find the number 23, you would first check the middle element of the list. If it’s less than 23, you adjust your
search to the upper half; if it’s more, you adjust to the lower half.
You repeat this process until you find 23.
Depth-First Search (DFS) is an algorithm for traversing or searching tree or graph data structures. It explores as
far as possible along each branch before backtracking.
Imagine a simple tree structure:
A
/ \
B C
/\ / \
D E F G

Starting from A, you would visit B, then go as deep as possible to D before backtracking to explore E, and then backtrack to
A to explore C, continuing this process until all nodes are visited. ABDBEBACFCG
Breadth-First Search (BFS) is another tree/graph traversal algorithm that explores all neighbor nodes at the
present depth before moving on to nodes at the next depth level.
Using the same tree structure as above:
You start at A and visit all immediate neighbors (B and C). After that, you would visit the next level (D, E, F, G), ensuring all
nodes at the current level are explored before moving deeper. ABACGCFCABDBE
•Linear Search is straightforward but can be slow for large datasets.
•Binary Search is efficient for sorted lists and significantly reduces the number of comparisons needed.
•DFS and BFS are useful for tree and graph traversal, each with different strategies for exploring nodes.
Overview
A greedy algorithm is an optimization technique that makes a series of choices, each of which looks best at the
moment. It aims to find a local optimum in the hope that these local solutions will lead to a global optimum.
Greedy algorithms are often faster and simpler than dynamic programming but don't guarantee the optimal
solution for all problems.
Key Characteristics
Locally Optimal Choice: At each step, the algorithm chooses the option that seems best at the moment without considering
the overall context.
Feasibility: The chosen option must be feasible; it must satisfy the problem's constraints.
Irrevocability: Once a choice is made, it cannot be changed.
1. Coin Change Problem (Minimum Coins)
Problem: Given a set of coin denominations, determine the minimum number of coins needed to make a certain amount.
Example: You have coins of denominations [1, 3, 4] and need to make 6.
Greedy Approach:
•Start with the largest denomination that is less than or equal to the remaining amount.
•Choose the coin of denomination 4, leaving an amount of 2.
•Next, choose the coin of denomination 1 twice.
Result: The coins used are 4 + 1 + 1, resulting in 3 coins.
Note: This approach works well in this case, but it might not always yield the optimal solution for other coin sets.
2. Activity Selection Problem
Problem: Given a set of activities with start and end times, select the maximum number of activities that don't overlap.
Example: Activities: (1, 3), (2, 5), (4, 6), (6, 8).
Greedy Approach:
•Sort activities by their end times.
•Select the first activity (1, 3), then choose the next one that starts after the last selected activity ends.
•Select (4, 6) and (6, 8).
Result: The maximum number of activities selected is 3.
Overview
Divide and conquer is an algorithmic paradigm that involves breaking a problem into smaller subproblems,
solving each subproblem independently, and then combining their solutions to solve the original problem. This
approach is particularly effective for solving problems that can be recursively divided into similar problems.
Merge sort, quick sort and binary search are examples of Divide and conquer.
Key Steps
Divide: Split the problem into smaller, more manageable subproblems. The subproblems should be of the same type as the
original problem.
Conquer: Solve each subproblem independently. If the subproblems are still too large, apply the divide and conquer strategy
recursively.
Combine: Merge the solutions of the subproblems to form the solution to the original problem.
The divide and conquer technique is a powerful tool for solving complex problems by breaking them down into
simpler, smaller problems. It is widely used in algorithms like merge sort, quick sort, binary search, and matrix
multiplication. The effectiveness of this approach lies in its ability to reduce the problem size and manage
complexity efficiently.
Overview
Dynamic programming (DP) is an optimization technique used to solve complex problems by breaking them
down into simpler subproblems. It is especially useful for problems that exhibit overlapping subproblems and
optimal substructure properties.
Key Concepts
Overlapping Subproblems: The problem can be broken down into smaller, overlapping problems that can be solved
independently.
Optimal Substructure: An optimal solution to the problem can be constructed from optimal solutions to its subproblems.

The Fibonacci sequence, coin change problem, and longest common subsequence are classic examples illustrating
its effectiveness.
Fibonacci Sequence
Problem: Calculate the nth Fibonacci number, where the sequence is defined as:
F(0) = 0
F(1) = 1
F(n) = F(n-1) + F(n-2) for n > 1
Naive Approach:
Using simple recursion leads to a lot of repeated calculations. For instance, F(5) requires calculating F(4) and F(3), and F(4) itself
requires F(3) and F(2), resulting in exponential time complexity.
Dynamic Programming Approach:
Memorization: Store results of F(n) in an array to avoid redundant calculations.
Bottom-Up Approach: Calculate Fibonacci numbers iteratively and store them in an array.

Summary
Dynamic programming is a powerful technique for solving optimization problems. By breaking problems into
overlapping subproblems and storing intermediate results, DP can significantly reduce time complexity compared to
naive recursive approaches. The Fibonacci sequence, coin change problem, and longest common subsequence are
classic examples illustrating its effectiveness.
Correctness: Provides the correct output for all inputs. Failing to handle edge cases makes an algorithm incorrect.
Efficiency: Measured in terms of time complexity (how long it takes) and space complexity (how much memory
it uses). Big-O notation (e.g., O(n), O(log n)) is used to describe complexity.
Scalability: Performs well with increasing data size. An algorithm that works on small data may fail with larger
datasets.
Determinism: A deterministic algorithm always produces the same result for the same input, unlike non-
deterministic algorithms that may involve randomness.
Termination: Must complete after a finite number of steps.
Search Engines: Use algorithms like Google’s PageRank to rank web pages and search for relevant content
efficiently.
Financial Markets: Algorithmic trading uses algorithms to automate high-frequency trades based on market data.
Social Media: Recommendation algorithms suggest content based on user behavior (e.g., YouTube, Netflix).
Route Optimization: Navigation systems use algorithms like Dijkstra’s to calculate the shortest routes.
Cryptography: Encryption algorithms (RSA, AES) secure online transactions and communications.
Robotics: Pathfinding algorithms allow robots to navigate environments by finding optimal paths (e.g., A*
search).
Complexity Trade-offs: Balancing speed (time complexity) and memory use (space complexity) can be difficult.
Large Datasets: Algorithms must scale to handle big data efficiently.
Real-time Requirements: Some algorithms must provide results instantly, like in high-frequency trading or self-
driving cars.
Types of ML
Applications
Challenges
Definition
Machine Learning (ML) has significantly impacted finance by enabling more accurate predictions, automated
processes, and risk management. Below is a detailed look into how ML is applied in finance:
Types of Machine Learning in Finance
Supervised Learning
In supervised learning, the model is trained on historical labeled data to predict future outcomes.
Unsupervised Learning
In unsupervised learning, models work with unlabeled data to identify patterns or groupings.
Semi-Supervised Learning
Semi-supervised learning combines a small amount of labeled data with a large amount of unlabeled data to improve the
learning process.
Reinforcement Learning (RL)
Reinforcement learning models learn by interacting with an environment and receiving feedback on their actions (rewards or
penalties).
Supervised Learning Overview
Supervised learning is a type of machine learning where the model is trained using labeled data. In this context,
"labeled" means that each training example is paired with the correct output. The goal of supervised learning is to
learn a mapping from inputs to outputs, so that the model can predict the correct output for new, unseen data.
Key Concepts
Training Data: The dataset used to train the model, consisting of input-output pairs. The model learns from this data by
identifying patterns or relationships between the inputs (features) and outputs (labels).
Labels: The correct output or target value for each example in the training data. This could be a class in classification
problems or a continuous value in regression problems.
Model: The algorithm that learns the relationship between inputs and outputs from the training data. Once trained, the model
can make predictions on new, unseen data.
Types of Supervised Learning
Classification
Problem: Assign a label from a set of predefined categories.
Example: Email spam detection, where the input is an email and the output is either "spam" or "not spam".
Popular Algorithms: Logistic regression, decision trees, support vector machines (SVM), and k-nearest neighbors (KNN).
Regression
Problem: Predict a continuous value.
Example: Predicting house prices based on features like location, size, and number of rooms.
Popular Algorithms: Linear regression, polynomial regression, ridge regression, and lasso regression.
How Supervised Learning Works
Data Collection: Gather labeled data that contains input-output pairs.
Model Training: Use an algorithm to learn the mapping from inputs to outputs. The model minimizes the error by adjusting
its internal parameters using techniques like gradient descent.
Model Testing: After training, the model is evaluated on a separate set of data (testing data) to check its performance and
ability to generalize to new inputs.
Prediction: Once trained, the model can predict the output for new, unseen inputs.
Examples in Finance
Credit Scoring
Input: Customer features such as income, credit history, and employment status.
Output: Predict whether a customer will default on a loan or not (classification problem).
Stock Price Prediction
Input: Historical stock prices, trading volume, and other market indicators.
Output: Predict future stock prices (regression problem).
Fraud Detection
Input: Transaction data, including location, amount, and user behavior.
Output: Classify whether the transaction is fraudulent or not (classification problem).
Benefits of Supervised Learning
Accuracy: High accuracy when enough labeled data is available for training.
Interpretability: Easier to interpret the learned model and understand its decision-making process (e.g., linear regression
models).
Wide Applications: Can be applied to a variety of tasks such as classification, regression, and anomaly detection.
Challenges
Data Labeling: Requires a large amount of labeled data, which can be expensive and time-consuming to obtain.
Overfitting: The model might perform well on the training data but poorly on new data if it learns noise or irrelevant
patterns.
Limited by Training Data: The model’s performance is highly dependent on the quality and representativeness of the
training data.
Unsupervised Learning Overview
Unsupervised learning is a type of machine learning where the model is trained on data without any labeled
output. In other words, the data only includes input features, and the goal of the model is to find hidden patterns,
relationships, or structures within the data. Unlike supervised learning, unsupervised learning does not have
predefined categories or target values for each data point.
Key Concepts
Input Data: The dataset used to train the model contains only features (no labels). The model attempts to learn the inherent
structure or distribution of the data.
Patterns and Structure: The model’s task is to identify meaningful groupings or relationships between the input data points.
No Supervision: There are no explicit labels guiding the model’s learning process, so the model relies solely on the data's
features to find patterns.
How Unsupervised Learning Works
Data Collection: Gather data without labels, where each data point has multiple features.
Model Training: The unsupervised learning algorithm analyzes the input data and attempts to identify patterns,
relationships, or clusters based solely on the features.
Pattern Discovery: The model groups data points into clusters, reduces the dimensionality of data, or detects anomalies.
Interpretation: Once the model has identified patterns, these patterns can be analyzed to extract useful insights or make
decisions.
How Unsupervised Learning Works
Data Collection: Gather data without labels, where each data point has multiple features.
Model Training: The unsupervised learning algorithm analyzes the input data and attempts to identify patterns,
relationships, or clusters based solely on the features.
Pattern Discovery: The model groups data points into clusters, reduces the dimensionality of data, or detects anomalies.
Interpretation: Once the model has identified patterns, these patterns can be analyzed to extract useful insights or make
decisions.
Examples in Finance
1. Customer Segmentation
Input: Customer features such as transaction history, income level, and spending habits.
Output: The algorithm groups customers into different segments based on their behavior. Marketers can then tailor strategies
for each group.
2. Anomaly Detection in Transactions
Input: Transaction data, including time, location, and amount.
Output: Identify outlier transactions that might be fraudulent based on unusual patterns compared to normal user behavior.
3. Portfolio Risk Analysis
Input: Historical returns of assets in a portfolio.
Output: Group similar assets into clusters based on risk and return characteristics, helping investors diversify their portfolios
more effectively.
Benefits of Unsupervised Learning
Discover Hidden Patterns: Can reveal insights and relationships within the data that were not apparent beforehand, useful
for exploratory data analysis.
No Need for Labeled Data: Since labeling data can be costly and time-consuming, unsupervised learning allows models to
learn directly from raw data without the need for manual labeling.
Adaptability: Unsupervised learning algorithms can adapt to new data and discover new patterns over time, making them
suitable for dynamic and evolving environments.
Challenges
Interpretability: The patterns or clusters identified by the model can be difficult to interpret, especially when dealing with
high-dimensional data.
No Direct Supervision: Without labeled data, evaluating the performance of unsupervised learning models is harder. There
is no clear objective like classification accuracy in supervised learning.
Scalability: Some unsupervised learning algorithms, such as hierarchical clustering, may struggle with very large datasets
due to computational limitations.
Common Applications of Unsupervised Learning
Market Basket Analysis: Finding products that are frequently bought together in retail.
Document Clustering: Grouping similar documents based on content, used in information retrieval or topic modeling.
Recommendation Systems: Identifying hidden patterns in user behavior to recommend similar products or content.

In summary, unsupervised learning is a powerful tool for discovering patterns in data without explicit labels. It is
widely used in tasks such as clustering, dimensionality reduction, and anomaly detection, making it highly
valuable in fields like finance, marketing, and data science.
Semi-Supervised Learning Overview
Semi-supervised learning is a machine learning approach that combines a small amount of labeled data with a
large amount of unlabeled data during training. This method leverages the advantages of both supervised and
unsupervised learning, particularly in situations where obtaining labeled data is expensive or time-consuming, but
unlabeled data is abundant.
Key Concepts
Labeled Data: A small portion of the dataset is labeled, meaning the input data points have corresponding output labels or
categories.
Unlabeled Data: A much larger portion of the dataset is unlabeled, containing only the input features without any associated
output labels.
Learning Process: The model learns from both the labeled data (using supervised techniques) and the structure of the
unlabeled data (using unsupervised techniques) to generalize better to unseen data.
How Semi-Supervised Learning Works
Labeling a Small Subset: In a large dataset, only a small fraction of the data points are labeled. These labeled points provide
the foundation for initial learning.
Exploring Unlabeled Data: The model then uses the unlabeled data to identify patterns, clusters, or structures that can help
improve predictions. The assumption is that the unlabeled data contains information that is useful for understanding the
relationships between input features.
Refinement and Prediction: The model refines its learning by combining the information from both labeled and unlabeled
data, leading to better performance, especially when the amount of labeled data is limited.
Examples in Real-World Applications
Text Classification: For example, in spam detection, only a small portion of emails might be labeled as “spam” or “not
spam,” while the vast majority of emails are unlabeled. The model learns to classify spam emails by leveraging both labeled
and unlabeled data.
Image Recognition: In a large collection of images, only a few images might be manually labeled (e.g., dog, cat, car), while
the rest are unlabeled. The model uses both sets to improve classification performance.
Fraud Detection: In financial transactions, there might be few known cases of fraud (labeled data), but millions of
transactions are unlabeled. Semi-supervised learning can help detect fraud by analyzing both labeled fraud cases and the
structure of the remaining transactions.
Benefits of Semi-Supervised Learning
Cost-Efficiency: Reduces the need for a large, fully labeled dataset, which can be expensive and time-consuming to produce.
Improved Accuracy: Semi-supervised learning can improve model accuracy by leveraging unlabeled data to better
understand the data distribution.
Adaptability: Can work well in scenarios where labeled data is scarce, but there is a large amount of unlabeled data, which
is often the case in real-world applications.
Common Algorithms for Semi-Supervised Learning
Self-Training: The model is first trained on labeled data, then predicts labels for the unlabeled data. These predicted labels
are added to the training set, and the model is retrained.
Co-Training: Two models are trained on different views of the data (e.g., different feature subsets). Each model predicts
labels for the unlabeled data, which the other model uses for training.
Graph-Based Methods: The data points are represented as nodes in a graph, where similar data points are connected. The
model propagates labels across the graph, assuming similar data points should have similar labels.
Benefits of Semi-Supervised Learning
Cost-Efficiency: Reduces the need for a large, fully labeled dataset, which can be expensive and time-consuming to produce.
Improved Accuracy: Semi-supervised learning can improve model accuracy by leveraging unlabeled data to better
understand the data distribution.
Adaptability: Can work well in scenarios where labeled data is scarce, but there is a large amount of unlabeled data, which
is often the case in real-world applications.
Common Algorithms for Semi-Supervised Learning
Self-Training: The model is first trained on labeled data, then predicts labels for the unlabeled data. These predicted labels
are added to the training set, and the model is retrained.
Co-Training: Two models are trained on different views of the data (e.g., different feature subsets). Each model predicts
labels for the unlabeled data, which the other model uses for training.
Graph-Based Methods: The data points are represented as nodes in a graph, where similar data points are connected. The
model propagates labels across the graph, assuming similar data points should have similar labels.
Challenges of Semi-Supervised Learning
Labeling Errors: Incorrect labeling of even a small amount of data can lead to poor performance because the model may
propagate these errors to the unlabeled data.
Assumption of Structure: Semi-supervised learning assumes that the unlabeled data contains useful information (e.g.,
clusters), but if this assumption is incorrect, the model may not benefit from the additional data.
Complexity: Combining both labeled and unlabeled data effectively can be complex, requiring careful tuning of algorithms
and assumptions.
Example in Finance: Credit Scoring
In credit scoring, only a limited number of borrowers may have a labeled credit history (good/bad). However, many other
applicants might have limited or no history (unlabeled). By using semi-supervised learning, the model can learn from both
the small labeled dataset and the larger unlabeled set to predict the creditworthiness of new applicants.
Summary
Semi-supervised learning is a powerful approach for situations where labeled data is scarce but unlabeled data is
plentiful. By combining supervised and unsupervised techniques, it can improve model performance in many real-
world tasks like text classification, image recognition, and fraud detection.
Reinforcement Learning (RL) Overview
Reinforcement learning (RL) is a type of machine learning in which an agent learns to make decisions by interacting with an
environment. The agent takes actions to maximize some notion of cumulative reward, learning from the consequences of its
actions over time through trial and error. Unlike supervised learning, where the correct answers are provided, RL relies on
the feedback from the environment to guide the agent toward optimal behavior.
Key Concepts
Agent: The decision-maker that interacts with the environment.
Environment: Everything that the agent interacts with; it provides feedback to the agent based on its actions.
State: A specific situation or configuration of the environment at a given time.
Action: The choices that the agent can make in a given state.
Reward: Feedback received from the environment in response to an action; it can be positive (reward) or negative (penalty).
Policy: The strategy that the agent uses to decide its actions based on the current state.
Value Function: Estimates how good it is to be in a particular state, taking into account future rewards.
Exploration vs. Exploitation: The balance between trying out new actions (exploration) to discover potentially better long-
term strategies and using known actions (exploitation) that yield the highest immediate rewards.
How Reinforcement Learning Works
Interaction with the Environment: The agent perceives the current state of the environment and selects an action based on
its policy.
Receiving Feedback: The environment transitions to a new state based on the agent’s action and provides a reward or
penalty.
Learning: The agent uses this feedback to adjust its policy, improving its future actions to maximize cumulative rewards.
Iteration: This cycle repeats, allowing the agent to learn an optimal policy through repeated interactions and feedback loops.
Types of Reinforcement Learning
Model-Free RL: The agent learns directly from the environment by trial and error without any prior knowledge or
assumptions about how the environment works.
Example: Q-learning, where the agent learns a value function for each action in each state.
Model-Based RL: The agent builds a model of the environment, predicting the consequences of its actions to improve its
decision-making.
Example: The agent predicts how the environment will respond to its actions and uses this knowledge to plan more effectively.
Examples of Reinforcement Learning
Gaming: RL is commonly used in video game AI, where agents learn to play games like Chess, Go, or video games by trial
and error, improving their strategies over time.
Autonomous Vehicles: In self-driving cars, RL helps the vehicle learn to navigate by interacting with its environment,
learning from feedback such as avoiding collisions or optimizing travel time.
Robotics: RL is used to train robots to perform complex tasks like picking up objects or navigating through a room by
learning through repeated trials and improving their actions.
Finance: RL can be used in algorithmic trading, where an agent learns to make buying and selling decisions based on market
data to maximize profits over time.
Benefits of Reinforcement Learning
Autonomy: RL agents can operate in environments with minimal supervision, learning from feedback rather than explicit
instructions.
Adaptability: RL agents can learn and adapt to dynamic environments where the conditions and states change over time.
Generalization: Once trained, RL agents can generalize their knowledge to similar tasks and environments.
Challenges of Reinforcement Learning
Exploration-Exploitation Dilemma: Balancing exploration (trying new actions) and exploitation (relying on known
actions) is a difficult problem in RL.
Sparse Rewards: In some environments, rewards may be rare or delayed, making it hard for the agent to learn effective
policies.
Computational Resources: RL often requires significant computational power, especially for complex environments and
tasks.
Example in Finance: Portfolio Management
In portfolio management, an RL agent can be trained to buy, sell, or hold assets by interacting with the financial
market environment. The agent's goal is to maximize returns over time, learning from feedback based on the profit
or loss generated by each decision. The agent adjusts its policy based on market trends, risk factors, and other
variables.

Summary
Reinforcement learning is a powerful approach that enables machines and agents to learn optimal behavior
through interaction with their environment and receiving feedback. It has wide applications in gaming, robotics,
finance, and autonomous systems, offering the potential for highly adaptive and autonomous decision-making
systems.
Credit Scoring and Risk Assessment
ML models are used to predict creditworthiness by analyzing vast amounts of borrower data (income, credit history, behavior).
Fraud Detection
ML algorithms are critical in detecting fraudulent activities by analyzing transaction data for anomalies that reviewers might miss.
Algorithmic Trading
ML models are used in algorithmic trading to predict price movements and make trades automatically.
Portfolio Management (Robo-Advisors)
Robo-advisors use ML to offer personalized investment advice by analyzing user profiles, risk appetite, and financial goals.
Sentiment Analysis for Market Predictions
ML models analyze large amounts of unstructured data from news, social media, and reports to assess market sentiment.
Risk Management and Forecasting
ML helps banks and financial institutions forecast market risks and manage liquidity, especially in volatile markets.
Regulatory Compliance (RegTech)
ML algorithms are increasingly used to ensure compliance with regulatory requirements by monitoring transactions and analyzing
documentation.
Data Quality and Availability
Financial models depend heavily on data accuracy, and missing or biased data can lead to poor predictions.
Example: Credit scoring models trained on biased data may lead to unfair decisions about who gets loans.
Interpretability and Transparency
Financial regulators often require that models be interpretable. Some ML models (like neural networks) are considered
"black boxes" due to their complexity.
Example: Credit scoring models need to provide clear reasons why a loan was approved or denied.
Model Risk
ML models can degrade over time if not continuously updated with fresh data. In finance, outdated models may lead to
significant losses.
Example: A trading algorithm that doesn’t adapt to new market conditions could result in poor investment decisions.
How Deep learning works
Key concepts
Challenges of DL
Key differences between DL and ML
Deep Learning (DL) is a subset of Machine Learning that uses neural networks with multiple layers (hence
"deep") to model and understand complex patterns in data. DL has revolutionized various fields by enabling
models to perform more accurate predictions, classification, and decision-making without needing human-defined
features. Its application in finance has been growing, particularly in areas requiring sophisticated data analysis and
real-time decision-making.
1. How Deep Learning Works
Deep Learning models are composed of neural networks with multiple layers of neurons (nodes). Each neuron
mimics the human brain, processing inputs, applying weights, biases, and passing outputs to the next layer. The
deeper the network, the more abstract features it can learn from data.
Input Layer: The first layer that receives the data.
Hidden Layers: Multiple layers between input and output where data is processed.
Output Layer: The final layer that provides the prediction or classification result.
Each layer in a deep neural network transforms the input data into more abstract representations, allowing the
model to detect increasingly complex patterns.
Artificial Neural Networks (ANNs)
An ANN is the foundation of deep learning, where neurons are connected in layers, with each neuron receiving inputs, applying a
function, and producing an output.
•Application: Predicting stock prices based on historical financial data and various market indicators.
Convolutional Neural Networks (CNNs)
CNNs are particularly effective for image and pattern recognition, but they can also be used in finance for analyzing time series data
or transaction histories.
•Application: Analyzing graphical data or recognizing patterns in trading charts.
Recurrent Neural Networks (RNNs)
RNNs are designed for sequential data, where previous outputs can influence current ones. This makes them useful in predicting
trends over time.
•Application: Forecasting stock market prices, detecting fraud in transaction sequences, or predicting customer lifetime value based
on behavioral data.
Long Short-Term Memory (LSTM)
LSTMs are a special type of RNN that can capture long-term dependencies in sequential data, solving issues like vanishing gradients
that affect traditional RNNs.
•Application: Predicting long-term trends in markets or managing credit risk over time.
Data Availability and Quality
Deep learning requires vast amounts of high-quality data to train models effectively. In finance, obtaining clean and labeled data can be difficult due to
privacy concerns, fragmented systems, or inconsistencies in how data is recorded.
•Example: Poor-quality data can lead to overfitting or misleading predictions, especially in financial markets where small inaccuracies can result in
significant losses.
Interpretability
Deep learning models, particularly neural networks, are often referred to as "black boxes" due to their complexity and difficulty in understanding how
they arrive at specific decisions. This can be problematic in finance, where transparency is required by regulators and clients.
•Example: A bank using a deep learning model to approve loans may struggle to explain why a certain loan was rejected, creating issues of trust and
compliance with regulators.
Computational Costs
Training deep learning models requires significant computational power and time, especially with large datasets. This can be costly and time-
consuming for financial institutions, limiting their ability to deploy DL in real-time applications.
•Example: High-frequency trading requires rapid decision-making, but deep learning models may not always be fast enough to handle microsecond-
level trading decisions.
Risk of Overfitting
Deep learning models are prone to overfitting, especially when working with small or noisy datasets. Overfitting occurs when the model becomes too
complex and fits the training data too closely, losing its ability to generalize to new data.
•Example: A deep learning model may perform exceptionally well on historical stock price data but fail to adapt to new market conditions, leading to
poor performance in real-world trading.
Data Requirement: DL requires significantly larger datasets compared to traditional ML models.
Feature Engineering: In ML, features must be manually selected, while DL models automatically extract
relevant features from raw data.
Model Complexity: DL models are more complex with multiple hidden layers, while traditional ML models are
often simpler with one or two layers.
Computational Power: DL models demand much higher computational resources, especially during training.
Deep learning offers immense potential in finance, from fraud detection to algorithmic trading, but it also poses
challenges related to data quality, interpretability, and computational requirements. As these challenges are
addressed, the adoption of deep learning in finance will continue to grow, enabling more advanced, accurate, and
real-time decision-making capabilities.

Ai & ML Digital Notes
No ratings yet
Ai & ML Digital Notes
177 pages
Algorithm
No ratings yet
Algorithm
3 pages
04-Searching and Sorting Algorithms
No ratings yet
04-Searching and Sorting Algorithms
1 page
DAA-Unit II
No ratings yet
DAA-Unit II
12 pages
Basics of Data Structures and Algorithms
No ratings yet
Basics of Data Structures and Algorithms
40 pages
Fundamentals of Computing Algorithms
No ratings yet
Fundamentals of Computing Algorithms
60 pages
08
No ratings yet
08
60 pages
DSA PPT
No ratings yet
DSA PPT
23 pages
DCA7104 Analysis and Design of Algorithms
No ratings yet
DCA7104 Analysis and Design of Algorithms
12 pages
Data Structure & Algorithms Complete Interview Guide ?
No ratings yet
Data Structure & Algorithms Complete Interview Guide ?
300 pages
CSC 425 Summary Note
No ratings yet
CSC 425 Summary Note
13 pages
DCA7104 - Analysis and Design of Algorithms
No ratings yet
DCA7104 - Analysis and Design of Algorithms
14 pages
Lecture 1 DAA
No ratings yet
Lecture 1 DAA
52 pages
Unitwise Definitions
No ratings yet
Unitwise Definitions
6 pages
Exam
No ratings yet
Exam
15 pages
Merge Sort Algorithm: Quick Sort Also Uses Divide and Conquer Technique Like
No ratings yet
Merge Sort Algorithm: Quick Sort Also Uses Divide and Conquer Technique Like
5 pages
Daa Lab File
No ratings yet
Daa Lab File
36 pages
Types of Algorithm Design Techniques
No ratings yet
Types of Algorithm Design Techniques
4 pages
Introduction to Algorithms
No ratings yet
Introduction to Algorithms
41 pages
Group Assignment - Theory of Algorithms
No ratings yet
Group Assignment - Theory of Algorithms
11 pages
DSA_Unit_05
No ratings yet
DSA_Unit_05
9 pages
35 Algorithm Types
No ratings yet
35 Algorithm Types
22 pages
Untitled
0% (3)
Untitled
10 pages
Algorithm
No ratings yet
Algorithm
54 pages
Algorithm-Lecture4 - Sorting-1
No ratings yet
Algorithm-Lecture4 - Sorting-1
45 pages
Data Structure Assignment
No ratings yet
Data Structure Assignment
6 pages
Top 6 Algorithms
No ratings yet
Top 6 Algorithms
9 pages
ADA unit-1
No ratings yet
ADA unit-1
14 pages
Da Unit IV
No ratings yet
Da Unit IV
8 pages
2-Divide and Conquer Approach
No ratings yet
2-Divide and Conquer Approach
162 pages
AAA Lecture 6&7 Divide and Conquer
No ratings yet
AAA Lecture 6&7 Divide and Conquer
49 pages
Algorithm Answers
No ratings yet
Algorithm Answers
27 pages
Algorithm Design Unit 4
No ratings yet
Algorithm Design Unit 4
7 pages
Unit 1
No ratings yet
Unit 1
54 pages
Main PPT Lesson
No ratings yet
Main PPT Lesson
85 pages
DAAlab_file
No ratings yet
DAAlab_file
24 pages
Atul GFG Ds Algorithm
No ratings yet
Atul GFG Ds Algorithm
10 pages
Daa Unit 2
No ratings yet
Daa Unit 2
23 pages
Clrs Content
No ratings yet
Clrs Content
5 pages
15. Algorithm Designs
No ratings yet
15. Algorithm Designs
56 pages
DAA Lab
No ratings yet
DAA Lab
178 pages
Divide and Conquer Algorithm
No ratings yet
Divide and Conquer Algorithm
7 pages
Dsa Reviewer 1
No ratings yet
Dsa Reviewer 1
9 pages
Diploma 5 Sem ADA Notes 3
No ratings yet
Diploma 5 Sem ADA Notes 3
18 pages
Interview Preparation Plan for Fresher
No ratings yet
Interview Preparation Plan for Fresher
13 pages
Unit I Design And Analysis of Algorithms continued
No ratings yet
Unit I Design And Analysis of Algorithms continued
21 pages
(A) What Is Randomized Quicksort? Analyse The Expected Running Time of Randomized Quicksort, With The Help of A Suitable Example. Answer
No ratings yet
(A) What Is Randomized Quicksort? Analyse The Expected Running Time of Randomized Quicksort, With The Help of A Suitable Example. Answer
14 pages
Unit 3
No ratings yet
Unit 3
68 pages
Ds Mod5
No ratings yet
Ds Mod5
138 pages
Algorithm Design Paradigms: General Approaches To The Construction of Efficient Solutions To
No ratings yet
Algorithm Design Paradigms: General Approaches To The Construction of Efficient Solutions To
7 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
6 pages
Aiml Unit 1
No ratings yet
Aiml Unit 1
10 pages
35 Algorithm Types
No ratings yet
35 Algorithm Types
14 pages
Mca 403
No ratings yet
Mca 403
85 pages
All Algorithms
No ratings yet
All Algorithms
17 pages
DS_class_ppts(Searching,Sorting to AVLtrees) (1)
No ratings yet
DS_class_ppts(Searching,Sorting to AVLtrees) (1)
165 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Structures and Algorithm
From Everand
Data Structures and Algorithm
Knowledge Flow
No ratings yet
Beam Search: Fundamentals and Applications
From Everand
Beam Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit II-Uninformed Search Strategies
No ratings yet
Unit II-Uninformed Search Strategies
53 pages
Bilal Ahmad Ai & DSS Assign # 01
No ratings yet
Bilal Ahmad Ai & DSS Assign # 01
8 pages
Ai Notes - 15-21
No ratings yet
Ai Notes - 15-21
7 pages
Assignment Unit 2 Problem Solving by Search
No ratings yet
Assignment Unit 2 Problem Solving by Search
2 pages
HPC 1 BFS
No ratings yet
HPC 1 BFS
10 pages
Queue
No ratings yet
Queue
448 pages
Unit 4 - Trees
No ratings yet
Unit 4 - Trees
162 pages
Dsa Course File
No ratings yet
Dsa Course File
161 pages
CS502 Quiz-3 by Vu Topper RM-1
No ratings yet
CS502 Quiz-3 by Vu Topper RM-1
92 pages
Experiment No. 02: PEAS Description For A Medical Diagnosis System: Performance Measure
No ratings yet
Experiment No. 02: PEAS Description For A Medical Diagnosis System: Performance Measure
9 pages
A Brief Study of Graph Data Structure: Ijarcce
No ratings yet
A Brief Study of Graph Data Structure: Ijarcce
5 pages
IV-I NNSC L N
No ratings yet
IV-I NNSC L N
130 pages
Artificial Intelligence
100% (2)
Artificial Intelligence
46 pages
AI Lab Course Handout Spring 2024
No ratings yet
AI Lab Course Handout Spring 2024
10 pages
Uninformed Search
No ratings yet
Uninformed Search
45 pages
AID 5th Semester Deep Learning Laboratory - AD3511 - Lab Manual
No ratings yet
AID 5th Semester Deep Learning Laboratory - AD3511 - Lab Manual
86 pages
DSA Syllabus
No ratings yet
DSA Syllabus
7 pages
AI Unit2 ProblemSolving
No ratings yet
AI Unit2 ProblemSolving
191 pages
Data Structure MCQ Question
0% (1)
Data Structure MCQ Question
6 pages
Lab 1
No ratings yet
Lab 1
34 pages
Data Structure KCS301
No ratings yet
Data Structure KCS301
2 pages
Mid_Sem_Report (1)
No ratings yet
Mid_Sem_Report (1)
10 pages
Problem Solving
No ratings yet
Problem Solving
85 pages
Mathematical Foundation of Computer Science Jan2007 r059210502
No ratings yet
Mathematical Foundation of Computer Science Jan2007 r059210502
10 pages
Assignment Problem
No ratings yet
Assignment Problem
17 pages
Automating Variation and Repeater Analysis in Physical Design of
No ratings yet
Automating Variation and Repeater Analysis in Physical Design of
79 pages
Artificial Intelligence Unit-I Question Bank (Solved) (Theory & MCQ) Theory
No ratings yet
Artificial Intelligence Unit-I Question Bank (Solved) (Theory & MCQ) Theory
78 pages
Imp
No ratings yet
Imp
3 pages
PSC Per.2
No ratings yet
PSC Per.2
9 pages

241010_C1.5. Algorithm, Machine Learning and Deep Learning

Uploaded by

241010_C1.5. Algorithm, Machine Learning and Deep Learning

Uploaded by

Chapter 1.

5: Algorithm and Machine learning

Prepared for Technology Applications in Fiance – FTU 2024

You might also like