0% found this document useful (0 votes)

486 views28 pages

Kan Slide

The document discusses Kolmogorov-Arnold networks (KAN), an alternative to multilayer perceptrons for machine learning. It provides an overview of topics covered, including prerequisites, the universal approximation theorem, and properties of KANs such as their ability to represent functions and parameters count compared to MLPs. B-splines and Bézier curves are also introduced as they relate to KANs.

Uploaded by

Gobi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

486 views28 pages

Kan Slide

Uploaded by

Gobi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Kolmogorov–

Arnold Networks
Umar Jamil
Downloaded from: https://github.com/hkproj/kan-notes
License: Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0):
https://creativecommons.org/licenses/by-nc/4.0/legalcode

Not for commercial use

Umar Jamil – https://github.com/hkproj/kan-notes

Topics Prerequisites
• Review of Multilayer Perceptron • Basics of calculus (derivative)
• Introduction to data fitting • Basics of deep learning
(backpropagation)
• Bézier Curves
• B-Splines
• Universal Approximation Theorem
• Kolmogorov-Arnold Representation
Theorem
• MLPs vs KAN
• Properties
• Multi-layer KANs
• Parameters count: MLPs vs KANs
• Grid extension
• Interpretability
• Continual training

Umar Jamil – https://github.com/hkproj/kan-notes

The Multi-layer Perceptron (MLP)
A multilayer perceptron is a neural network made up of multiple layers of neurons, organized in a feed-forward way, with nonlinear activation functions in
between.
How does it work?

Class 1

Class 2

Class 3

Class 4

Class 5
Input

Hidden Layer 1 Hidden Layer 2 Output (logits)

Umar Jamil – https://github.com/hkproj/kan-notes

The Linear layer in PyTorch

Umar Jamil – https://github.com/hkproj/kan-notes

The Linear layer in detail
A linear layer in a MLP is made of a weight matrix and a bias matrix.
n1 n2 n3 n4 n5
The bias vector will be broadcasted to every
b= b
1
row in the 𝑋𝑊 𝑇 table.
𝑧1 = (𝑟1 + 𝑏1 ) = (σ3𝑖=1 𝑎𝑖 𝑤𝑖 + 𝑏1 ) (1, 5)
+
f1 f2 f3 f1 f2 f3 f4 f5 f1 f2 f3 f4 f5

a1 a2 a3 r1 z1
Item 1 𝑶 = 𝑿𝑾𝑻 + 𝒃 Item 1 Item 1

Item 2 Item 2 Item 2

Item 3 n1 n2 n3 n4 n5 Item 3 Item 3

X= 𝑾𝑻 = w2
𝑿𝑾𝑻 = O=
(10, 3) (3, 5) w3 (10, 5) (10, 5)

Item 10 Item 10 Item 10

Umar Jamil – https://github.com/hkproj/kan-notes

Why do we need activation functions?
After each Linear layer, we usually apply a nonlinear activation function. Why?

𝑶𝟏 = 𝒙𝑾1𝑻 + 𝒃𝟏

𝑶𝟐 = (𝑶𝟏 )𝑾𝑻2 + 𝒃𝟐

𝑶𝟐 = (𝒙𝑾1𝑻 + 𝒃𝟏 )𝑾𝑻2 + 𝒃𝟐

𝑶𝟐 = 𝒙𝑾1𝑻 𝑾𝑻2 + 𝒃𝟏 𝑾𝑻2 + 𝒃𝟐

As you can see, if we do not apply any activation functions, the output will just be a linear combination of the inputs, which means that our MLP will not be
able to learn any non-linear mapping between the input and output, which represents most of the real-world data.

Umar Jamil – https://github.com/hkproj/kan-notes

Introduction to data fitting
Imagine you’re making a 2D game and you want animate your sprite (character) to pass through a series of points. One way would be to make a straight line
from one point to the next, but that wouldn’t look so good. What if you could create a smoother path, like the one below?

Umar Jamil – https://github.com/hkproj/kan-notes

Smooth curves through polynomial curves
How to find the equation of such a smooth curve?
One way is to write the generic equation of a polynomial curve and force it to pass through the series of points to get the coefficients of the equation.
We have 4 points, so we can make a system of equations with 4 equations, which means we can solve for 4 variables: yes, we get a polynomial with degree 3.

𝑦 = 𝑎𝑥 3 + 𝑏𝑥 2 + 𝑐𝑥 + 𝑑
We can write our system of equations as follows and solve to find the equation of the curve:

5 = 𝑎(0)3 +𝑏(0)2 +𝑐 0 + 𝑑
1 = 𝑎(1)3 +𝑏(1)2 + 𝑐(1) + 𝑑
3 = 𝑎(2)3 +𝑏(2)2 + 𝑐(2) + 𝑑
2 = 𝑎(5)3 +𝑏(5)2 + 𝑐(5) + 𝑑

Umar Jamil – https://github.com/hkproj/kan-notes

What if I have hundreds of points?
If you have N points, you need a polynomial of degree N – 1 if you want the line to pass through all those points. But as you can see, when we have lots of
points, the polynomial starts getting crazy on the extremes. We wouldn’t want the character in our 2D game to go out of the screen while we’re animating it,
right?
Thankfully, someone took the time to solve this problem, because we have Bézier curves!

Source: https://arachnoid.com/polysolve/

Umar Jamil – https://github.com/hkproj/kan-notes

Bézier curves
A Bézier curves is a parametric curve (which means that all the coordinates of the curve depend on an independent variable 𝑡, between 0 and 1).
For example, given two points, we can calculate the linear B curve as the following interpolation:

𝑩 𝑡 = 𝑷0 + 𝑡 𝑷1 − 𝑷0 = 1 − 𝑡 𝑷0 + 𝑡𝑷1

Given three points, we can calculate the quadratic Bézier curve that interpolates them.
Source: Wikipedia

𝑸0 𝑡 = 1 − 𝑡 𝑷0 + 𝑡𝑷1
𝑸1 𝑡 = 1 − 𝑡 𝑷1 + 𝑡𝑷2

𝑩 𝑡 = 1 − 𝑡 𝑸0 + 𝑡𝑸1
= 1 − 𝑡 1 − 𝑡 𝑷0 + 𝑡𝑷1 + 𝑡 1 − 𝑡 𝑷1 + 𝑡𝑷2
= 1 − 𝑡 2 𝑷0 + 2 1 − 𝑡 𝑡𝑷1 + 𝑡 2 𝑷2

With four points, we can proceed with a similar reasoning.

Umar Jamil – https://github.com/hkproj/kan-notes

Bézier curves: going deeper
Yes, we can go deeper! If we have 𝑛 + 1 points, we can find the 𝑛 degree Bézier curve using the following formula

𝑛 𝑛
𝑛 𝑛−𝑖 𝑖
𝑩 𝑡 = ෍ 1−𝑡 𝑡 𝑷𝑖 = ෍ 𝑏𝑖,𝑛 (𝑡)𝑷𝑖
𝑖
𝑖=0 𝑖=0

Bernstein basis polynomials

Blue: 𝑏0,3 𝑡
Green: 𝑏1,3 𝑡
Red: 𝑏2,3 𝑡
Cyan: 𝑏3,3 𝑡

Binomial coefficients

𝑛 𝑛!
=
𝑖 𝑖! 𝑛 − 𝑖 !
Source: Wikipedia

Umar Jamil – https://github.com/hkproj/kan-notes

From Bézier curves to B-Splines
If you have lots of points (say n), you need a Bézier curve with a degree n-1 to approximate it well, but that can be quite complicated computationally to
calculate.
Someone wise thought: why don’t we stitch together many Bézier curves between all these points, instead of one big Bézier curve that interpolates all of
them?

Source: Wikipedia

Umar Jamil – https://github.com/hkproj/kan-notes

B-splines in detail
A 𝑘-degree B-Spline curve that is defined by 𝑛 control points, will consist of 𝑛 − 𝑘 Bézier curves.
For example, if we want to use a quadratic Bézier curve and we have 6 points, we need 6 − 2 = 4 Bézier curves.
In this case we have n=6 and k=2

Source: Wolfram Alpha

Umar Jamil – https://github.com/hkproj/kan-notes

B-splines in detail
The degree of our B-Spline also tells what kind of continuity we get.

Source: MIT

Umar Jamil – https://github.com/hkproj/kan-notes

Calculating B-splines: algorithm

Source: MIT

Umar Jamil – https://github.com/hkproj/kan-notes

B-Splines: basis functions

𝑁0,2 𝑁5,2

𝑁2,2 𝑁3,2
𝑁1,2 𝑁4,2

Umar Jamil – https://github.com/hkproj/kan-notes

B-splines: local control
Moving a control point only changes the curve locally (in the proximity of the control point), leaving the adjacent Bezier curves unchanged!

Umar Jamil – https://github.com/hkproj/kan-notes

Universal Approximation Theorem
We can think of neural networks as functions approximators. Usually, we have access to some data points generated by an ideal function that we do not have
access. The goal of training a neural network is to approximate this ideal function (that we do not have access).
But how do we know if a neural network is powerful enough to model our ideal function? What can we say about the expressive power of neural networks?
This is what the universal approximation theorem is all about: it is a series of results that put limits on what neural networks can learn.
It has been proven that neural networks with a certain width (number of neurons) and depth (number of layers) can approximate any continuous function if
using specific non-linear activation functions, for example the ReLU function. Check Wikipedia for more theoretical results.

I want to emphasize what it means to be a universal approximator: it means that given an ideal function (or a family of functions) that models the training data,
the network can learn to approximate it as good as we want, that is, given an error 𝜖, we can always find an approximate function that is close to the ideal
function within this error limit.
This is however a theoretical result; it doesn’t tell us how to do it practically. On a practical level, we have many problems:
• Achieving good approximations may take enormous amounts of computational power
• We may need a large big quantity of training data
• Our hardware may not be able to represent certain weights in 32 bit
• Our optimizer may remain stuck in a local minima

So as you can see, just because a neural network can learn anything, doesn’t mean we are be able to learn it in practice. But at least we know that the limits
are practical.

Umar Jamil – https://github.com/hkproj/kan-notes

Kolmogorov-Arnold representation theorem

Umar Jamil – https://github.com/hkproj/kan-notes

Kolmogorov-Arnold Networks
This can network can be thought of as two layers applied in sequence:
𝑜1 • The first layer maps 2 input features into 5 output features.
• The second layer maps 5 input features into 1 output feature.

𝑛=2
2𝑛 + 1 = 5
𝜑1 𝜑2 𝜑3 𝜑4 𝜑5

ℎ1 ℎ2 ℎ3 ℎ4 ℎ5 We sum the output of the learnable functions

Instead of having learnable weights,

we have learnable functions
𝜑1,1 𝜑2,1 𝜑3,1 𝜑4,1 𝜑5,1 𝜑1,2 𝜑2,2 𝜑3,2 𝜑4,2 𝜑5,2

𝑥1 𝑥2

Umar Jamil – https://github.com/hkproj/kan-notes

MLP vs KAN

Umar Jamil – https://github.com/hkproj/kan-notes

Multi-layer KAN

Layer 2
5 input features, 1 output features
total of 5 functions to “learn”

Layer 1
2 input features, 5 output features
total of 10 functions to “learn”

Umar Jamil – https://github.com/hkproj/kan-notes

Implementation details

Umar Jamil – https://github.com/hkproj/kan-notes

Parameters count

Compared to MLP, we also have (G+k) parameters for each activation, because we need to learn where to put the control points for the B-Splines.

Umar Jamil – https://github.com/hkproj/kan-notes

Grid extension
We can increase the number of “control points” in the B-Spline to give it more “degrees of freedom” to better approximate more complex functions, meaning
that we can extend the grid of an existing pre-trained network.

Umar Jamil – https://github.com/hkproj/kan-notes

Interpretability

Umar Jamil – https://github.com/hkproj/kan-notes

Continual learning

Umar Jamil – https://github.com/hkproj/kan-notes

Thanks for watching!
Don’t forget to subscribe for
more amazing content on AI
and Machine Learning!

Umar Jamil – https://github.com/hkproj/kan-notes

Theory of Modeling and Simulation 3rd Edition by Bernard Zeigler, Alexandre Muzy, Ernesto Kofman 0128133708 9780128133705 Download
No ratings yet
Theory of Modeling and Simulation 3rd Edition by Bernard Zeigler, Alexandre Muzy, Ernesto Kofman 0128133708 9780128133705 Download
76 pages
Ansys Fluent UDFs - Book - EN - r02
No ratings yet
Ansys Fluent UDFs - Book - EN - r02
240 pages
7-Knowledge Distillation
No ratings yet
7-Knowledge Distillation
29 pages
Application of Combinational Circuits
No ratings yet
Application of Combinational Circuits
6 pages
Hang Li - Machine Learning Methods-Springer (2023) (Z-Lib - Io)
100% (9)
Hang Li - Machine Learning Methods-Springer (2023) (Z-Lib - Io)
530 pages
Undergraduate Computational Physics Projects On Quantum Computing
No ratings yet
Undergraduate Computational Physics Projects On Quantum Computing
16 pages
Current
No ratings yet
Current
575 pages
Solution of Transient 2D Heat Conduction Problem Using Freefem++
No ratings yet
Solution of Transient 2D Heat Conduction Problem Using Freefem++
4 pages
(Universitext) Paolo Baldi - Probability - An Introduction Through Theory and Exercises-Springer (2024) (Z-Lib - Io)
No ratings yet
(Universitext) Paolo Baldi - Probability - An Introduction Through Theory and Exercises-Springer (2024) (Z-Lib - Io)
395 pages
SVM
No ratings yet
SVM
19 pages
NN Examples Matlab
No ratings yet
NN Examples Matlab
91 pages
Memahami Deep Learning
100% (1)
Memahami Deep Learning
109 pages
Ddos Detection and Mitigation in SDN Using Onos Controller: Dr. Shashank Srivastava
No ratings yet
Ddos Detection and Mitigation in SDN Using Onos Controller: Dr. Shashank Srivastava
57 pages
Telegram Channel Telegram Group
No ratings yet
Telegram Channel Telegram Group
36 pages
Intro To QMLand QNN
No ratings yet
Intro To QMLand QNN
13 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
33 pages
Deepseek-Vl: Towards Real-World Vision-Language Understanding
No ratings yet
Deepseek-Vl: Towards Real-World Vision-Language Understanding
33 pages
Mplug-Docowl 1.5: Unified Structure Learning For Ocr-Free Document Understanding
No ratings yet
Mplug-Docowl 1.5: Unified Structure Learning For Ocr-Free Document Understanding
26 pages
Richi's Neural Nets Summary
No ratings yet
Richi's Neural Nets Summary
114 pages
RNN
No ratings yet
RNN
12 pages
Solution 2
0% (1)
Solution 2
4 pages
Chapter 2. Transformers: A Note For Early Release Readers
No ratings yet
Chapter 2. Transformers: A Note For Early Release Readers
85 pages
Final Mcqs Ai Unit 3
No ratings yet
Final Mcqs Ai Unit 3
6 pages
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
No ratings yet
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
18 pages
Ai
No ratings yet
Ai
28 pages
Driver Drowsiness Detection System Published Review Paper
No ratings yet
Driver Drowsiness Detection System Published Review Paper
3 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
NES-704 Pt-5 PDF
No ratings yet
NES-704 Pt-5 PDF
26 pages
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
No ratings yet
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
8 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
19 pages
Petroleum: Mohammad Ali Ahmadi, Zhangxing Chen
No ratings yet
Petroleum: Mohammad Ali Ahmadi, Zhangxing Chen
14 pages
Stock Trend Prediction With Neural Network Techniques
No ratings yet
Stock Trend Prediction With Neural Network Techniques
61 pages
Application of ANN in Predicting Credit Card Default
No ratings yet
Application of ANN in Predicting Credit Card Default
19 pages
Mechanical Systems and Signal Processing: R.A. Saeed, A.N. Galybin, V. Popov
No ratings yet
Mechanical Systems and Signal Processing: R.A. Saeed, A.N. Galybin, V. Popov
18 pages
Computer Graphics For Engineers: Splines and Bezier Curves
No ratings yet
Computer Graphics For Engineers: Splines and Bezier Curves
79 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Facial Expression Classification Based On SVM, KNN and MLP Classifiers
No ratings yet
Facial Expression Classification Based On SVM, KNN and MLP Classifiers
7 pages
19EEE362:Deep Learning For Visual Computing: Dr.T.Ananthan
No ratings yet
19EEE362:Deep Learning For Visual Computing: Dr.T.Ananthan
23 pages
Ref-Ann and Its Applications in Power System
No ratings yet
Ref-Ann and Its Applications in Power System
9 pages
Energies 17 00296
No ratings yet
Energies 17 00296
18 pages
Modified Generative AI and LLMs in Practice
No ratings yet
Modified Generative AI and LLMs in Practice
6 pages
Flight Fare Prediction System Using Machine Learning
No ratings yet
Flight Fare Prediction System Using Machine Learning
10 pages
Minor Project
No ratings yet
Minor Project
78 pages
Management Information Systems
No ratings yet
Management Information Systems
227 pages
Interfețe Vizuale Om-Mașină
No ratings yet
Interfețe Vizuale Om-Mașină
15 pages
Different Types of Flux Limiters in TVD Schemes
No ratings yet
Different Types of Flux Limiters in TVD Schemes
31 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
Image Classification Using Convolutional Neural Network With Python
No ratings yet
Image Classification Using Convolutional Neural Network With Python
8 pages
Neural Network Complete Notes
No ratings yet
Neural Network Complete Notes
46 pages
8.01 Machine Learning Basics
No ratings yet
8.01 Machine Learning Basics
6 pages
Ethiopian Sign Language Recognition Using Artificial Neural Network
No ratings yet
Ethiopian Sign Language Recognition Using Artificial Neural Network
6 pages
Answers All 2007
0% (1)
Answers All 2007
64 pages
Towards The Ultimate Conservative Difference Scheme. II. Monotonicity and Conservation Combined in A Second-Order Scheme
0% (1)
Towards The Ultimate Conservative Difference Scheme. II. Monotonicity and Conservation Combined in A Second-Order Scheme
10 pages
Rynold Equation - Journal Bearing Project
100% (1)
Rynold Equation - Journal Bearing Project
42 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
3 pages
Multi-Scale Modeling Mechanical Research Network: Fluid Sub-Section
No ratings yet
Multi-Scale Modeling Mechanical Research Network: Fluid Sub-Section
26 pages
Machine Learning: Chapter 4. Artificial Neural Networks
No ratings yet
Machine Learning: Chapter 4. Artificial Neural Networks
34 pages
OpenFOAM Course PDF
No ratings yet
OpenFOAM Course PDF
136 pages
PDE For Modelica
No ratings yet
PDE For Modelica
108 pages
Me8691 Computer Aided Design and Manufacturing Syllabus
100% (1)
Me8691 Computer Aided Design and Manufacturing Syllabus
2 pages
Fuzzy Logic
No ratings yet
Fuzzy Logic
47 pages
Driver Behavior Modeling at Uncontrolled Intersections Under Indian Traffic Conditions
No ratings yet
Driver Behavior Modeling at Uncontrolled Intersections Under Indian Traffic Conditions
11 pages
ICEF 2020 Keynote Prith Banerjee
No ratings yet
ICEF 2020 Keynote Prith Banerjee
23 pages
Ws 022033448
No ratings yet
Ws 022033448
14 pages
Optimizacion (Ingles)
No ratings yet
Optimizacion (Ingles)
133 pages
Unit 4
No ratings yet
Unit 4
16 pages
CGR Unit V INTRODUCTION TO CURVES
No ratings yet
CGR Unit V INTRODUCTION TO CURVES
13 pages
Week 1 (Mechatronics)
No ratings yet
Week 1 (Mechatronics)
20 pages
Normalization
No ratings yet
Normalization
1 page
DDR3 Demo For The ECP5™ and ECP5-5G™ Versa Development Boards User Guide
No ratings yet
DDR3 Demo For The ECP5™ and ECP5-5G™ Versa Development Boards User Guide
12 pages
Next-Generation Sequencing Data Analysis 2nd Edition
No ratings yet
Next-Generation Sequencing Data Analysis 2nd Edition
86 pages
Software Engineering: Chapter 6-Data Flow Diagram
No ratings yet
Software Engineering: Chapter 6-Data Flow Diagram
32 pages
Openfvm-Flow: Reference Manual
No ratings yet
Openfvm-Flow: Reference Manual
59 pages
Slide 2 ARM Architecture and Instruction Set
No ratings yet
Slide 2 ARM Architecture and Instruction Set
234 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
MSTest Vs NUnit
No ratings yet
MSTest Vs NUnit
4 pages
Research Paper On Basic of Artificial Neural Network
No ratings yet
Research Paper On Basic of Artificial Neural Network
5 pages
Build Mamdani Systems (GUI) : On This Page
No ratings yet
Build Mamdani Systems (GUI) : On This Page
14 pages
Anfis Structure
No ratings yet
Anfis Structure
5 pages
Introduction To CFD Module
No ratings yet
Introduction To CFD Module
46 pages
Serial Communication
100% (1)
Serial Communication
28 pages
Implement Multiphase Chalmers Student Project
No ratings yet
Implement Multiphase Chalmers Student Project
23 pages
MATLab Tutorial #5 PDF
No ratings yet
MATLab Tutorial #5 PDF
7 pages
Combustion Tutorial List 13.0
No ratings yet
Combustion Tutorial List 13.0
1 page
MCE 488: Introduction To Computational Fluid Dynamics: CFD Applications
No ratings yet
MCE 488: Introduction To Computational Fluid Dynamics: CFD Applications
1 page
Intranets and Wireless Networks
No ratings yet
Intranets and Wireless Networks
28 pages
Lecture 14 - Multiphase Flows Applied Computational Fluid Dynamics
No ratings yet
Lecture 14 - Multiphase Flows Applied Computational Fluid Dynamics
31 pages
Model Compression Techniquesin Deep Learning
No ratings yet
Model Compression Techniquesin Deep Learning
23 pages
Flowcode Basic Tutorial
No ratings yet
Flowcode Basic Tutorial
5 pages
Introduction To LAMMPS and OVITO
No ratings yet
Introduction To LAMMPS and OVITO
2 pages
IterateAI Careers
No ratings yet
IterateAI Careers
4 pages
The Most Used Positional Encoding: Rope: Damien Benveniste
No ratings yet
The Most Used Positional Encoding: Rope: Damien Benveniste
7 pages
Examples: Telegrapher's Equation: Rev 28 Jan 2013
No ratings yet
Examples: Telegrapher's Equation: Rev 28 Jan 2013
5 pages
Flameless Combustion
No ratings yet
Flameless Combustion
8 pages
2024 11 15 AI Updates
No ratings yet
2024 11 15 AI Updates
20 pages
Pointwisesupport: Accelerator Function Accelerator Function
No ratings yet
Pointwisesupport: Accelerator Function Accelerator Function
1 page
Model Based Control
No ratings yet
Model Based Control
6 pages
Flowcode Introductory Course PDF
No ratings yet
Flowcode Introductory Course PDF
12 pages
25 UCS632 Interpolation
No ratings yet
25 UCS632 Interpolation
77 pages
AIcrowd - Single-Source Augmentation - Challenges
No ratings yet
AIcrowd - Single-Source Augmentation - Challenges
1 page
Inductive Moment Matching
No ratings yet
Inductive Moment Matching
36 pages
Chapter 14 - Analyzing Adversarial Performance - The Deep Learning Architect's Handbook
No ratings yet
Chapter 14 - Analyzing Adversarial Performance - The Deep Learning Architect's Handbook
1 page
Histogram Equalization Techniques
No ratings yet
Histogram Equalization Techniques
18 pages
Crop Recommendation System To Maximize C
No ratings yet
Crop Recommendation System To Maximize C
4 pages
Master Thesis Doc
No ratings yet
Master Thesis Doc
55 pages
Unit I
No ratings yet
Unit I
203 pages
Computational Intelligence Techniques For Trading and Investment 1st Edition Christian Dunis Instant Download
No ratings yet
Computational Intelligence Techniques For Trading and Investment 1st Edition Christian Dunis Instant Download
60 pages
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

Kan Slide

Uploaded by

Kan Slide

Uploaded by

Kolmogorov–

Not for commercial use

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Hidden Layer 1 Hidden Layer 2 Output (logits)

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Item 2 Item 2 Item 2

Item 3 n1 n2 n3 n4 n5 Item 3 Item 3

Item 10 Item 10 Item 10

Umar Jamil – https://github.com/hkproj/kan-notes

𝑶𝟐 = 𝒙𝑾1𝑻 𝑾𝑻2 + 𝒃𝟏 𝑾𝑻2 + 𝒃𝟐

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

With four points, we can proceed with a similar reasoning.

Umar Jamil – https://github.com/hkproj/kan-notes

Bernstein basis polynomials

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Source: Wolfram Alpha

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

ℎ1 ℎ2 ℎ3 ℎ4 ℎ5 We sum the output of the learnable functions

Instead of having learnable weights,

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

Umar Jamil – https://github.com/hkproj/kan-notes

You might also like