0% found this document useful (0 votes)

21 views29 pages

Module 10 - Learners Guide

The document discusses Edge AI and its applications, including the use of various hardware devices and optimization techniques such as pruning, clustering, and quantization. It also covers neuromorphic computing, comparing it with conventional AI, and introduces Edge Cloud, highlighting its benefits, use cases, and challenges. The content is aimed at providing insights into the architecture, implementation, and future trends of Edge AI and Edge Cloud technologies.

Uploaded by

blackythekarpie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views29 pages

Module 10 - Learners Guide

Uploaded by

blackythekarpie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Edge AI / Embedded AI

20 April 2024
Agenda
• What is Edge AI ?
• Why, Where ,When ,How we use it for different solutions
• Sample Hardware devices
• Optimizations for target devices
i. Board background knowledge - understanding the problem statement, target
specifications
ii. Porting
iii. Optimizations for Edge devices –
i. Pruning,
ii. Clustering
iii. Quantization
• Neuromorphic computing
i. Von Neumann Vs Neuromorphic architectures
ii. SNNs
iii. Neuromorphic computing Vs Conventional AI(ANN/CNN)
iv. Applications
v. Example board
vi. Demo Videos
• EdgeCloud
I. What is Edge Cloud ?
II. Why is Edge cloud ?
III. Use cases of Edge Cloud &
IV. Challenges
Edge AI
What is Edge AI ?
Conventional Edge Devices
Edge devices are categorized into two major types
• Micro Controller (MCU)
• Micro Processor (MPU), Desktop PC, GPU , CPU

Casper system architecture of MCU and MPU

Automotive industry application

Why, Where,When,How we use it for different solutions
• Distributed systems
• Intelligence at Edge
• Cloud cost reduction
• Data security
• Processing at data generation spot
Sample edge devices

• Microcontrollers
• Micro processors
• Raspberry pi
• Sensors
• Smart phones
• Tablets
• Desktop PC
• GPU
• TPU
• ECU in cars
• Drones
• Connected devices
Background of Board knowledge & Porting

i. Background Knowledge of board required:

• Specifications of the board – RAM , ROM, Processor
• Usecase priority – Accuracy , latency , etc.,
• Connectivity for the board
• Support of IDE for the board , what is the development environment – python,C,C++
ii. Porting :
• Streamline the model development with the necessary optimizations specific
to target
• Check for the porting support – Binary/ .py file / MLOps pipepine etc.,
• As it’s a AI model , future enhancements provision has to be considered
• Packages supporting the target board and AI model
• ONNX model conversion (if needed)
iii. Optimizations

Need for optimization :

• All models are not compatible with all hardwares
• In order to realize the algorithm onto to selected target of lower memory footprint , we
need to optimize it according to the use case
• Its always a trade off between latency Vs accuracy when we optimize any model
• Below chart shows various methods of optimizations that can be implemented
• We will discuss 3 major methods in this session - Pruning , Quantization, Clustering

AI Model
Optimizations

Gradient
Matrix Scaling in
Pruning Quantization Clustering
Decomposition Network
Quantization
Pruning
Pruning is used to produce models having a smaller size for inference. Pruning is
implemented by removing unimportant connections or neurons. With reduced size, the model
becomes both memory efficient and energy efficient and faster at inference with minimal
loss.
Clustering

• Clustering works by grouping the weights of each layer in a model into a predefined
number of clusters, then sharing the centroid values for the weights belonging to each
individual cluster. This reduces the number of unique weight values in a model, thus
reducing its complexity.
• As a result, clustered models can be compressed more effectively, providing
deployment benefits similar to pruning.

Development workflow
• As a starting point, check if the models in hosted models can work for your application. If
not, it’s recommend that start with the post-training quantization tool since this is broadly
applicable and does not require training data.
• For cases where the accuracy and latency targets are not met, or hardware accelerator
support is important, quantization-aware training is the better option. If you want to further
reduce your model size, you can try pruning and/or clustering prior to quantizing your
models.
Quantization
Quantization refers to reducing the precision of weights, parameters, biases, and activations
so that they occupy less memory and the size of the model would be reduced. Usually it
replaces float32 parameters and inputs with other types, such as float16, INT32, INT16, INT8,
INT4, INT1 etc.
There are two ways to perform quantization.
• Post Training Quantization
• Quantization Aware Training
Types of Quantization
Post-Training Quantization (PTQ)
• Quantizing an already-trained model.
• Weights and activations are quantized for deployment.
• Significant reduction in model size
• Loss of accuracy
Quantization Aware Training
• Quantization is considered during training process
• Requires modification in training pipeline
• Better accuracy

Comparison of Quantization Aware Training (left) and Post Training

Quantisation(right)
Comparison for Quantization methods
Size
Technique Data requirements Accuracy Supported hardware
reduction
Post-training float16 No data Up to 50% Insignificant accuracy CPU, GPU
quantization loss

Post-training dynamic range No data Up to 75% Smallest accuracy CPU, GPU (Android)
quantization loss

Post-training integer Unlabelled representative Up to 75% Small accuracy loss CPU, GPU (Android), EdgeTPU,
quantization sample Hexagon DSP

Quantization-aware training Labelled training data Up to 75% Smallest accuracy CPU, GPU (Android), EdgeTPU,
loss Hexagon DSP

Sample
optimization
methods using
Tensorflow
library, shown
in the
Decision tree
Few model Quantization done for common CNN models

Top-1 Latency (Post Latency

Top-1 Top-1 Accuracy Latency Size Size
Accuracy Training (Quantization
Model Accuracy (Quantization (Original) (Original) (Optimized)
(Post Training Quantized) Aware Training)
(Original) Aware Training) (ms) (MB) (MB)
Quantized) (ms) (ms)

Mobilenet-v1-1-224 0.709 0.657 0.70 124 112 64 16.9 4.3

Mobilenet-v2-1-224 0.719 0.637 0.709 89 98 54 14 3.6

Inception_v3 0.78 0.772 0.775 1130 845 543 95.7 23.9

Resnet_v2_101 0.770 0.768 N/A 3973 2868 N/A 178.3 44.9

Neuromorphic Computing
Von Neumann Vs Neuromorphic Computing
Evolution of SNNs

Neuromorphic Market trends

Spiking Neural Networks(SNN)
The idea is that neurons in the SNN do not
transmit information at each propagation When the membrane potential reaches the
cycle (as it happens with typical multi- threshold, the neuron fires, and generates a
layer perceptron networks), but rather signal that travels to other neurons which, in
transmit information only when a membrane turn, increase or decrease their potentials in
potential—an intrinsic quality of the neuron response to this signal. A neuron model that
related to its membrane electrical charge— fires at the moment of threshold crossing is
reaches a specific value, called the also called a spiking neuron model.
threshold.

Neuromorphic Computing
Neuromorphic computing is putting together synthetic neurons that operate
according to the same principles as the human brain.

It operates on Spiking Neural Networks (SNNs), where each "neuron"

communicates with other neurons independently. It imitates the organic neural
networks found in living brains. Each "neuron" in the SNN can fire
independently of the others, and when it does, it sends pulsed signals to other
neurons in the network that directly alter the electrical states of those
neurons.
Neuromorphic Vs Conventional AI
Sl no Neuromorphic Computing Conventional AI
1 Designed to emulate the structure and behavior of Typically uses digital processors, such as CPUs and
biological neural networks. GPUs, following the von Neumann architecture.
Memory and processing units are separate

2 Emphasizes event-driven and asynchronous Processing involves executing software-based

processing, where computations occur when inputs algorithms in a sequential or parallel manner.
or stimuli are detected, similar to how neurons fire Algorithms are designed to process data and make
in response to signals in the brain. decisions based on predefined rules or learned
patterns.

3 Neuromorphic systems often use spiking neurons Conventional AI relies on fixed-precision

and synapses to represent and transmit data arithmetic and symbolic representations for data
processing

4 Neuromorphic computing aims to achieve high Conventional AI processing can be energy-

energy efficiency by minimizing data movement, intensive due to high-speed data transfer and
taking advantage of analog and continuous complex computations, although optimizations
computations, and leveraging the brain's efficient and advancements in hardware have improved
signal processing mechanisms efficiency.
ANN Vs SNN

SNN ANN
• Spikes as Fundamental Units • Continuous Values
• Temporal Coding • Vector Representations
• Sparse and Event-Driven • Fixed Precision
• Integration and Propagation • Feedforward Processing and
Backpropagation

** SNN is better than ANN in terms of power efficiency .

Neuromorphic Devices for SNN
Real World Applications
Industry that can be targeted :

1.Home appliances- smart home edge

intelligence
2.Automotive
3.Healthcare
4.Aerospace
5.Security
Etc.,

Automotive industry application

Brainchip Akida Raspberry pi EvalKit

Akida MiniPCIe card

Demos in Akida Evalkit :

1.Visual wake-word demo (detect person standing in camera field of view)

2.Edge learning demo
3.Edge face recognition demo
4.Detection demo Demo videos

We have tested all the demos mention above using the brainchip Akida
raspberry pi evalkit
Edge Cloud
Edge Cloud
What is Edge Cloud ?

• Edge computing describes the process of bringing compute

and storage elements closer to the network edge. And edge
cloud goes a step further and uses a cloud architecture for
that same process.
• Edge cloud computing extends the convenience of cloud to
edge networks. Edge clouds are hosted by micro-data
centers that store, analyze, and process data faster than is
possible using a connection to a data center.
• An edge cloud strategy places intelligent edge nodes closer
to local resources, equipment, and devices, with software to
deliver services in a way that’s like using public cloud
services.

Reference : Webinar Recording: How to Build a Basic

Edge Cloud (youtube.com)
Why Edge Cloud

•Faster response times. Data within a cloud edge network can be processed and consumed close to where it
is generated, enabling faster response times for enhanced user experiences.

•Optimized bandwidth. By processing more workloads locally, cloud edge networks minimize the need to
transmit massive amounts of data to centralized servers, reducing network usage and lowering bandwidth
needs.

•Increased security. By processing and storing data locally, cloud edge networks limit the distance sensitive
information must travel, and minimize its exposure to threats.

•Simpler data governance. Many countries and jurisdictions have different requirements for how data such as
customer records can be collected, used, stored, protected, and retained. When data travels long distances to
reach cloud data centers, managing these data sovereignty mandates can be complex and time-consuming.
Cloud edge computing simplifies data governance by processing and storing data locally.

•More flexibility and scalability. Cloud edge networks make it easy to scale applications with ease and to run
modern apps built on containers or existing apps on virtual machines, all within a single platform.

•Real-time insight. Because cloud edge computing enables data to be processed faster and delivered with
quicker response times, solutions such as analytics platforms can deliver insight to end users with greater
speed and timeliness.
Use cases for Edge Cloud & Challenges
Areas where we can implement Edge Cloud solutions :

• Multimedia experiences
• Manufacturing
• Self-driving vehicles
• Healthcare
• Smart cities Solutions

Challenges of Edge Cloud :

As a complex and critical system within IT infrastructure, edge cloud presents some challenges:

•Edge device management: Managing many edge devices across multiple locations can be resource
intensive. Ensuring proper configuration, security updates, and maintenance of servers poses a challenge.

•Network connectivity and reliability: Edge cloud relies on a stable connection between the servers at the
edge of the network and a centralized cloud. In remote environments, maintaining consistent network
connectivity may be difficult, leading to unreliable access to cloud services.

•Scalability and resource management: Scaling resources at the edge to handle fluctuations in demand
can be difficult. Achieving seamless resource orchestration and load balancing in a distributed environment
requires careful coordination.
References for Edge Cloud

Links:
https://www.hpe.com/in/en/what-is/edge-to-cloud.html

https://www.ciena.com/insights/what-is/What-is-Edge-Cloud.html

https://www.intel.com/content/www/us/en/edge-computing/edge-
cloud.html#articleparagraph_375506681

https://www.vmware.com/topics/glossary/content/edge-
cloud.html#:~:text=Edge%20cloud%20is%20cloud%20computing,or%20private%20cloud%20f
or%20processing
Thank You

www.tataelxsi.com

Confidentiality Notice
This document and all information contained herein is the sole property of Tata Elxsi
Limited and shall not be reproduced or disclosed to a third party without the express
written consent of Tata Elxsi Limited.

Artificial Intelligence Hardware Design - Challenges and Solutions
100% (2)
Artificial Intelligence Hardware Design - Challenges and Solutions
233 pages
Positive Psychology the Science of Happiness and Flourishing 3rd Edition by William C Compton Edward Hoffman eBook and TestBank Bundle Full Download
100% (1)
Positive Psychology the Science of Happiness and Flourishing 3rd Edition by William C Compton Edward Hoffman eBook and TestBank Bundle Full Download
402 pages
ZyNet Automating Deep Neural Network Implementation On Low-Cost Reconfigurable Edge Computing Platforms
No ratings yet
ZyNet Automating Deep Neural Network Implementation On Low-Cost Reconfigurable Edge Computing Platforms
4 pages
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
No ratings yet
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
231 pages
3). [A3D3 workshop] mcunet-v3
No ratings yet
3). [A3D3 workshop] mcunet-v3
42 pages
2006.03669v2
No ratings yet
2006.03669v2
73 pages
TinyML Talks Theocharis Theocharides 200804
No ratings yet
TinyML Talks Theocharis Theocharides 200804
79 pages
A DNN Optimization Framework With Unlabeled
No ratings yet
A DNN Optimization Framework With Unlabeled
5 pages
2019_neurips_tutorial
No ratings yet
2019_neurips_tutorial
138 pages
Hardware Architectures For Deep Neural Networks: ISCA Tutorial June 24, 2017
No ratings yet
Hardware Architectures For Deep Neural Networks: ISCA Tutorial June 24, 2017
290 pages
PQAT
No ratings yet
PQAT
25 pages
DNN Accelerators
No ratings yet
DNN Accelerators
29 pages
DLEI_PPT_B-Batch_Unit-6
No ratings yet
DLEI_PPT_B-Batch_Unit-6
41 pages
30006
No ratings yet
30006
91 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
M Thesis Report
No ratings yet
M Thesis Report
38 pages
A Comprehensive Survey On Model Compression and Acceleration
No ratings yet
A Comprehensive Survey On Model Compression and Acceleration
43 pages
Day5_03_Converting Neural Networks model into Optimzied Code
No ratings yet
Day5_03_Converting Neural Networks model into Optimzied Code
25 pages
Emerging Computing Paradigms: The Case of Neuromorphic Platforms
No ratings yet
Emerging Computing Paradigms: The Case of Neuromorphic Platforms
21 pages
Tutorial-on-DNN-6-of-9-Network-and-Hardware-Co-Design
No ratings yet
Tutorial-on-DNN-6-of-9-Network-and-Hardware-Co-Design
60 pages
esm2024-mizrahi-slides (2)
No ratings yet
esm2024-mizrahi-slides (2)
77 pages
EECS251Leture-JennyHuang 2021
No ratings yet
EECS251Leture-JennyHuang 2021
67 pages
High-Performance Hardware For Machine Learning - 0916
No ratings yet
High-Performance Hardware For Machine Learning - 0916
68 pages
Design Possibilities and Challenges of DNN Models
No ratings yet
Design Possibilities and Challenges of DNN Models
61 pages
W01 PracticalProblemsProjects
No ratings yet
W01 PracticalProblemsProjects
27 pages
20231130_IntroductionToAISystems
No ratings yet
20231130_IntroductionToAISystems
29 pages
Automates Neural Architecture Construction
No ratings yet
Automates Neural Architecture Construction
23 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
BNN in FPGA
No ratings yet
BNN in FPGA
15 pages
Efficient Deep Learning Infrastructures For Embedded Computing Systems: A Comprehensive Survey and Future Envision
No ratings yet
Efficient Deep Learning Infrastructures For Embedded Computing Systems: A Comprehensive Survey and Future Envision
101 pages
Quantization and Deployment Od DNN On Microcontroller
No ratings yet
Quantization and Deployment Od DNN On Microcontroller
34 pages
Advanced Topics in Autonomous Driving Using Deep Learning: Presenter: Nasim Souly
No ratings yet
Advanced Topics in Autonomous Driving Using Deep Learning: Presenter: Nasim Souly
41 pages
A Survey of Quantization Methods For Efficient Neural Network Inference
No ratings yet
A Survey of Quantization Methods For Efficient Neural Network Inference
33 pages
A Survey of Model Compression and Acceleration For Deep Neural Networks
No ratings yet
A Survey of Model Compression and Acceleration For Deep Neural Networks
10 pages
HAQ: Hardware-Aware Automated Quantization With Mixed Precision
No ratings yet
HAQ: Hardware-Aware Automated Quantization With Mixed Precision
10 pages
Full Python For Teenagers: Learn To Program Like A Superhero! James R Payne Ebook All Chapters
100% (4)
Full Python For Teenagers: Learn To Program Like A Superhero! James R Payne Ebook All Chapters
62 pages
Capra 2020
No ratings yet
Capra 2020
48 pages
AI Computing Trends - Challenges Innovations-Final
No ratings yet
AI Computing Trends - Challenges Innovations-Final
18 pages
export_26_05_2025-14_49
No ratings yet
export_26_05_2025-14_49
4 pages
An_Overview_of_Efficient_Interconnection_Networks_for_Deep_Neural_Network_Accelerators
No ratings yet
An_Overview_of_Efficient_Interconnection_Networks_for_Deep_Neural_Network_Accelerators
15 pages
An_End-to-End_Workflow_to_Efficiently_Compress_and_Deploy_DNN_Classifiers_on_SoC_FPGA
No ratings yet
An_End-to-End_Workflow_to_Efficiently_Compress_and_Deploy_DNN_Classifiers_on_SoC_FPGA
4 pages
Adv - Java GTU Study Material Presentations Unit-4 Java Server Pages
No ratings yet
Adv - Java GTU Study Material Presentations Unit-4 Java Server Pages
153 pages
Paper 8
No ratings yet
Paper 8
7 pages
Machine Learning and Big Data Analytics Paradigms: Analysis, Applications and Challenges Aboul Ella Hassanien pdf download
100% (7)
Machine Learning and Big Data Analytics Paradigms: Analysis, Applications and Challenges Aboul Ella Hassanien pdf download
62 pages
Futureinternet 12 00113 v2
No ratings yet
Futureinternet 12 00113 v2
22 pages
Lesson2 Huawei Ascend Platform Introduction EXTERNAL
No ratings yet
Lesson2 Huawei Ascend Platform Introduction EXTERNAL
40 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
Individual Paper - Nina Luksha - ITEC 625 9080 - Updated
No ratings yet
Individual Paper - Nina Luksha - ITEC 625 9080 - Updated
11 pages
L-0017398760-pdf
No ratings yet
L-0017398760-pdf
24 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
5 pages
w1--Machine Learning Hardware Design for Efficiency, Flexibility, and Scalability [Feature]
No ratings yet
w1--Machine Learning Hardware Design for Efficiency, Flexibility, and Scalability [Feature]
19 pages
Self - Notes - Neuromorphic Computing
No ratings yet
Self - Notes - Neuromorphic Computing
21 pages
A Mixed-Pruning Based Framework for Embedded Convolutional Neural Network Acceleration
No ratings yet
A Mixed-Pruning Based Framework for Embedded Convolutional Neural Network Acceleration
10 pages
transforming-edge-ai-with-npus-in-microcontrollers
No ratings yet
transforming-edge-ai-with-npus-in-microcontrollers
12 pages
Embedded_Deep_Learning_Accelerators_A_Survey_on_Recent_Advances
No ratings yet
Embedded_Deep_Learning_Accelerators_A_Survey_on_Recent_Advances
19 pages
Ug4 Proj
No ratings yet
Ug4 Proj
44 pages
Embedded Deep Learning Accelerators - A Survey On Recent Advances
No ratings yet
Embedded Deep Learning Accelerators - A Survey On Recent Advances
19 pages
FP-BNN-on-FPGA
No ratings yet
FP-BNN-on-FPGA
15 pages
Options Strategies eBook (Hindi Version)
100% (2)
Options Strategies eBook (Hindi Version)
44 pages
Lecture 1 - Intro
No ratings yet
Lecture 1 - Intro
57 pages
Tutorial On DNN 1 of 9 Background of DNNs
No ratings yet
Tutorial On DNN 1 of 9 Background of DNNs
65 pages
Cloud Computing
No ratings yet
Cloud Computing
88 pages
057-203 E800 Soft
No ratings yet
057-203 E800 Soft
134 pages
Hardware Accleration For ML
No ratings yet
Hardware Accleration For ML
26 pages
Introduction To TensorFlow For Artificial Intelligence
No ratings yet
Introduction To TensorFlow For Artificial Intelligence
41 pages
Ericsson RBS Series
100% (1)
Ericsson RBS Series
2 pages
611 Ajbs
No ratings yet
611 Ajbs
45 pages
Chapter 1
50% (2)
Chapter 1
67 pages
Non-Contact Forehead Infrared Thermometer User Manual: M. Feingersh & Co - LTD
No ratings yet
Non-Contact Forehead Infrared Thermometer User Manual: M. Feingersh & Co - LTD
16 pages
Infineon IKCM30F60GD DataSheet v02 - 05 EN
No ratings yet
Infineon IKCM30F60GD DataSheet v02 - 05 EN
17 pages
Isvlsi2019 SS
No ratings yet
Isvlsi2019 SS
7 pages
Matchbox_Educable_Noughts_and_Crosses_Engine
No ratings yet
Matchbox_Educable_Noughts_and_Crosses_Engine
8 pages
Tcs v. Infosys
No ratings yet
Tcs v. Infosys
40 pages
Retiming
No ratings yet
Retiming
24 pages
Presa Klauke EK60UNVL PDF
No ratings yet
Presa Klauke EK60UNVL PDF
17 pages
PaperCut MF - Kyocera Embedded Manual - 2015-09-03
No ratings yet
PaperCut MF - Kyocera Embedded Manual - 2015-09-03
29 pages
TWDLMDA20DTK DATASHEET WW en-GB
No ratings yet
TWDLMDA20DTK DATASHEET WW en-GB
8 pages
Administrative Manager Resume
100% (1)
Administrative Manager Resume
8 pages
Notemaking
No ratings yet
Notemaking
4 pages
Application For Bridging Visa E - Subclass 050: Residential Address
No ratings yet
Application For Bridging Visa E - Subclass 050: Residential Address
7 pages
Ohs
No ratings yet
Ohs
27 pages
Manual Gshock Frogman (Qw3184)
No ratings yet
Manual Gshock Frogman (Qw3184)
8 pages
Ficha Tecnica TK 4014 MED
No ratings yet
Ficha Tecnica TK 4014 MED
2 pages
EASA Part 66 Module 4 Electronic Fundamentals 2 Days PDF
No ratings yet
EASA Part 66 Module 4 Electronic Fundamentals 2 Days PDF
3 pages
Using The Parent Order Forms in Your Kit?
No ratings yet
Using The Parent Order Forms in Your Kit?
1 page
MITRE ATT&CK in Amazon Web Services (AWS) :: A Defender's Cheat Sheet
100% (1)
MITRE ATT&CK in Amazon Web Services (AWS) :: A Defender's Cheat Sheet
12 pages
DXTBMP QuickStart
No ratings yet
DXTBMP QuickStart
2 pages
Department of Education: TO: Undersecretaries
No ratings yet
Department of Education: TO: Undersecretaries
1 page
Explanatory Notes and Order Form Feb 10 - R&S
No ratings yet
Explanatory Notes and Order Form Feb 10 - R&S
3 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet

Module 10 - Learners Guide

Uploaded by

Module 10 - Learners Guide

Uploaded by

Edge AI / Embedded AI

Casper system architecture of MCU and MPU

Automotive industry application

i. Background Knowledge of board required:

Need for optimization :

Comparison of Quantization Aware Training (left) and Post Training

Top-1 Latency (Post Latency

Mobilenet-v1-1-224 0.709 0.657 0.70 124 112 64 16.9 4.3

Mobilenet-v2-1-224 0.719 0.637 0.709 89 98 54 14 3.6

Inception_v3 0.78 0.772 0.775 1130 845 543 95.7 23.9

Resnet_v2_101 0.770 0.768 N/A 3973 2868 N/A 178.3 44.9

Neuromorphic Market trends

It operates on Spiking Neural Networks (SNNs), where each "neuron"

2 Emphasizes event-driven and asynchronous Processing involves executing software-based

3 Neuromorphic systems often use spiking neurons Conventional AI relies on fixed-precision

4 Neuromorphic computing aims to achieve high Conventional AI processing can be energy-

** SNN is better than ANN in terms of power efficiency .

1.Home appliances- smart home edge

Automotive industry application

Akida MiniPCIe card

Demos in Akida Evalkit :

1.Visual wake-word demo (detect person standing in camera field of view)

• Edge computing describes the process of bringing compute

Reference : Webinar Recording: How to Build a Basic

Challenges of Edge Cloud :

You might also like