SlideShare a Scribd company logo
2
Most read
4
Most read
5
Most read
Trying to figure out MCP by
actually building an app from
scratch with open source and SLMs
Julien Simon, Chief Evangelist
julien@arcee.ai
https://www.julien.org
https://github.com/juliensimon/smolagents-mcp-demo
Just another remote procedure call protocol 👴
1982: UNIX RPC – First mainstream RPC, used in NFS.
1994: CORBA – Early cross-language object middleware.
2000: REST – Dominant HTTP API style.
2000: SOAP – Enterprise XML-based APIs.
2005: gRPC – Protobuf + HTTP/2, high-performance.
2007: Thri
f
– Mul
ti
-language RPC.
2010: WebSockets – Real-
ti
me, bidirec
ti
onal communica
ti
on.
2014: GraphQL – Client-driven data queries.
2023: MCP – Standardizes AI data integra
ti
on ⬅ YOU ARE HERE
Food for thought
Security, Trust, Audi
ti
ng - CRITICAL for enterprise adop
ti
on
• What measures are in place to authen
ti
cate and validate the servers your applica
ti
on communicates with?
• How do you ensure each func
ti
on is accessible only to authorized users under appropriate condi
ti
ons?
• How do you establish clear iden
ti
ty management to a
tt
ribute ac
ti
ons accurately within MCP systems?
Discoverability & Rou
ti
ng
• How does your applica
ti
on discover and connect to remote servers dynamically?
• How do you determine the most suitable server and func
ti
on(s) for each task within your applica
ti
on?
Versioning & Compa
ti
bility
• How can you ensure that updates do not disrupt exis
ti
ng func
ti
onali
ti
es for users?
• How does MCP live alongside other protocols (REST, OpenAI-style func
ti
on calling, etc.)?
Documenta
ti
on & Usability
• How do you ensure that func
ti
on descrip
ti
ons are detailed and understandable for models?
Performance & Cost Management
• How do you op
ti
mize latency and token consump
ti
on in agen
ti
c systems?
Arcee AI - Post-trained models
State-of-the-art tech stack based on open-source libraries
Spectrum (continuous pre-training), MergeKit (merging), DistilKit (distillation), EvolKit (dataset improvement)
Best-in-class models based on open-source architectures
Hugging Face OpenLLM Leaderboard benchmarks
Llama 3.1 70B
🥇
Best 70B model
Qwen2 1.5B
🥇
Best 1.5B model
Llama 3.1 8B
🥇
Best 8B model
Qwen2.5 14B
🥇
Best 14B model
Qwen2 72B
🥇
Best Arabic model
https://www.together.ai/models/afm-4-5b-preview
https://www.arcee.ai/blog/announcing-the-arcee-foundation-model-family
https://www.arcee.ai/blog/deep-dive-afm-4-5b-the-first-arcee-foundational-model
https://www.arcee.ai/blog/extending-afm-4-5b-to-64k-context-length
Arcee Foundation Models (AFM)
AFM-4.5B-Preview
https://api.together.ai/models/arcee-ai/AFM-4.5B-Preview
AFM-4.5B-Preview vs. Qwen-3-4B
8/10. Tie on Industrials, loss on Communication Services.
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Google Gemma-3n-E4B-it
8/10, tie on Healthcare, loss on IT
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Llama-3.2-8B
10/10 😃
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Mixtral-8x7B-Instruct
Almost tied (4/10) with 8% of Mixtral’s size
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
Julien Simon, Chief Evangelist
julien@arcee.ai
https://www.julien.org
https://github.com/juliensimon/smolagents-mcp-demo
Models on Hugging Face;
OpenRouter and Together AI
Chat with AFM
AFM blog post

More Related Content

PPTX
How I Developed My First MCP Server? & How You Can Develop It Too?
Jafar Pathan
 
PPTX
General Learning.pptx
AmmarAhmedSiddiqui2
 
PDF
20200520 - Como empezar a desarrollar aplicaciones serverless
Marcia Villalba
 
PDF
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
PPTX
Maintainable Machine Learning Products
Andrew Musselman
 
PDF
Alfresco and the Model Context Protocol (MCP)
Angel Borroy López
 
PDF
MCP The Birth of an Open AI Ecosystem.pdf
davidandersonofficia
 
PDF
C19013010 the tutorial to build shared ai services session 2
Bill Liu
 
How I Developed My First MCP Server? & How You Can Develop It Too?
Jafar Pathan
 
General Learning.pptx
AmmarAhmedSiddiqui2
 
20200520 - Como empezar a desarrollar aplicaciones serverless
Marcia Villalba
 
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
Maintainable Machine Learning Products
Andrew Musselman
 
Alfresco and the Model Context Protocol (MCP)
Angel Borroy López
 
MCP The Birth of an Open AI Ecosystem.pdf
davidandersonofficia
 
C19013010 the tutorial to build shared ai services session 2
Bill Liu
 

Similar to Trying to figure out MCP by actually building an app from scratch with open source and SLMs (20)

PDF
Olist Architecture v2.0
Osvaldo Santana Neto
 
PDF
Building a high-performance, scalable ML & NLP platform with Python, Sheer El...
Pôle Systematic Paris-Region
 
PPTX
Architecting extremelylarge scale web applications
Prashanth Panduranga
 
PDF
Why Is MCP Server Development Trending Now.pdf
imoliviabennett
 
PDF
MCP in Action: Seamless AI Integration for Automation and Beyond.pdf
Kiran Kumar Edupuganti
 
PPTX
The Right Kind of API – How To Choose Appropriate API Protocols and Data Form...
Nordic APIs
 
PDF
ContextForge MCP Gateway - the missing proxy for AI Agents and Tools
Mihai Criveti
 
PDF
Why Is MCP Server Development Trending Now.pdf
SoluLab1231
 
PDF
Generative AI on Enterprise Cloud with NiFi and Milvus
Timothy Spann
 
PPTX
Serverless machine learning architectures at Helixa
Data Science Milan
 
PDF
Hpc lunch and learn
John D Almon
 
PPTX
AWS Startup Garage - Building your MVP on AWS
Cobus Bernard
 
PPTX
Architectures, Frameworks and Infrastructure
harendra_pathak
 
PPTX
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
PDF
Tech for the Non Technical - Anatomy of an Application Stack
Intelligent_ly
 
PPTX
AI and Innovations on AWS
Adrian Hornsby
 
PDF
Compiled Websites with Plone, Django, Xapian and SSI
Wojciech Lichota
 
PDF
Agents for Enterprise Workflows - Berkeley LLM AI Agents MOOC
VincentLui15
 
PDF
Rapid API Development with LoopBack/StrongLoop
Raymond Camden
 
PDF
AWS Startup Day Bogotá - Tools for Building Your Startup
Amazon Web Services LATAM
 
Olist Architecture v2.0
Osvaldo Santana Neto
 
Building a high-performance, scalable ML & NLP platform with Python, Sheer El...
Pôle Systematic Paris-Region
 
Architecting extremelylarge scale web applications
Prashanth Panduranga
 
Why Is MCP Server Development Trending Now.pdf
imoliviabennett
 
MCP in Action: Seamless AI Integration for Automation and Beyond.pdf
Kiran Kumar Edupuganti
 
The Right Kind of API – How To Choose Appropriate API Protocols and Data Form...
Nordic APIs
 
ContextForge MCP Gateway - the missing proxy for AI Agents and Tools
Mihai Criveti
 
Why Is MCP Server Development Trending Now.pdf
SoluLab1231
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Timothy Spann
 
Serverless machine learning architectures at Helixa
Data Science Milan
 
Hpc lunch and learn
John D Almon
 
AWS Startup Garage - Building your MVP on AWS
Cobus Bernard
 
Architectures, Frameworks and Infrastructure
harendra_pathak
 
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
Tech for the Non Technical - Anatomy of an Application Stack
Intelligent_ly
 
AI and Innovations on AWS
Adrian Hornsby
 
Compiled Websites with Plone, Django, Xapian and SSI
Wojciech Lichota
 
Agents for Enterprise Workflows - Berkeley LLM AI Agents MOOC
VincentLui15
 
Rapid API Development with LoopBack/StrongLoop
Raymond Camden
 
AWS Startup Day Bogotá - Tools for Building Your Startup
Amazon Web Services LATAM
 
Ad

More from Julien SIMON (20)

PDF
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
PDF
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
PDF
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
PDF
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
PDF
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
PDF
An introduction to computer vision with Hugging Face
Julien SIMON
 
PDF
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
PDF
Building NLP applications with Transformers
Julien SIMON
 
PPTX
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
PDF
Starting your AI/ML project right (May 2020)
Julien SIMON
 
PPTX
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
PPTX
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
An introduction to computer vision with Hugging Face
Julien SIMON
 
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
Building NLP applications with Transformers
Julien SIMON
 
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
Starting your AI/ML project right (May 2020)
Julien SIMON
 
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Ad

Recently uploaded (20)

PDF
Software Development Methodologies in 2025
KodekX
 
PDF
madgavkar20181017ppt McKinsey Presentation.pdf
georgschmitzdoerner
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
CIFDAQ
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
agentic-ai-and-the-future-of-autonomous-systems.pdf
siddharthnetsavvies
 
PDF
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
DOCX
Top AI API Alternatives to OpenAI: A Side-by-Side Breakdown
vilush
 
PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
Precisely
 
PDF
Google’s NotebookLM Unveils Video Overviews
SOFTTECHHUB
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
Software Development Company | KodekX
KodekX
 
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
AVTRON Technologies LLC
 
PDF
This slide provides an overview Technology
mineshkharadi333
 
PPTX
The Power of IoT Sensor Integration in Smart Infrastructure and Automation.pptx
Rejig Digital
 
Software Development Methodologies in 2025
KodekX
 
madgavkar20181017ppt McKinsey Presentation.pdf
georgschmitzdoerner
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
CIFDAQ
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
agentic-ai-and-the-future-of-autonomous-systems.pdf
siddharthnetsavvies
 
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Top AI API Alternatives to OpenAI: A Side-by-Side Breakdown
vilush
 
Enable Enterprise-Ready Security on IBM i Systems.pdf
Precisely
 
Google’s NotebookLM Unveils Video Overviews
SOFTTECHHUB
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Software Development Company | KodekX
KodekX
 
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
AVTRON Technologies LLC
 
This slide provides an overview Technology
mineshkharadi333
 
The Power of IoT Sensor Integration in Smart Infrastructure and Automation.pptx
Rejig Digital
 

Trying to figure out MCP by actually building an app from scratch with open source and SLMs

  • 1. Trying to figure out MCP by actually building an app from scratch with open source and SLMs Julien Simon, Chief Evangelist [email protected] https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo
  • 2. Just another remote procedure call protocol 👴 1982: UNIX RPC – First mainstream RPC, used in NFS. 1994: CORBA – Early cross-language object middleware. 2000: REST – Dominant HTTP API style. 2000: SOAP – Enterprise XML-based APIs. 2005: gRPC – Protobuf + HTTP/2, high-performance. 2007: Thri f – Mul ti -language RPC. 2010: WebSockets – Real- ti me, bidirec ti onal communica ti on. 2014: GraphQL – Client-driven data queries. 2023: MCP – Standardizes AI data integra ti on ⬅ YOU ARE HERE
  • 3. Food for thought Security, Trust, Audi ti ng - CRITICAL for enterprise adop ti on • What measures are in place to authen ti cate and validate the servers your applica ti on communicates with? • How do you ensure each func ti on is accessible only to authorized users under appropriate condi ti ons? • How do you establish clear iden ti ty management to a tt ribute ac ti ons accurately within MCP systems? Discoverability & Rou ti ng • How does your applica ti on discover and connect to remote servers dynamically? • How do you determine the most suitable server and func ti on(s) for each task within your applica ti on? Versioning & Compa ti bility • How can you ensure that updates do not disrupt exis ti ng func ti onali ti es for users? • How does MCP live alongside other protocols (REST, OpenAI-style func ti on calling, etc.)? Documenta ti on & Usability • How do you ensure that func ti on descrip ti ons are detailed and understandable for models? Performance & Cost Management • How do you op ti mize latency and token consump ti on in agen ti c systems?
  • 4. Arcee AI - Post-trained models State-of-the-art tech stack based on open-source libraries Spectrum (continuous pre-training), MergeKit (merging), DistilKit (distillation), EvolKit (dataset improvement) Best-in-class models based on open-source architectures Hugging Face OpenLLM Leaderboard benchmarks Llama 3.1 70B 🥇 Best 70B model Qwen2 1.5B 🥇 Best 1.5B model Llama 3.1 8B 🥇 Best 8B model Qwen2.5 14B 🥇 Best 14B model Qwen2 72B 🥇 Best Arabic model
  • 7. AFM-4.5B-Preview vs. Qwen-3-4B 8/10. Tie on Industrials, loss on Communication Services. 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 8. AFM-4.5B-Preview vs. Google Gemma-3n-E4B-it 8/10, tie on Healthcare, loss on IT 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 9. AFM-4.5B-Preview vs. Llama-3.2-8B 10/10 😃 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 10. AFM-4.5B-Preview vs. Mixtral-8x7B-Instruct Almost tied (4/10) with 8% of Mixtral’s size 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 11. Julien Simon, Chief Evangelist [email protected] https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo Models on Hugging Face; OpenRouter and Together AI Chat with AFM AFM blog post