0% found this document useful (0 votes)

10 views5 pages

Assignment

The task involves creating a User Facing chat-style RAG Agentic System where users can query an LLM that provides answers using predefined tools and a daily updating data pipeline to a MySQL database. Key requirements include asynchronous programming in Python, basic API creation, and data processing, while avoiding complex libraries and ensuring tool calls are not visible to the user. The project is divided into four main parts: data pipeline, agent creation, API communication interface, and basic UI, with a focus on simplicity and functionality for demonstration purposes.

Uploaded by

jatinavhad756

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Assignment

Uploaded by

jatinavhad756

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Take Home Assessment Task

Summary of task:
You need to make a User Facing chat style RAG Agentic System in which the user will ask queries to LLM
and LLM will provide the answer Using given tools (tool call/function call). The condition is that data
must be up to date daily, ie: you will also need to make a data pipeline which will update data daily into
your local MySQL or Vector Database ( VectorDB is out of the scope of task but if you do it , it's good).

Concepts/Required Knowledge to complete the task

1. Asynchronous Programming in python

2. Setting up MySQL Database & Querying the Database

In Querying the Database Avoid ORMs, Use RAW SQL Queries, it is not needed that you know
hard and every query. simple select , data filtering and basic table joining is sufficient

3. Creating Basic APIs in python (FastAPI will be Simplest choice here.) or websockets based
interface with LLM.

4. Agentic LLM inference

For this task , Agent is simply LLM with having predetermined tools/functions [ex:
fetch_result_from_google(query: str), function which agent can use to query google by
providing us query arguments] at it's disposal to solve the tasks.
if you are good developer and are familiar with everything except this agentic system, then
just understand it as in LLM system prompt we suggest LLM to use functions based on user
query and give LLM function schema (function schema is just definition of function basically
name of the function ,arguments function takes and what are the type of the argument).
LLM returns us which function to use and with what argument in JSON and we parse the
json , use python eval() to execute the function and return the result back to LLM, which then
LLM summarizes the results and gives the answer to the user.

5. Basic Data Processing

what is data pipeline?: In simplest definition it is nothing but automation of data processing
from data download to clean to pushing to database without near 0% manual input after
data pipeline creation.

Page 1 of 5
you do not need to setup cronjob/schedulers for this task , during demo we will ask you to
just run the pipeline and will wait for the data update in DB.

6. Ollama For running local LLM ( use qwen2.5 0.5b or 1b ) if you don't have GPT key. we advise you for
testing purpose use local LLM as OpenAI APIs will be costly.

7. Optional: Basic HTML, css, Js for chat and input box.

Note: it is not needed to have any fancy good looking UI , simplest plain white HTML page
with result box and input box will be perfect as long as we can chat with Agent(LLM) through
API. You can use AI for this portion. Streamlit , gradio , solara all works.

Outside of the Scope OR Non required Stuff.

1. Using OpenAI API (Local LLM works)
2. Scraping for data gathering ( Avoid it , use any data which you can get from free API and is daily
updating)
note: Weather Data APIs does not count
3. Vector Database Setup
4. Logging of User and Chat Specific Agent Activity in queryable format. ( sounds simple but
complicated and out of demo scope)
5. Fully Functional APIs with Auth and MongoDB history Management
This will go way beyond the scope of the demo and will become full on project.
API does not need to be fully functional , only required function is when we hit the api we can
chat with the Agent , that's it. History management is not needed as you will need to setup
chat_id and user_id system which will again go beyond demo. but if you want to do it then
just use plain SQLite or even json file with demo sample chat_id and user_id.
You don't even Need to make FastAPI APIs , You can use websockets and it will still count , all
that is needed is interface with Agent.
6. Agentic Libraries such as pyautogen , autogen-chat , langchain are not needed you can
use barebone OpenAI python client ( supports local ollama all you need to do is change base url
from openai api endpoint with your local ollama api endpoint). In fact we would advise you to use
Barebone OpenAI python client as agentic libraries are fully bloated and complicated simple stuff
unnecessarily, and OpenAI python client by default supports Async. but yes there is no specific
guideline here , complexity is subjective you can use Any framework you like/prefer for making
Agent As long as it Supports Fully Async non blocking functionality either by built-in or by you
manually. Do note that if you go with OpenAI client approach you will need to make tool call parser
and executer which is by default available in almost all agentic libraries

Conditions

Page 2 of 5
Compulsory Conditions

1. Agent function/tool calls should not be visible to end user

2. Data Must be Daily updating. Static Data Won't count. Do note that daily updating here does not
mean if you run today and you must have today's date data. it can be week old. majority of systems
data update frequency is between 1 day to week.
3. No Code Interpreter/Execution by LLM. Only tool/function calls.
4. Agent will not query the DATA API directly , Agent can only use your MySQL data and other utils
that you may provide.

Optional but preferred Condition

1. Everything must be async and non blocking (In Agent and User Agent communication interface.
for pipeline it is ok to go with sync as this is just a demo).

Task Breakdown.
Task is divided mainly into 4 parts.

1. Data pipeline
2. Agent Creation with tool calls functionality
3. API/Communication interface with Agent
4. Basic UI.

1. Data pipeline
Choice of Data will determine your system and end project. here are few data sources which meets all
the conditions of required Data.
of course you can use whatever Data you see fit as long as it meets conditions.

1. Federal Registry

This is USA federal registry data API which contains executive documents and other registry
related data. if used this data then user queries for example would be what are the
new executive orders by president donald trump this month and
summarize them for me.
note: queries won't be related to semantic similarity as we have suggested to not setup
vector database. example what are the executive documents related to
artificial intelligence and security in all the past years

Page 3 of 5
This data is daily updating and you can query the API to fetch date specific updates after ,
before data. and it is free.
you only need to get the 2025 dataset.

2. FDA DRUG RELATED DATA

You can use any dataset from them , related to adverse effects of drugs , reported incidents,
disease related.

Best practices in the data pipeline.

This will make development easy

1. Downloader and Processors Must be separate

2. Records must be kept atleast 1 week pipeline records needed to be kept (example: daily
downloaded raw data, and processed data)

2. Agent Creation with tool calls.

Agent is really fancy word of saying LLM with specific task and tools at its disposal
to solve it

You will find many tutorials for this and you can use any framework.
if you do plan to use barebone OpenAI python client or direct aiohttp request to
openai/local_ollama endpoint , you will need to make a system to render/execute LLMs generated
tool calls, which might get little extra but it is not that hard. all you need to do is parse the tool call
request and execute the function and return the result.
make sure to only execute functions which you have defined. otherwise eval() will be very
dangerous to use.
basic overview is that
User Query -> LLM , LLM thinks and decide if need tool to use and yes then which tool -> tool call
response -> Your system executes the tool -> returns result back to LLM -> Again LLM Decide need
to response (here it means: answer to user query) or need to execute another tool , say it decides
to make response -> returns Response -> User.
if you plan to use an agentic library then also it is ok.

3. API/Communication interface with Agent & 4 Basic

UI.

Page 4 of 5
Pretty simple and straightforward, You just need to connect users with LLM using API or
websocket.
In simple terms user enters query to UI -> UI calls Your API -> it gives user query to LLM -> and
process happens same as (2) -> API returns returns result to UI -> UI renders it and shows the
response to user.

Overall
Data Pipeline -> MySQL : will daily fill the data.

User Query -> YOUR INTERFACE API -> LLM -> tool call to use MySQL to get user requested data -> LLM
summary -> Response -> API returns the result to UI.

Other Important Notes:

Treat it as demo , Not full project , You can cut corners for example:
in data pipeline you only get past 2 month data instead of full 2025,
if not able to understand async programming then stick to sync.
We are aware of the complexity of the task , this is moderately complex but actual production/real
world grade task.
Asynchronous Programming is a big ask here. the original task does enforces fully async system
but if you are able to architect the overall system but have a hard time in async programming then
you can stick to sync.
Using LLM for code help and understanding project is allowed , but make sure to understand
system design by yourself as this is not a CRUD task.
At the end all we want to see is can you design the actual system. and do note that the prototype
does not need to be perfect, it can be improved laterwards.

Few Async Libraries which will be useful for your task

1. aiohttp (for making requests)
2. aiofiles ( for writing to files)
3. aiomysql ( for async mysql query , use this as lot of documentation is available for this one)

Page 5 of 5

Survey LLM-Agents 2025
No ratings yet
Survey LLM-Agents 2025
44 pages
AutoGen: Enabling Next-Gen LLM Applications Via Multi-Agent Conversation
No ratings yet
AutoGen: Enabling Next-Gen LLM Applications Via Multi-Agent Conversation
43 pages
OpenAISDK Documentacion
No ratings yet
OpenAISDK Documentacion
165 pages
[Slide]-LangChain
No ratings yet
[Slide]-LangChain
20 pages
Genarative AI.dev Doc-1
No ratings yet
Genarative AI.dev Doc-1
48 pages
Synthetic_data
No ratings yet
Synthetic_data
33 pages
24 - VNHSGE - VietNamese High School Graduation Examination Dataset For Large Language Models
No ratings yet
24 - VNHSGE - VietNamese High School Graduation Examination Dataset For Large Language Models
74 pages
AI Policy Field Guide
No ratings yet
AI Policy Field Guide
63 pages
Multi Agent Application Roadmap
No ratings yet
Multi Agent Application Roadmap
3 pages
Ai Journey
No ratings yet
Ai Journey
20 pages
LLM Framework - Documentation
100% (2)
LLM Framework - Documentation
23 pages
AI_Agent_Workflow_vs_Agent_Part_5_by_Vipra_Singh_Mar,_2025_Medium (2)
No ratings yet
AI_Agent_Workflow_vs_Agent_Part_5_by_Vipra_Singh_Mar,_2025_Medium (2)
25 pages
Building+Database+Agents
No ratings yet
Building+Database+Agents
47 pages
Creative Connections_Goldenberg
No ratings yet
Creative Connections_Goldenberg
23 pages
PLANxRAG Planning-Guided Retrieval Augmented Generation
No ratings yet
PLANxRAG Planning-Guided Retrieval Augmented Generation
22 pages
External
No ratings yet
External
17 pages
Google Search Tips
No ratings yet
Google Search Tips
29 pages
Fetch - Ai Hackathon - Application - IIT-M
No ratings yet
Fetch - Ai Hackathon - Application - IIT-M
25 pages
BreakoutAI Assessment - AI Agent
No ratings yet
BreakoutAI Assessment - AI Agent
8 pages
GIDS - Building Robust, Secure LLM and Agentic AI Workflows
100% (1)
GIDS - Building Robust, Secure LLM and Agentic AI Workflows
36 pages
bhandari2024llm
No ratings yet
bhandari2024llm
13 pages
lab_session1_25oct2024
No ratings yet
lab_session1_25oct2024
29 pages
YouTube Video Search and Transcript-based QA with LLM (Project)
No ratings yet
YouTube Video Search and Transcript-based QA with LLM (Project)
13 pages
AI Stack 2025
No ratings yet
AI Stack 2025
81 pages
Gender Stereotypes in Artificial Intelligence Within The Accounting Profession Using Large Language Models
No ratings yet
Gender Stereotypes in Artificial Intelligence Within The Accounting Profession Using Large Language Models
11 pages
Language Agent Tree Search
No ratings yet
Language Agent Tree Search
26 pages
Salesforce Certified Agentforce - 5
No ratings yet
Salesforce Certified Agentforce - 5
5 pages
Watermarking Large Language Models and the Generat
No ratings yet
Watermarking Large Language Models and the Generat
8 pages
Agentic AI
No ratings yet
Agentic AI
12 pages
Agent Work Flows
No ratings yet
Agent Work Flows
72 pages
Principled Instructions Are All You Need For Questioning LLaMA-1/2, GPT-3.5/4
No ratings yet
Principled Instructions Are All You Need For Questioning LLaMA-1/2, GPT-3.5/4
24 pages
Fighting Fire With Fire: Can ChatGPT Detect AI-generated Text?
No ratings yet
Fighting Fire With Fire: Can ChatGPT Detect AI-generated Text?
8 pages
RL for LLMs - An Overview
No ratings yet
RL for LLMs - An Overview
9 pages
Logic LM
No ratings yet
Logic LM
19 pages
UGRD-AI6100 AI - PRELIM LAB EXAM - Attempt 2
No ratings yet
UGRD-AI6100 AI - PRELIM LAB EXAM - Attempt 2
13 pages
Harshit AI ML Engineer (2)
No ratings yet
Harshit AI ML Engineer (2)
4 pages
Interview_Questions_UBS
No ratings yet
Interview_Questions_UBS
7 pages
gen project
No ratings yet
gen project
7 pages
Module#3_L17_Information Retrieval Using Agents & Tools
No ratings yet
Module#3_L17_Information Retrieval Using Agents & Tools
33 pages
Agentic_ai_s1
No ratings yet
Agentic_ai_s1
14 pages
DE-Jumbotail Data Engineering Hiring Assignment (1)
No ratings yet
DE-Jumbotail Data Engineering Hiring Assignment (1)
3 pages
Discord Taz
No ratings yet
Discord Taz
3 pages
document
No ratings yet
document
31 pages
AI Database Query System
No ratings yet
AI Database Query System
7 pages
Brolly AI - Generative AI - Online Training
No ratings yet
Brolly AI - Generative AI - Online Training
13 pages
Document RAG Assignment
No ratings yet
Document RAG Assignment
4 pages
Generative AI Brochure
No ratings yet
Generative AI Brochure
37 pages
Assignment
No ratings yet
Assignment
5 pages
Pinnacle_Plus Projects
No ratings yet
Pinnacle_Plus Projects
12 pages
Achieve Better Economics and Performance Through Hybrid AI Finalpdf
No ratings yet
Achieve Better Economics and Performance Through Hybrid AI Finalpdf
6 pages
Effective Prompt Engineering for LLMs_ A Developer’s Guide to Advanced AI Techniques _ by Pankaj _ Nov, 2024 _ Medium
No ratings yet
Effective Prompt Engineering for LLMs_ A Developer’s Guide to Advanced AI Techniques _ by Pankaj _ Nov, 2024 _ Medium
16 pages
GALLM_Unit_5_Note
No ratings yet
GALLM_Unit_5_Note
7 pages
Bootcamp_GenAI_AgenticAI_Backend_Engineers_MacBook
No ratings yet
Bootcamp_GenAI_AgenticAI_Backend_Engineers_MacBook
3 pages
Assignments-4
No ratings yet
Assignments-4
1 page
AOZ Studio: A New App Creation Studio With Zero Coding
No ratings yet
AOZ Studio: A New App Creation Studio With Zero Coding
4 pages
PPT Agents
No ratings yet
PPT Agents
59 pages
Doubt clearance
No ratings yet
Doubt clearance
5 pages
Step 2 Ai Agents
No ratings yet
Step 2 Ai Agents
1 page
GenAI_Notes
No ratings yet
GenAI_Notes
9 pages
How the Agent Class Works Behind the Scenes (OpenAI Agents SDK) (1)
No ratings yet
How the Agent Class Works Behind the Scenes (OpenAI Agents SDK) (1)
2 pages
Gen AI Brochure
No ratings yet
Gen AI Brochure
4 pages
AI model orchestration
No ratings yet
AI model orchestration
3 pages
Master Catalog for GenAI Programs for LNW-19Jul2024
No ratings yet
Master Catalog for GenAI Programs for LNW-19Jul2024
9 pages
Agentic AI Projects
50% (2)
Agentic AI Projects
9 pages
Academic Research Assistance 1716570959
No ratings yet
Academic Research Assistance 1716570959
13 pages
Agent Ai
No ratings yet
Agent Ai
30 pages
ai-agents
No ratings yet
ai-agents
1 page
Prompt Engineering
100% (1)
Prompt Engineering
33 pages
Oracle 1Z0 1127 24 Questions and Answers PDF
100% (1)
Oracle 1Z0 1127 24 Questions and Answers PDF
8 pages
AI Engineer Roadmap
No ratings yet
AI Engineer Roadmap
22 pages
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
50% (2)
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
21 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
The Beginner’s Guide to APIs
From Everand
The Beginner’s Guide to APIs
Steven Mcananey
No ratings yet
The Beginner’s Guide to Local AI – Free AI Run Locally on Your PC
From Everand
The Beginner’s Guide to Local AI – Free AI Run Locally on Your PC
Steven Mcananey
No ratings yet
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
From Everand
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
Jesse Liberty
No ratings yet
Python Automation for Beginners: A Practical Guide with Examples
From Everand
Python Automation for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Intermediate Load Runner With Oracle/Apex Concepts.
From Everand
Intermediate Load Runner With Oracle/Apex Concepts.
Rohan Gordon
No ratings yet
DevOps For Beginners: DevOps Software Development Method Guide For Software Developers and IT Professionals
From Everand
DevOps For Beginners: DevOps Software Development Method Guide For Software Developers and IT Professionals
Joseph Joyner
No ratings yet
Concept Based Practice Question for Blue Prism in Robotic Process Automation (RPA)
From Everand
Concept Based Practice Question for Blue Prism in Robotic Process Automation (RPA)
Exam OG
No ratings yet
Amazon SimpleDB: LITE
From Everand
Amazon SimpleDB: LITE
Prabhakar Chaganti
No ratings yet
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
From Everand
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
Mark Magic
No ratings yet
20 Windows Tools Every SysAdmin Should Know
From Everand
20 Windows Tools Every SysAdmin Should Know
padmin
4.5/5 (3)
Getting Started With Quick Test Professional (QTP) And Descriptive Programming
From Everand
Getting Started With Quick Test Professional (QTP) And Descriptive Programming
Gaurav Garg
4.5/5 (2)
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
.Net Framework and Programming in ASP.NET
From Everand
.Net Framework and Programming in ASP.NET
Priyanka Agarwal
No ratings yet
C# for Beginners: Learn in 24 Hours
From Everand
C# for Beginners: Learn in 24 Hours
Alex Nordeen
No ratings yet
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Java / J2EE Interview Questions You'll Most Likely Be Asked
From Everand
Java / J2EE Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Assignment

Uploaded by

Assignment

Uploaded by

Take Home Assessment Task

Concepts/Required Knowledge to complete the task

2. Setting up MySQL Database & Querying the Database

4. Agentic LLM inference

5. Basic Data Processing

7. Optional: Basic HTML, css, Js for chat and input box.

Outside of the Scope OR Non required Stuff.

1. Agent function/tool calls should not be visible to end user

Optional but preferred Condition

2. FDA DRUG RELATED DATA

Best practices in the data pipeline.

This will make development easy

1. Downloader and Processors Must be separate

2. Agent Creation with tool calls.

3. API/Communication interface with Agent & 4 Basic

Other Important Notes:

Few Async Libraries which will be useful for your task

You might also like