0% found this document useful (0 votes)

59 views3 pages

RAI AI Engineer Intern Assignments

dfgsdfgdfgs

Uploaded by

yash.singh.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views3 pages

RAI AI Engineer Intern Assignments

dfgsdfgdfgs

Uploaded by

yash.singh.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Greetings from ResoluteAI.in!

Thank you for your interest in our internship opening. As a next step in the screening process, you are
required to complete the below-mentioned assignment.
Role: AI Engineer Intern
Duration: 48hrs.
Data Link: Please Click Here to access data of all the below tasks

Choose any one from the options below.

Mandatory Tasks: Tasks should be mandatorily completed.

Task 1: OCR
Complexity: Easy

User Story: As a user, I should provide a path to the image, and the program should display the text from the
image. (you are free to use open-source models and codes, but please ensure there is no complete
copy-paste done)

Task 2: Document Extraction

Complexity: Medium

User Story: As a user, I should provide a path of PDF, and the program should display the text from the Pdf.
(you are free to use open-source models and codes, but please ensure that there is no complete copy-paste
done)

Hints: Table information should be in the List_items section, other information can be taken as headers of
JSON file. Submit the collab-notebook/code file. JSON file

Task 3: Crowd Detection

Complexity: Medium

User-Story: Train an AI model to detect persons in a video & further analyze the detected data to identify
crowds. This involves leveraging pre-trained object detection models and applying custom logic to detect and
log crowd events.

Crowd Detection Logic hints :

● A crowd is defined as three or more persons standing close to each other for 10 consecutive frames.
● Identify groups of three or more persons standing close together in a frame.
● Check if the identified groups persist for at least 10 consecutive frames.
● If a crowd is detected, log the frame number and the count of persons in the crowd.
● Save the results in a CSV file with the Frame Number, Person Count in Crowd

1
Task 4: Create a Retrieval Augmented Generation (RAG) Application in Streamlit
Complexity: Medium

User-Story: Build an interactive Streamlit application where users can upload multiple documents and chat
with those documents using RAG.

Hints:

● Implement a file uploader to allow users to upload multiple documents (PDF, DOCX, TXT).

● Parse and preprocess the uploaded documents.

● Use a Large language model and a retrieval mechanism to augment the generation.
● Integrate with a document retrieval library to fetch relevant passages from the uploaded documents.
● Build a chat interface where users can ask questions.
● Display responses generated by the RAG model using the retrieved document passages.
● Test the application with various documents and queries.
● Submit the screen recording of the working application.

Task 5: Finetune Llama2-8B or Llama3-7B Model on a Custom Dataset

Complexity: Medium

User Story: Fine-tune a large language model on a specific dataset to adapt it for a particular task.

Hints:

● Setup Google Colab(Gpu) with all required libraries like Hugging Face Transformers, PyTorch, etc.
● Collect and preprocess the custom dataset, Format the dataset in a way suitable for fine-tuning (e.g.,
JSON, CSV).
● Load Llama2-8B or Llama3-7B using Hugging Face Transformers and prepare the model for
fine-tuning.
● Write a script to fine-tune the model on your dataset & adjust hyperparameters(learning rate,batch
size, epochs) as needed.
● Evaluate the fine-tuned model on a validation set.
● Analyze performance improvements and possible overfitting.
● Save the fine-tuned model and push it to HuggingFace..
● Document the fine-tuning process, including the dataset, parameters, and results.
● Submit the Colab Notebook and link to the model deployed on cloud (Hugging Face)

2
Task 6: Create a Custom Agent Using LangChain that can generate code and execute It
Complexity: Hard

User Story: Develop a custom agent using LangChain to generate and execute code based on user prompts.

Hints:

● Define the scope and capabilities of the agent (e.g., languages it can code in, types of tasks it can
handle).
● Integrate a language model (e.g., GPT-3) with LangChain to handle code generation.
● Ensure the agent can run and test code safely.
● Write the logic for the agent to understand prompts, generate code, and execute it.
● Test the agent with various coding tasks and prompts.

Task 7: Create a Visualization Pipeline Using LLMs

Complexity: Hard

User-Story: Create a visualization pipeline using any LLM of your liking, Create a streamlit application where
the user will be able to upload his structured data, which could be CSV or EXCEL, and ask questions from that
file. LLM should perform the required analysis to answer the user's question. Make your LLM generate code
based on the user's question and then execute it. (Easy way to do it is to use the agents provided by the
langchain for it. If you use it, you will need to explain how the agent works in the background.)

Hints:
● Implement a file uploader to allow users to upload CSV or Excel files, and parse and preprocess the
uploaded data.
● Use an LLM (e.g., GPT-3) to generate code for data analysis based on user questions.
● Implement LangChain agents to handle code generation and execution.
● Execute the generated code securely.
● Display the results and visualizations in the Streamlit app.
● Test the application with various datasets and questions.
● Validate the accuracy and relevance of the analysis.

Submission:
● Send a screen recorded video of the user story or upload it into your Google Drive and share
the link (please ensure to rename the video to your full name before sending it)
● Once approved we will ask for the code
● Please zip the video before sending
● Rename structure: TASKNUMBER_FULLNAME

Deadline:

48 hours after receival of the Assignment.

Note: Approach will be given More Value in assessment.

Databricks Generative AI Engineer Associate Practice Questions
No ratings yet
Databricks Generative AI Engineer Associate Practice Questions
7 pages
Sales Management Report
No ratings yet
Sales Management Report
7 pages
HRM360 Assignment
No ratings yet
HRM360 Assignment
10 pages
Gul Nawaz CV
No ratings yet
Gul Nawaz CV
2 pages
2007 GMC Acadia 3.6L Vin 7 Electric Diagrams 4of5
57% (7)
2007 GMC Acadia 3.6L Vin 7 Electric Diagrams 4of5
1 page
Current Affairs
No ratings yet
Current Affairs
3 pages
Rahul Sharma Data Scientist Resume
No ratings yet
Rahul Sharma Data Scientist Resume
3 pages
LLM Project Cards
No ratings yet
LLM Project Cards
30 pages
ArIES Open Projects ML
No ratings yet
ArIES Open Projects ML
6 pages
14 Concrete Structures Cast in Situ - Colour
No ratings yet
14 Concrete Structures Cast in Situ - Colour
233 pages
Manual de RM1
No ratings yet
Manual de RM1
75 pages
Wells Fargo Everyday Checking
No ratings yet
Wells Fargo Everyday Checking
7 pages
Ios Mat 0010 13
50% (2)
Ios Mat 0010 13
55 pages
VenkataRamana - Data Scientist - 5Y
No ratings yet
VenkataRamana - Data Scientist - 5Y
3 pages
Task 1 ML
No ratings yet
Task 1 ML
7 pages
LangChain Custom Project - Student Implementation Guide
No ratings yet
LangChain Custom Project - Student Implementation Guide
9 pages
Set 1
No ratings yet
Set 1
4 pages
Outlier 1
No ratings yet
Outlier 1
2 pages
Smanimarannmphase 1
No ratings yet
Smanimarannmphase 1
3 pages
CBSE Class 6 Maths Practice Worksheets
100% (1)
CBSE Class 6 Maths Practice Worksheets
2 pages
ML Case Study
No ratings yet
ML Case Study
1 page
Unit 1
No ratings yet
Unit 1
68 pages
Coca Cola - Portfolio Project
No ratings yet
Coca Cola - Portfolio Project
15 pages
Sithafal Project Tasks
No ratings yet
Sithafal Project Tasks
2 pages
Lab Session1 25oct2024
No ratings yet
Lab Session1 25oct2024
29 pages
RediMinds - AIEnabler - Technical - Exercise - DF 1
No ratings yet
RediMinds - AIEnabler - Technical - Exercise - DF 1
2 pages
Uduud - Google Search
No ratings yet
Uduud - Google Search
2 pages
Report1 2
No ratings yet
Report1 2
9 pages
SHM Exercise-3
No ratings yet
SHM Exercise-3
5 pages
Gen Project
No ratings yet
Gen Project
7 pages
How To Build Data Pipelines For Machine Learning - by Shaw Talebi - Towards Data Science
No ratings yet
How To Build Data Pipelines For Machine Learning - by Shaw Talebi - Towards Data Science
21 pages
Dual, Low Noise, High Performance Uncompensated Operational Amplifier
No ratings yet
Dual, Low Noise, High Performance Uncompensated Operational Amplifier
5 pages
CB SC P2cse23010
No ratings yet
CB SC P2cse23010
30 pages
Text Classification Using Hugging Face
No ratings yet
Text Classification Using Hugging Face
1 page
Đề Cương Ôn Thi CK 2 k10
No ratings yet
Đề Cương Ôn Thi CK 2 k10
9 pages
What Is This Document About
No ratings yet
What Is This Document About
7 pages
LLM For QnA Proposal
No ratings yet
LLM For QnA Proposal
12 pages
CW Sequence Analysis
No ratings yet
CW Sequence Analysis
9 pages
COMP9491 Week1 Projects
No ratings yet
COMP9491 Week1 Projects
35 pages
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
No ratings yet
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
3 pages
Mve 200 - 3
No ratings yet
Mve 200 - 3
2 pages
THP - AI Resident
No ratings yet
THP - AI Resident
3 pages
Internship Task - AgenticAI
No ratings yet
Internship Task - AgenticAI
3 pages
Pgi20s02j - Lab Record
No ratings yet
Pgi20s02j - Lab Record
24 pages
AMV in Pharma
No ratings yet
AMV in Pharma
13 pages
Wa0016.
No ratings yet
Wa0016.
1 page
Chapter 11 Network Models: Quantitative Analysis For Management, 11e (Render)
No ratings yet
Chapter 11 Network Models: Quantitative Analysis For Management, 11e (Render)
32 pages
CV NguyenVanTuan
No ratings yet
CV NguyenVanTuan
3 pages
Archmodels Vol 127
100% (1)
Archmodels Vol 127
71 pages
Post Nominals Procedures
No ratings yet
Post Nominals Procedures
3 pages
Mini Project Docubot Power Point
No ratings yet
Mini Project Docubot Power Point
17 pages
Assignment For Applied AI Engineer (RAG Pipeline) Role
No ratings yet
Assignment For Applied AI Engineer (RAG Pipeline) Role
4 pages
Project Africa Now
No ratings yet
Project Africa Now
6 pages
Datasheet Building LLM Applications With Prompt Engineering
No ratings yet
Datasheet Building LLM Applications With Prompt Engineering
3 pages
CHATGPT
No ratings yet
CHATGPT
12 pages
Papers With Code v2
No ratings yet
Papers With Code v2
15 pages
Pec Gen Ai Notes
No ratings yet
Pec Gen Ai Notes
11 pages
Ali Ahmad and Rameez - Project - Proposal
No ratings yet
Ali Ahmad and Rameez - Project - Proposal
5 pages
Taask
No ratings yet
Taask
18 pages
Sanjulika Sharma MLE Data Scientist Resume
No ratings yet
Sanjulika Sharma MLE Data Scientist Resume
2 pages
NLP Unit3&4 QB
No ratings yet
NLP Unit3&4 QB
5 pages
Sales Management Report
No ratings yet
Sales Management Report
8 pages
AI LAB Assignment 10
No ratings yet
AI LAB Assignment 10
4 pages
AI LAB Assignment 09
No ratings yet
AI LAB Assignment 09
4 pages
Building LLM Applications For Production
100% (3)
Building LLM Applications For Production
28 pages
CV 04 2020
No ratings yet
CV 04 2020
3 pages
Projects GenAI Pinnacle Program
No ratings yet
Projects GenAI Pinnacle Program
14 pages
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
3 pages
Henry Cavill
No ratings yet
Henry Cavill
2 pages
Data Science Interns Tasks
No ratings yet
Data Science Interns Tasks
2 pages
Christian Dior The Magic of Fashion
100% (3)
Christian Dior The Magic of Fashion
66 pages
Final Copy 222-1
No ratings yet
Final Copy 222-1
51 pages
Fintree 10X Workbench V02 Final
No ratings yet
Fintree 10X Workbench V02 Final
6 pages
ML Project List
No ratings yet
ML Project List
3 pages
Datafy Generative-Ai Learning Path
No ratings yet
Datafy Generative-Ai Learning Path
7 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
Document RAG Assignment
No ratings yet
Document RAG Assignment
4 pages
Aryan 2022PH11425
No ratings yet
Aryan 2022PH11425
3 pages
Generative AI Curriculum
No ratings yet
Generative AI Curriculum
2 pages
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
4 pages
Building RAG Apps
No ratings yet
Building RAG Apps
32 pages
Project Seminar
No ratings yet
Project Seminar
12 pages
Hackathon Siet Problem Statements
No ratings yet
Hackathon Siet Problem Statements
5 pages
Fundamentals of Generative AI
No ratings yet
Fundamentals of Generative AI
5 pages
EVD Evolution Eng
No ratings yet
EVD Evolution Eng
52 pages
Avasa Hotel, Hydreabad
75% (4)
Avasa Hotel, Hydreabad
12 pages
Retail Management
No ratings yet
Retail Management
8 pages
Saira
100% (1)
Saira
6 pages
Python - Genai - Intqa 2
No ratings yet
Python - Genai - Intqa 2
5 pages
Xtrade Website Demo
No ratings yet
Xtrade Website Demo
72 pages
TypeScript Blueprints
From Everand
TypeScript Blueprints
Ivo Gabe de Wolff
No ratings yet
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
The Beginner’s Guide to Kilo Code
From Everand
The Beginner’s Guide to Kilo Code
Steven Mcananey
No ratings yet
The Beginner’s Guide to AI - Aider
From Everand
The Beginner’s Guide to AI - Aider
Steven Mcananey
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)

RAI AI Engineer Intern Assignments

Uploaded by

RAI AI Engineer Intern Assignments

Uploaded by

Greetings from ResoluteAI.in!

Choose any one from the options below.

Task 2: Document Extraction

Task 3: Crowd Detection

Crowd Detection Logic hints :

● Parse and preprocess the uploaded documents.

Task 5: Finetune Llama2-8B or Llama3-7B Model on a Custom Dataset

Task 7: Create a Visualization Pipeline Using LLMs

48 hours after receival of the Assignment.

Note: Approach will be given More Value in assessment.

You might also like