0% found this document useful (0 votes)

21 views12 pages

ML-AI - Technical Coding Evaluation

The document outlines a project for optimizing an airline call center using AI, focusing on two main problems: a two-agent system for handling flight inquiries and a binary classification model for customer sentiment analysis. It details the functionalities required for each agent, the implementation of a sentiment analysis model, and guidelines for setup, testing, and submission. The project emphasizes the use of Generative AI and machine learning techniques to enhance customer service and operational efficiency.

Uploaded by

Arul Prathakshinisree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views12 pages

ML-AI - Technical Coding Evaluation

Uploaded by

Arul Prathakshinisree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

AI-Powered Airline Call Center

Optimization
Overview
In an airline call center, agents handle a wide range of customer inquiries—from flight
cancellations and reschedules to refunds and complaints. These conversations produce
valuable data that can be leveraged through Generative AI. By integrating services like AWS
Bedrock or OpenAI or Together AI models, the system can:

 Transcribe audio recordings

 Categorize issues (e.g., Flight Cancellation vs. Refund)

 Compute KPIs to optimize call center operations

This approach should enhance customer satisfaction, reduce call handling times, and
streamline agent performance.

Problem 1: Two-Agent System with Function Calling &

Structured Output
Evaluation Criteria:

 Multi-Agent Coordination – Handling two AI agents working together

 Function Calling – One agent invoking another’s functions

 Structured JSON Output – Ensuring responses follow a defined schema

 Prompt Engineering – Effectively guiding AI agents

Problem Statement

You are building a system where two AI agents collaborate to answer user queries about
airline flights:

1. Info Agent

 Has access to a function get_flight_info(flight_number) that returns structured

flight data (e.g., destination, departure time, status).

 Responds only in JSON format, with no additional text.

2. QA Agent

 Receives user queries (e.g., “What time does Flight 123 depart?”).

 Calls the Info Agent to fetch relevant flight data.

 Processes the result and returns a structured JSON response in a user-friendly

format.

Functions to Implement

1. get_flight_info(flight_number: str) -> dict

 Simulates or mocks flight data retrieval.

Returns a Python dictionary with keys like:

{
"flight_number": "AI123",
"departure_time": "08:00 AM",
"destination": "Delhi",
"status": "Delayed"
}

2. info_agent_request(flight_number: str) -> str

 Calls get_flight_info and returns the data as a JSON string.

 No extra text—only valid JSON.

3. qa_agent_respond(user_query: str) -> str

 Extracts the flight number from the query (e.g., “Flight 123”).

 Calls info_agent_request to get the flight’s JSON data.

Returns a structured JSON response, for example:

{
"answer": "Flight AI123 departs at 08:00 AM to Delhi. Current status: Delayed."
}

 The output must strictly follow JSON format—no plain text or extra commentary.

Test Cases:
Function Call Expected Output

get_flight_info("AI123") {"flight_number": "AI123",

"departure_time": "08:00 AM",
"destination": "Delhi", "status":
"Delayed"}

info_agent_request("AI123") JSON string of the above dictionary (without

extra text)

qa_agent_respond("When does {"answer": "Flight AI123 departs at 08:00

Flight AI123 depart?") AM to Delhi. Current status: Delayed."}

qa_agent_respond("What is the {"answer": "Flight AI999 not found in

status of Flight AI999?") database."} (if flight doesn’t exist)
Problem 2: Binary Classification for Customer
Sentiment
Evaluation Criteria:

 Data Preprocessing – Tokenization, handling missing values, etc.

 Model Training – Logistic Regression, Naive Bayes, or similar

 Prediction & Evaluation – Accuracy, confusion matrix, etc.

 Use of Provided Training Data

Problem Statement

You have a small dataset of customer feedback from an airline. Each row contains a text
snippet and a binary label: positive or negative.

Your task:

1. Train a model to classify feedback as positive or negative.

2. Predict the sentiment for new input text.

Sample Training Data

Text Label

"The flight was on time, and the staff was friendly." positive

"I had to wait 3 hours due to a delay. Terrible!" negative

"Great legroom and comfortable seats." positive

"Lost my luggage, extremely upset about this." negative

"Check-in was smooth, no issues at all." positive

(Full training dataset will be provided.)

Link to download: 2026_ML_test_dataset

Functions to Implement

1. train_sentiment_model(training_data: List[Tuple[str, str]]) -> Any

 Accepts a list of (text, label) pairs where label ∈ { "positive", "negative" }.

 Preprocesses the text (tokenization, lowercasing, etc.).

 Trains a simple model (e.g., LogisticRegression from sklearn).

 Returns the trained model object.

2. predict_sentiment(model: Any, new_text: str) -> str

 Accepts the trained model and a text string.

 Applies the same preprocessing used during training.

 Returns "positive" or "negative" based on the model’s prediction.

Test Cases
Function Call Expected Output

train_sentiment_model([("I love this airline", Trained model object (e.g.,

"positive"), ("Worst experience ever", LogisticRegression)
"negative")])

predict_sentiment(model, "The seats were Likely "positive" (depending

comfortable and service was great!") on training)

predict_sentiment(model, "They lost my Likely "negative"

baggage and were very unhelpful!")

predict_sentiment(model, "Nothing special, just Could be "positive" or

an average flight.") "negative"—depends on the
model.

Implementation & Submission Guidelines

Prerequisites

Before you begin, ensure you have:

 Python 3.8+ installed

 A virtual environment setup (venv or conda recommended)

 API keys for OpenAI, AWS Bedrock, or Together AI (depending on your choice of
LLM provider)

Installed required libraries:

pip install openai boto3 together ai

LLM Setup and API Key Configuration

1. Using OpenAI

Getting API Key

1. Sign up or log in at OpenAI.

2. Navigate to API Keys under your Account Settings.

3. Generate a new API key and copy it.

Setting Up API Key in Python
import openai
import os

os.environ["OPENAI_API_KEY"] = "your_openai_api_key"

response = openai.ChatCompletion.create(
model="gpt-4",
messages=[{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "What is AI?"}]
)

print(response["choices"][0]["message"]["content"])

2. Using AWS Bedrock

Getting API Key

1. Sign in to AWS Console.

2. Navigate to AWS IAM → Create a user with Bedrock access.

3. Attach the AmazonBedrockFullAccess policy.

4. Generate and download the Access Key ID and Secret Access Key.

Setting Up API Key in Python

```
import boto3

bedrock = boto3.client(
service_name="bedrock-runtime",
region_name="us-east-1", # Change region if needed
aws_access_key_id="your_aws_access_key",
aws_secret_access_key="your_aws_secret_key"
)
response = bedrock.invoke_model(
body='{"prompt": "What is AI?", "max_tokens": 100}',
modelId="anthropic.claude-v2"
)

print(response["body"].read().decode("utf-8"))
```

3. Using Together AI

Getting API Key

1. Sign up at Together AI.

2. Go to API Keys and generate a new key.

3. Copy and store it securely.

Setting Up API Key in Python

import together
import os

os.environ["TOGETHER_API_KEY"] = "your_together_api_key"

response = together.ChatCompletion.create(
model="together/gpt-neoxt-20b",
messages=[{"role": "user", "content": "What is AI?"}]
)

print(response["choices"][0]["message"]["content"])
2. Language & Libraries

 Use Python.

 For Problem 1, simulate/mock multi-agent interaction.

 For Problem 2, libraries like sklearn, numpy, pandas are permitted.

3. Project Structure
Suggested file structure:

├── agent_system.py # For Problem 1

├── ml_classifier.py # For Problem 2
├── README.md # Setup & usage instructions

4. Testing

 Ensure functions pass all test cases.

 Handle edge cases (e.g., missing flight number, empty text).

5. Documentation

 Docstrings for every function.

 README should explain:

o Installation

o Running the code

o Approach for multi-agent function calling (optional).

6. Time Constraints

 Recommended: 60 minutes (or as allocated).

 Extra time? Consider:

o Robust error handling

o Hyperparameter tuning

7. Submission Instructions

1. Project Structure

a. Keep each problem in a separate folder.

b. Each folder must contain a README.md with instructions to run the code.

c. Include all necessary files but DO NOT include pycache, venv, or

unnecessary logs.

2. API Keys

a. Store API keys in an api_keys.env file inside each folder.

b. Do not hardcode API keys in the scripts.

3. Dependencies

a. Each folder must have a requirements.txt specifying the necessary

packages.

4. Packaging for Submission

a. Zip the project folder while excluding virtual environments and cache files.

b. The final ZIP file should contain all folders with their respective README.md,
requirements.txt, and scripts.
c. Add the Zip file to your drive and give access to whoever can
access the link.
d. Submit the link to google form given
e. Google Form for Submission:
https://docs.google.com/forms/d/e/1FAIpQLSc8E8Sh32CeKFDr
N82DouvMh1DLimzWgiTW_VtmJAmlziophw/viewform?
usp=header

Example Structure:

submission.zip/

│── problem1/

│ ├── main.py

│ ├── api_keys.env

│ ├── requirements.txt

│ ├── README.md

│── problem2/

│ ├── main.py

│ ├── api_keys.env

│ ├── requirements.txt

│ ├── README.md
All the very best

10 Standout Coding Projects
No ratings yet
10 Standout Coding Projects
61 pages
Unit-V NLP
No ratings yet
Unit-V NLP
9 pages
Deployment
No ratings yet
Deployment
23 pages
AIT TASKS2 Merged
No ratings yet
AIT TASKS2 Merged
24 pages
Project Report - M13 Sentiment Analyzer
No ratings yet
Project Report - M13 Sentiment Analyzer
9 pages
Aai
No ratings yet
Aai
42 pages
GUHAN
No ratings yet
GUHAN
19 pages
AI Mini Project
No ratings yet
AI Mini Project
22 pages
ML Interview Preparation Schedule
No ratings yet
ML Interview Preparation Schedule
242 pages
P3R1 Text Classification
No ratings yet
P3R1 Text Classification
4 pages
Python 21to30
No ratings yet
Python 21to30
9 pages
Airplane Passanger Satisfication Prediction
No ratings yet
Airplane Passanger Satisfication Prediction
86 pages
Python For AI Developers
No ratings yet
Python For AI Developers
45 pages
Python Applications
No ratings yet
Python Applications
8 pages
Experiment Python 12018
No ratings yet
Experiment Python 12018
13 pages
Practical Fie AI Class 10
No ratings yet
Practical Fie AI Class 10
19 pages
Fundamentals of Database Systems 6th Edition by Ramez Elmasri
No ratings yet
Fundamentals of Database Systems 6th Edition by Ramez Elmasri
317 pages
BAET Record
No ratings yet
BAET Record
19 pages
HW 5
No ratings yet
HW 5
7 pages
NLU Final
No ratings yet
NLU Final
23 pages
Assignment Data Science
No ratings yet
Assignment Data Science
6 pages
Assignment Data Science Intern
No ratings yet
Assignment Data Science Intern
8 pages
All Projects F21
No ratings yet
All Projects F21
138 pages
Bulba Advanced Instructions
No ratings yet
Bulba Advanced Instructions
13 pages
Language Processing For Social Media Ex No: Date
No ratings yet
Language Processing For Social Media Ex No: Date
8 pages
Autogen OpenAi Class
No ratings yet
Autogen OpenAi Class
12 pages
? Python Project Ideas - Level
No ratings yet
? Python Project Ideas - Level
12 pages
Arsalan's Project New
No ratings yet
Arsalan's Project New
4 pages
Python Development Task List
No ratings yet
Python Development Task List
15 pages
AI Lab - Manual - 136
No ratings yet
AI Lab - Manual - 136
17 pages
3 Standout Projects
No ratings yet
3 Standout Projects
29 pages
Build Reliable Machine Learning Pipelines With Continuous Integration
No ratings yet
Build Reliable Machine Learning Pipelines With Continuous Integration
22 pages
Arsalan's Project
No ratings yet
Arsalan's Project
4 pages
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
100% (1)
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
25 pages
BERT Model
No ratings yet
BERT Model
69 pages
All Projects F 21
No ratings yet
All Projects F 21
141 pages
A Simple Guide To OpenAI API With Python
No ratings yet
A Simple Guide To OpenAI API With Python
9 pages
AIot Lab Syllabus
No ratings yet
AIot Lab Syllabus
4 pages
ML Project Proposal PDF
No ratings yet
ML Project Proposal PDF
4 pages
Practical 1to10
No ratings yet
Practical 1to10
32 pages
WT and Fds Practical Slips
No ratings yet
WT and Fds Practical Slips
32 pages
Projects Groups Spring 24-25
No ratings yet
Projects Groups Spring 24-25
3 pages
Ai Lab 02
No ratings yet
Ai Lab 02
12 pages
Pydantic Ai Implementation Guide
No ratings yet
Pydantic Ai Implementation Guide
26 pages
4aeee7-Ba25-Ff2e-30d7-63d306a7270 Open Ai Playground Example Prompts - Google Sheets
No ratings yet
4aeee7-Ba25-Ff2e-30d7-63d306a7270 Open Ai Playground Example Prompts - Google Sheets
8 pages
Internship Report
No ratings yet
Internship Report
20 pages
Flight Fare Prediction
No ratings yet
Flight Fare Prediction
5 pages
Report Sentiment Analysis Marcos Matheus
No ratings yet
Report Sentiment Analysis Marcos Matheus
12 pages
Applications Using PLSQL With OOps Concepts
No ratings yet
Applications Using PLSQL With OOps Concepts
49 pages
Computer PGT
No ratings yet
Computer PGT
4 pages
FineTuning Process Using OpenAI 1703440516
No ratings yet
FineTuning Process Using OpenAI 1703440516
14 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Faculty of Information Systems
No ratings yet
Faculty of Information Systems
66 pages
Langchain Onepager
No ratings yet
Langchain Onepager
1 page
PHP Lab Program - II BCOM CA
No ratings yet
PHP Lab Program - II BCOM CA
5 pages
Eee 207 Data Base Management Systems
No ratings yet
Eee 207 Data Base Management Systems
3 pages
Module 1: Platform Development Basics: Salesforce Developer
No ratings yet
Module 1: Platform Development Basics: Salesforce Developer
3 pages
Navttc Course Outline Python Django Angular React
No ratings yet
Navttc Course Outline Python Django Angular React
8 pages
Sentiment Analysis On Tweets
No ratings yet
Sentiment Analysis On Tweets
2 pages
DDMS Part-1
No ratings yet
DDMS Part-1
35 pages
Assignment2 Instructions
No ratings yet
Assignment2 Instructions
10 pages
Cloud Security UNIT 1
No ratings yet
Cloud Security UNIT 1
17 pages
Course Project Report For: Artificial Intelligence EL-3011
No ratings yet
Course Project Report For: Artificial Intelligence EL-3011
8 pages
DB Practical SQL
No ratings yet
DB Practical SQL
31 pages
Sample Questions
No ratings yet
Sample Questions
21 pages
Python Project Ideas
No ratings yet
Python Project Ideas
12 pages
Python Cheat Sheet: Topics
No ratings yet
Python Cheat Sheet: Topics
16 pages
Sorcecode
No ratings yet
Sorcecode
42 pages
Oracle Database Service High Availability With Data Guard?: Robert Bialek Senior Principal Consultant
No ratings yet
Oracle Database Service High Availability With Data Guard?: Robert Bialek Senior Principal Consultant
36 pages
Presentation 3
No ratings yet
Presentation 3
16 pages
Interpret All Statistics and Graphs For One-Way ANOVA - Minitab Express
No ratings yet
Interpret All Statistics and Graphs For One-Way ANOVA - Minitab Express
18 pages
Xii CS Practical File
No ratings yet
Xii CS Practical File
41 pages
Sinhgad Institute of Management MCA-I, Div A&B Dbms (SQL) Assignment No-2
No ratings yet
Sinhgad Institute of Management MCA-I, Div A&B Dbms (SQL) Assignment No-2
8 pages
Acknowledgement: Niharika Sharma XII Science
No ratings yet
Acknowledgement: Niharika Sharma XII Science
24 pages
LINUX and SQL
No ratings yet
LINUX and SQL
8 pages
Prectical List
No ratings yet
Prectical List
6 pages
Apache Nutch
No ratings yet
Apache Nutch
9 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
17 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
16 pages
Course Code: Course Title: TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
No ratings yet
Course Code: Course Title: TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
4 pages
EX 10 Trigger
No ratings yet
EX 10 Trigger
4 pages
Predictive Numericals 20 Questions
No ratings yet
Predictive Numericals 20 Questions
4 pages
DATA 1050 Cheatsheet
No ratings yet
DATA 1050 Cheatsheet
4 pages
URL Fuzzer - Discover Hidden Files and Directories Report (Light)
No ratings yet
URL Fuzzer - Discover Hidden Files and Directories Report (Light)
2 pages
Rakhi's Resume
No ratings yet
Rakhi's Resume
1 page
Raviteja Resume GD
No ratings yet
Raviteja Resume GD
2 pages
Module2 Database Management Systems
No ratings yet
Module2 Database Management Systems
13 pages