GenAI Intern Assignment
GenAI Intern Assignment
Overview
Dataset Details
Setup
Requirements
Evaluation
Test Queries
Deliverables
Overview
This assignment requires you to develop a natural language interface that
translates user input (in plain text) into SQL queries for the Pagila database.
Dataset Details
The Pagila database consists of 21 related tables.
Setup
Clone the repo and run the following command to setup locally.
1. Run: docker-compose up
Requirements
1. Input Format: Plain text questions from the user.
The system should generate accurate SQL queries that align with the
schema constraints (e.g., proper table and column names).
4. Model Considerations:
You may use the free tier of Gemini or any other OSS LLM.
Evaluation
The agent should be be evaluated against a set of 40 natural language queries
(provided below).
50% for queries that are logically correct but have minor errors.
Test Queries
Here is the set of 40 natural language queries that will be used for evaluation,
available as an excel sheet.
Deliverables
1. Codebase with a README explaining setup and usage.
3. A brief demo showing the system in action with multiple test cases.