Python Engineer Problem Statements

The document outlines optional problem statements for applicants to showcase their skills through unique projects or by solving specific tasks. Two main problem statements are provided: designing a collaborative document management backend and building a customer feedback analysis system using machine learning. Applicants are encouraged to submit their solutions via GitHub, with a focus on demonstrating technical proficiency, creativity, and effective communication.

Uploaded by

Sai Rajeev

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views5 pages

Python Engineer Problem Statements

Uploaded by

Sai Rajeev

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

The problem statements are optional.

We understand that many applicants may

already have unique experiences or projects that showcase their skills, and we
value those immensely. If your portfolio includes standout work, you don’t need
to attempt these problems.

However, if you haven’t had the opportunity to work on substantial projects or

wish to demonstrate your capabilities in a focused way, these problem
statements provide an excellent opportunity to shine. Additionally, any work you
do on these problem statements will be owned by you, and you are free to use it
in your portfolio or future applications.

Feel free to select any one of the problem statements that aligns best with your
skills, comfort level, and background.

Problem Statements

1. Design and Implement a Collaborative

Document Management Backend
Your task is to design and implement a backend service for a collaborative document
management application. The application should allow users to upload, share, and
manage documents.

Requirements:

1. Core Features:
o Users can upload and download documents.
o Support basic metadata for documents (e.g., title, author, tags, uploaded date).
o Enable document sharing with other users with different permissions:
§ Read-only
§ Edit
§ Admin
o Implement document versioning (track changes made to documents).
2. Scalability and Performance:
o Design the system to handle millions of documents and thousands of
simultaneous users.
o Ensure fast retrieval of documents and metadata.
3. Search and Filters:
o Allow searching documents by title, tags, or author.
o Provide filters for upload date and file size.
4. Advanced Collaboration (Open-ended):
o Suggest additional features to enhance collaboration (e.g., commenting, real-
time editing).
o Justify the trade-offs and technical decisions involved in implementing them.
5. Deployment:
o Provide a deployment-ready solution (Docker, CI/CD pipelines, etc.).

Deliverables:

1. Codebase:
o Implement the core backend functionality.
o Include tests to validate the correctness of your implementation.
2. Design Document:
o Describe your architecture, design decisions, and trade-offs in a separate file
(Markdown or PDF).
3. Future Enhancements:
o Propose at least two additional features and explain how they could be
implemented.

2. Build and Deploy a Customer Feedback

Analysis System
You are tasked with developing a machine learning pipeline that analyzes customer feedback
data to extract actionable insights. The system should handle data ingestion, preprocessing,
model training, and deployment for real-time predictions.

Data sources: you can use any Kaggle datasets like-

https://www.kaggle.com/datasets/arhamrumi/amazon-product-reviews

Requirements:

1. Data Ingestion and Preprocessing:

o Design a pipeline to handle raw customer feedback (e.g., text reviews) from
multiple sources (CSV files, APIs).
o Clean the data by removing duplicates, handling missing values, and normalizing
text.
o Identify and implement techniques to handle imbalanced datasets.
2. Model Development:
o Train a machine learning model to classify feedback into predefined categories
(e.g., Positive, Negative, Neutral).
o Explore and justify your choice of model(s) (e.g., traditional ML like Random
Forests vs deep learning like BERT).
o Evaluate your model using appropriate metrics like precision, recall, and F1
score.
3. Feature Engineering:
o Extract meaningful features from the feedback text (e.g., TF-IDF, embeddings).
o Include exploratory data analysis (EDA) to support your feature selection
decisions.
4. Model Deployment:
o Develop an API to serve the model for real-time feedback classification.
o Ensure the API can handle high concurrency and provides predictions with low
latency.
5. Monitoring and Feedback Loop:
o Implement a basic monitoring system to track model performance over time
(e.g., data drift, prediction accuracy).
o Suggest a strategy to retrain the model periodically with new data.

Deliverables:

1. Codebase:
o A complete pipeline for preprocessing, model training, and deployment.
o Include scripts for unit testing and performance benchmarking.
2. Documentation:
o A detailed report explaining your pipeline, model selection, and trade-offs.
3. API Endpoint:
o Host your model on a cloud platform or provide a Dockerized setup for running
the API locally.
4. Future Plan:
o Propose enhancements for handling multilingual feedback or incorporating
customer demographic data.
Evaluation Criteria
1. Thought Process and Approach

• Understanding the Problem: Have you clearly identified the requirements and
constraints of the chosen problem statement?
• Solution Design: Does your approach demonstrate a thoughtful design, considering real-
world challenges?
• Reasoning: Are your decisions for architecture, tools, and algorithms supported by
logical explanations?
• Creativity: Have you introduced unique or innovative elements in your solution?

2. Implementation and Technical Proficiency

• Code Quality: Is your code clean, modular, and well-documented?

• Pipeline Completeness:
o For Problem 1 (Collaborative Document Management): Have you addressed key
functionalities like version control, access management, and collaboration
features?
o For Problem 2 (Customer Feedback Analysis): Have you implemented the full
ML pipeline, from data preprocessing to deployment?
• Use of AI/ML (where applicable):
o Did you effectively use AI tools, libraries, or models to solve specific parts of the
problem?
o Is the AI/ML model appropriate for the task and well-integrated into the overall
system?

3. Real-World Application and Scalability

• Practicality: Does your solution address the problem in a realistic, applicable manner?
• Scalability and Robustness:
o Is the backend/system designed to handle large-scale or dynamic workloads?
o How well does the solution handle errors, edge cases, or data variations?
• Performance: Have you demonstrated or explained how your solution optimizes
performance (e.g., response times, model accuracy)?
4. Communication and Presentation

• Documentation: Have you provided clear documentation for your solution, explaining
its components, usage, and setup?
• Clarity: Is your submission organized and easy to understand?

General Guidelines

• Use of AI Tools: Using AI tools is encouraged. However, the focus is on how you
integrate and leverage AI to enhance your solution, not merely on AI-generated
outputs. Submissions that rely solely on AI-generated solutions without demonstrating
your own thought process or understanding will not be considered for further
evaluation.
• Optionality: Remember, these assignments are optional. If you have previous projects
or experience that already showcase your skills, you can choose to highlight those
instead.

Note

This evaluation is designed to assess both technical skills and problem-solving abilities. Whether
your expertise lies in backend development or machine learning, the goal is to demonstrate
your understanding, creativity, and ability to apply skills effectively.

Please submit your solution by emailing a link to your GitHub repository

to [email protected]. Rest assured, we carefully review every
application we receive.

Best of luck!

AI Recruit
No ratings yet
AI Recruit
7 pages
Consumer Complaint Prediction Pipeline
No ratings yet
Consumer Complaint Prediction Pipeline
4 pages
Online Cake Order-Mern
No ratings yet
Online Cake Order-Mern
5 pages
Online Assignment Plagiarism Check
No ratings yet
Online Assignment Plagiarism Check
5 pages
Automated ML Solution for Industrial Use
No ratings yet
Automated ML Solution for Industrial Use
4 pages
Prompt Engineering Assignment - Weather Application
No ratings yet
Prompt Engineering Assignment - Weather Application
5 pages
E-commerce Chatbot Development Case Study
No ratings yet
E-commerce Chatbot Development Case Study
3 pages
Software Developement Prompts
No ratings yet
Software Developement Prompts
14 pages
AI Role Assignment 4
No ratings yet
AI Role Assignment 4
2 pages
Group Project Assignment
No ratings yet
Group Project Assignment
7 pages
(BotAI) - Takehome PS 4
No ratings yet
(BotAI) - Takehome PS 4
3 pages
Arsalan's Project
No ratings yet
Arsalan's Project
4 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
AI-ML Intern Assignment
No ratings yet
AI-ML Intern Assignment
5 pages
Online Vehicle Rental Management System-Mern
No ratings yet
Online Vehicle Rental Management System-Mern
5 pages
Problem Statements
No ratings yet
Problem Statements
9 pages
Catering Reserving and Ordering System-Mern
100% (1)
Catering Reserving and Ordering System-Mern
5 pages
Software Engineer Assignment
No ratings yet
Software Engineer Assignment
8 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
Sahil Garg Updated For Azure
No ratings yet
Sahil Garg Updated For Azure
8 pages
Flight Fare Prediction Overview
No ratings yet
Flight Fare Prediction Overview
5 pages
Impactful Project Ideas
No ratings yet
Impactful Project Ideas
16 pages
Assignment Data Science
No ratings yet
Assignment Data Science
6 pages
CSI RUBIX24 Problem Statements
No ratings yet
CSI RUBIX24 Problem Statements
15 pages
Arsalan's Project New
No ratings yet
Arsalan's Project New
4 pages
Entertainment Web Application
No ratings yet
Entertainment Web Application
5 pages
Artificial Intelligence and Machine Learning Fundamentals
No ratings yet
Artificial Intelligence and Machine Learning Fundamentals
54 pages
Wa0013.
No ratings yet
Wa0013.
4 pages
Mars Open Projects 2025
No ratings yet
Mars Open Projects 2025
7 pages
67a050ecd4b14 Unstop AIML Intership Assessment
No ratings yet
67a050ecd4b14 Unstop AIML Intership Assessment
1 page
Social Media Web Application
No ratings yet
Social Media Web Application
5 pages
CSE357CV
No ratings yet
CSE357CV
3 pages
Problem Statement 1 - Real-Time Collaborative Document Editing System
No ratings yet
Problem Statement 1 - Real-Time Collaborative Document Editing System
3 pages
Shyena Consultant Ayush S MLOps 5+ Years
No ratings yet
Shyena Consultant Ayush S MLOps 5+ Years
5 pages
Problem Statement:: Project Title Technologies Domain Project Difficulties Level
No ratings yet
Problem Statement:: Project Title Technologies Domain Project Difficulties Level
4 pages
Alcovia - Preprocess Assignment
No ratings yet
Alcovia - Preprocess Assignment
3 pages
Final Year Sem VII
No ratings yet
Final Year Sem VII
23 pages
Summary Example
No ratings yet
Summary Example
2 pages
Hackathon Problem Statements
No ratings yet
Hackathon Problem Statements
3 pages
Gena Iaws
No ratings yet
Gena Iaws
3 pages
Jayadhi Entry-Level Internship Assignment
No ratings yet
Jayadhi Entry-Level Internship Assignment
5 pages
DSML Projects
No ratings yet
DSML Projects
10 pages
Bhaskar CV
No ratings yet
Bhaskar CV
5 pages
CodeChef-VIT'24 Recruitment Task Sheet - 240229 - 205340
No ratings yet
CodeChef-VIT'24 Recruitment Task Sheet - 240229 - 205340
12 pages
BERT for Multi-Label Ticket Classification
No ratings yet
BERT for Multi-Label Ticket Classification
4 pages
Assignment
No ratings yet
Assignment
3 pages
Project Report - M13 Sentiment Analyzer
No ratings yet
Project Report - M13 Sentiment Analyzer
9 pages
Xeno SDE Internship Assignment - 2025
No ratings yet
Xeno SDE Internship Assignment - 2025
5 pages
OpenMic Ai AI Product Engineer (Full Stack Engineer
No ratings yet
OpenMic Ai AI Product Engineer (Full Stack Engineer
4 pages
Kaleema Shaik Resume
No ratings yet
Kaleema Shaik Resume
2 pages
670442b95e11a Hack2Hire Hackathon Overview 1
No ratings yet
670442b95e11a Hack2Hire Hackathon Overview 1
6 pages
Take Home Assignment
No ratings yet
Take Home Assignment
2 pages
AI-Driven Code Review For Software Quality: Step 1: Define The Scope
No ratings yet
AI-Driven Code Review For Software Quality: Step 1: Define The Scope
3 pages
Meit Y
No ratings yet
Meit Y
5 pages
AI Agent UC Berkeley
No ratings yet
AI Agent UC Berkeley
14 pages
Breakdown of The Tasks and Subtasks For The Project
No ratings yet
Breakdown of The Tasks and Subtasks For The Project
4 pages
Senior Software Engineer Test
No ratings yet
Senior Software Engineer Test
3 pages
Customer Sentiment Analysis Project
No ratings yet
Customer Sentiment Analysis Project
3 pages
Max Pain
No ratings yet
Max Pain
16 pages
Docker Commands Cheat Sheet
100% (1)
Docker Commands Cheat Sheet
7 pages
Kubernetes in DevOps
No ratings yet
Kubernetes in DevOps
17 pages
Web Protocols - WebSocket
No ratings yet
Web Protocols - WebSocket
8 pages
List EoT Jan - Jun 2023 Rev.01
No ratings yet
List EoT Jan - Jun 2023 Rev.01
3 pages
Embedded Firmware Design and Development
No ratings yet
Embedded Firmware Design and Development
9 pages
Challenges in Mechanical Engineering Coursework
100% (2)
Challenges in Mechanical Engineering Coursework
8 pages
T 8097 EN Type 3347 Hygienic Angle Valve: Application
No ratings yet
T 8097 EN Type 3347 Hygienic Angle Valve: Application
22 pages
Tech Support & Quality Specialist Profile
No ratings yet
Tech Support & Quality Specialist Profile
2 pages
Lubimax G 337 - TDS
100% (2)
Lubimax G 337 - TDS
2 pages
Statement - 2022 01 19
No ratings yet
Statement - 2022 01 19
4 pages
NSX Battle Card - Final
100% (1)
NSX Battle Card - Final
2 pages
Applied Microsoft SQL Server 2008 Reporting Services PDF
No ratings yet
Applied Microsoft SQL Server 2008 Reporting Services PDF
770 pages
V2500-A5 Fan Trim Balancing
No ratings yet
V2500-A5 Fan Trim Balancing
22 pages
Ctek Mxs 25 Car Battery Charger User Manual
No ratings yet
Ctek Mxs 25 Car Battery Charger User Manual
6 pages
Digital Electronics II Final Exam Guide
No ratings yet
Digital Electronics II Final Exam Guide
2 pages
Master Plumber Exam: Plumbing Materials Quiz
No ratings yet
Master Plumber Exam: Plumbing Materials Quiz
3 pages
Veeam Backup 9 5 U4 User Guide Vsphere
No ratings yet
Veeam Backup 9 5 U4 User Guide Vsphere
1,246 pages
Kea HV479 2024 09 10 13 18 53
No ratings yet
Kea HV479 2024 09 10 13 18 53
3 pages
3 Phase PID Motor 3
No ratings yet
3 Phase PID Motor 3
11 pages
EPM Assignment 1 PDF
No ratings yet
EPM Assignment 1 PDF
4 pages
OOAD Lectures
No ratings yet
OOAD Lectures
104 pages
DBMS Introduction Advantages Types File System
No ratings yet
DBMS Introduction Advantages Types File System
30 pages
RHCSA Exam Prep Guide
100% (2)
RHCSA Exam Prep Guide
42 pages
Sem 8 - Pending Book Submission - Library - Summer 2024
No ratings yet
Sem 8 - Pending Book Submission - Library - Summer 2024
3 pages
Phases of CALL
No ratings yet
Phases of CALL
28 pages
BSC6910 Spare Parts Catalog (V100R025C10 - 01) (PDF) - EN
No ratings yet
BSC6910 Spare Parts Catalog (V100R025C10 - 01) (PDF) - EN
23 pages
JNCIE-SEC-11.a C7 NAT - Pps
No ratings yet
JNCIE-SEC-11.a C7 NAT - Pps
37 pages
Robi Axiata Internship Experience
100% (1)
Robi Axiata Internship Experience
2 pages
Senior Lecturer Resume: Imrul Kaes
No ratings yet
Senior Lecturer Resume: Imrul Kaes
3 pages
MCQ - Question-Part-3
No ratings yet
MCQ - Question-Part-3
1 page
Understanding The Security of Discrete GPUs
No ratings yet
Understanding The Security of Discrete GPUs
11 pages
Shreeji Electrical
No ratings yet
Shreeji Electrical
4 pages
Surveyor CV for Engineering Firms
No ratings yet
Surveyor CV for Engineering Firms
4 pages