Assignment_ML
Assignment_ML
At Dhineu, we are building cutting-edge solutions at the intersection of Artificial Intelligence, Geospatial
Analytics, and Enterprise Intelligence. As part of our effort to bring on highly capable interns into our
Machine Learning team, we have designed this assignment to evaluate your ability to solve real-world
problems in the domains of Computer Vision, NLP/LLMs, Vision-Language Models, and GeoAI.
This assignment is designed to assess your:
Problem-solving approach
Technical and modeling skills
Creativity in applying ML techniques
Code quality and reproducibility
Ability to communicate your results effectively
You are required to choose one (1) out of the four listed assignment tracks and submit your solution in
the form of a GitHub repository or Drive folder containing code, sample outputs, and a brief write-up.
This is your opportunity to show how you approach open-ended AI problems, and we encourage you to
go beyond the minimum requirements — deploy your models, visualize results, or even identify ethical or
geospatial impacts of your work.
Copyright © 2025 by Dhineu. Permission needs to be granted to use, modify, and distribute this
document.
2. NLP/LLM: Text Summarization & Geo-Tagged QA Pipeline
Objective:
o Build a system that summarizes articles or answers questions from
structured/unstructured text .
o Make a table by logging geolocation of each news (rows).
Dataset: CNN-DailyMail News Text Summarization (train.csv), from this link
https://www.kaggle.com/datasets/gowrishankarp/newspaper-text-summarization-cnn-
dailymail
Expected Deliverables:
o Working summarization or QA pipeline
o Script to extract and geocode geographic references from text
o Final output CSV with summarized content along with geo-coordinates
o Brief report on model architecture and pipeline design
Suggested Models: T5, BART, LLaMA-2, GPT-4 API, LangChain, Haystack, or other open-source
frameworks
3. Vision-Language Models (VLMs): Image Captioning
Objective: Generate text captions for street-view images using vision-language models.
Dataset: GoPro street view footage from this link https://dhineumlassignment.s3.us-east-
1.amazonaws.com/goPro.zip
Expected Deliverables:
o Trained captioning model
o Sample predictions (images + generated captions)
o Evaluation using BLEU, CIDEr, METEOR scores
o Short report on approach, architecture used, and results
Suggested Models: BLIP, CLIP, Flamingo, or similar open-source models
4. GeoAI: Satellite Image Change Detection
Objective: Detect changes in satellite/aerial images over time.
Dataset: Dataset contain after and before image along with labels (buildings). Please download
it from here https://dhineumlassignment.s3.us-east-1.amazonaws.com/CDD.zip
Expected Deliverables:
o Preprocessed data pipeline
o Change detection outputs (before/after images with detected changes)
Copyright © 2025 by Dhineu. Permission needs to be granted to use, modify, and distribute this
document.
o Evaluation metrics or visual inspection outputs
o Brief write-up on methodology and model results
Suggested Models: Siamese networks, UNet variants, ChangeFormer, or other open-source
change detection models
Submission Guidelines
Submit a GitHub repo or Google Drive link containing:
o Code (Jupyter Notebook or Python scripts)
o Preprocessed sample dataset (or a script to download)
o Final report (max 3 pages)
o README with setup instructions
Bonus Points For:
Deploying your model via a simple API (FastAPI/Flask)
Creating a basic visualization dashboard (Streamlit/Gradio)
Writing clean, well-documented, modular code
Demonstrating awareness of ethical AI or bias concerns
Evaluation Criteria:
Your submission will be evaluated based on the following:
Problem Understanding – Clarity in problem definition and domain relevance
Model Choice & Justification – Appropriateness and reasoning behind selected approach
Data Handling – Quality of preprocessing, transformation, and use of dataset
Implementation & Code Quality – Clean, modular, and reproducible code
Results & Evaluation – Use of appropriate metrics and insights from results
Reporting – Clarity and completeness of your write-up
Bonus Points – Deployment, dashboards, visualization, or ethical/geospatial considerations
This assignment will help us evaluate your proficiency in coding, ML concepts and your problem-solving
abilities. Good luck!
Copyright © 2025 by Dhineu. Permission needs to be granted to use, modify, and distribute this
document.