Hamoye Data Science Internships Handbook Spring 2024
Hamoye Data Science Internships Handbook Spring 2024
Welcome to Hamoye
Mission
Theme
Program Structure
Admission Requirements
Program Tracks
Data Science
MLOps/Data Engineering
DevOps
Data Storytelling
Program Curricula
A. Foundational Courses
B. Curricula Courses
Data Science Track
Data Engineering Track
DevOps Track
Data Storytelling Track
C. Syllabi for all Tracks Courses
D. Mandatory Projects
Premiere Project
Capstone Project
Project Groupings
E. Webinars and Career Coaching sessions with industry Experts
F. Certification and Hamoye Fellowship
G. Non-Certification
H. Other offerings
Program Timeline
How To Use the Platform
Communication Channels
Social Media
Code of Conduct
Frequently asked Questions
1
Welcome to Hamoye
Our mission is to help prepare young professionals and college interns for their choice
careers and the future of work. It is a known fact that getting the first professional job,
post college education, is universally challenging. Depending on the country, it may be
the most difficult challenge young graduates will ever face. Whether one is successful at
it or not could determine the course of one’s life. This is where Hamoye comes in. We
offer world class, specially designed, internship style learning programs, to help
prospective interns acquire real world work experience; all for free. What’s the catch?
There is no catch! Our goal is to help struggling young professionals or college
students who want to set themselves up for success with some of the tools they need to
differentiate themselves in the tough tech job market.
Our internship programs simulate the real work environment, while cutting out all of the
unnecessary distractions that come with traditional internships. Not only do we have the
capacity to provide everyone with an opportunity to acquire job-ready expertise, we
have the special sauce to ensure our interns get the best out of the experience. Also,
interns will work alone and with peers, in a fast paced, competitive environment, to keep
them motivated throughout their choice program.
2
Mission
To jumpstart desirable careers for millions of people all over the world and develop an
army of problem solvers for the world. We are a tech startup founded to help young
professionals acquire in-demand expertise by redesigning how internships work.
Theme
Building the future of work
Program Structure
Hamoye Data Science Internship is a 4-month remote internship program targeted at
individuals who have solid analytical skills and wish to start a career or advance their
skills for rapid career development in Machine Learning Ops/Data Engineering, Data
Science, Data Storytelling, DevOps and Generative AI. Interns will learn the rudiments
of machine learning, coding, statistics, and models for data science. Participants will be
required to progress through a 6-sprint competition, each lasting 2-3 weeks. Each sprint
has a pre-set training curriculum and a real-world use case challenge.
At the end of this internship, interns will apply skills they learned to collaborate and
provide data-centric solutions to real world problems. For instance, interns can use a
dataset to build a machine learning model. It is expected that interns who enroll in this
option already possess basic programming skills.
Admission Requirements
This internship is open to everyone- professionals and students in higher institutions
who are ready to commit the time and effort to scale the rigorous program- no entrance
exam required.
3
Program Tracks
The program is divided into five tracks:
Generative AI
Data Science
The data science track includes multiple carefully crafted courses. These include
Introduction to Python for Machine Learning, Regression in Machine Learning,
Classification in Machine Learning, Neural Network, Image Recognition and Object
Detection, Practical Time Series Analysis. At the end of the course curriculum is the
capstone project.
MLOps/Data Engineering
The data engineering track shares the first two modules with data Science i.e.
Introduction to Python for Machine Learning and Regression in Machine Learning.
However, in addition to these introductory courses, it also covers multiple courses in
MLOps, specifically Introduction to Kubernetes, Machine Learning Operations with
Kubeflow, Kubeflow Components and Pipelines, followed by a capstone project.
DevOps
DevOps is the combination of cultural philosophies, practices, and tools that increases
an organization’s ability to deliver applications and services at high velocity. The
curriculum begins with Introduction to DevOps, followed by CICD (continuous
integration and continuous delivery or continuous deployment), Containers and
Infrastructure as Code, and Cloud computing; each course with its compulsory labs. At
the end of the course curriculum is the capstone project.
4
Data Storytelling
Data storytelling is the practice of building a narrative around a set of data and its
accompanying visualizations, to help convey the meaning of that data in a powerful and
compelling fashion. The curriculum starts with Introduction to Python for Machine
Learning and then Data Storytelling specific courses such as Becoming a Data
Storyteller, Python Libraries for Data Storytelling, Data Storytelling in Pattern
Recognition, and Data Curation for Storytelling follows at the end of the course
5
Program Curricula
The internship is a careful culmination of courses and projects designed to take interns
from beginner to advanced level in their respective tiers. Course curricula and projects
have been designed with inputs from experts around the world. Each track has
respective capstone projects, structured to allow interns to leverage all the knowledge
and skills gained from the course while working on their chosen projects with the
guidance of professionals across the world. The projects will boost interns’ portfolios
and prove their job-readiness to prospective employers.
A. Foundational Courses
- Grades do not count towards final certification
Before the internship begins, interns will have the liberty to take some appetite whetting
courses as they prepare for the real deal. This module will expose interns to the basic
programming language in data science and introduce them into the world of machine
learning, some of the most important python libraries, and classification of machine
learning problems. The courses in this module include:
Note that these courses are optional, as it is expected that every intern is already
familiar with the concepts therein.
6
B. Curricula Courses
- All Grades count towards final certification
Generative AI Track
Stage A
Introduction to Generative AI
Artificial Intelligence (AI) has made significant advancements, impacting businesses,
societies, and individuals. For about the last decade, deep learning has evolved to
process and generate unstructured data like text, images, videos, and more. These
advanced AI models, which are based on deep learning, have gained popularity in
various industries, and include large language models (LLMs).There is currently a
significant level of hype in both the media and the industry surrounding AI. This is driven
by various factors, including advancements in technology, high-profile applications, and
the potential for transformative impacts across multiple sectors.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage B
Getting Started with OpenAI: Fundamentals of Prompt Engineering
This course offers a basic overview of OpenAI's technology suite, focusing on the
foundational principles of prompt engineering to effectively communicate with AI
models. Participants will learn how to craft precise prompts to generate desired
responses, unlocking the full potential of AI in various applications.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Stage C
Advanced Prompt Engineering and Langchain
It should be obvious that it helps to improve prompts to get better results on different
tasks. Here we will get some more advanced tips for prompt engineering to enhance our
prompts, optimize performance and fortify security of our LLM-based applications
7
Quiz for this course on the platform needs to be submitted before the course due date.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage D
Model Fine-Tuning
The course covers the concepts of pattern recognition, storytelling, and their
relationship. Although originally developed using Wolfram Programming Language,
interns are expected to go through the course as a guide ,and produce a storytelling
project using Python Programming Language as the language of choice for data
analysis and visualization.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage E
Gen AI: End To End Project
Here we will build on top of this novel transformer architecture. While text-to-text
generative models like GPT are impressive, one of the most versatile solutions that AI
companies offer is the ability to generate text embeddings based on powerful LLms.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage A
8
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
climate change. Interns will work on cleaning, wrangling, exploring, providing summary
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage B
Regression in Machine Learning
In this course, interns will develop a multivariate multiple regression model to study the
effect of eight input variables on two output variables, which are the heating load and
the cooling load, of residential buildings.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage C
Classification in Machine Learning
In this course, interns will use provided datasets to classify and predict the quality
metrics (score) of the ecological footprint data for the different countries. This data
includes total and per capita national biocapacity, the ecological footprint of
consumption, the ecological footprint of production, and total area in hectares.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage D
Neural Networks, Image Recognition, and Object Detection
9
In this course, interns will explore the building blocks of neural networks, which are
versatile, powerful, and scalable. Interns will learn how to use them to tackle complex
problems like the use of convolutional neural networks for image recognition.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage E
Practical Time Series Analysis
In this course, interns will explore and build time series forecasting models for
measurements of electric power consumption in one household with a one-minute
sampling rate over a period of almost 4 years.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage B
Regression in Machine Learning
10
In this course, interns will develop a multivariate multiple regression model to study the
effect of eight input variables on two output variables, which are the heating load and
the cooling load, of residential buildings.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage C
Introduction to Kubernetes
This course explores what kubernetes is, its architecture and building blocks, how it can
be run on our local system or in the cloud, different ways we can configure and protect
sensitive information, and how one can let external applications access one’s
kubernetes application.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Stage D
Machine Learning Operations with Kubeflow
In this course, interns will learn about the application of machine learning operations
with Kubeflow, how Kubeflow can be used to make deployments of machine learning
workflows on kubernetes simple, portable and scalable, resulting in faster release and
lower operating costs.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Stage E
Kubeflow components and Pipeline
In this course, interns will learn how to build Kubeflow components and connect them in
a pipeline. This course will also provide the basic knowledge of building machine
learning pipelines using Kubeflow.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
11
DevOps and Cloud Engineering Track
Stage A
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
Climate Change. Interns will work on cleaning, wrangling, exploring, providing summary
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage B
Introduction to DevOps
The compound of development (Dev) and operations (Ops). This course delves into the
essential topics that businesses must address in order to begin their DevOps
transformation Journey, such as altering the team's mentality and setting timeframes
and targets.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage C
CICD
Continuous integration (CI) is the practice of automatically building and testing code whenever a
team member pushes changes to version control.
After each little job is completed, CI encourages developers to contribute their code and unit
tests by merging their modifications into a shared version control repository.
12
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.
Stage D
Containers and Infrastructure as Code
Containers are a solution to the challenge of moving software from one computing
environment to another while keeping it running consistently. Did you know Containers
are different from Dockers? If you don't, you're in the right place, enjoy the course!
In the Infrastructure As Code part of this course, you'll be learning more on Chef, Ansible and
Terraform, are you ready?
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.
Stage E
Cloud Computing
In this course, we hope you'll get a good foundation of what Cloud Computing is all about and
how to get started.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.
13
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage B
Becoming a Data Storyteller
In this course, interns will learn the basics of technical writing, and how to become
technical writers. This course outlines and defines the technical writing process, best
practices, and steps to launch their technical writing career.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Stage C
Python Libraries for Data Storytelling
This course will teach interns the different python libraries such as seaborn, matplotlib,
GGplot, plotly, etc. by building data storytelling projects.
Quiz for this course on the platform needs to be submitted before the course due date.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage D
Data Storytelling in Pattern Recognition
The course covers the concepts of pattern recognition, storytelling, and their
relationship. Although originally developed using Wolfram Programming Language,
interns are expected to go through the course as a guide ,and produce a storytelling
project using Python Programming Language as the language of choice for data
analysis and visualization.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
14
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Stage E
Data Curation for Storytelling
In this course, interns will learn the what, the why, and the how of data curation.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.
Track Selection
In the initial week of Stage A, all interns, with the exception of those pursuing the
Generative AI track, must fill out a mandatory track selection form. It is important for
interns to be mindful of the form's submission deadline.
D. Mandatory Projects
There will be two mandatory projects to be completed in assigned groups, and only
active members of the groups will be rewarded, so no slacking folks!
15
Premiere Project
Upon the start of Stage B, interns will be organized into groups and provided
information for the first presentation project of this internship. Groups include interns in
the Data Science, Data Engineering, Data Storytelling, DevOps & Cloud Engineering ,
and Generative AI tracks. Find premiere project documentation samples here
Capstone Project
At the start of the program, interns can get a glimpse of the various projects that will be
worked on by visiting our blog. The capstone projects allow interns to leverage on what
they've learnt, and provide data-driven solutions to some of the problems facing
humanity. These capstone projects have been designed to address different challenges
across many sectors. We are advocates of learning by doing since we believe that
true mastery of any skill, especially something as practical as coding, comes through
doing.
As a result, interns who are enrolled in any Hamoye programs must be ready to get
their hands dirty with work, which will make refined experts out of them.
Our past interns have testified to the efficacy of capstone projects, and their testimonies
and experiences can be found here and here.
Project Groupings
To reiterate, Group Projects are a requirement of the Hamoye Internships. Interns are
grouped into smaller sizes to accommodate the contribution of every member in all the
projects that they will be involved in.
Before the conclusion of Stage B, all interns will find out what group they have been
placed in, and the premiere projects must be executed within these groups. Just before
stage D is concluded, interns will be regrouped into their capstone project groups.
Group members are encouraged to meet each other virtually and get acquainted with
each other in order to facilitate the rapport they need when the projects will start.
16
Working in groups will help interns to simulate a typical work environment, and they will
find out it might be easier to work with some group members than others. Nonetheless,
everyone has to actively participate in the project to get the desired outcome.
Group members will be asked to choose a Group Lead, Assistant Group Lead and a
Query Analyst.
Important: No group switch or swap is allowed, Interns who do not adhere to this
rule will be scored zero in the premiere or capstone project or both, depending on
when the switch/swap occurred. Group and Assistant Group Leads should also
not allow interns who are not on their group list to participate in their projects.
17
In addition, interns who score 60% or more in each course quiz they complete will
also receive an automatically generated certificate for the course as soon as the
course is officially closed on hamoye.com
Asides the core criteria listed above, here are some of your vital responsibilities as an
intern that will equally affect your chances of being an Hamoye Fellow:
1. Include your correct 16-digit student ID in forms whenever you submit projects.
Here is a guide to help you find your Hamoye Student ID. Inputting an incorrect
Student ID would guarantee a zero grade in that project.
2. Double check your codes and projects to ensure that they are correctly uploaded;
if not, identify empty uploads and re-upload files. There will be no other chances
to resubmit your projects after the due dates.Click this link to learn how to submit
your tag along project at the end of each stage
3. Ensure you have a stable internet connection when attempting quizzes and
project uploads. There will be no other chances to reattempt quizzes once a quiz
is submitted. Learn how to save your quiz progress here.
The Hamoye fellowship is a body of Hamoye’s past interns, all of whom have been
certified and found eligible for employment opportunities within Hamoye and other data
science firms.
G. Non-Certification
There could be instances where students don’t meet the criteria set out in Section E.
For example, illness, family situation or some other unforeseen circumstances. This is
why each successfully completed course with a grade of 60% or more is rewarded with
a certificate, so that you are rewarded as you go. Basically, everyone who puts in the
work as best as they can should have one or more certificates to show for the different
courses completed with a score of 60% or more.
H. Other offerings
- Awards for best intern per sprint
- Career coaching and recommendations
- Published project articles on the Hamoye blog
- Membership of OneCV: Hamoye’s exclusive job market
- Access to Hamoye AI Labs Faculty
18
Program Timeline
Component Period Duration Dates
(weeks)
19
How To Use the Platform
Our Platform is uniquely designed to give our interns the best learning experience.
Aside from solely being a learning management system, we have incorporated features
that enable a holistic learning experience, consisting of Study groups, Chat boxes, and
much more. It is important that interns are first signed up on hamoye.com and then
familiarize themselves with the platform before the start of the internship. The following
has been prepared to help you enjoy your first time on hamoye.com:
Visit our blog for more guides on how to use the platform
Communication Channels
Communication is best done when it goes two ways. Interns will not spend 100% of the
time studying materials, there will always be avenues for constructive conversations
between them and the organization.
The following channels will be used for the purpose of communication during the
program: Hamoye platform discussion forum, email, Zoom, and Slack.
Zoom/Youtube livestream will be the official channel for all audio and video calls
including webinars.
Slack will be the interactive space for the course. We understand the importance of
interaction, and this space is created with that in mind. Interns are encouraged to
interact with each other and also with the Hamoye Team regarding any questions they
might have. Different channels have been created to serve several purposes, and more
may be created as the need arises
20
Hamoye.com: For each of the courses, there is a discussion button that allows interns
to share ideas around concepts being taught, ask questions from their peers, and
provide answers to questions.
Social Media
Stay connected with us on social media in order to stay updated on recent happenings.
We also take to our social media channels to celebrate outstanding interns.
Code of Conduct
The Code of Conduct aims to ensure that all Interns understand the standard of conduct
required. Interns are expected to uphold the Code of Conduct at all times while learning,
carrying out their duties and interactions.
A. Be Responsible: Carry out all their learning activities responsibly, safely, and in a
competent manner. The primary responsibility is to learn; we expect interns to conduct
themselves in a manner worthy of emulation, without being judgemental or biased.
Avoid providing information to other Interns that are misrepresented or falsified.
B. Be Respectful: Respect other Interns and honor their integrity without prejudice to
economic status, abilities, race and customs, cultural and religious needs.
21
C. Be Present: Attend all learning activities i.e Projects, Webinars, Weekly Coding
Clubs, and Slack Engagements diligently; if indisposed, interns should inform
community members, and the Hamoye Team Representative as early as possible. Avoid
abandoning learning abruptly, especially during the periods of assigned group
responsibilities, as this will have a negative effect on the learning outcome i.e grades, of
the other group members. Should there be an absolute need for abrupt withdrawal from
learning, interns should inform the representatives of the Hamoye Team before leaving.
All information confidential to Hamoye, especially personal data of interns, must not be
disclosed or used for any other purposes unless absolutely necessary to help our
Interns learning experience with Hamoye. It is advisable to check with the Hamoye
Team Representative or the Data Protection Officer at [email protected] if there are any
queries or doubts. Photographs and videos are considered personal data of the person
photographed or filmed. If an Intern would like to take photographs or videos of other
Interns, consent must be sought from them in a private and personal arrangement. As
an Intern, you must destroy all copies of the name list and any documents containing
personal data of other Interns, and all other confidential information of Hamoye
immediately after their learning ends.
22
iv. Do’s
As an intern, you are required to;
A. Be aware that you will be identified with Hamoye, hence, it is required of you to
understand the mission and core values of the organization
B. Be punctual for your learning activities and engagements
C. Lead by example and be good role models for other Interns
D. Be mindful of interns who are overly-friendly or attention-seeking. Maintain
appropriate behavior and prevent having any otherwise contacts with other
Interns.
E. Maintain confidentiality and respect the privacy and dignity of other interns.
F. Respect the feelings of other interns.
G. Be generous with encouragement and praise by regarding and celebrating the
talents and achievements of other interns.
H. Instill independence and self-confidence by providing guidance to other interns to
complete their tasks by themselves, rather than doing things for them.
I. Consult the Hamoye staff when in doubt
J. Give compliments or feedback to Hamoye staff on positive or adverse encounters
while learning
K. Report immediately to the Hamoye team representative, should you be aware of
any information from other interns that is of potential threat or harm to Interns
and/or others.
v. Don'ts
As an intern in the Hamoye programs, the following are considered as prohibited;
23
E. Uploading interns’ photographs to an online platform, including websites, blogs,
or any form of social media without prior permissions from people in the
photograph.
F. Sharing information about the interns and Hamoye with others, unless for
promotional purposes.
G. Cheating is strictly prohibited either on the quizzes and/or in the coding labs.
Quizzes and Labs are done independently and not with a friend or as a group.
Interns found doing this will be promptly removed from the program. Retroactive
dismissal will also be carried out if it’s found out later on that colluding had
occurred.
Hamoye internships offer unique opportunities for interns that simulate exact activities in
a corporate environment. You will work on real life projects with your peers across the
world. You will also be privileged to attend webinars conducted by experts in the data
science and machine learning fields and unblocking sessions conducted by the Hamoye
faculty. Certificates of completion and lots more will be awarded to interns who
excellently conclude their internship.The internship handbook explicitly explains the
internship structure.
I am yet to take my quiz/ I can’t find the quiz in my course, please how do I go
about that?
24
Here is a simple guide to achieve that
How do I ensure the optimum experience during my quiz taking? What if I want to
save my quiz progress?
It is important you save your quiz progress as often as possible while taking your quiz in
order to ensure Take a look at this too
We are currently preparing for our premier project, please is there any template
for our slides?
Yes we have, check it out here. You might also want to consider these guidelines
I need a Kaggle account to conclude this course, how do I get my phone number
verified fast?
Here’s how.
No, one cannot. In any case you can re-register for the next cohort and start the
application process again.
The course materials will be delivered in various forms on the Hamoye web app. The
course will be largely delivered in scripts, a bit in videos, and one-one interactions with
the Hamoye team.
25
My quiz automatically logged me out and graded me low despite having enough
time. How can I retrieve my lost time?
The quiz on the platform can only be taken once, so be sure to have a strong internet
connection before attempting the quiz. Most importantly, be sure to save your quiz
progress as often as possible while taking your quiz. This medium article is helpful.
After your registration process is completed, you will be added to a Slack workspace via
your email address (kindly ensure a valid email address is provided). Our programs
coordinator will send in timely details and information pertaining to the internship via the
slack channel and all interactions and engagements between all our Interns are carried
on within this confines. Once the internship begins and you have been assigned to your
group, you can then begin to enjoy learning with the community.
The course will be delivered in an e-learning style. Hence, a good internet connection
and a reliable device (tablet, desktop, laptop) are essential for learning.
What’s a data scientist or a machine learning expert without real-life project experience?
From the start of the internship, interns will be able to choose from a plethora of projects
on Hamoye website and about a 4-week timeline will be allotted to work on the project,
with peers or independently, which will be judged by a group of experts across various
sectors
The Hamoye platform is replete with diverse projects across various sectors. At the
beginning of the internship, together with your group, you will be assigned the projects
which you will work on and present to industry experts a few weeks after the start of the
program and at the end of the internship.
● Data Science
● MLOps/Data Engineering
26
● DevOps & Cloud Engineering
● Data Storytelling
No, you are expected to select only one track and if you chose more than one track, you
will be disqualified from the program.
● Every participant has the opportunity to learn using a pre-set training curriculum,
connect, collaborate and build a real-world use case.
● Every participant who successfully completes a sprint and the associated
challenge will earn a certificate from Hamoye for each sprint completed.
● Participants will have an opportunity to work on a unique AI open-source project.
● The top participants from each sprint will be declared sprint winners and awarded
prizes
● Finalists will be invited to join Hamoye AI Labs or our partner companies across
the world.
Data Science
Which application do I recommend for my coding task in order to arrive at the
accurate answer? Google Colab or Jupyter notebook?
Either of them should work. Best practice demands that your version of Jupyter
Notebook is at least up to date, i.e it should be or closer to the recent version.
27
Are there tag along projects at every stage?
Every stage in this track has a compulsory tag along project.
Data Engineering
Is there a tag along project for stage C?
No, for DE track, there are only tag along projects for stage A and B
I discovered that I was only able to create a limited GCP account, what do I do
since I still have to use it till stage E
It is advisable you begin with one of the accounts first (GCP OR Azure), use up the free
trial and then create the other account (GCP OR Azure as well). One of these products
lasts 30 days, and that should span two stages, the other spans 12 months and would
be sufficient for all the stages in this program.
Data Storytelling
Is there a tag along project for Stage B
No, there is no tag along project for Stage B in the DST track.
28