0% found this document useful (0 votes)
54 views29 pages

Hamoye Data Science Internships Handbook Spring 2024

Uploaded by

Ronaldo Shah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views29 pages

Hamoye Data Science Internships Handbook Spring 2024

Uploaded by

Ronaldo Shah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

Table of Contents

Welcome to Hamoye
Mission
Theme
Program Structure
Admission Requirements
Program Tracks
Data Science
MLOps/Data Engineering
DevOps
Data Storytelling
Program Curricula
A. Foundational Courses
B. Curricula Courses
Data Science Track
Data Engineering Track
DevOps Track
Data Storytelling Track
C. Syllabi for all Tracks Courses
D. Mandatory Projects
Premiere Project
Capstone Project
Project Groupings
E. Webinars and Career Coaching sessions with industry Experts
F. Certification and Hamoye Fellowship
G. Non-Certification
H. Other offerings
Program Timeline
How To Use the Platform
Communication Channels
Social Media
Code of Conduct
Frequently asked Questions

1
Welcome to Hamoye
Our mission is to help prepare young professionals and college interns for their choice
careers and the future of work. It is a known fact that getting the first professional job,
post college education, is universally challenging. Depending on the country, it may be
the most difficult challenge young graduates will ever face. Whether one is successful at
it or not could determine the course of one’s life. This is where Hamoye comes in. We
offer world class, specially designed, internship style learning programs, to help
prospective interns acquire real world work experience; all for free. What’s the catch?
There is no catch! Our goal is to help struggling young professionals or college
students who want to set themselves up for success with some of the tools they need to
differentiate themselves in the tough tech job market.

Our internship programs simulate the real work environment, while cutting out all of the
unnecessary distractions that come with traditional internships. Not only do we have the
capacity to provide everyone with an opportunity to acquire job-ready expertise, we
have the special sauce to ensure our interns get the best out of the experience. Also,
interns will work alone and with peers, in a fast paced, competitive environment, to keep
them motivated throughout their choice program.

Our internship programs are offered 3 to 4 times a year and involve:


(i) specially designed short courses with real-world projects to introduce interns to the
tools they need to stand out
(ii) exposure to industry experts via webinars
(iii) career coaching and
(iv) presentation project with peers.

Our Social Learning Management Platform (hamoye.com) is designed to help interns


participate from anywhere in the world, track their accomplishments, and showcase
themselves to potential employers.

2
Mission
To jumpstart desirable careers for millions of people all over the world and develop an
army of problem solvers for the world. We are a tech startup founded to help young
professionals acquire in-demand expertise by redesigning how internships work.

Theme
Building the future of work

Program Structure
Hamoye Data Science Internship is a 4-month remote internship program targeted at
individuals who have solid analytical skills and wish to start a career or advance their
skills for rapid career development in Machine Learning Ops/Data Engineering, Data
Science, Data Storytelling, DevOps and Generative AI. Interns will learn the rudiments
of machine learning, coding, statistics, and models for data science. Participants will be
required to progress through a 6-sprint competition, each lasting 2-3 weeks. Each sprint
has a pre-set training curriculum and a real-world use case challenge.

At the end of this internship, interns will apply skills they learned to collaborate and
provide data-centric solutions to real world problems. For instance, interns can use a
dataset to build a machine learning model. It is expected that interns who enroll in this
option already possess basic programming skills.

Admission Requirements
This internship is open to everyone- professionals and students in higher institutions
who are ready to commit the time and effort to scale the rigorous program- no entrance
exam required.

3
Program Tracks
The program is divided into five tracks:

Generative AI

Generative AI is a transformative field that focuses on creating AI models capable of


generating new and innovative content autonomously. Our curriculum is designed to
immerse learners in this exciting domain, starting with foundational courses like
Introduction to Generative AI and OpenAI & Prompt Engineering Basics. As learners
progress, they delve deeper into advanced topics such as Advanced Prompt
Engineering & Langchain and Fine-Tuning Pretrained Models. The journey culminates
with an End-to-End Project Integration, where learners apply their skills to create
comprehensive generative AI solutions

Data Science
The data science track includes multiple carefully crafted courses. These include
Introduction to Python for Machine Learning, Regression in Machine Learning,
Classification in Machine Learning, Neural Network, Image Recognition and Object
Detection, Practical Time Series Analysis. At the end of the course curriculum is the
capstone project.

MLOps/Data Engineering
The data engineering track shares the first two modules with data Science i.e.
Introduction to Python for Machine Learning and Regression in Machine Learning.
However, in addition to these introductory courses, it also covers multiple courses in
MLOps, specifically Introduction to Kubernetes, Machine Learning Operations with
Kubeflow, Kubeflow Components and Pipelines, followed by a capstone project.

DevOps
DevOps is the combination of cultural philosophies, practices, and tools that increases
an organization’s ability to deliver applications and services at high velocity. The
curriculum begins with Introduction to DevOps, followed by CICD (continuous
integration and continuous delivery or continuous deployment), Containers and
Infrastructure as Code, and Cloud computing; each course with its compulsory labs. At
the end of the course curriculum is the capstone project.

4
Data Storytelling
Data storytelling is the practice of building a narrative around a set of data and its
accompanying visualizations, to help convey the meaning of that data in a powerful and
compelling fashion. The curriculum starts with Introduction to Python for Machine
Learning and then Data Storytelling specific courses such as Becoming a Data
Storyteller, Python Libraries for Data Storytelling, Data Storytelling in Pattern
Recognition, and Data Curation for Storytelling follows at the end of the course

5
Program Curricula
The internship is a careful culmination of courses and projects designed to take interns
from beginner to advanced level in their respective tiers. Course curricula and projects
have been designed with inputs from experts around the world. Each track has
respective capstone projects, structured to allow interns to leverage all the knowledge
and skills gained from the course while working on their chosen projects with the
guidance of professionals across the world. The projects will boost interns’ portfolios
and prove their job-readiness to prospective employers.

A. Foundational Courses
- Grades do not count towards final certification

Before the internship begins, interns will have the liberty to take some appetite whetting
courses as they prepare for the real deal. This module will expose interns to the basic
programming language in data science and introduce them into the world of machine
learning, some of the most important python libraries, and classification of machine
learning problems. The courses in this module include:

1. Introduction to Python With Google Colab & Anaconda


2. Version Control Using Git

Note that these courses are optional, as it is expected that every intern is already
familiar with the concepts therein.

6
B. Curricula Courses
- All Grades count towards final certification

Generative AI Track
Stage A
Introduction to Generative AI
Artificial Intelligence (AI) has made significant advancements, impacting businesses,
societies, and individuals. For about the last decade, deep learning has evolved to
process and generate unstructured data like text, images, videos, and more. These
advanced AI models, which are based on deep learning, have gained popularity in
various industries, and include large language models (LLMs).There is currently a
significant level of hype in both the media and the industry surrounding AI. This is driven
by various factors, including advancements in technology, high-profile applications, and
the potential for transformative impacts across multiple sectors.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage B
Getting Started with OpenAI: Fundamentals of Prompt Engineering
This course offers a basic overview of OpenAI's technology suite, focusing on the
foundational principles of prompt engineering to effectively communicate with AI
models. Participants will learn how to craft precise prompts to generate desired
responses, unlocking the full potential of AI in various applications.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Stage C
Advanced Prompt Engineering and Langchain
It should be obvious that it helps to improve prompts to get better results on different
tasks. Here we will get some more advanced tips for prompt engineering to enhance our
prompts, optimize performance and fortify security of our LLM-based applications

7
Quiz for this course on the platform needs to be submitted before the course due date.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage D
Model Fine-Tuning
The course covers the concepts of pattern recognition, storytelling, and their
relationship. Although originally developed using Wolfram Programming Language,
interns are expected to go through the course as a guide ,and produce a storytelling
project using Python Programming Language as the language of choice for data
analysis and visualization.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage E
Gen AI: End To End Project

Here we will build on top of this novel transformer architecture. While text-to-text
generative models like GPT are impressive, one of the most versatile solutions that AI
companies offer is the ability to generate text embeddings based on powerful LLms.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Data Science Track

Stage A

8
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
climate change. Interns will work on cleaning, wrangling, exploring, providing summary
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage B
Regression in Machine Learning
In this course, interns will develop a multivariate multiple regression model to study the
effect of eight input variables on two output variables, which are the heating load and
the cooling load, of residential buildings.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage C
Classification in Machine Learning
In this course, interns will use provided datasets to classify and predict the quality
metrics (score) of the ecological footprint data for the different countries. This data
includes total and per capita national biocapacity, the ecological footprint of
consumption, the ecological footprint of production, and total area in hectares.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage D
Neural Networks, Image Recognition, and Object Detection

9
In this course, interns will explore the building blocks of neural networks, which are
versatile, powerful, and scalable. Interns will learn how to use them to tackle complex
problems like the use of convolutional neural networks for image recognition.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage E
Practical Time Series Analysis
In this course, interns will explore and build time series forecasting models for
measurements of electric power consumption in one household with a one-minute
sampling rate over a period of almost 4 years.
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Data Engineering Track


Stage A
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
Climate Change. Interns will work on cleaning, wrangling, exploring, providing summary
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage B
Regression in Machine Learning

10
In this course, interns will develop a multivariate multiple regression model to study the
effect of eight input variables on two output variables, which are the heating load and
the cooling load, of residential buildings.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage C
Introduction to Kubernetes
This course explores what kubernetes is, its architecture and building blocks, how it can
be run on our local system or in the cloud, different ways we can configure and protect
sensitive information, and how one can let external applications access one’s
kubernetes application.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Stage D
Machine Learning Operations with Kubeflow
In this course, interns will learn about the application of machine learning operations
with Kubeflow, how Kubeflow can be used to make deployments of machine learning
workflows on kubernetes simple, portable and scalable, resulting in faster release and
lower operating costs.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Stage E
Kubeflow components and Pipeline
In this course, interns will learn how to build Kubeflow components and connect them in
a pipeline. This course will also provide the basic knowledge of building machine
learning pipelines using Kubeflow.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

11
DevOps and Cloud Engineering Track
Stage A
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
Climate Change. Interns will work on cleaning, wrangling, exploring, providing summary
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage B
Introduction to DevOps
The compound of development (Dev) and operations (Ops). This course delves into the
essential topics that businesses must address in order to begin their DevOps
transformation Journey, such as altering the team's mentality and setting timeframes
and targets.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage C
CICD

Continuous integration (CI) is the practice of automatically building and testing code whenever a
team member pushes changes to version control.

After each little job is completed, CI encourages developers to contribute their code and unit
tests by merging their modifications into a shared version control repository.

12
The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.

Stage D
Containers and Infrastructure as Code
Containers are a solution to the challenge of moving software from one computing
environment to another while keeping it running consistently. Did you know Containers
are different from Dockers? If you don't, you're in the right place, enjoy the course!

In the Infrastructure As Code part of this course, you'll be learning more on Chef, Ansible and
Terraform, are you ready?

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.

Stage E
Cloud Computing
In this course, we hope you'll get a good foundation of what Cloud Computing is all about and
how to get started.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.
Completing the Labs is mandatory and each lab link should be uploaded by the intern to
the database provided at the start of the course.

Data Storytelling Track


Stage A
Introduction to Python for Machine Learning
This course will expose interns to machine learning techniques and how they can serve
as an invaluable tool for solving one of the grand challenges posed to humanity such as
Climate Change. Interns will work on cleaning, wrangling, exploring, providing summary

13
statistics and interesting visualizations on a public utility data containing millions of rows
and tens of files in structured format.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage B
Becoming a Data Storyteller
In this course, interns will learn the basics of technical writing, and how to become
technical writers. This course outlines and defines the technical writing process, best
practices, and steps to launch their technical writing career.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Stage C
Python Libraries for Data Storytelling
This course will teach interns the different python libraries such as seaborn, matplotlib,
GGplot, plotly, etc. by building data storytelling projects.

Quiz for this course on the platform needs to be submitted before the course due date.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage D
Data Storytelling in Pattern Recognition
The course covers the concepts of pattern recognition, storytelling, and their
relationship. Although originally developed using Wolfram Programming Language,
interns are expected to go through the course as a guide ,and produce a storytelling
project using Python Programming Language as the language of choice for data
analysis and visualization.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

14
Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Stage E
Data Curation for Storytelling
In this course, interns will learn the what, the why, and the how of data curation.

The quiz(zes) for this course on the platform needs to be attempted and submitted
before the deadline for submission.

Completing the coding exercise is mandatory and the link should be uploaded by the
intern to the database provided at the start of the course.

Track Selection
In the initial week of Stage A, all interns, with the exception of those pursuing the
Generative AI track, must fill out a mandatory track selection form. It is important for
interns to be mindful of the form's submission deadline.

C. Syllabi for all Tracks Courses


● HDSC Spring '24 Stage A syllabus (all Tracks except Gen AI Track).pdf
● Hamoye Internships Spring '24 Generative AI Syllabus.pdf
● Hamoye Internships Spring '24 Data Science Syllabus.pdf
● Hamoye Internships Spring '24 Data Engineering/Mlops Syllabus.pdf
● Hamoye Internships Spring '24 DevOps Syllabus.pdf
● Hamoye Internships Spring '24 Data Storytelling Syllabus.pdf

D. Mandatory Projects

There will be two mandatory projects to be completed in assigned groups, and only
active members of the groups will be rewarded, so no slacking folks!

15
Premiere Project

Upon the start of Stage B, interns will be organized into groups and provided
information for the first presentation project of this internship. Groups include interns in
the Data Science, Data Engineering, Data Storytelling, DevOps & Cloud Engineering ,
and Generative AI tracks. Find premiere project documentation samples here

Capstone Project

At the start of the program, interns can get a glimpse of the various projects that will be
worked on by visiting our blog. The capstone projects allow interns to leverage on what
they've learnt, and provide data-driven solutions to some of the problems facing
humanity. These capstone projects have been designed to address different challenges
across many sectors. We are advocates of learning by doing since we believe that
true mastery of any skill, especially something as practical as coding, comes through
doing.

As a result, interns who are enrolled in any Hamoye programs must be ready to get
their hands dirty with work, which will make refined experts out of them.

Our past interns have testified to the efficacy of capstone projects, and their testimonies
and experiences can be found here and here.

Project Groupings
To reiterate, Group Projects are a requirement of the Hamoye Internships. Interns are
grouped into smaller sizes to accommodate the contribution of every member in all the
projects that they will be involved in.

Before the conclusion of Stage B, all interns will find out what group they have been
placed in, and the premiere projects must be executed within these groups. Just before
stage D is concluded, interns will be regrouped into their capstone project groups.

Group members are encouraged to meet each other virtually and get acquainted with
each other in order to facilitate the rapport they need when the projects will start.

16
Working in groups will help interns to simulate a typical work environment, and they will
find out it might be easier to work with some group members than others. Nonetheless,
everyone has to actively participate in the project to get the desired outcome.

Group members will be asked to choose a Group Lead, Assistant Group Lead and a
Query Analyst.

Important: No group switch or swap is allowed, Interns who do not adhere to this
rule will be scored zero in the premiere or capstone project or both, depending on
when the switch/swap occurred. Group and Assistant Group Leads should also
not allow interns who are not on their group list to participate in their projects.

E. Webinars and Career Coaching sessions


with industry Experts
All of the webinars and career coaching sessions are mandatory, and no intern should
miss out as attendance will be taken at each session. We want our interns to be
equipped to make decisions, get ideas and be motivated by those who have walked in
their shoes and who know exactly what it takes to find a fulfilling career. We spend a lot
of time organizing these sessions, the industry experts as well spend time in preparing
for these presentations, the least interns should do is join all webinars.

F. Certification and Hamoye Fellowship


The following must be fulfilled to obtain a Hamoye Fellowship Certificate. All interns who
wish to have a certificate must endeavor to fulfill these criteria:

1) Completion of all mandatory course quizzes completed on Hamoye.com


2) Completion of all mandatory coding/tag-along project or other coding
assignments
3) Active participation in all group projects.
4) Attendance from start to finish of all webinars
5) Obtain an average of 60%, no exceptions.

17
In addition, interns who score 60% or more in each course quiz they complete will
also receive an automatically generated certificate for the course as soon as the
course is officially closed on hamoye.com

Asides the core criteria listed above, here are some of your vital responsibilities as an
intern that will equally affect your chances of being an Hamoye Fellow:
1. Include your correct 16-digit student ID in forms whenever you submit projects.
Here is a guide to help you find your Hamoye Student ID. Inputting an incorrect
Student ID would guarantee a zero grade in that project.
2. Double check your codes and projects to ensure that they are correctly uploaded;
if not, identify empty uploads and re-upload files. There will be no other chances
to resubmit your projects after the due dates.Click this link to learn how to submit
your tag along project at the end of each stage
3. Ensure you have a stable internet connection when attempting quizzes and
project uploads. There will be no other chances to reattempt quizzes once a quiz
is submitted. Learn how to save your quiz progress here.

The Hamoye fellowship is a body of Hamoye’s past interns, all of whom have been
certified and found eligible for employment opportunities within Hamoye and other data
science firms.

G. Non-Certification
There could be instances where students don’t meet the criteria set out in Section E.
For example, illness, family situation or some other unforeseen circumstances. This is
why each successfully completed course with a grade of 60% or more is rewarded with
a certificate, so that you are rewarded as you go. Basically, everyone who puts in the
work as best as they can should have one or more certificates to show for the different
courses completed with a score of 60% or more.

H. Other offerings
- Awards for best intern per sprint
- Career coaching and recommendations
- Published project articles on the Hamoye blog
- Membership of OneCV: Hamoye’s exclusive job market
- Access to Hamoye AI Labs Faculty

18
Program Timeline
Component Period Duration Dates
(weeks)

Onboarding/Orientation Week 0 0 27th March- 2nd April

Stage A Weeks 1 & 2 2 3rd - 16th April

Stage B Weeks 3 & 4 2 17th- 30th April

BREAK Week 5 1 1st - 7th May

Stage C Weeks 6 & 7 2 8th - 21st May

PREMIERE PROJECT Week 8 1 25th May


PRESENTATION

Stage D Weeks 9 & 10 2 22nd May - 4th June

Stage E Weeks 11&12 2 5th - 18th June

Stage F- CAPSTONE Weeks 13-16 5 19th June - 24th July


PROJECT Presentation - 27th July
PRESENTATION

19
How To Use the Platform
Our Platform is uniquely designed to give our interns the best learning experience.
Aside from solely being a learning management system, we have incorporated features
that enable a holistic learning experience, consisting of Study groups, Chat boxes, and
much more. It is important that interns are first signed up on hamoye.com and then
familiarize themselves with the platform before the start of the internship. The following
has been prepared to help you enjoy your first time on hamoye.com:

● See how to create an account on hamoye.com


● See how to take a course on hamoye.com
● How to save your quiz progress
● How to rate or review on the platform
● How to retrieve your unique User ID

Visit our blog for more guides on how to use the platform

Communication Channels
Communication is best done when it goes two ways. Interns will not spend 100% of the
time studying materials, there will always be avenues for constructive conversations
between them and the organization.

The following channels will be used for the purpose of communication during the
program: Hamoye platform discussion forum, email, Zoom, and Slack.

Zoom/Youtube livestream will be the official channel for all audio and video calls
including webinars.

Slack will be the interactive space for the course. We understand the importance of
interaction, and this space is created with that in mind. Interns are encouraged to
interact with each other and also with the Hamoye Team regarding any questions they
might have. Different channels have been created to serve several purposes, and more
may be created as the need arises

20
Hamoye.com: For each of the courses, there is a discussion button that allows interns
to share ideas around concepts being taught, ask questions from their peers, and
provide answers to questions.

Social Media
Stay connected with us on social media in order to stay updated on recent happenings.
We also take to our social media channels to celebrate outstanding interns.

Visit www.hamoye.com/social to follow us on our various social media platforms


@hamoyehq.

Code of Conduct

The Code of Conduct aims to ensure that all Interns understand the standard of conduct
required. Interns are expected to uphold the Code of Conduct at all times while learning,
carrying out their duties and interactions.

i. Expectations From Our Interns

Interns are expected at all times to:

A. Be Responsible: Carry out all their learning activities responsibly, safely, and in a
competent manner. The primary responsibility is to learn; we expect interns to conduct
themselves in a manner worthy of emulation, without being judgemental or biased.
Avoid providing information to other Interns that are misrepresented or falsified.

B. Be Respectful: Respect other Interns and honor their integrity without prejudice to
economic status, abilities, race and customs, cultural and religious needs.

21
C. Be Present: Attend all learning activities i.e Projects, Webinars, Weekly Coding
Clubs, and Slack Engagements diligently; if indisposed, interns should inform
community members, and the Hamoye Team Representative as early as possible. Avoid
abandoning learning abruptly, especially during the periods of assigned group
responsibilities, as this will have a negative effect on the learning outcome i.e grades, of
the other group members. Should there be an absolute need for abrupt withdrawal from
learning, interns should inform the representatives of the Hamoye Team before leaving.

D. Be Discerning: Behave in the most appropriate or desirable manner while learning


such that liabilities will neither be created, nor Hamoye disreputed. Avoid engaging in
activities or actions that will bring any harm (physical or mental) to a person or property.
Exercise discretion in extending help, especially in monetary terms beyond the scope of
their learning, or seeking/accepting rewards, benefits or gifts without authorisation.

ii. Protection of Confidential Information

All information confidential to Hamoye, especially personal data of interns, must not be
disclosed or used for any other purposes unless absolutely necessary to help our
Interns learning experience with Hamoye. It is advisable to check with the Hamoye
Team Representative or the Data Protection Officer at [email protected] if there are any
queries or doubts. Photographs and videos are considered personal data of the person
photographed or filmed. If an Intern would like to take photographs or videos of other
Interns, consent must be sought from them in a private and personal arrangement. As
an Intern, you must destroy all copies of the name list and any documents containing
personal data of other Interns, and all other confidential information of Hamoye
immediately after their learning ends.

iii. Conflict of Interest:

Interns are not expected to:

A. Assume learning with other organizations. Having other personal commitments,


or situations that will give rise to conflict of interest in the period of your learning
with Hamoye.
B. Use or allow others to use Hamoye’s name, property, resources, information, or
funds for any purpose other than that required for their learning activities.
C. Act as a spokesperson for Hamoye unless prior permission or authority has been
granted.

22
iv. Do’s
As an intern, you are required to;

A. Be aware that you will be identified with Hamoye, hence, it is required of you to
understand the mission and core values of the organization
B. Be punctual for your learning activities and engagements
C. Lead by example and be good role models for other Interns
D. Be mindful of interns who are overly-friendly or attention-seeking. Maintain
appropriate behavior and prevent having any otherwise contacts with other
Interns.
E. Maintain confidentiality and respect the privacy and dignity of other interns.
F. Respect the feelings of other interns.
G. Be generous with encouragement and praise by regarding and celebrating the
talents and achievements of other interns.
H. Instill independence and self-confidence by providing guidance to other interns to
complete their tasks by themselves, rather than doing things for them.
I. Consult the Hamoye staff when in doubt
J. Give compliments or feedback to Hamoye staff on positive or adverse encounters
while learning
K. Report immediately to the Hamoye team representative, should you be aware of
any information from other interns that is of potential threat or harm to Interns
and/or others.

v. Don'ts
As an intern in the Hamoye programs, the following are considered as prohibited;

A. Soliciting sales or promotion of services of any kind on our platforms.


B. Stepping in to take over tasks, assignments, and projects given to other interns
for monetary purposes or otherwise.
C. Making empty promises or giving any form of monetary or material reward to
other interns without first consulting the associate in charge of the Hamoye team.
D. Verbally, emotionally, or physically abusing the interns or Hamoye staff,
especially using profanities or derogatory remarks.

23
E. Uploading interns’ photographs to an online platform, including websites, blogs,
or any form of social media without prior permissions from people in the
photograph.
F. Sharing information about the interns and Hamoye with others, unless for
promotional purposes.
G. Cheating is strictly prohibited either on the quizzes and/or in the coding labs.
Quizzes and Labs are done independently and not with a friend or as a group.
Interns found doing this will be promptly removed from the program. Retroactive
dismissal will also be carried out if it’s found out later on that colluding had
occurred.

Frequently asked Questions


Let’s take a look at some of the questions that frequently pop up during the course of
the internship.

What does HDSC mean?


HDSC is an acronym for Hamoye Data Science Internship
Is it a paid program?

HDSC is not a paid internship. The program is entirely free.

How’s Hamoye internship program different from other programs?

Hamoye internships offer unique opportunities for interns that simulate exact activities in
a corporate environment. You will work on real life projects with your peers across the
world. You will also be privileged to attend webinars conducted by experts in the data
science and machine learning fields and unblocking sessions conducted by the Hamoye
faculty. Certificates of completion and lots more will be awarded to interns who
excellently conclude their internship.The internship handbook explicitly explains the
internship structure.

How do I submit my tag-along project?


Click this link to learn how to submit your tag along project at the end of each stage

What is the ask me session all about?


Take a look at this to learn more about it

I am yet to take my quiz/ I can’t find the quiz in my course, please how do I go
about that?

24
Here is a simple guide to achieve that

How do I ensure the optimum experience during my quiz taking? What if I want to
save my quiz progress?
It is important you save your quiz progress as often as possible while taking your quiz in
order to ensure Take a look at this too

I have completed a course, how do you rate or review on the platform?


Here is a guide on how to rate a course on the platform

When will I be added to my project group?


Only interns that fill the track selection form within the stipulated deadline will be added
to a group. Grouping will be communicated in due course.

We are currently preparing for our premier project, please is there any template
for our slides?
Yes we have, check it out here. You might also want to consider these guidelines

I need a Kaggle account to conclude this course, how do I get my phone number
verified fast?
Here’s how.

How do I access a course certificate after I have successfully completed it on the


Hamoye platform?
Click here to find out.

Can I defer from one cohort to another?

No, one cannot. In any case you can re-register for the next cohort and start the
application process again.

Will a certificate be issued to me at the end of the program?

Please scroll up to section E (Certification and Hamoye Fellowship) and Section F


(Non-Certification), above for details.

Through what media will courses be taught?

The course materials will be delivered in various forms on the Hamoye web app. The
course will be largely delivered in scripts, a bit in videos, and one-one interactions with
the Hamoye team.

25
My quiz automatically logged me out and graded me low despite having enough
time. How can I retrieve my lost time?
The quiz on the platform can only be taken once, so be sure to have a strong internet
connection before attempting the quiz. Most importantly, be sure to save your quiz
progress as often as possible while taking your quiz. This medium article is helpful.

How does community engagement work on Slack?

After your registration process is completed, you will be added to a Slack workspace via
your email address (kindly ensure a valid email address is provided). Our programs
coordinator will send in timely details and information pertaining to the internship via the
slack channel and all interactions and engagements between all our Interns are carried
on within this confines. Once the internship begins and you have been assigned to your
group, you can then begin to enjoy learning with the community.

What materials would I need to be a part of the learning process?

The course will be delivered in an e-learning style. Hence, a good internet connection
and a reliable device (tablet, desktop, laptop) are essential for learning.

Will there be chances to work on any real projects?

What’s a data scientist or a machine learning expert without real-life project experience?
From the start of the internship, interns will be able to choose from a plethora of projects
on Hamoye website and about a 4-week timeline will be allotted to work on the project,
with peers or independently, which will be judged by a group of experts across various
sectors

How do I know the capstone project I will be working on?

The Hamoye platform is replete with diverse projects across various sectors. At the
beginning of the internship, together with your group, you will be assigned the projects
which you will work on and present to industry experts a few weeks after the start of the
program and at the end of the internship.

Can I switch from a project group I have been assigned?

No, you will score a zero in that project if you do so.

How many tracks exist in the internship?

The internship is divided into four tracks:

● Data Science
● MLOps/Data Engineering

26
● DevOps & Cloud Engineering
● Data Storytelling

Can I choose more than one track?

No, you are expected to select only one track and if you chose more than one track, you
will be disqualified from the program.

How many stages are in the program?

Participants will be required to progress through a 6-sprint competition, each lasting 2 to


3 weeks.

Will I be paid monthly during the duration of the Internship?


Participants will not be paid monthly for the duration of the program. Although, the top
participants from each sprint and track will be declared sprint winners. Sprint winners
across all 5 sprints will receive awards to help defray costs of mobile data.

How many hours am I expected to devote for the internship?


The number of hours to be devoted for the internship is subjective and based on your
current level of expertise. As a beginner, it is expected that you devote at least 35–40
hours per week due to the rigorous nature of the program.

What do I tend to benefit from the Data Science internship?

● Every participant has the opportunity to learn using a pre-set training curriculum,
connect, collaborate and build a real-world use case.
● Every participant who successfully completes a sprint and the associated
challenge will earn a certificate from Hamoye for each sprint completed.
● Participants will have an opportunity to work on a unique AI open-source project.
● The top participants from each sprint will be declared sprint winners and awarded
prizes
● Finalists will be invited to join Hamoye AI Labs or our partner companies across
the world.

Data Science
Which application do I recommend for my coding task in order to arrive at the
accurate answer? Google Colab or Jupyter notebook?
Either of them should work. Best practice demands that your version of Jupyter
Notebook is at least up to date, i.e it should be or closer to the recent version.

27
Are there tag along projects at every stage?
Every stage in this track has a compulsory tag along project.

Data Engineering
Is there a tag along project for stage C?
No, for DE track, there are only tag along projects for stage A and B

How do I create a GCP account?


Click this link to learn more about creating your account

How do I create my azure account?


Learn how to create your azure account here

I discovered that I was only able to create a limited GCP account, what do I do
since I still have to use it till stage E
It is advisable you begin with one of the accounts first (GCP OR Azure), use up the free
trial and then create the other account (GCP OR Azure as well). One of these products
lasts 30 days, and that should span two stages, the other spans 12 months and would
be sufficient for all the stages in this program.

Data Storytelling
Is there a tag along project for Stage B
No, there is no tag along project for Stage B in the DST track.

What website do I scrap for my Stage C course?


Since no website was stated, you are expected to choose any proffered website for the
task.

Find more frequently asked questions here

28

You might also like