0% found this document useful (0 votes)
5 views

ITvitae Data Science Program - Introduction

The document outlines the kickoff agenda for the Itvitae Data Science program led by Angel Sevilla Camins in April 2024, detailing morning and afternoon sessions focused on data science introduction, curriculum, and toolkit setup. It also describes Eraneos, the consulting group behind the program, emphasizing their expertise in data and AI across various industries. The curriculum includes two modules covering Data Science and additional skills like Deep Learning and Natural Language Processing, with a structured schedule of lessons and assessments.

Uploaded by

Bart Mania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

ITvitae Data Science Program - Introduction

The document outlines the kickoff agenda for the Itvitae Data Science program led by Angel Sevilla Camins in April 2024, detailing morning and afternoon sessions focused on data science introduction, curriculum, and toolkit setup. It also describes Eraneos, the consulting group behind the program, emphasizing their expertise in data and AI across various industries. The curriculum includes two modules covering Data Science and additional skills like Deep Learning and Natural Language Processing, with a structured schedule of lessons and assessments.

Uploaded by

Bart Mania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 35

Data Science Program

Itvitae Kickoff
Angel Sevilla Camins
April 2024
Kickoff Itvitae

AGENDA

Morning

• Eraneos
• Introduction Data Science
• Curriculum
• Tooling
• Materials

After Lunch

• Lab: Toolkit Set-up

2023 2
Eraneos

2023 3
Management & Technology Consulting Group

We help our customers unlock the full potential of digital

Netherlands 2021
Germany top consultancies join forces
Luxembourg Austria
USA
Spain Switzerland
China
14
offices in nine countries
Singapore

1000+
dedicated professionals

Trusted Experienced Awarded

by Fortune 500 companies,


governmental organizations
and hidden champions
in shaping organizations with
our consultants, data
experts and cybersecurity
as a leading consulting
company and employer

Business and technology
have been in our DNA for
more than 30 years.

specialists

2023 4
Who are we?
Eraneos Analytics BV / Data & AI is the only full-stack data activation expert offering innovative solutions that help clients
transform, innovate and grow.
 Founded in 2000; Based in the Netherlands; offices in Amsterdam and Groningen, employing 50 people
 Innovative minds, team of enthusiastic men and women: Software Engineers, Data Engineers, Machine Learning
Engineers, Data Scientists, A.I. Experts, Data Architects, Data Consultants and Project Managers
 Building Solid Partnership with our clients and delivering meaningful results for their business
 Merged with Quint in 2020, since 2021 part of eraneos group

2023 5
Management & Technology Consulting Group

Data & AI - Areas and industries of expertise

Our areas of expertise


Digital Business Sourcing & Cyber Security Organizational Technology & Data & AI
& Innovation IT Advisory & Privacy Excellence & Platforms
Transformation
Leveraging Data & AI to help clients
transform, grow, and innovate
Implementation Consulting Training
We enable business value with specific and
innovative end-to-end Data and AI solutions
in a responsible and seamless manner. “
Public Services Real Estate Manufacturing Life Science Automotive Healthcare

Retail &
Financial Transportation Professional Energy & Technology,
Consumer
Services & Logistics Services Utilities Media & Telecom
Goods
We do this by:
Our industries of expertise
• Helping customers to organize their data.

• Delivering data and AI capabilities.

• Solving actual real-world business problems.

• Inspiring and continuously disrupting.

2023 6
Introduction Data
Science

2023 7
Kickoff Itvitae

Presentation

2023 8
Kickoff Itvitae

Introduction round

•Background, education

•Hobbies, interest

•Which are your expectations


about this course?

2023 9
Kickoff Itvitae

Introduction Data Science

“Data science is the


extraction of knowledge
from data.”

2023 10
Kickoff Itvitae

The Evolution of data science

2023 11
Kickoff Itvitae

What does a data scientist?

2023 12
The Data Scientist – 60 Second Data Science: The https://www.youtube.com/watch?v=i2jwZcWicSY
Kickoff Itvitae

Data science – an overlap of disciplines

Definition:
Data science combines the scientific
method, math and statistics,
specialized programming, advanced
analytics, AI, and even storytelling to
uncover and explain the business
insights buried in data.

Source: https://www.ibm.com/cloud/learn/data-science-introduction & https://ge.iitm.ac.in/I2MP/data-science/


2023 13
Kickoff Itvitae

The data scientist’s skills

Defining the
Programming Data Gathering
Problem

Data Preprocessing
Mathematics
Data Scientist’s
Skills

Visualization
Statistics

Big Data Communicating


Machine Learning
Engineering Results
2023 14
Title

2023 15
Kickoff Itvitae

The data science process

2023 16
Kickoff Itvitae

Not every data scientist does the same

2023 17
Kickoff Itvitae

AIRBNB data science flavors

2023 18
Explainable AI

What does ‘science’ do in data science?

The scientific method:

 Ask a question.

 State a hypothesis about the answer to the question.

 Make a testable prediction that would provide evidence in


favor of the hypothesis if correct.

 Or better: try to falsify it.

 Test the prediction via an experiment involving data.

 Draw the appropriate conclusions through analyses of


experimental results.

2023 19
Curriculum

2023 20
Kickoff Itvitae

Curriculum - what's on the program?

2 Modules:

1. Data Science

2. Additional skills:
1. Deep Learning
2. Natural Language Processing (NLP)
3. Big Data Engineering

2023 21
Kickoff Itvitae

How are we going to proceed?

• Getting Started: Data Scientist's Toolkit

• Build up knowledge and skills in each of the 2 modules

• Each lesson is a mix of theory, lab exercises and example cases

• In addition, info about best practices, tips & tricks, templates for workflow and code

• 2 days a week with guidance

• 2 tests, one after Data Science and other after the rest, equally important.

• Interactive, learning from each other

• Asking questions is important


2023 22
Kickoff Itvitae

Module 1 – Data science (1)

Date Weekday Description


16/Apr/24 Tuesday Introduction and tools setup
18/Apr/24 Thursday Python review
23/Apr/24 Tuesday Data Preprocessing
25/Apr/24 Thursday Visualization
30/Apr/24 Tuesday Python Machine Learning Basics 1
02/May/24 Thursday Python Machine Learning Basics 2
07/May/24 Tuesday Local models
09/May/24 Thursday Vakantie
14/May/24 Tuesday Web Scraping
16/May/24 Thursday Bayesian Learning
21/May/24 Tuesday Vakantie

2023 23
Kickoff Itvitae

Module 1 – Data science (2)

Date Weekday Description


23/May/24 Thursday Vakantie
28/May/24 Tuesday Support Vector Machines
30/May/24 Thursday Decision-Regression-Trees
04/Jun/24 Tuesday Ensemble-Models
06/Jun/24 Thursday Review
11/Jun/24 Tuesday Assessment Data Science

2023 24
Kickoff Itvitae

Module 2 – Additional skills (1)

Date Weekday Description


13/Jun/24 Thursday Deep Learning with Tensorflow and Keras 1
18/Jun/24 Tuesday Deep Learning with Tensorflow and Keras 2
20/Jun/24 Thursday Deep Learning with Tensorflow and Keras 3
25/Jun/24 Tuesday Natural Language Processing 1
27/Jun/24 Thursday Natural Language Processing 2
02/Jul/24 Tuesday Model explanation with SHAP
04/Jul/24 Thursday Version Control: git and gitflow
09/Jul/24 Tuesday Databricks 1 Dataframes
11/Jul/24 Thursday Databricks 2 SparkML
16/Jul/24 Tuesday Vakantie
18/Jul/24 Thursday Vakantie
23/Jul/24 Tuesday Vakantie
25/Jul/24 Thursday Vakantie
2023 25
Kickoff Itvitae

Module 2 – Additional skills (2)


Date Weekday Description
30/Jul/24 Tuesday Vakantie
01/Aug/24 Thursday Vakantie
06/Aug/24 Tuesday Databricks 3 Streaming
08/Aug/24 Thursday Databricks 4 Optimization/testing
13/Aug/24 Tuesday Docker
15/Aug/24 Thursday Kubernetes Basics
20/Aug/24 Tuesday Python TDD
22/Aug/24 Thursday Azure CI/CD Azure
27/Aug/24 Tuesday Azure Data Factory
29/Aug/24 Thursday Terraform
03/Sep/24 Tuesday Azure ML Studio 1
05/Sep/24 Thursday Azure ML Studio 2
10/Sep/24 Tuesday Review
12/Sep/24 Thursday Assessment Big Data
2023 26
Diplomas + ceremony
Toolkit

2023 27
Kickoff Itvitae

Data Science Toolkit

2023 28
Lab: Toolkit setup

2023 29
Kickoff Itvitae

Installing Core Toolkit:

• WSL2 https://learn.microsoft.com/en-us/windows/wsl/install

• Python (Anaconda): https://www.continuum.io/downloads

• Docker https://docs.docker.com/get-docker/

• Git: https://git-scm.com/downloads

• Access to the course materials

2023 30
Materials

2023 31
Kickoff Itvitae

Demo

• Working with Jupyter Notebooks

• Data Visualization in Python

• Machine Learning Basics using Scikit-learn

2023 32
Kickoff Itvitae

ADDITIONAL MATERIAL

• Pluralsight

• DataFramed podcast

• Data Scientists analysis at Microsoft:


http://web.cs.ucla.edu/~miryung/Publications/tse2017-datascientists.pdf

• 50 years of data science:


https://courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf

• What is a Data Scientist: https://www.youtube.com/watch?v=iQBat7e0MQs

• Data Scientists analysis at Microsoft:


http://web.cs.ucla.edu/~miryung/Publications/tse2017-datascientists.pdf

• Ciara Byrne = Development Workflows for Data Scientists (Free eBook):


https://data-science.github.com/report.pdf

2023 33
Kickoff Itvitae

Questions?

• Contact:

 Chapter Lead Data Science: Angel Sevilla [email protected]

2023 34
THANK YOU FOR YOUR
ATTENTION

2023 35

You might also like