MySkill X Lion Parcel - Short Class Data Science Fundamental

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 7

Portofolio - Short Class

Data
Science
Owner: Muhammad Anis

Build your skill and portfolio via myskill.id/bootcamp


Course Summary
Poin Belajar Rangkuman

Data Engineering, Data Analyst, and Data Scientist


- Data Engineering are also called Data Architect, ETL Developer, and Big Data Developer
Data Engineers build data warehouse
The Anatomy of a Data
- Data Analyst are also called as BI Analyst/Engineer, Business Analyst, and Statistician
Team
Data Analyst analyzes the situation (talks about what’s going on)
- Data Scientist are also called as Machine Learning Engineer and Machine Learning Ops
Data Scientist predicts what’s going to happen in the future.

1. Get good at Statistics and Maths


2. Learn to code
3. Understand Database
How To Become a 4. Master data munging, visualization, and reporting
Data Scientist 5. Level up with Big data
6. Build your data portfolio
7. Go on Internships, bootcamps, and get a job
8. Follow and engage with the community

#RintisKarirImpi
Course Summary
Poin Belajar Rangkuman

Data Scientist help solve businesses problem and create valuable insights for the future.
1. Business Problem: Understand the problem using data analysis
A Day in the Life of a
2. Learning: Study and prepare the data needed for analysis
Data Scientist
3. Reflection and Communication: Analyze data, create models, and communicate results
4. Business Value: Ensure the results of the analysis benefits the business.

- Machine Learning is a subfield of Artificial Intelligence, which is broadly defined as the capability of a
machine to imitate intelligent human behavior.
Machine Learning - Machine Learning techniques comprised of
1. Supervised Learning = Develop predictive model based on both input and output data
2. Unsupervised Learning = Group and interpret data based only on input data

#RintisKarirImpi
Task: Diabetes Prediction System with KNN Algorithm
DESCRIPTION AND DATASET :
Dataset terdiri dari variabel prediktif dan hasil yang menggambarkan apakah seseorang menderita diabetes atau tidak, Proyek mini ini bertujuan untuk membangun model
pembelajaran mesin menggunakan metode pembelajaran terawasi dan algoritma k-Nearest Neighbor (KNN) dengan Python.
DATASET : https://bit.ly/DatasetSCDataMySkillxLionParcel

Notebook Programming Python


TOOLS REQUIRED : Language
Code Editor Package

RELATED DOCUMENTATION:
https://colab.research.google.com/
https://pandas.pydata.org/docs/user_guide/10min.html
https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html
https://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html
https://scikit-learn.org/stable/modules/generated/sklearn.metrics.classification_report.html

#RintisKarirImpi
Practice Result
Screenshot hasil latihan kamu dalam mengolah data sesuai dengan yang telah dicontohkan oleh pemateri. Tambahkan
screenshot tersebut pada slide ini.

#RintisKarirImpi
Practice Result
Screenshot hasil latihan kamu dalam mengolah data sesuai dengan yang telah dicontohkan oleh pemateri. Tambahkan
screenshot tersebut pada slide ini.

#RintisKarirImpi
Practice Result
Screenshot hasil latihan kamu dalam mengolah data sesuai dengan yang telah dicontohkan oleh pemateri. Tambahkan
screenshot tersebut pada slide ini.

#RintisKarirImpi

You might also like