DA106 Week1 Material
DA106 Week1 Material
o Introduction to data science Data science is the field of study that combines
o Domain expertise
o Relationship between data science and
o Programming skills
artificial intelligence o Knowledge of math and statistics
o Comprehend the process of data to extract meaningful insights from data
Data Science and Machine Learning Data Science and Machine Learning
“machine learning” Machine learning systems
generate insights
● Computers automatically detect patterns
and make predictions or decisions from that analysts and business users
data translate into tangible
business value.
● Learn from data without relying on a
predetermined mathematical model.
● Is a subset of Artificial Intelligence (AI). Solving useful problems,
otherwise useless
Data Science and Machine Learning Applications of Data Science
Actual data science workflow can BUSINESS
be complex! ● Help businesses to increase business value of
its available data for competitive advantage
against their competitors.
● Understand customers better
● Take better decisions
Data gathering
○ Collected using sensors or manual labour or
mining data from web
○ Digital data collected by sensors
○ Manual annotation and data entry to
computer from physical documents
○ Extract data from internet using scripts
Data Generation Data Categories
Data pre-processing is used to apply transformations to convert ● Quantitative data: This data can be described
using numbers, and basic mathematical
unstructured data into a structured counterpart.
procedures, including addition, are possible on
the set.
Key characteristics:
“This Wednesday morn, are you early to rise? Then look East. The ● Qualitative data: This data cannot be
Crescent Moon joins Venus & Saturn. Afloat in the dawn skies.”
described using numbers and basic
mathematics. This data is generally thought of
as being described using natural categories
and language.
Quantitative and Qualitative Quantitative and Qualitative