0% found this document useful (0 votes)
4 views

intro to DS

The document provides an introduction to data and data science, defining data as recorded observations and data science as a multidisciplinary field that extracts knowledge from large datasets. It outlines the roles of data scientists, the types of data, and the applications of data science in various industries. Key concepts such as big data, data mining, and the data science process are also discussed.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

intro to DS

The document provides an introduction to data and data science, defining data as recorded observations and data science as a multidisciplinary field that extracts knowledge from large datasets. It outlines the roles of data scientists, the types of data, and the applications of data science in various industries. Key concepts such as big data, data mining, and the data science process are also discussed.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

Data Science

Prof. Prachi Verma, Assistant Professor


Computer Science & Engineering
CHAPTER-1

Introduction to core concepts and technologies


What is Data?

• Data are observations that are measured and communicated in such a way as to
be intelligible to both the recorder and the reader.
• So, you as a person are not data, but recorded observations about you are data.
• In recent times, data visualization has grown exponentially due to increasing
popularity and effectiveness of visual reports and dashboards
• For example, your name when written down is data; or the digital recording you
speaking your name is data; or a digital photograph of your face or video of you
dancing are data.
What is Data Science?

• An area that manages, manipulates, extracts, and interprets knowledge from


tremendous amount of data.
• Data science (DS) is a multidisciplinary field of study with goal to address the
challenges in big data.
• Data science principles apply to all data – big and small
• Theories and techniques from many fields and disciplines are used to investigate
and analyze a large amount of data to help decision makers in many industries
such as science, engineering, economics, politics, finance, and education
Data Science
Data Science
Data Scientists
• Data scientists are the key to realizing the opportunities presented by big data.
They bring structure to it, find compelling patterns in it, and advise executives
on the implications for products, processes, and decisions
• What do data scientist do?
• National Security
• Cyber Security
• Business Analytics
• Engineering Healthcare
• And many more…
Terminology
• Big Data
• Clustering
• Data Governance
• Data Mining
• Data Set
Data Science Process
• Data Science process is as given below:
Data Science Component
• Data Science component is as given below:
• Statistics
• Visualisation
• Machine learning
• Deep learning
Types of Data
• Categorical Data (Nominal, Ordinal)
• Numerical Data (Discrete, Continuous, Interval, Ratio)
Types of Data
Types of Data
Types of Data

• Categorial Data

• Categorical data represents characteristics. Therefore it can represent things like


a person’s gender, language etc. Categorical data can also take on numerical
values (Example: 1 for female and 0 for male). Note that those numbers don’t
have mathematical meaning.
Types of Data

• Nominal - Nominal data are recorded as categories. For this reason, nominal
data is also known as categorical data. For example, rocks can be generally
categorized as igneous, sedimentary and metamorphic.

• Ordinal - Ordinal data are recorded as the rank order of scores (1st, 2nd, 3rd,
etc.). An example of ordinal data is the result of a horse race, which says only
which horses arrived first, second, or third but include no information about
race times.
Types of Data

• Interval - Interval data are recorded not just about the order of the data points,
but also the size of the intervals in between data points. A highly familiar
example of interval scale measurement is temperature with the Celsius scale. In
this particular scale, the unit of measurement is 1/100 of the temperature
difference between the freezing and boiling points of water. The zero point,
however, is arbitrary.

• Ratio - Ratio data are recorded on an interval scale with a true zero point. Mass,
length, time, plane angle, energy and electric charge are examples of physical
measures that are ratio scales. Informally, the distinguishing feature of a ratio
scale is the possession of a zero value. For example, the Kelvin temperature scale
has a non-arbitrary zero point of absolute zero.
Application of Data Science

• Internet Search
• Recommendation Systems
• Image & Speech Recognition
• Gaming World
• Online Price Comparison
Thank You

You might also like