0% found this document useful (0 votes)
52 views

Data Science With Python

Data science involves extracting hidden patterns from datasets through data mining and using that information to make decisions. It works with large, unlimited datasets without constraints. Python is a dynamically typed language, meaning variable types are determined at runtime rather than compile time. Classification predicts discrete outcomes while regression predicts continuous variable values. Decision trees organize data into a tree structure to classify it according to conditions.

Uploaded by

Hafiza Maria
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views

Data Science With Python

Data science involves extracting hidden patterns from datasets through data mining and using that information to make decisions. It works with large, unlimited datasets without constraints. Python is a dynamically typed language, meaning variable types are determined at runtime rather than compile time. Classification predicts discrete outcomes while regression predicts continuous variable values. Decision trees organize data into a tree structure to classify it according to conditions.

Uploaded by

Hafiza Maria
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 23

Data Science With

Python
Data vs Information
• Data is a collection of facts, such as numbers, words,
measurements, observations or just descriptions of things.
• If data is not put into context, it doesn't do anything to a
human or computer.
• Information is the result achieved when you have processed,
interpreted, and organized facts.
A real-world example
• Data  Data of Facebook users.

• Information  Data of people on Facebook who are over 24


years and have BS degree. (Information is just
categorization/context of data)
Database vs Dataset
Database vs Dataset
• Database  A database is generally used for storing related,
structured data, with well defined data formats (input and
output are deterministic).

• Dataset  A collection of facts/figures without having any


relationship between them (may have relations, but such
relations are hidden).
Real-World Example
• Database  NADRA database, having records of family
relationships/profession/wealth.

• Dataset  Data of people died in titanic (name, age, seat


numbers, marital status and gender etc.).
Data Science
Data Science vs Data Mining
• Finding hidden patterns/relationships from a dataset is simply
known as data mining.
• Using that extracted information form a dataset in a decision
making process is called data science.
Real-World Example
• Data Mining  Finding Which snacks are bought more often
in super store.

• Data Science  Putting those snacks at the most accessible


shelf in the superstore.
Big Data?
Big Data?
• Data without any perceivable limitations.
Real-World Example
• Data generated by Social Media, you cannot predict easily
how much data will be generated in next 2 years.
• Its big data.
Real-World Example
• Would NADRA’s data be called big data?
Real-World Example
• Would NADRA’s data be called big data?
• No.
• Its is much easy to predict how much data will be generated in
next two years.
Python
Dynamic vs Static Languages
Statically typed languages
• A language is statically typed if the type of a variable is known at
compile time. For some languages this means that you as the
programmer must specify what type each variable is.
• The main advantage here is that all kinds of checking can be done by
the compiler, and therefore a lot of trivial bugs are caught at a very
early stage.
• Compiling.
• Examples: C, C++, Java, Rust, Go, Scala
Dynamic vs Static Languages
Dynamically typed languages
• A language is dynamically typed if the type is associated with run-time
values, and not named variables/fields/etc. This means that you as a
programmer can write a little quicker because you do not have to
specify types every time.
• Interpretation.
• Examples: Perl, Ruby, Python, PHP, JavaScript
Classification vs Regression
Classification vs Regression
• Classification  Division/prediction into predefine classes or
groups. (discrete values)

• Regression  prediction of future trends (continuous values).


Real-World Example
• Classification  Predicting the bad and good neighborhood
for property purchase.

• Regression  Predicting the values of property will be in


future.
Data Structures
• A data structure is a specialized format for organizing,
processing, retrieving and storing data.
Data Structures
• A decision tree is one of the supervised machine leaning
algorithm.
• This algorithm can be used for regression and classification
problems.
• Yet, is mostly used for classification problems.
• A decision tree follows a set of if-else conditions to visualize
the data and classify it according to the conditions.
Data Structures

You might also like