DATA
SCIENCE
Presented by
Rajesh P
Sanjay Kumar M
CSE III Year
WHAT IS DATA SCIENCE ?
•Data science is an interdisciplinary field that uses scientific
methods, algorithms, processes, and systems to extract
knowledge and insights from structured and unstructured
data.
•In today's data-driven world, data science plays a crucial
role in helping organizations make informed decisions,
optimize processes, and gain a competitive edge.
•The data science workflow typically involves data collection,
data preprocessing, exploratory data analysis (EDA),
modeling, and interpretation of results.
2
DATA
COLLECTION
• Data collection is the process of gathering relevant data from various
sources, including databases, APIs, sensors, and more.
• Data preprocessing involves cleaning and transforming the data to make it
suitable for analysis.
• Common data preprocessing tasks include handling missing values, dealing
with outliers, and normalizing or scaling features.
3
EXPLORATORY DATA
ANALYSIS
•EDA usually stands for Exploratory Data
Analysis.
•EDA is a critical step in data science that
involves visualizing and summarizing data to
gain insights and detect patterns.
•EDA techniques may include creating
histograms, scatter plots, and correlation
matrices to understand the data's distribution
and relationships.
•EDA helps identify potential areas for further
analysis and model development.
5
MACHINE LEARNING
• Machine learning analyzes and examines large chunks of
data automatically.
• It automates the data analysis process and makes
predictions in real-time without any human involvement.
• You can further build and train the data model to make
real-time predictions.
2
DATA VISUALIZATION
•Data visualization is the graphical
representation of data to help users understand
its meaning and patterns.
•Effective data visualization can aid in
storytelling and communicating complex
findings to a non-technical audience.
•Tools like Matplotlib, Seaborn, and Tableau are
commonly used for creating data visualizations.
5
APPLICATION
•Data science is applied in various industries, such as healthcare, finance,
marketing, and e-commerce.
•Examples of data science applications include personalized recommendation
systems, fraud detection algorithms, predictive maintenance in
manufacturing, and sentiment analysis in social media.
•These applications have a tangible impact on improving decision-making and
efficiency.
3
THANK YOU
FOR
LISTENING