0% found this document useful (0 votes)
19 views

Database Data Science: Role of The IN

This document discusses the role of databases in data science. It defines databases as organized collections of data stored electronically, and data science as using scientific methods to extract knowledge from structured and unstructured data. The document states that much of the world's data resides in databases, which data scientists use to store, retrieve, and test data. It also notes that data scientists analyze data from databases to make organizational decisions, and use SQL to query, manipulate, and prepare big data.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Database Data Science: Role of The IN

This document discusses the role of databases in data science. It defines databases as organized collections of data stored electronically, and data science as using scientific methods to extract knowledge from structured and unstructured data. The document states that much of the world's data resides in databases, which data scientists use to store, retrieve, and test data. It also notes that data scientists analyze data from databases to make organizational decisions, and use SQL to query, manipulate, and prepare big data.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 9

ROLE OF THE DATABASE IN

DATA SCIENCE
WHAT IS DATABASE.?
A Database is an organized collection of data, generally stored and
accessed electronically from a computer system. Where databases are
more complex they are often developed using formal design and
modeling techniques.
WHAT IS DATA SCIENCE.?
 Data Science is a multi-disciplinary field that uses scientific methods,
processes, algorithms and systems to extract knowledge and insights
from Structured And Unstructured Data.
 Data Science is the same concept as data mining and big data: "use the
most powerful hardware, the most powerful programming systems, and
the most efficient algorithms to solve problems“
 Data Science is a tool that can be used to help reduce costs and make
better decisions. It is about looking for solutions to abstract business
problems.
UNDERSTAND

 Data Science is the study of making insights and


decisions based on Data ( Structured And Unstructured Data ).
 Database is an organized collection of Data.
Unstructured Data
• All those things that can't be so readily classified and fit into a neat box:
Ex. photos and graphic images, videos, streaming instrument data;
• RAW TEXT
 Documents, Digital Libraries
 Substring Indexing, Regular Expressions
 Natural Language parsing
 Extracting Requirements From Jobs Postings
STRUCTURED DATA
 Structured data refers to any data that resides in a fixed field within a
record or file. This includes data contained in relational databases
and spreadsheets.
 Characteristics of Structured Data
• Structured data first depends on creating a Data Model – a model of
the types of business data that will be recorded and how they will be
stored, processed and accessed.
• Easily entered, stored, queried and analyzed. structured data were
the only way to effectively manage data.
• -Managed Using Structured Query Language (SQL)
DATABSE ROLE IN DATA SCIENCE

 Much of the world's data resides in Databases. SQL (or Structured Query
Language) is a powerful language which is used for communicating with
and extracting data from databases.
 Data Scientist works with data and all the structured & unstructured data is
stored in databases. So, if one needs to play with data, he must need
Database.
 Data Scientists Store & retrieve Data and test environment, use Databases
 For Querying and Manipulating the data Scientist uses a language that is
similar to SQL known as HiveQL.
DATABSE ROLE IN DATA SCIENCE

 DataScientists doing analytics tasks over the data that must be stored
in Oracle DB or any other data Server.
 Data Scientists analysis the data from database to makes decisions
for organizations.
 When working with Big Data processing tools, Scientists uses SQL
for data preparation and wrangling.
Thank You 

You might also like