0% found this document useful (0 votes)
2 views6 pages

Unit1 - Notes

Data science is the study of large volumes of data to uncover patterns and inform business decisions using advanced tools and techniques. The data science lifecycle consists of five stages: Capture, Maintain, Process, Analyze, and Communicate, each focusing on different aspects of data handling and analysis. Applications of data science span various fields including search engines, transportation, finance, e-commerce, healthcare, and more, enhancing efficiency and decision-making processes.

Uploaded by

APARNA R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views6 pages

Unit1 - Notes

Data science is the study of large volumes of data to uncover patterns and inform business decisions using advanced tools and techniques. The data science lifecycle consists of five stages: Capture, Maintain, Process, Analyze, and Communicate, each focusing on different aspects of data handling and analysis. Applications of data science span various fields including search engines, transportation, finance, e-commerce, healthcare, and more, enhancing efficiency and decision-making processes.

Uploaded by

APARNA R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

What Is Data Science?

Data science is the domain of study that deals with vast volumes of data
using modern tools and techniques to find unseen patterns, derive
meaningful information, and make business decisions. Data science uses
complex machine learning algorithms to build predictive models. The data
used for analysis can come from many different sources and presented in
various formats.

Data Science is the deep study of a large quantity of data, which involves
extracting some meaningful from the raw, structured, and unstructured
data. The extracting out meaningful data from large amounts use
processing of data and this processing can be done using statistical
techniques and algorithm, scientific techniques, different technologies,
etc. It uses various tools and techniques to extract meaningful data from
raw data. Data Science is also known as the Future of Artificial
Intelligence.

The Data Science Lifecycle

Now that you know what is data science, next up let us focus on the data
science lifecycle. Data science’s lifecycle consists of five distinct stages,
each with its own tasks:

1. Capture: Data Acquisition, Data Entry, Signal Reception, Data


Extraction. This stage involves gathering raw structured and
unstructured data.

2. Maintain: Data Warehousing, Data Cleansing, Data Staging, Data


Processing, Data Architecture. This stage covers taking the raw data
and putting it in a form that can be used.
3. Process: Data Mining, Clustering/Classification, Data Modeling, Data
Summarization. Data scientists take the prepared data and examine its
patterns, ranges, and biases to determine how useful it will be in
predictive analysis.

4. Analyze: Exploratory/Confirmatory, Predictive Analysis, Regression, Text


Mining, Qualitative Analysis. Here is the real meat of the lifecycle. This
stage involves performing the various analyses on the data.

5. Communicate: Data Reporting, Data Visualization, Business


Intelligence, Decision Making. In this final step, analysts prepare the
analyses in easily readable forms such as charts, graphs, and reports.

Data Science Tools


The data science profession is challenging, but fortunately, there are
plenty of tools available to help the data scientist succeed at their job.

 Data Analysis: SAS, Jupyter, R Studio, MATLAB, Excel, RapidMiner

 Data Warehousing: Informatica/ Talend, AWS Redshift

 Data Visualization: Jupyter, Tableau, Cognos, RAW

 Machine Learning: Spark MLib, Mahout, Azure ML studio

Applications of Data Science

1. In Search Engines

The most useful application of Data Science is Search Engines. As we


know when we want to search for something on the internet, we mostly
used Search engines like Google, Yahoo, Safari, Firefox, etc. So Data
Science is used to get Searches faster.

For Example, When we search something suppose “Data Structure and


algorithm courses ” then at that time on the Internet Explorer we get the
first link of GeeksforGeeks Courses. This happens because the
GeeksforGeeks website is visited most in order to get information
regarding Data Structure courses and Computer related subjects. So this
analysis is Done using Data Science, and we get the Topmost visited Web
Links.

2. In Transport

Data Science also entered into the Transport field like Driverless Cars.
With the help of Driverless Cars, it is easy to reduce the number of
Accidents.

For Example, In Driverless Cars the training data is fed into the
algorithm and with the help of Data Science techniques, the Data is
analyzed like what is the speed limit in Highway, Busy Streets, Narrow
Roads, etc. And how to handle different situations while driving etc.

3. In Finance

Data Science plays a key role in Financial Industries. Financial Industries


always have an issue of fraud and risk of losses. Thus, Financial Industries
needs to automate risk of loss analysis in order to carry out strategic
decisions for the company. Also, Financial Industries uses Data Science
Analytics tools in order to predict the future. It allows the companies to
predict customer lifetime value and their stock market moves.

For Example, In Stock Market, Data Science is the main part. In the
Stock Market, Data Science is used to examine past behavior with past
data and their goal is to examine the future outcome. Data is analyzed in
such a way that it makes it possible to predict future stock prices over a
set timetable.

4. In E-Commerce

E-Commerce Websites like Amazon, Flipkart, etc. uses data Science to


make a better user experience with personalized recommendations.
For Example, When we search for something on the E-commerce
websites we get suggestions similar to choices according to our past data
and also we get recommendations according to most buy the product,
most rated, most searched, etc. This is all done with the help of Data
Science.

5. In Health Care

In the Healthcare Industry data science act as a boon. Data Science is


used for:

Detecting Tumor.

Drug discoveries.

Medical Image Analysis.

Virtual Medical Bots.

Genetics and Genomics.

Predictive Modeling for Diagnosis etc.

6. Image Recognition

Currently, Data Science is also used in Image Recognition. For


Example, When we upload our image with our friend on Facebook,
Facebook gives suggestions Tagging who is in the picture. This is done
with the help of machine learning and Data Science. When an Image is
Recognized, the data analysis is done on one’s Facebook friends and after
analysis, if the faces which are present in the picture matched with
someone else profile then Facebook suggests us auto-tagging.

7. Targeting Recommendation
Targeting Recommendation is the most important application of Data
Science. Whatever the user searches on the Internet, he/she will see
numerous posts everywhere. This can be explained properly with an
example: Suppose I want a mobile phone, so I just Google search it and
after that, I changed my mind to buy offline. Data Science helps those
companies who are paying for Advertisements for their mobile. So
everywhere on the internet in the social media, in the websites, in the
apps everywhere I will see the recommendation of that mobile phone
which I searched for. So this will force me to buy online.

8. Airline Routing Planning

With the help of Data Science, Airline Sector is also growing like with the
help of it, it becomes easy to predict flight delays. It also helps to decide
whether to directly land into the destination or take a halt in between like
a flight can have a direct route from Delhi to the U.S.A or it can halt in
between after that reach at the destination.

9. Data Science in Gaming

In most of the games where a user will play with an opponent i.e. a
Computer Opponent, data science concepts are used with machine
learning where with the help of past data the Computer will improve its
performance. There are many games like Chess, EA Sports, etc. will use
Data Science concepts.

10. Medicine and Drug Development

The process of creating medicine is very difficult and time-consuming and


has to be done with full disciplined because it is a matter of Someone’s
life. Without Data Science, it takes lots of time, resources, and finance or
developing new Medicine or drug but with the help of Data Science, it
becomes easy because the prediction of success rate can be easily
determined based on biological data or factors. The algorithms based on
data science will forecast how this will react to the human body without
lab experiments.

11. In Delivery Logistics

Various Logistics companies like DHL, FedEx, etc. make use of Data
Science. Data Science helps these companies to find the best route for the
Shipment of their Products, the best time suited for delivery, the best
mode of transport to reach the destination, etc.

12. Autocomplete

AutoComplete feature is an important part of Data Science where the user


will get the facility to just type a few letters or words, and he will get the
feature of auto-completing the line. In Google Mail, when we are writing
formal mail to someone so at that time data science concept of
Autocomplete feature is used where he/she is an efficient choice to auto-
complete the whole line. Also in Search Engines in social media, in
various apps, AutoComplete feature is widely used.

You might also like