Data Analytics-Moutran Diane
Data Analytics-Moutran Diane
STUDENT ID # 202200057
We now live in the era of big data. Data in its raw form doesn’t mean anything, it needs to be
analyzed to give useful information that is important to drive smart business decisions, solve
challenges and develop new and innovative products and services1, this process is known as Data
Analytics.
Let’s list below some terms related to Data Analytics:
1-DATA MINING:
Data Mining is sorting data to find valuable information, patterns and relationships through large
datasets. It drives to increase efficiency in business operations.2
Example: Stock market companies use data mining to extract hidden predictive information that
would lead to an increase in revenues and a decrease in costs.
2- DATA LAKE
The Data Lake is a big data storage that holds raw data in its native format before being
organized and structured. It stores the data in files in a highly accessible way, easily updated and
at a low cost3.
Example: Google Cloud Storage can be used to store raw data.
3-DATA MANAGEMENT
Data Management is simply the practice of managing information. The purpose is to assure that
data is always accurate, consistent, secure, reliable and accessible so it can be analyzed for
business decisions4.
Example: Blackboard program used by colleges as a reporting solution to help the student
understand and optimize every dimension related to their studies and institution.
4-DATA VISUALISATION
Data Visualization is the process of transforming large raw data into graphs, charts, images and
even videos to make it more understandable. It allows us to gain insights, discover new patterns
and spot trends in a user-friendly creative way so it becomes much easier to visualize5.
Examples: Pie charts, Bar charts, Cartography, Colorful Dashboard.
5-DATA VERACITY:
Data Veracity is defined as the accuracy or the truthfulness of a data set. It is not just the quality
of the data itself but how trustworthy the data source, type and processing of it is 6.
Example: Wikipedia is not reliable since it can be edited by anyone at any time so generally
cannot be trusted.
Data-as-a-service is a data management that treats data as a product. It is a strategy built on the
premise that data can be delivered to the user via a demand on the cloud. Users no longer have to
install software locally for data storage, integration and processing. Instead, it is done on cloud7.
Example: iCloud, Dropbox, WE transfer.
The internet of things (IoT) – is a giant network with connected devices. These devices gather
and share data about how they are used and the environment in which they are operated, it is all
done using sensors that are embedded in every physical device. These sensors continuously emit
data about the working state of these devices8.
Example: Smartwatch tracking daily activities
Artificial Intelligence is the building of smart machines and systems that can acquire and process
data in a way that the human brain would9.
Example: Siri the virtual assistant that is a part of Apple devices that recognizes and
understands the human spoken language.
9-DATA PIPELINE
Data Pipeline is a mechanism used to safely move raw data from a source (such as an
application) to a destination (such as a data warehouse). Along its way, data is optimized and
organized10.
Example: Debit card company would need a data pipeline to safely move its data through the
payment processing system.
10-DATA ENGINEERING
Data Engineering tasks are to collect, manage and convert raw data into useful information used
by analysts. It is also responsible for developing and maintaining data pipelines to ensure data
quality and accuracy11.
11-METADATA
Metadata is “data about your data”. It gives information about the source, the identity of the
data12.
Example: The subject, date, time and format in an email sent or received
REFERENCES
3- PHILIP RUSSOM, “Data Lakes Purposes, Practices, Patterns, and Platforms – Best
Practices Report” Q1-2017 page 5
9- BERNARD MARR, “The Key Definitions of Artificial Intelligence (AI) That Explain Its
Importance”, /https://bernardmarr.com/the-key-definitions-of-artificial-intelligence-ai-
that-explain-its-importance/