4.0 Introduction to Data
4.0 Introduction to Data
• `In common usage, data (/ˈdeɪtə/, also US: /ˈdætə/) is a collection of discrete or
continuous values that convey information, describing the quantity, quality, fact,
statistics, other basic units of meaning, or simply sequences of symbols that may
be further interpreted formally. A datum is an individual value in a collection of
data. Data are usually organized into structures such as tables that provide
additional context and meaning, and may themselves be used as data in larger
structures` -- wikipedia
Types of Data in general
• Data can be classified in several ways depending on its nature, structure, and
how it’s processed. Let’s break it down into several key categories:
Examples:
• Social media data (Twitter, Facebook)
• Sensor data from IoT devices
• Streaming data from real-time systems like financial markets or website logs
Nature of data according to Data
science
In data science, the nature of data refers to the characteristics, types, and structures
of the data that are used for analysis, modeling, and decision-making. Data can vary
in its form, source, granularity, and structure, influencing how it is processed
and analyzed
The mentioned data types are used in machine learning however there are more for
example
Granularity of Data
Granularity refers to the level of detail in the data.
• Fine-grained data: Highly detailed data, such as individual transactions, which
provides more insight but can be difficult to aggregate and process. Like Sensor
data collected from a smart device every second.
Examples:
• GIS (Geographic Information System) Data: Maps, satellite images, and
data about locations (e.g., elevation, land use).
• GPS Data: Location tracking data from smartphones or vehicles.