Big Data Analitics Assignment One
Big Data Analitics Assignment One
Mini Assignment
Name ID No
The following are three definitions of big data, each highlighting its major dimensions.
A. Definition by Gartner:
Definition:
"Big data is high-volume, high-velocity, and/or high-variety information assets that demand
cost-effective, innovative forms of information processing for enhanced insight and decision
making."
Major Dimensions:
• Velocity: The speed at which new data is generated and moves around.
B. Definition by IBM:
Definition:
"Big data is the term applied to data sets whose size or type is beyond the ability of traditional
relational databases to capture, manage, and process the data with low latency. It has one or
more of the following characteristics – high volume, high velocity, and high variety. High
veracity and high value are also considerations."
Major Dimensions:
• Value: The potential insights and benefits that can be derived from the data.
C. Definition by McKinsey & Company:
Definition:
"Big data refers to datasets whose size is beyond the ability of typical database software tools
to capture, store, manage, and analyze. This data is characterized by its sheer volume, the speed
at which it must be processed, and the variety of forms it takes."
Major Dimensions:
• Volume: The large scale of data that exceeds traditional storage capabilities.
2. Review three source of data in our context preferable if the source is from different
discipline?
Soil Data
Soil data is critical for agriculture, environmental management, and land-use planning.
Understanding soil characteristics helps in optimizing crop production, managing soil health,
and mitigating environmental impacts.
Major Dimensions:
• Variety: Includes soil composition, fertility, erosion, and land use practices.
Rural Land Administration data is crucial for managing land resources, ensuring land tenure
security, and supporting agricultural development.
Volume: - Large datasets containing detailed records of land ownership, usage rights,
boundaries, and transactions.
• Example: Records of millions of land parcels, each with specific details about
ownership and land use.
Variety: - Includes a wide range of data types such as cadastral maps, land registration
documents, land use plans, and demographic information of landholders.
Velocity: - Data is continuously updated as land transactions occur, ownership changes, and
new surveys are conducted.
• Example: Real-time updates to the land registry when a land transaction is completed
or a boundary is redefined.
Livestock data
Livestock data is vital for managing and improving the livestock sector, ensuring animal health,
optimizing production, and informing policy decisions.
Example: Records of millions of cattle, sheep, goats, poultry, and other livestock across
different regions.
Variety: -Includes diverse data types such as animal identification, health records, production
metrics, breeding information, and economic data.
• Example: Data on birth rates, vaccination records, milk production, meat yield, and
genetic information.
Velocity: -Data is continuously updated based on ongoing activities such as births, sales,
medical treatments, and productivity measures.