1.6 Understanding The Data Landscape
1.6 Understanding The Data Landscape
UNIT NO 1
IV
VIII
1.Types of Data
2.Data Sources
3.Data Collection Methods
SUBJECT CODE
20CJEL803
BRIEF NAME
TYPE THE SUBJECT HISTORY
HERE OF WEB ANALYTICS
HERE
Types of Data:
Structured Data:
Organized and easily stored in relational databases, including numerical values,
customer records, and financial transactions.
Unstructured Data:
Complex and varied formats such as emails, social media posts, images, and
videos that require specialized processing techniques.
Semi-Structured Data:
A mix of both structured and unstructured data, such as JSON and XML files,
providing flexibility in data storage and retrieval.
Big Data:
Large-scale data sets characterized by volume, velocity, and variety, necessitating
advanced analytics and processing tools.
SUBJECT CODE
20CJEL803
BRIEF NAME
TYPE THE SUBJECT HISTORY
HERE OF WEB ANALYTICS
Collected directly from users through interactions with websites, apps, and
business platforms.
Second-Party Data:
Shared data obtained from trusted partners for enhanced insights and audience
targeting.
Third-Party
- Data:
Acquired from external providers, commonly used for market research,
advertising, and competitive analysis.
Open Data:
Publicly available data from government agencies, research institutions, and
organizations to support transparency and innovation.
SUBJECT CODE
20CJEL803
BRIEF NAME
TYPE THE SUBJECT HISTORY
HERE OF WEB ANALYTICS
Managing the increasing volume of data while ensuring efficient storage and
retrieval.
Adoption of cloud computing and distributed storage solutions to
accommodate growth.
SUBJECT CODE
20CJEL803
BRIEF NAME
TYPE THE SUBJECT HISTORY
HERE OF WEB ANALYTICS
Artificial
. Intelligence and Machine Learning:
Enhancing data analysis capabilities with predictive modeling and automated insights.
AI-driven data cleaning and augmentation techniques to improve data quality.
Edge Computing:
Processing data closer to the source to reduce latency and improve real-time
decision-making.
Integration with IoT for faster and decentralized analytics.
Data Democratization:
Enabling access to data for non-technical users through intuitive visualization and
self-service analytics tools. Promoting data literacy initiatives within organizations.