0% found this document useful (0 votes)
12 views15 pages

Data Collection and Storage

Uploaded by

roushannitin4596
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views15 pages

Data Collection and Storage

Uploaded by

roushannitin4596
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Different types of Data Collection

Techniques
 Web Scraping: Web scraping involves automatically extracting data from websites and web pages.

 It is commonly used to gather data from websites, including product details, news articles, and user
reviews.
 Web scraping is used in a variety of digital businesses that rely on data harvesting. Legitimate use cases
include:
 Search engine bots crawling a site, analyzing its content and then ranking it.
 Price comparison sites deploying bots to auto-fetch prices and product descriptions for allied seller
websites.
 Social Media Data Collection:
 Data scientists collect and analyze data from social media platforms to gain insights into user
behavior, sentiment, and trends.
 APIs provided by platforms like Twitter, Facebook, and Instagram are commonly used for data
retrieval.
 On Facebook, a social media giant with a return on equity forecast to be high in 3 years (38.82%),
data includes numbers of likes, increases in followers, or number of shares. On Twitter, results
include numbers of impressions or retweets.
 Time Series Data Collection:
 Time series data is collected at regular intervals over time and is essential for forecasting and analysis.
Examples include stock prices, weather measurements, and IoT sensor data.
 Time series analysis typically requires a large number of data points to ensure consistency and
reliability. An extensive data set ensures you have a representative sample size and that analysis can
cut through noisy data.
 It also ensures that any trends or patterns discovered are not outliers and can account for seasonal
variance.
 Sensor data is the information collected by a Data Sensor component when, at an instance in time, it scans one or
more IMS database environments and measures the specified conditions (or states) occurring in those
environments.

 Often coming together in a network, sensors generate mass quantities of sensor data that may or may not be
immediately useful for decision-makers. Each of these data points is captured at a specific moment in time,
effectively transforming sensor data into time series data that can be analyzed across this additional dimension.
 Mobile Apps Data:
 Mobile applications and IoT devices collect data from sensors like GPS, accelerometers, and
gyroscopes.
 This data is used for various purposes, including location tracking, health monitoring, and
environmental sensing.
 Audio data refers to any kind of digital data that represents a sound. This can include speech, music,
environmental sounds, or any other type of audio signal.

 Audio data is typically stored in digital formats such as WAV, MP3, or AAC, and can be processed
and manipulated using various software tools and techniques.
 Video Data Collection:Video data collection is the process of capturing video data. It can be done
manually, via handheld devices like phones and cameras, with the data uploaded through a data
collection platform.

 For more streamlined and efficient processing, video data collection can be automated or streamlined
by using in-production devices or gleamed through existing data sources (e.g. security cameras, car
dashboard cameras, etc.).
 Satellite and Remote Sensing Data: Remote sensing technology collects data from satellites and
sensors, providing information about Earth's surface, weather patterns, and environmental changes.

 Satellite data provides satellite imagery and earth observation data of the earth’s surface and its
atmosphere. Satellites also provide images of other planets. Resolution images of the earth indicate
changes in land cover, cloud cover, ocean levels, ice cover, and atmospheric composition.
 Healthcare data refers to a wide range of information related to the health and medical history of
individuals, patients, and populations. This data is collected, stored, and analyzed to support medical
treatment, research, healthcare management, and decision-making. Healthcare data can encompass various
types of information.
 Text and Document Analysis:-The process involves evaluating electronic and physical documents to
interpret them, gain an understanding of their meaning and develop upon the information they provide.

 This field of analysis is crucial in various domains, including natural language processing (NLP), data
mining, information retrieval, sentiment analysis, and content summarization. Here are the key
components and techniques involved in text and document analysis:
 Crowdsourcing is a method of obtaining ideas, services, content, or contributions by soliciting input,
work, or information from a large group of people, typically from an online community or the general
public.

 It leverages the collective intelligence and efforts of a diverse group of individuals to achieve a
common goal. Crowdsourcing has become increasingly popular in various fields and industries,
including technology, research, business, and creative endeavors.
 Biometric Data Collection:

 In addition to biographic data, many ID systems collect fingerprints, iris scans, facial images, and/or
other biometry to use for biometric recognition—automatic recognition of individuals based on their
biological or behavioral characteristics.

 This process involves comparing a template generated from a live biometric sample (e.g., a
fingerprint or selfie) to previously stored biometric(s) to determine the probability that they are a
match.
 Log Files and Server Data:
 Server log files record user interactions and activities on websites and applications.
 Analyzing log data helps in understanding user behavior, performance issues, and security events.

You might also like