0% found this document useful (0 votes)
48 views10 pages

Lecture 1 (With Ans)

The document discusses computational tools for statistics and data analysis. It defines big data as large datasets that cannot be captured, organized or processed within a reasonable time frame using traditional software. Examples of big data sources and characteristics like volume, velocity and variety are provided. The differences between big data and traditional databases in terms of data sources, frequency and structure are explored. Several real-world applications of data analysis in fields such as banking, education, transportation and more are described through case studies.

Uploaded by

劉泳
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
48 views10 pages

Lecture 1 (With Ans)

The document discusses computational tools for statistics and data analysis. It defines big data as large datasets that cannot be captured, organized or processed within a reasonable time frame using traditional software. Examples of big data sources and characteristics like volume, velocity and variety are provided. The differences between big data and traditional databases in terms of data sources, frequency and structure are explored. Several real-world applications of data analysis in fields such as banking, education, transportation and more are described through case studies.

Uploaded by

劉泳
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

SEHH1071

C O M P U TAT I O N A L T O O L S
F O R S TAT I S T I C S

I N T R O D U C T I O N T O D ATA - A L L A B O U T D ATA !
OUTLINE

• Basic and trends in Data

• Big Data Analysis vs Traditional Data Analysis

• Application of Data Analysis


RECAP

Data Information
Stored Presented
Raw Processed
Technical based Application based
Collected Analyzed
Not ready for understanding Easy to understand
Input Output
WHAT IS SO SPECIAL ABOUT BIG
DATA?
• Definition
– Big data includes datasets that have the size beyond the capability of traditional software and tools
to capture, organize and process data with a reasonable time frame

– Big data is with high volume, high velocity and high variety that need to make use of the data mining
and unstructured tools to obtain pattern and information for decision making.
WHAT IS BIG DATA? WHY?

Some big data facts about IG:


Volume
- >800 million Instagram users

- 500 millions are active every day

- >30% users look at their IG more


than once every day Velocity Variety
- >90 millions photos and videos are
shared everyday

Beyond the capacity of the conventional database system


BIG DATA VS TRADITIONAL DATABASE
• New source of data

• Data frequency

• Data/problem structure

• Not Only SQL


– Unstructured (may not have predefined schema as relational database)
• Relational database
– data are stored in inter-related tables that contain rows and columns
– use of foreign keys to reference the tables

• Expand horizontally when scale increases


APPLICATION OF DATA ANALYSIS
NOWADAYS
• Banking • Education
– Customer spending pattern – Analysis Learning through online planform
– Credit risk analysis
• Government and charity
• Communication and media – Operation efficiency
– Accurate and useful Information delivery
– Pattern of information demand • Transportation
– Routing
• Healthcare
– Effective and efficient utilization of medical resources • Business
– Clustering and segmentation
– Recommendation

https://www.youtube.com/watch?v=rl7ZBqjB6MI
APPLICATION OF DATA ANALYSIS NOWADAYS
• Case 1: IG Story for marketing

– Goals:
• Using KPI to analysis the effectiveness of
your advertisement in your IG

– Inputs:
• Views
• Tags (Geotag, hashtag)
• Taps (tap back, tap forward)

– Outputs (e.g. Instagram insight)


APPLICATION OF DATA ANALYSIS NOWADAYS
• Case 2: Research Study on
anthropologist with social media
photos

– Goals:
• Investigate the cultural phenomenon

– Inputs:
• Aanalyze over 100 million of photos
being uploaded to social media

Source: https://www.technologyreview.com/s/608116/data-mining-100-million-instagram-photos-reveals-global-
clothing-patterns/
APPLICATION OF DATA ANALYSIS
NOWADAYS
• Case 3: Public Transport System in Jakarta

– Goals: Improving the public transport system in the city (e.g. bus scheduling)

– Inputs:
• Real time GPS data from bus
• Passengers tap-in data

– Outputs:

• Real Time arrival time


• Congestion information

This Photo by Unknown Author is licensed under CC BY-SA


Source: Global Pulse, 2017. USING BIG DATA ANALYTICS FOR IMPROVED
PUBLIC TRANSPORT

You might also like