Lecture 3 - What Is Big Data Analytics - GEOG 3226 - 2023
Lecture 3 - What Is Big Data Analytics - GEOG 3226 - 2023
Lecture 3 - What Is Big Data Analytics - GEOG 3226 - 2023
GEOG 3226
Urban Data Science
Source: utimaco.com
• Big data only becomes useful when
we do something with it
• You have big urban data. However, you don’t know where to start.
Then these four questions can be good corner stones!
Source: techtarget.com
• Descriptive analytics is relatively
accessible and easy-to-use
• Basis for Planning and Strategy: Understanding past trends and patterns
provides valuable insights that can be the foundation for strategic planning and
decision-making
1. Hypothesis testing
• When applied in diagnostic analytics, _____ are oriented toward the past
• Example: “I assume this month’s decline in COVID-19 infections
was caused by recent increase in vaccination.”
Income
Education
However, money
spent on pets and
number of lawyers
does not cause the
other.
😳?
Source: vox.com
• While determining causation is ideal, correlation can still offer the insight
needed
Urban Data to make sense of your data and use it to make good decisions
Science GEOG 3226
Diagnostic regression analysis (Week 11)
• Some relationships between variables require in-depth examination by
_____ analysis, which can be used to determine the relationship between
1) two variables (single linear regression) or 2) three or more variables (multiple regression)
• Regression can provide insights into the structure of the relationship between
variables and provides measures of how well the data fit that relationship
• Such insights can provide extremely valuable for analyzing historical trends
(diagnostic analytics) and developing forecasts (predictive analytics)
Urban Data Science GEOG 3226
Example of diagnostic analytics: Correlation
• Example: Starbucks
• Based on the found correlation between the “feels-like” temperature and the
number of Frappuccino sold,
Starbucks is predicting the sales volume and do the localized marketing efforts
Urban Data Science GEOG 3226
https://mediaspace.esri.com/media/t/1_wcl6obys (9:20)
Benefits of diagnostic analytics
• Root Cause Analysis: Diagnostic analytics allows analysts to identify the root
causes of observed outcomes, helping them understand the underlying factors
contributing to a specific result
Source: theathletic.com,
soccertake.com
• Clubs use analytics to predict future performance, simulate game scenarios, and
make strategic decisions.
Clubs
•Urban startScience
Data to use artificial intelligence and machine learning for data analysis,
GEOG 3226
providing deeper insights and more accurate predictions.
Benefits of predictive analytics
Source: medium.com
After Exam 1
Source: Big Data and Social Science : Data Science Methods and Tools for
Research and Practice, edited by Ian Foster, et al., CRC Press LLC, 2020
• 3. Text analysis
• 4. Machine learning
• 5. Network analysis
• Identification of _____: Outliers can heavily influence statistical tests and models.
EDA helps in detecting and, if necessary, addressing these outliers
Outliers Outliers
Colonial rulers
Colonial subjects
Transport
poverty
Kim, Y., Lee, J., Kim, J., & Nakajima, N. (2021). The Disparity in Transit Travel Time
between Koreans and Japanese in 1930s Colonial Seoul. Findings.
• Main purpose
– Seeing what the data can tell us
before diving into the formal
modeling or hypothesis testing task
Source: https://towardsdatascience.com/exploratory-spatial-data-analysis-esda-spatial-autocorrelation-7592edcbef9a
• Data visualization: Represent data in a meaningful, visual way that users can
interpret and easily comprehend. Provide an accessible way to see and
understand trends, outliers, and patterns in data
Methods Jang, K. M., Chen, J., Kang, Y., Kim, J., Lee, J., & Duarte, F. (2023). Understanding Place Identity
with Generative AI. arXiv preprint arXiv:2306.04662.
• Real-world example:
Voice Assistants like Apple's Siri.
• Real-world example:
Platforms like Netflix or Spotify suggest movies, shows, or songs based on your previous
choices. Machine learning algorithms analyze your viewing or listening habits (and those
of similar users) to recommend content you might enjoy
• Real-world example:
Photo Tagging Features on platforms like
Facebook. When you upload a photo, you
might notice the platform automatically
recognizing and tagging people in the image
– How to use big, real-time data for urban human mobility and accessibility analysis?