Module 1
Module 1
Module 1
Total Percent
Technology Total Jobs Research Research
Jobs Jobs
Content/quotes from: “Could Machine Learning Be a General-Purpose Technology? Evidence from Online Job Postings” by Goldfarb, Taska & Teodoridis.
ML’s Promise as a GPT (cont.)
Jobs widespread across industries
Industry NAICS2 AI GIS Quantum Fracking Robotics Nanotech IoT CRISPR Cloud Total
Accommodation and Food Services 5,389 883 4 5 1,516 5 828 2 15,090 5,594,113
Admin & Support & Waste Mgmt & Remediation Srvcs. 4,142 2,099 0 75 2,363 6 1,836 9 26,763 2,237,225
Agriculture, Forestry, Fishing and Hunting 820 1,388 0 3 384 0 146 0 2,310 97,644
Arts, Entertainment and Recreation 437 917 0 2 925 7 109 0 2,591 578,889
Construction
Educational Services
335
12,467
2,406
6,836
1
246
45
62
2,330
10,868
22
1,764
361
1,503
High volume
0
1,384
3,709
17,848
850,081
4,018,454
of jobs and
Finance and Insurance 40,261 3,871 26 32 7,809 7 4,569 35 98,700 5,787,861
Health Care and Social Assistance 102,401 34,512 158 612 114,965 734 28,477 1,793 481,770 38,700,000
Information 32,613 4,483 189 26 6,213 85 17,079 2 130,353 2,229,564
Management of Companies and Enterprises
Manufacturing
348
37,215
302
7,738
0
257
4
144
160
73,053
2
582
108
24,472
research jobs
0
1,733
1,156
96,368
80,359
6,307,396
Mining 674 1,098 0 828 489 2 257 0 1,736 216,880
Other Services (except Public Administration) 1,614 1,695 0 77 2,314 12 181 5 5,786 1,120,323
Professional, Scientific, & Technical Services 97,704 39,478 5,403 395 36,715 1,550 33,912 1,683 435,035 7,333,834
Public Administration 4,838 21,175 41 29 3,467 70 680 46 13,446 2,448,184
Real Estate Rental and Leasing 2,086 3,302 5 16 3,815 0 500 1 8,000 1,293,048
Retail Trade 21,705 1,220 14 132 11,214 21 5,917 2 54,247 8,720,486
Transportation and Warehousing 3,411 2,639 1 398 2,803 6 1,236 2 11,433 4,668,538
Utilities 801 3,896 0 51 886 6 493 1 2,877 333,522
Wholesale Trade 1,311 2,680 10 21 2,185 2 1,842 18 6,071 620,793
Total 370,572 142,645 6,355 2,957 284,474 4,883 124,506 6,716 1,415,289 93,237,194
Table A3.1. Number of jobs in data by industry and technology (2015-2018) Research by Goldfarb et al (2019)
ML’s Promise as a GPT (cont.)
Jobs widespread across industries
Industry NAICS2 AI GIS Quantum Fracking Robotics Nanotech IoT CRISPR Cloud Total
Accommodation and Food Services 5,389 883 4 5 1,516 5 828 2 15,090 5,594,113
Admin & Support & Waste Mgmt & Remediation Srvcs. 4,142 2,099 0 75 2,363 6 1,836 9 26,763 2,237,225
Agriculture, Forestry, Fishing and Hunting 820 1,388 0 3 384 0 146 0 2,310 97,644
Arts, Entertainment and Recreation 437 917 0 2 925 7 109 0 2,591 578,889
Construction
Educational Services
335
12,467
2,406
6,836
1
246
45
62
2,330
10,868
22
1,764
361
1,503
High volume
0
1,384
3,709
17,848
850,081
4,018,454
of jobs and
Finance and Insurance 40,261 3,871 26 32 7,809 7 4,569 35 98,700 5,787,861
Health Care and Social Assistance 102,401 34,512 158 612 114,965 734 28,477 1,793 481,770 38,700,000
Information 32,613 4,483 189 26 6,213 85 17,079 2 130,353 2,229,564
Management of Companies and Enterprises
Manufacturing
348
37,215
302
7,738
0
257
4
144
160
73,053
2
582
108
24,472
research jobs
0
1,733
1,156
96,368
80,359
6,307,396
Mining 674 1,098 0 828 489 2 257 0 1,736 216,880
Other Services (except Public Administration) 1,614 1,695 0 77 2,314 12 181 5 5,786 1,120,323
Professional, Scientific, & Technical Services 97,704 39,478 5,403 395 36,715 1,550 33,912 1,683 435,035 7,333,834
Public Administration 4,838 21,175 41 29 3,467 70 680 46 13,446 2,448,184
Real Estate Rental and Leasing 2,086 3,302 5 16 3,815 0 500 1 8,000 1,293,048
Retail Trade 21,705 1,220 14 132 11,214 21 5,917 2 54,247 8,720,486
Transportation and Warehousing 3,411 2,639 1 398 2,803 6 1,236 2 11,433 4,668,538
Utilities 801 3,896 0 51 886 6 493 1 2,877 333,522
Wholesale Trade 1,311 2,680 10 21 2,185 2 1,842 18 6,071 620,793
Total 370,572 142,645 6,355 2,957 284,474 4,883 124,506 6,716 1,415,289 93,237,194
Table A3.1. Number of jobs in data by industry and technology (2015-2018) Research by Goldfarb et al (2019)
ML’s Promise as a GPT (cont.)
Research jobs also widespread across industries
Industry NAICS2 AI GIS Quantum Fracking Robotics Nanotech IoT CRISPR Cloud Total
Accommodation and Food Services 438 24 1 0 43 2 32 2 214 13,410
Admin & Support & Waste Mgmt & Remediation Srvcs. 506 56 0 3 69 1 55 9 472 16,768
Agriculture, Forestry, Fishing and Hunting 243 220 0 0 24 0 6 0 130 9,847
Arts, Entertainment and Recreation 51 110 0 0 12 4 10 0 57 7,823
Construction
Educational Services
53
3,991
76
1,785
0
71
0
14
68
1,979
1
682
13
139
High volume
0
1,095
69
1,023
6,204
267,403
of jobs and
Finance and Insurance 6,409 271 15 3 309 2 249 29 2,132 114,244
Health Care and Social Assistance 11,974 1,905 63 21 8,186 366 895 1,393 7,299 1,078,322
Information 6,148 144 19 0 584 21 404 1 3,383 53,490
Management of Companies and Enterprises
Manufacturing
42
7,682
7
382
0
103
0
35
6
6,024
0
248
9
1,598
research jobs
0
1,363
19
3,306
1,061
357,069
Mining 108 51 0 25 34 1 70 0 37 4,251
Other Services (except Public Administration) 273 162 0 3 102 2 3 0 104 15,440
Professional, Scientific, & Technical Services 13,595 1,423 1,048 23 3,704 1,134 1,798 1,305 8,845 391,943
Public Administration 728 1,544 17 5 401 33 55 37 287 149,862
Real Estate Rental and Leasing 371 544 2 1 83 0 30 0 193 9,976
Retail Trade 3,599 129 9 2 570 14 290 2 1,470 35,200
Transportation and Warehousing 643 98 0 1 95 1 27 1 188 13,639
Utilities 99 28 0 5 29 5 19 0 36 5,762
Wholesale Trade 197 34 0 0 126 2 48 13 128 16,492
Total 57,150 8,993 1,348 141 22,448 2,519 5,750 5,250 29,392 2,568,206
Table A3.2. Number of research jobs in data by industry and technology (2015-2018) Research by Goldfarb et al (2019)
AI and Business
• AI (specifically ML) shows early indicators of being a general-
purpose technology
• Implications for firms if ML is a GPT:
• Realize that most industries are likely to change
• Be patient: the transformative impact may come with a lag
• To effectively leverage the opportunities, managers need to
understand the technology and its applications, and make
changes to their “business models, technology infrastructure,
organizational processes, and culture”
Content from: “Could Machine Learning Be a General-Purpose Technology? Evidence from Online Job Postings” by Goldfarb, Taska &
Teodoridis
Content/Quote from: https://www.simplilearn.com/introduction-to-digital-transformation-tutorial
AI Fundamentals for Non-Data Scientists
Course Introduction
• Data has been very important to businesses for multiple decades, but the
emphasis on big data is relatively new
• Computing capacity: the capacity to store data has increased and the
associated cost has decreased
• Data generation: the world is going digital, more people and things are
connected than ever before
Content/quotes from: “Big Data” by Victor Mayer- Schonberger and Kenneth Cukier, Images: https://www.axios.com/fewer-people-use-facebook-for-news-
13273e71-ab51-49cc-b4ee-1588a241c0c8.html, https://www.imore.com/best-fitbit, https://www.digitaltrends.com/cell-phone-reviews/apple-iphone-xs-review/
Drivers of Big Data
Content/quotes from: “Big Data” by Victor Mayer- Schonberger and Kenneth Cukier
Big Data in Healthcare
Availability
Availability
Tool Developers
Data Experts: Data architects, governance, policy
Decision Making - Exec. & Management: Apply data to solve business issues
Industry Vertical Domain Expertise: Identify relevant business issues, ask the right questions
Choosing a Big Data Tool
“Data Mining”
Machine Learning
AI Fundamentals for Non-Data Scientists
Data Management Tools
• Database
• Data warehouse
• A particular kind of DBMS
• Specialized in the type of data it stores - historic data from many sources
• Specialized in the purpose it serves - analytics
• Example: Azure SQL Data Warehouse, Google BigQuery, Snowflake,
Amazon Redshift
• Big data tools like Hadoop serve two main purposes, storage and processing
• Storage - lots of data (usually in a distributed fashion)
• Processing - involves distributed processing across nodes, parallelizing
the data process as much as possible
• Popular Hadoop distribution: Cloudera
• Database:
• “A structured collection of data”
• Example: an Excel spreadsheet
• Database Management System (DBMS):
• “Allows user to access and manage the database”
• Examples: Excel, Oracle, MongoDB
• Often also referred to as a database
• Data Warehouse:
• A particular kind of DBMS, “specialized in the data it stores - historic data
from many sources - and the purpose it serves - analytics”
• Azure SQL Data Warehouse, Google BigQuery, Snowflake, Amazon Redshift
Content/quotes from: https://www.dremio.com/what-is-a-data-warehouse/
How Do Data Warehouses Work?
• The next step, and the core business opportunity, involves using data to
make predictions about the future, called predictive analytics
• Can we predict demand?
• Can we predict fraud?
Content/quotes from: “Prediction Machines” by Ajay Agrawal, Joshua Gans, and Avi Goldfarb
Predictive Analytics: Amazon Recommendations
Amazon’s Recommendations
Content/quotes from: “Prediction Machines” by Ajay Agrawal, Joshua Gans, and Avi Goldfarb, Images: https://www.netsville.com/2018/11/13/amazon-hq2-bringing-25k-jobs-to-2-new-east-
coast-locations/amazon-logo-square/, https://icon-library.net/icon/shopping-icon-on-amazon-25.html, https://www.imore.com/amazon-app-everything-you-need-know,
https://services.amazon.com/handmade/getting-started-guide.html, https://www.mageplaza.com/blog/product-recomendation-how-amazon-succeeds-with-it.html
Predictive Analytics: Amazon Payments
• Big data & predictive analytics have direct benefits for Amazon
Content/quotes from: “Prediction Machines” by Ajay Agrawal, Joshua Gans, and Avi Goldfarb
AI Fundamentals for Non-Data Scientists
Introduction to Artificial Intelligence
Definition of AI
• “The theory and development of computer systems
able to perform tasks that normally require human
intelligence” - Merriam Webster
• Understanding language, reasoning, speech
recognition, decision-making, navigating the visual
world, manipulating physical objects, etc.
At its core, AI is about
getting computers to do
things that require human
intelligence.
Images: https://www.securitymagazine.com/articles/90871-whats-the-real-role-of-ai-and-ml-in-cybersecurity
Intelligence on a Spectrum
Origins
• Alan Turing, Cryptography
• Can machines think?
• “My contention is that machines can be constructed
which will simulate the behavior of the human mind
very closely.”
• Proposed the Turing test for machine intelligence
(1950) - Can machines do well in imitation games?
• Soon after, a workshop called the Dartmouth Summer Research Project on
Artificial Intelligence was held, which historians believe is what coined the
term AI
Content/quotes from: “A Human’s Guide to Machine Intelligence” by Kartik Hosanagar
“ Calling it AI made it extremely ambitious, and it inspired
many people to enter the field, which has been
responsible for a lot of the progress.”
— PEDRO DOMINGOS
AI in the Press
Much of what the general public knows about AI is about AI playing popular
games, which has been widely covered in the press
Images: https://www.nytimes.com/2011/02/17/science/17jeopardy-watson.html
Progression of AI in Games
• Idea: Learning from data where you have cleanly labeled outputs and
inputs
• Inputs can be referred to as features/covariates
• Outputs are often called targets (this is what we are trying to predict)
• Example - Email data
• The output we are trying to predict is whether an email is spam or not
• The inputs, or features/covariates, are the actual text in the email
• Supervised learning is used when there is cleanly labeled past data,
which have a “correct” answer
• Certain data have been labeled as “spam”
• Certain other data have been labeled as “not spam”
Supervised Learning Example
• Suppose you have two ad copies and you don’t know which will attract
more clicks (and therefore visitors to your website)
• Traditional A/B testing involves showing ad A 50% of the time and ad B 50%
of the time, and then assessing which ad performed better
Ad A Ad B
• Machine learning can improve upon A/B testing through bandit algorithms
• Bandit algorithms update beliefs based upon performance
• They spend more time on best
performers early on while still
learning and improving over time
• The bandit begins by showing
50% A & 50% B, but slowly starts
allocating more and more traffic
to the higher-performing ad as it
learns and confirms which one is
better
AI Fundamentals for Non-Data Scientists
Reinforcement Learning
Conversion rate
What is accuracy?
• Ability to make correct predictions on data you haven’t seen
Quantity of data
• Number of distinct observations
Quantity of data
• Number of distinct observations
Quantity of data
• Number of distinct observations
• Different characteristics about your observations
Quantity of data
• Number of distinct observations
• Different characteristics about your observations