Big Data and Social Science - A Practical Guide To

Uploaded by

dzdzed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views2 pages

Big Data and Social Science - A Practical Guide To

Uploaded by

dzdzed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

JSS Journal of Statistical Software

June 2017, Volume 78, Book Review 2. doi: 10.18637/jss.v078.b02

Reviewer: Stefano M. Iacus

University of Milan

Big Data and Social Science – A Practical Guide to Methods and Tools

Ian Foster, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter, Julia Lane
Chapman & Hall/CRC, Boca Raton, 2016.
ISBN 978-1-498-75140-7. 356 pp. GBP 25.95 (H).
http://www.bigdatasocialscience.com/

Social science practitioners and researchers face more and more large and complex datasets.
There is no way to turn around the “big data” issue, big data are now a commodity in the
social sciences. Still, the social data scientist is not yet a common profile in academia and
research institutions. The reasons for this uncompleted transition, from social scientist to
social data scientist, is the lack of access to proper statistical methods and computer science
tools. This book tries to fill the gap between the willingness to perform a big data analysis in
the social sciences and the actual competence of doing it. The authors did a great effort in this
direction and they succeed to some extent. As data science is a mix of skills and background
knowledge from different fields, it is clearly impossible to fill all the gaps. Therefore, this
book, at times, must remain on the surface.
The book is divided into three parts. The first one, “Model and Curation”, explains how to
deal with simple web scraping and then describes the more complex use of API, although it
doesn’t really explain the details of some basic, yet necessary, steps and problems faced by
practitioners, like authentication, tokens, etc. Apart from that, the Python code presented is
sufficient to understand the API concept. Some practical examples of interactions with OR-
CID and Twitter API’s are also explained. The ORCID to Twitter case study also introduces
the other problem of big data which is the record linkage issue, i.e., how to put together
data from different sources? This chapter describes applications of survey methodology to
the big data context, like probabilistic record linkage and other useful techniques. Then,
a standard chapter follows about data base systems (where do I store the data once I got
them?). Finally, the authors present the distributed computing paradigm to solve elementary
but scalable tasks on huge amounts of data. To present, this is one of the non-specialist books
which treats the topic with sufficient detail. I think this is a plus.
The second part of the book, called “Modelling and Analysis”, goes through the standard
machine learning topics but avoids to talk about deep learning, which is quite trendy these
days and available through several open source frameworks. The next two chapters contain
a non exhaustive review of some text analysis techniques. This part is probably too elemen-
tary, and recent approaches like Word2Vec or aggregated sentiment analysis approaches are
2 Big Data and Social Science – A Practical Guide to Methods and Tools

completely missing, although quite popular in the social sciences. This second part of the
book ends with the basic ideas of social network analysis.
The third part of the book is dedicated to “Inference and Ethics” but actually starts with
effective data visualization, another fundamental topic. Inference is considered in terms of
errors more than in terms of statistical models (although some are presented). This chapter
clearly addresses the problem of how and where errors arise in big data analysis. This is an
often underestimated problem in social data science. Some solutions to these problems are
also, very briefly, presented. The last chapter discusses Privacy and Confidentiality. This is a
topic usually neglected in technical books but a real concern for the social data scientist and
especially if he or she works in an governmental authority or public institution. This problem
is two-fold: on the one hand, it is pretty legal (how to keep the privacy of the subject under
investigation provided that we can mix many sources of data); on the other hand, this issue is
about the distribution of data for replicability which is more and more common in the social
sciences.
The book also contains a “Workbook” chapter, which is a collection of Jupyter notebooks
that explain how to replicate most of the examples presented in the book by skipping the
burden of learning everything from scratch but allowing the practitioner to work through
these examples at first and only later to dig deeply in the code.
In summary, although there is a growing number of books related to social science and big
data, this volume contains several non-trivial aspects which make it worth to have in the
library, possibly along with other similar textbooks as a good complement to them. Not all
subjects are treated in full detail, but when this is the case, most of the time, the overview
offered is valuable and the reader can always examine other specialized texts later on.

Reviewer:
Stefano M. Iacus
Department of Economics, Management and Quantitative Methods
University of Milan
Via Conservatorio 7, I-20123 Milan, Italy
E-mail: [email protected]

Journal of Statistical Software http://www.jstatsoft.org/

published by the Foundation for Open Access Statistics http://www.foastat.org/
June 2017, Volume 78, Book Review 2 Published: 2017-06-01
doi:10.18637/jss.v078.b02

Big Data Algorithms
100% (1)
Big Data Algorithms
476 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
255 pages
R Data Analysis Projects PDF
No ratings yet
R Data Analysis Projects PDF
354 pages
Data Science in Practice (Alan Said Vicenç Torra) (Z-Library)
No ratings yet
Data Science in Practice (Alan Said Vicenç Torra) (Z-Library)
265 pages
Guide To Big Data Applications PDF
100% (5)
Guide To Big Data Applications PDF
567 pages
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
50% (2)
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
16 pages
CiscoPress - Big Data Concepts Methodologies Tools and Applications (2016)
No ratings yet
CiscoPress - Big Data Concepts Methodologies Tools and Applications (2016)
3,147 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
14 pages
Smart Energy Monitoring System - Project Final Report
No ratings yet
Smart Energy Monitoring System - Project Final Report
11 pages
Dokumen - Pub Data Science and Social Research II Methods Technologies and Applications 1st Ed 9783030512217 9783030512224
No ratings yet
Dokumen - Pub Data Science and Social Research II Methods Technologies and Applications 1st Ed 9783030512217 9783030512224
391 pages
Shu-Heng Chen - Big Data in Computational Social Science and Humanities (2018, Springer) PDF
No ratings yet
Shu-Heng Chen - Big Data in Computational Social Science and Humanities (2018, Springer) PDF
391 pages
Big Data Analytics Methods and Applications Jovan Pehcevski
100% (5)
Big Data Analytics Methods and Applications Jovan Pehcevski
430 pages
Project Report Hate
100% (1)
Project Report Hate
24 pages
Data Science Unit 1 Notes
No ratings yet
Data Science Unit 1 Notes
22 pages
Developing Analytic Talent: Becoming a Data Scientist
From Everand
Developing Analytic Talent: Becoming a Data Scientist
Vincent Granville
3/5 (7)
Introduction To Data ScienceA Python Approach To Concepts, Techniques and Applications PDF
100% (10)
Introduction To Data ScienceA Python Approach To Concepts, Techniques and Applications PDF
227 pages
Internship Report 2023-24 Data Science
100% (2)
Internship Report 2023-24 Data Science
23 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
36 pages
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next (English Edition)
From Everand
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next (English Edition)
Dr. Gypsy Nandi
No ratings yet
SRS-Diabetes Detection Using Machine Learning
100% (1)
SRS-Diabetes Detection Using Machine Learning
8 pages
Final Project
No ratings yet
Final Project
34 pages
LN2015 01
No ratings yet
LN2015 01
16 pages
Next-Generation Big Data Analytics
No ratings yet
Next-Generation Big Data Analytics
9 pages
Big Data and Social Science
No ratings yet
Big Data and Social Science
4 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
42 pages
AI2SD2019 Paper 230
No ratings yet
AI2SD2019 Paper 230
13 pages
Unit I
No ratings yet
Unit I
262 pages
David Lazer Et Al - Science20 - Computational Social Science-Obstacles and Opportunities
No ratings yet
David Lazer Et Al - Science20 - Computational Social Science-Obstacles and Opportunities
4 pages
Unit 4 - Big Data
No ratings yet
Unit 4 - Big Data
7 pages
The Information Process: A Model and Hierarchy
From Everand
The Information Process: A Model and Hierarchy
Victor Yang
No ratings yet
GaleanoPenaTEST Final
No ratings yet
GaleanoPenaTEST Final
44 pages
Data Science 2015
No ratings yet
Data Science 2015
229 pages
Big Data Privacy Issues in Public Social Media
No ratings yet
Big Data Privacy Issues in Public Social Media
6 pages
Big Data Research For Social Science and Social Im
No ratings yet
Big Data Research For Social Science and Social Im
4 pages
Summary of Big Data
No ratings yet
Summary of Big Data
5 pages
Comparative Study of Big Data Analytics Tools R and Tableau2017IOP Conference Series Materials Science and Engineering
No ratings yet
Comparative Study of Big Data Analytics Tools R and Tableau2017IOP Conference Series Materials Science and Engineering
10 pages
Big Data and Social Science-Dikompresi
No ratings yet
Big Data and Social Science-Dikompresi
81 pages
BiblioTK-Srinivasan S - Guide To Big Data Applications 2018 PDF
No ratings yet
BiblioTK-Srinivasan S - Guide To Big Data Applications 2018 PDF
567 pages
Dsa QB
No ratings yet
Dsa QB
25 pages
Unit 1 To 5
No ratings yet
Unit 1 To 5
202 pages
Lecture Notes in Computer Science 8302: Editorial Board
No ratings yet
Lecture Notes in Computer Science 8302: Editorial Board
10 pages
Journal of Statistical Software: Reviewer: Stefano Maria Iacus University of Milan
No ratings yet
Journal of Statistical Software: Reviewer: Stefano Maria Iacus University of Milan
3 pages
995 424 1 PB
No ratings yet
995 424 1 PB
37 pages
Undergraduate Topics in Computer Science: Series Editor
No ratings yet
Undergraduate Topics in Computer Science: Series Editor
13 pages
Jsaer2016 03 02 106 108
No ratings yet
Jsaer2016 03 02 106 108
3 pages
The Real Work of Data Science: Turning data into information, better decisions, and stronger organizations
From Everand
The Real Work of Data Science: Turning data into information, better decisions, and stronger organizations
Ron S. Kenett
No ratings yet
Big Data For Good.: This Is The Industry Watch Blog
No ratings yet
Big Data For Good.: This Is The Industry Watch Blog
10 pages
Datasist: A Python-Based Library For Easy Data Analysis, Visualization and Modeling
No ratings yet
Datasist: A Python-Based Library For Easy Data Analysis, Visualization and Modeling
17 pages
Jrsssa 182 4 1648a
No ratings yet
Jrsssa 182 4 1648a
2 pages
22UCS303 DS-Unit I-N
No ratings yet
22UCS303 DS-Unit I-N
42 pages
Applied Multivariate Analysis: Using Bayesian and Frequentist Methods of Inference, Second Edition
From Everand
Applied Multivariate Analysis: Using Bayesian and Frequentist Methods of Inference, Second Edition
S. James Press
No ratings yet
DS Syllabus
No ratings yet
DS Syllabus
29 pages
Lec 1 Data Science and Big Data
No ratings yet
Lec 1 Data Science and Big Data
3 pages
Lecture 1 and 2 Powerpoints
No ratings yet
Lecture 1 and 2 Powerpoints
32 pages
(IJCST-V6I6P21) :yogesh Sharma, Aastha Jaie, Heena Garg, Sagar Kumar
No ratings yet
(IJCST-V6I6P21) :yogesh Sharma, Aastha Jaie, Heena Garg, Sagar Kumar
6 pages
Big Data As A Source For Official Statistics - Piet J.H. Daas, Marco Puts
No ratings yet
Big Data As A Source For Official Statistics - Piet J.H. Daas, Marco Puts
8 pages
Radial Basis Function Networks: Applications: Introduction To Neural Networks: Lecture 14
No ratings yet
Radial Basis Function Networks: Applications: Introduction To Neural Networks: Lecture 14
14 pages
B.sc. (Artificial Intelligence and Machine Learning) - 03102024
No ratings yet
B.sc. (Artificial Intelligence and Machine Learning) - 03102024
36 pages
Big Data For Org
No ratings yet
Big Data For Org
10 pages
FR Kitchin, R. The Data Revolution Big Data, Open Data, Data Infrastructures and
No ratings yet
FR Kitchin, R. The Data Revolution Big Data, Open Data, Data Infrastructures and
3 pages
ETI-CH1 Notes
No ratings yet
ETI-CH1 Notes
19 pages
BIG Data Is A Solution
No ratings yet
BIG Data Is A Solution
3 pages
Big Data Analysis Using Apache HADOOP (November 2013) : Abstract-Big Data Problems Are Often Complex To
No ratings yet
Big Data Analysis Using Apache HADOOP (November 2013) : Abstract-Big Data Problems Are Often Complex To
11 pages
Module 8-9 Big Data and E-Science
No ratings yet
Module 8-9 Big Data and E-Science
4 pages
Big Data For Education in Student S' Perspective: G. Vaitheeswaran L. Arockiam
No ratings yet
Big Data For Education in Student S' Perspective: G. Vaitheeswaran L. Arockiam
7 pages
BDA2023 Outline
No ratings yet
BDA2023 Outline
7 pages
Data Science Fusion: Integrating Maths, Python, and Machine Learning
From Everand
Data Science Fusion: Integrating Maths, Python, and Machine Learning
NIBEDITA Sahu
No ratings yet
Kutz Data Decision Making
No ratings yet
Kutz Data Decision Making
3 pages
(IJCST-V10I4P1) :swagata Sarkar, Dhivya Balaje, Vibha V, Harish Pichumani
No ratings yet
(IJCST-V10I4P1) :swagata Sarkar, Dhivya Balaje, Vibha V, Harish Pichumani
4 pages
UT Dallas Syllabus For cs4375.501.07f Taught by Yu Chung NG (Ycn041000)
No ratings yet
UT Dallas Syllabus For cs4375.501.07f Taught by Yu Chung NG (Ycn041000)
5 pages
Learn Econometrics Fast
From Everand
Learn Econometrics Fast
Hesbon R.M
No ratings yet
Cyber Vulnerability Intelligence For Internet of Things Binary
No ratings yet
Cyber Vulnerability Intelligence For Internet of Things Binary
10 pages
Business Data Analytics
No ratings yet
Business Data Analytics
19 pages
Big Data Analysis
No ratings yet
Big Data Analysis
3 pages
Master Thesis Topics: Felix Kahlhoefer
No ratings yet
Master Thesis Topics: Felix Kahlhoefer
10 pages
Winner Take All Autoencoders
No ratings yet
Winner Take All Autoencoders
11 pages
Machine Leaning 1 Unit
No ratings yet
Machine Leaning 1 Unit
10 pages
4COSC008C.Coursework 1 Specification
No ratings yet
4COSC008C.Coursework 1 Specification
7 pages
SAR Target Recognition Based On Deep Learning
No ratings yet
SAR Target Recognition Based On Deep Learning
7 pages
Unit - 5
No ratings yet
Unit - 5
13 pages
Coursediary - mvjscsl16 - Algorithms & Ai Laboratory
No ratings yet
Coursediary - mvjscsl16 - Algorithms & Ai Laboratory
61 pages
A Complete Guide To Data Augmentation - DataCamp
No ratings yet
A Complete Guide To Data Augmentation - DataCamp
18 pages
CSTD Background Guide
No ratings yet
CSTD Background Guide
33 pages
Main
No ratings yet
Main
12 pages
Machine Learning 2025
No ratings yet
Machine Learning 2025
12 pages
Predicting Sonar Rocks Against Mines With ML
No ratings yet
Predicting Sonar Rocks Against Mines With ML
7 pages
Machine Learning Guide For Oil and Gas Using Python Hoss Belyadipdf Download
100% (2)
Machine Learning Guide For Oil and Gas Using Python Hoss Belyadipdf Download
78 pages
Statement of Purpose
No ratings yet
Statement of Purpose
1 page
The Improvement of Forecasting ATMs Cash Demand of Iran Banking Network Using
No ratings yet
The Improvement of Forecasting ATMs Cash Demand of Iran Banking Network Using
11 pages
Malicious Use of AI - UNCCT-UNICRI Report - Web
No ratings yet
Malicious Use of AI - UNCCT-UNICRI Report - Web
58 pages

Big Data and Social Science - A Practical Guide To

Uploaded by

Big Data and Social Science - A Practical Guide To

Uploaded by

JSS Journal of Statistical Software

June 2017, Volume 78, Book Review 2. doi: 10.18637/jss.v078.b02

Reviewer: Stefano M. Iacus

Journal of Statistical Software http://www.jstatsoft.org/

You might also like