CERN Accelerating science

Talk
Title PHYSTAT seminar: From COVID-19 Testing to Election Prediction: How Small Are Our Big Data?
Video
If you experience any problem watching the video, click the download button below
Download Embed
Show n. of views
Mp4:480p
(presenter)
720p
(presenter)
1080p
(presenter)
240p
(presenter)
360p
(presenter)
Copy-paste this code into your page:
Author(s) Meng, Xiao-Li (speaker) (Harvard University)
Corporate author(s) CERN. Geneva
Imprint 2021-07-07. - 4531.
Series (EP-IT Data science seminars)
Lecture note on 2021-07-07T15:00:00
Subject category EP-IT Data science seminars
Abstract

The term “Big Data” emphasizes data quantity, not quality. What will be the effective sample size when we take into account the deterioration of data quality because of, for example, the selection bias in COVID-19 testing or the non-response bias in 2016 US Election polling results? This talk provides an answer to such questions, based on the concept of data defect index (ddi) developed in [1]. It will also discuss briefly the application of ddi for 2020 US Election, as reported in [2].

[1] Xiao-Li Meng (2018) “Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election”, Annals of Applied Statistics 12 685.

[2] M. Isakov and S. Kuriwaki (2020) “Towards Principled Unskewing: Viewing 2020 Election Polls Through a Corrective Lens from 2016”, Harvard Data Science Review, https://hdsr.mitpress.mit.edu/pub/cnxbwum6/release/3

 Xiao-Li Meng is Professor of Statistics at Harvard. He was Chair of their Statistics Department, and is the Founding Editor in Chief of the Harvard Data Science Review. His research interests include ‘Statistical theory for data science’; and ‘Signal extraction and uncertainty estimates’. He is a very popular lecturer.

EP/IT Data Science Seminar: https://cern.zoom.us/j/98545267593?pwd=akZWdmlyK01zbjFQa0x2c2ZXWW9ydz09
Copyright/License © 2021-2024 CERN
Submitted by [email protected]

 


 Záznam vytvorený 2021-07-19, zmenený 2024-06-26


External link:
Nahraj plný text
Event details