PHYSTAT seminar: From COVID-19 Testing to Election Prediction:  How Small Are Our Big Data?

Talk

Title

PHYSTAT seminar: From COVID-19 Testing to Election Prediction: How Small Are Our Big Data?

Video

If you experience any problem watching the video, click the download button below

Download Embed

Show n. of views

Mp4:

Copy-paste this code into your page:
<iframe width="640" height="360" frameborder="0" src="http://cds.cern.ch/video/2776121?" allowfullscreen></iframe>

Author(s)

Meng, Xiao-Li (speaker) (Harvard University)

Corporate author(s)

CERN. Geneva

Imprint

2021-07-07. - 4531.

Series

(EP-IT Data science seminars)

Lecture note

on 2021-07-07T15:00:00

Subject category

EP-IT Data science seminars

Abstract

The term “Big Data” emphasizes data quantity, not quality. What will be the effective sample size when we take into account the deterioration of data quality because of, for example, the selection bias in COVID-19 testing or the non-response bias in 2016 US Election polling results? This talk provides an answer to such questions, based on the concept of data defect index (ddi) developed in [1]. It will also discuss briefly the application of ddi for 2020 US Election, as reported in [2].

[1] Xiao-Li Meng (2018) “Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election”, Annals of Applied Statistics 12 685.

[2] M. Isakov and S. Kuriwaki (2020) “Towards Principled Unskewing: Viewing 2020 Election Polls Through a Corrective Lens from 2016”, Harvard Data Science Review, https://hdsr.mitpress.mit.edu/pub/cnxbwum6/release/3

Xiao-Li Meng is Professor of Statistics at Harvard. He was Chair of their Statistics Department, and is the Founding Editor in Chief of the Harvard Data Science Review. His research interests include ‘Statistical theory for data science’; and ‘Signal extraction and uncertainty estimates’. He is a very popular lecturer.

EP/IT Data Science Seminar: https://cern.zoom.us/j/98545267593?pwd=akZWdmlyK01zbjFQa0x2c2ZXWW9ydz09

Submitted by

[email protected]

Back to search

Record created 2021-07-19, last modified 2024-06-26

Similar records

External link:

Event details

Add to personal basket
Export as BibTeX, MARC, MARCXML, DC, EndNote, NLM, RefWorks

CERN Document Server

Access articles, reports and multimedia content in HEP

Main menu

CERN Accelerating science