Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Zhang, Xiaoyi; Athanasiadou, Rodoniki; Razavian, Narges

Computer Science > Computation and Language

arXiv:1911.11324 (cs)

[Submitted on 26 Nov 2019 (v1), last revised 2 Dec 2019 (this version, v2)]

Title:Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Authors:Xiaoyi Zhang, Rodoniki Athanasiadou, Narges Razavian

View PDF

Abstract:Twitter data has been shown broadly applicable for public health surveillance. Previous public health studies based on Twitter data have largely relied on keyword-matching or topic models for clustering relevant tweets. However, both methods suffer from the short-length of texts and unpredictable noise that naturally occurs in user-generated contexts. In response, we introduce a deep learning approach that uses hashtags as a form of supervision and learns tweet embeddings for extracting informative textual features. In this case study, we address the specific task of estimating state-level obesity from dietary-related textual features. Our approach yields an estimation that strongly correlates the textual features to government data and outperforms the keyword-matching baseline. The results also demonstrate the potential of discovering risk factors using the textual features. This method is general-purpose and can be applied to a wide range of Twitter-based public health studies.

Subjects:	Computation and Language (cs.CL); Social and Information Networks (cs.SI)
Cite as:	arXiv:1911.11324 [cs.CL]
	(or arXiv:1911.11324v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1911.11324

Submission history

From: Xiaoyi Zhang [view email]
[v1] Tue, 26 Nov 2019 03:57:15 UTC (294 KB)
[v2] Mon, 2 Dec 2019 21:30:29 UTC (294 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
cs.SI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiaoyi Zhang
Narges Razavian

export BibTeX citation

Computer Science > Computation and Language

Title:Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators