Improved and Robust Controversy Detection in General Web Pages Using Semantic Approaches under Large Scale Conditions

Linmans, J.; van de Velde, B.; Kanoulas, E.

doi:https://doi.org/10.1145/3269206.3269301

item 1 out of 1

return to search results

Author: J. Linmans
B. van de Velde
E. Kanoulas
Year: 2018
Title: Improved and Robust Controversy Detection in General Web Pages Using Semantic Approaches under Large Scale Conditions
Event: 27th ACM International Conference on Information and Knowledge Management
Book/source title: CIKM'18
Book/source subtitle: proceedings of the 2018 ACM International Conference on Information and Knowledge Management : October 22-26, 2018, Torino, Italy
Pages (from-to): 1647-1650
Number of pages: 4
Publisher: New York, NY: The Association for Computing Machinery
ISBN (electronic): 9781450360142
Document type: Conference contribution
Faculty: Faculty of Science (FNWI)
Faculty of Economics and Business (FEB)
Institute: Informatics Institute (IVI)
Amsterdam Business School Research Institute (ABS-RI)
Abstract: Detecting controversy in general web pages is a daunting task, but increasingly essential to efficiently moderate discussions and effectively filter problematic content. Unfortunately, controversies occur across many topics and domains, with great changes over time. This paper investigates neural classifiers as a more robust methodology for controversy detection in general web pages. Current models have often cast controversy detection on general web pages as Wikipedia linking, or exact lexical matching tasks. The diverse and changing nature of controversies suggest that semantic approaches are better able to detect controversy. We train neural networks that can capture semantic information from texts using weak signal data. By leveraging the semantic properties of word embeddings we robustly improve on existing controversy detection methods. To evaluate model stability over time and to unseen topics, we asses model performance under varying training conditions to test cross-temporal, cross-topic, cross-domain performance and annotator congruence. In doing so, we demonstrate that weak-signal based neural approaches are closer to human estimates of controversy and are more robust to the inherent variability of controversies.
URL: go to publisher's site
Language: English
Persistent Identifier: https://hdl.handle.net/11245.1/abb78f43-9d12-4334-8e97-0ec91ce83881

Downloads

p1647-linmans(Final published version)

Disclaimer/Complaints regulations

If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.

p1647-linmans(Final published version)

Disclaimer/Complaints regulations