Skip to main content

Showing 1–17 of 17 results for author: Vyas, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.16356  [pdf, other

    cs.CL

    A Multi-Modal Multilingual Benchmark for Document Image Classification

    Authors: Yoshinari Fujinuma, Siddharth Varia, Nishant Sankaran, Srikar Appalaraju, Bonan Min, Yogarshi Vyas

    Abstract: Document image classification is different from plain-text document classification and consists of classifying a document by understanding the content and structure of documents such as forms, emails, and other such documents. We show that the only existing dataset for this task (Lewis et al., 2006) has several limitations and we introduce two newly curated multilingual datasets WIKI-DOC and MULTI… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Findings)

  2. arXiv:2305.17127  [pdf, other

    cs.CL

    Characterizing and Measuring Linguistic Dataset Drift

    Authors: Tyler A. Chang, Kishaloy Halder, Neha Anna John, Yogarshi Vyas, Yassine Benajiba, Miguel Ballesteros, Dan Roth

    Abstract: NLP models often degrade in performance when real world data distributions differ markedly from training data. However, existing dataset drift metrics in NLP have generally not considered specific dimensions of linguistic drift that affect model performance, and they have not been validated in their ability to predict model performance at the individual example level, where such metrics are often… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  3. arXiv:2305.13191  [pdf, other

    cs.CL cs.AI cs.LG

    Taxonomy Expansion for Named Entity Recognition

    Authors: Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha Anna John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros

    Abstract: Training a Named Entity Recognition (NER) model often involves fixing a taxonomy of entity types. However, requirements evolve and we might need the NER model to recognize additional entity types. A simple approach is to re-annotate entire dataset with both existing and additional entity types and then train the model on the re-annotated dataset. However, this is an extremely laborious task. To re… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  4. arXiv:2305.11242  [pdf, other

    cs.CL

    Comparing Biases and the Impact of Multilingual Training across Multiple Languages

    Authors: Sharon Levy, Neha Anna John, Ling Liu, Yogarshi Vyas, Jie Ma, Yoshinari Fujinuma, Miguel Ballesteros, Vittorio Castelli, Dan Roth

    Abstract: Studies in bias and fairness in natural language processing have primarily examined social biases within a single language and/or across few attributes (e.g. gender, race). However, biases can manifest differently across various languages for individual attributes. As a result, it is critical to examine biases within each language and attribute. Of equal importance is to study how these biases com… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  5. arXiv:2303.11660  [pdf, other

    cs.CL

    Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

    Authors: Ming Shen, Jie Ma, Shuai Wang, Yogarshi Vyas, Kalpit Dixit, Miguel Ballesteros, Yassine Benajiba

    Abstract: Opinion summarization provides an important solution for summarizing opinions expressed among a large number of reviews. However, generating aspect-specific and general summaries is challenging due to the lack of annotated data. In this work, we propose two simple yet effective unsupervised approaches to generate both aspect-specific and general opinion summaries by training on synthetic datasets… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: EACL 2023 Findings

  6. arXiv:2302.12297  [pdf, other

    cs.CL

    Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

    Authors: Katerina Margatina, Shuai Wang, Yogarshi Vyas, Neha Anna John, Yassine Benajiba, Miguel Ballesteros

    Abstract: Temporal concept drift refers to the problem of data changing over time. In NLP, that would entail that language (e.g. new expressions, meaning shifts) and factual knowledge (e.g. new concepts, updated facts) evolve over time. Focusing on the latter, we benchmark $11$ pretrained masked language models (MLMs) on a series of tests designed to evaluate the effect of temporal concept drift, as it is c… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear at EACL 2023. Our code will be available at https://github.com/amazon-science/temporal-robustness

  7. arXiv:2210.05613  [pdf, other

    cs.CL cs.AI

    Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

    Authors: Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

    Abstract: We investigate semi-structured document classification in a zero-shot setting. Classification of semi-structured documents is more challenging than that of standard unstructured documents, as positional, layout, and style information play a vital role in interpreting such documents. The standard classification setting where categories are fixed during both training and testing falls short in dynam… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  8. arXiv:2203.11258  [pdf, other

    cs.CL

    Efficient Classification of Long Documents Using Transformers

    Authors: Hyunji Hayley Park, Yogarshi Vyas, Kashif Shah

    Abstract: Several methods have been proposed for classifying long textual documents using Transformers. However, there is a lack of consensus on a benchmark to enable a fair comparison among different approaches. In this paper, we provide a comprehensive evaluation of the relative efficacy measured against various baselines and diverse datasets -- both in terms of accuracy as well as time and space overhead… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022; 8 pages

  9. Human-State-Aware Controller for a Tethered Aerial Robot Guiding a Human by Physical Interaction

    Authors: Mike Allenspach, Yash Vyas, Matthias Rubio, Roland Siegwart, Marco Tognon

    Abstract: With the rapid development of Aerial Physical Interaction, the possibility to have aerial robots physically interacting with humans is attracting a growing interest. In one of our previous works, we considered one of the first systems in which a human is physically connected to an aerial vehicle by a cable. There, we developed a compliant controller that allows the robot to pull the human toward a… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  10. arXiv:2108.12358  [pdf, other

    cs.RO

    Modelling and Estimation of Human Walking Gait for Physical Human-Robot Interaction

    Authors: Yash Vyas, Mike Allenspach, Christian Lanegger, Roland Siegwart, Marco Tognon

    Abstract: An approach to model and estimate human walking kinematics in real-time for Physical Human-Robot Interaction is presented. The human gait velocity along the forward and vertical direction of motion is modelled according to the Yoyo-model. We designed an Extended Kalman Filter (EKF) algorithm to estimate the frequency, bias and trigonometric state of a biased sinusoidal signal, from which the kinem… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: To be published in The 1st AIRPHARO Workshop on Aerial Robotic Systems Physically Interacting with the Environment, 4-5/10/2021

  11. arXiv:2106.14574  [pdf, other

    cs.CL

    Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

    Authors: Paula Czarnowska, Yogarshi Vyas, Kashif Shah

    Abstract: Measuring bias is key for better understanding and addressing unfairness in NLP/ML models. This is often done via fairness metrics which quantify the differences in a model's behaviour across a range of demographic groups. In this work, we shed more light on the differences and similarities between the fairness metrics used in NLP. First, we unify a broad range of existing metrics under three gene… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in Transaction of the Association for Computational Linguistics (TACL), 2021. The arXiv version is a pre-MIT Press publication version

  12. arXiv:2010.11333  [pdf, other

    cs.CL

    Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas

    Authors: Yogarshi Vyas, Miguel Ballesteros

    Abstract: In entity linking, mentions of named entities in raw text are disambiguated against a knowledge base (KB). This work focuses on linking to unseen KBs that do not have training data and whose schema is unknown during training. Our approach relies on methods to flexibly convert entities from arbitrary KBs with several attribute-value pairs into flat strings, which we use in conjunction with state-of… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  13. arXiv:2004.04295  [pdf, ps, other

    cs.CL

    Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events

    Authors: Miguel Ballesteros, Rishita Anubhai, Shuai Wang, Nima Pourdamghani, Yogarshi Vyas, Jie Ma, Parminder Bhatia, Kathleen McKeown, Yaser Al-Onaizan

    Abstract: In this paper, we propose a neural architecture and a set of training methods for ordering events by predicting temporal relations. Our proposed models receive a pair of events within a span of text as input and they identify temporal relations (Before, After, Equal, Vague) between them. Given that a key challenge with this task is the scarcity of annotated data, our models rely on either pretrain… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  14. arXiv:1803.11291  [pdf, other

    cs.CL

    Robust Cross-lingual Hypernymy Detection using Dependency Context

    Authors: Shyam Upadhyay, Yogarshi Vyas, Marine Carpuat, Dan Roth

    Abstract: Cross-lingual Hypernymy Detection involves determining if a word in one language ("fruit") is a hypernym of a word in another language ("pomme" i.e. apple in French). The ability to detect hypernymy cross-lingually can aid in solving cross-lingual versions of tasks such as textual entailment and event coreference. We propose BISPARSE-DEP, a family of unsupervised approaches for cross-lingual hyper… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Comments: NAACL 2018. SU and YV contributed equally

  15. arXiv:1803.11112  [pdf, other

    cs.CL

    Identifying Semantic Divergences in Parallel Text without Annotations

    Authors: Yogarshi Vyas, Xing Niu, Marine Carpuat

    Abstract: Recognizing that even correct translations are not always semantically equivalent, we automatically detect meaning divergences in parallel sentence pairs with a deep neural model of bilingual semantic similarity which can be trained for any parallel corpus without any manual annotation. We show that our semantic model detects divergences more accurately than models based on surface features derive… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Comments: Accepted as a full paper to NAACL 2018

  16. arXiv:1611.05118  [pdf, other

    cs.CV cs.CL

    The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

    Authors: Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry Davis

    Abstract: Visual narrative is often a combination of explicit information and judicious omissions, relying on the viewer to supply missing details. In comics, most movements in time and space are hidden in the "gutters" between panels. To follow the story, readers logically connect panels together by inferring unseen actions through a process called "closure". While computers can now describe what is explic… ▽ More

    Submitted 7 May, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

  17. arXiv:1510.07586  [pdf, ps, other

    cs.CL

    Parser for Abstract Meaning Representation using Learning to Search

    Authors: Sudha Rao, Yogarshi Vyas, Hal Daume III, Philip Resnik

    Abstract: We develop a novel technique to parse English sentences into Abstract Meaning Representation (AMR) using SEARN, a Learning to Search approach, by modeling the concept and the relation learning in a unified framework. We evaluate our parser on multiple datasets from varied domains and show an absolute improvement of 2% to 6% over the state-of-the-art. Additionally we show that using the most freque… ▽ More

    Submitted 26 October, 2015; originally announced October 2015.