Skip to main content

Showing 1–50 of 79 results for author: Collier, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03094  [pdf, other

    cs.CL

    500xCompressor: Generalized Prompt Compression for Large Language Models

    Authors: Zongqian Li, Yixuan Su, Nigel Collier

    Abstract: Prompt compression is crucial for enhancing inference speed, reducing costs, and improving user experience. However, current methods face challenges such as low compression ratios and potential data leakage during evaluation. To address these issues, we propose 500xCompressor, a method that compresses extensive natural language contexts into a minimum of one single special token. The 500xCompresso… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  2. arXiv:2406.19524  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian calibration of stochastic agent based model via random forest

    Authors: Connor Robertson, Cosmin Safta, Nicholson Collier, Jonathan Ozik, Jaideep Ray

    Abstract: Agent-based models (ABM) provide an excellent framework for modeling outbreaks and interventions in epidemiology by explicitly accounting for diverse individual interactions and environments. However, these models are usually stochastic and highly parametrized, requiring precise calibration for predictive performance. When considering realistic numbers of agents and properly accounting for stochas… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.17095  [pdf, other

    cs.CL

    Attention Instruction: Amplifying Attention in the Middle via Prompting

    Authors: Meiru Zhang, Zaiqiao Meng, Nigel Collier

    Abstract: The context window of large language models has been extended to 128k tokens or more. However, language models still suffer from position bias and have difficulty in accessing and using the middle part of the context due to the lack of attention. We examine the relative position awareness of LLMs and the feasibility of mitigating disproportional attention through prompting. We augment the original… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.11657  [pdf, other

    cs.CL cs.CY

    Can LLM be a Personalized Judge?

    Authors: Yijiang River Dong, Tiancheng Hu, Nigel Collier

    Abstract: Ensuring that large language models (LLMs) reflect diverse user values and preferences is crucial as their user bases expand globally. It is therefore encouraging to see the growing interest in LLM personalization within the research community. However, current works often rely on the LLM-as-a-Judge approach for evaluation without thoroughly examining its validity. In this paper, we investigate th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Our code is available at https://github.com/dong-river/Personalized-Judge

  5. arXiv:2406.11370  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments

    Authors: Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulić, Anna Korhonen

    Abstract: Large language models (LLMs) have shown promising abilities as cost-effective and reference-free evaluators for assessing language generation quality. In particular, pairwise LLM evaluators, which compare two generated texts and determine the preferred one, have been employed in a wide range of applications. However, LLMs exhibit preference biases and worrying sensitivity to prompt designs. In thi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures, 1 table (12 pages, 4 figures, 6 tables including references and appendices)

  6. arXiv:2406.02537  [pdf, other

    cs.CL cs.CV cs.LG

    TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

    Authors: Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić

    Abstract: Top-view perspective denotes a typical way in which humans read and reason over different types of maps, and it is vital for localization and navigation of humans as well as of `non-human' agents, such as the ones backed by large Vision-Language Models (VLMs). Nonetheless, spatial reasoning capabilities of modern VLMs remain unattested and underexplored. In this work, we thus study their capabilit… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)

  7. arXiv:2403.20279  [pdf, other

    cs.CL

    LUQ: Long-text Uncertainty Quantification for LLMs

    Authors: Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capability in a variety of NLP tasks. However, LLMs are also prone to generate nonfactual content. Uncertainty Quantification (UQ) is pivotal in enhancing our understanding of a model's confidence on its generation, thereby aiding in the mitigation of nonfactual outputs. Existing research on UQ predominantly targets short text generation, t… ▽ More

    Submitted 11 July, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  8. arXiv:2403.16950  [pdf, other

    cs.CL cs.AI cs.LG

    Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

    Authors: Yinhong Liu, Han Zhou, Zhijiang Guo, Ehsan Shareghi, Ivan Vulić, Anna Korhonen, Nigel Collier

    Abstract: Large Language Models (LLMs) have demonstrated promising capabilities as automatic evaluators in assessing the quality of generated natural language. However, LLMs still exhibit biases in evaluation and often struggle to generate coherent evaluations that align with human assessments. In this work, we first conduct a systematic study of the misalignment between LLM evaluators and human judgement,… ▽ More

    Submitted 10 August, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by COLM 2024

  9. arXiv:2402.10811  [pdf, other

    cs.CL cs.CY

    Quantifying the Persona Effect in LLM Simulations

    Authors: Tiancheng Hu, Nigel Collier

    Abstract: Large language models (LLMs) have shown remarkable promise in simulating human language and behavior. This study investigates how integrating persona variables-demographic, social, and behavioral factors-impacts LLMs' ability to simulate diverse perspectives. We find that persona variables account for <10% variance in annotations in existing subjective NLP datasets. Nonetheless, incorporating pers… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Main

  10. arXiv:2402.10175  [pdf, other

    cs.CL

    Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence

    Authors: Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

    Abstract: Recent large language models (LLMs) have shown remarkable performance in aligning generated text with user intentions across various tasks. When it comes to long-form text generation, there has been a growing interest in generation from a discourse coherence perspective. However, existing lexical or semantic metrics such as BLEU, ROUGE, BertScore cannot effectively capture the discourse coherence.… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by NAACL 2024 main conference

  11. arXiv:2402.10137  [pdf, other

    cs.CL

    TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles

    Authors: Yinhong Liu, Yimai Fang, David Vandyke, Nigel Collier

    Abstract: In light of recent advances in large language models (LLMs), the expectations for the next generation of virtual assistants include enhanced naturalness and adaptability across diverse usage scenarios. However, the creation of high-quality annotated data for Task-Oriented Dialog (TOD) is recognized to be slow and costly. To address these challenges, we introduce Task-Oriented Automatic Dialogs (TO… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024

  12. arXiv:2312.12299  [pdf, other

    cs.CL cs.AI

    Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions

    Authors: Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

    Abstract: Instruction-tuned large language models have shown remarkable performance in aligning generated text with user intentions across various tasks. However, maintaining human-like discourse structure in the generated text remains a challenging research question. In this paper, we propose Instruct-SCTG, a flexible and effective sequential framework that harnesses instruction-tuned language models to ge… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  13. arXiv:2310.15819  [pdf, other

    cs.CL cs.CY

    Generative Language Models Exhibit Social Identity Biases

    Authors: Tiancheng Hu, Yara Kyrychenko, Steve Rathje, Nigel Collier, Sander van der Linden, Jon Roozenbeek

    Abstract: The surge in popularity of large language models has given rise to concerns about biases that these models could learn from humans. We investigate whether ingroup solidarity and outgroup hostility, fundamental social identity biases known from social psychology, are present in 56 large language models. We find that almost all foundational language models and some instruction fine-tuned models exhi… ▽ More

    Submitted 17 June, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: supplementary material, data, and code see https://osf.io/9ht32/?view_only=f0ab4b23325f4c31ad3e12a7353b55f5

  14. arXiv:2310.13394  [pdf, other

    cs.CL cs.AI cs.CY

    POSQA: Probe the World Models of LLMs with Size Comparisons

    Authors: Chang Shu, Jiuzhou Han, Fangyu Liu, Ehsan Shareghi, Nigel Collier

    Abstract: Embodied language comprehension emphasizes that language understanding is not solely a matter of mental processing in the brain but also involves interactions with the physical and social environment. With the explosive growth of Large Language Models (LLMs) and their already ubiquitous presence in our daily lives, it is becoming increasingly necessary to verify their real-world understanding. Ins… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 Findings

  15. arXiv:2310.10226  [pdf, other

    cs.CL

    Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective

    Authors: Huayang Li, Tian Lan, Zihao Fu, Deng Cai, Lemao Liu, Nigel Collier, Taro Watanabe, Yixuan Su

    Abstract: There are a number of diverging hypotheses about the neural text degeneration problem, i.e., generating repetitive and dull loops, which makes this problem both interesting and confusing. In this work, we aim to advance our understanding by presenting a straightforward and fundamental explanation from the data perspective. Our preliminary investigation reveals a strong correlation between the dege… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  16. arXiv:2310.05915  [pdf, other

    cs.CL cs.AI cs.LG

    FireAct: Toward Language Agent Fine-tuning

    Authors: Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik Narasimhan, Shunyu Yao

    Abstract: Recent efforts have augmented language models (LMs) with external tools or environments, leading to the development of language agents that can reason and act. However, most of these agents rely on few-shot prompting techniques with off-the-shelf LMs. In this paper, we investigate and argue for the overlooked direction of fine-tuning LMs to obtain language agents. Using a setup of question answeri… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Code, data, and models are available at https://fireact-agent.github.io

  17. arXiv:2308.16463  [pdf, other

    cs.CV cs.CL

    Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

    Authors: Yupan Huang, Zaiqiao Meng, Fangyu Liu, Yixuan Su, Nigel Collier, Yutong Lu

    Abstract: Large language models exhibit enhanced zero-shot performance on various tasks when fine-tuned with instruction-following data. Multimodal instruction-following models extend these capabilities by integrating both text and images. However, existing models such as MiniGPT-4 face challenges in maintaining dialogue coherence in scenarios involving multiple images. A primary reason is the lack of a spe… ▽ More

    Submitted 1 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Reduced main content to 9 pages; typos corrected

  18. PSI/J: A Portable Interface for Submitting, Monitoring, and Managing Jobs

    Authors: Mihael Hategan-Marandiuc, Andre Merzky, Nicholson Collier, Ketan Maheshwari, Jonathan Ozik, Matteo Turilli, Andreas Wilke, Justin M. Wozniak, Kyle Chard, Ian Foster, Rafael Ferreira da Silva, Shantenu Jha, Daniel Laney

    Abstract: It is generally desirable for high-performance computing (HPC) applications to be portable between HPC systems, for example to make use of more performant hardware, make effective use of allocations, and to co-locate compute jobs with large datasets. Unfortunately, moving scientific applications between HPC systems is challenging for various reasons, most notably that HPC systems have different HP… ▽ More

    Submitted 20 September, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  19. arXiv:2305.14480  [pdf, other

    cs.CL cs.AI

    BAND: Biomedical Alert News Dataset

    Authors: Zihao Fu, Meiru Zhang, Zaiqiao Meng, Yannan Shen, David Buckeridge, Nigel Collier

    Abstract: Infectious disease outbreaks continue to pose a significant threat to human health and well-being. To improve disease surveillance and understanding of disease spread, several surveillance systems have been developed to monitor daily news alerts and social media. However, existing systems lack thorough epidemiological analysis in relation to corresponding alerts or news, largely due to the scarcit… ▽ More

    Submitted 15 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  20. arXiv:2305.13066  [pdf, other

    cs.CL cs.AI

    Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

    Authors: Zihao Fu, Yixuan Su, Zaiqiao Meng, Nigel Collier

    Abstract: Biomedical named entity recognition is one of the core tasks in biomedical natural language processing (BioNLP). To tackle this task, numerous supervised/distantly supervised approaches have been proposed. Despite their remarkable success, these approaches inescapably demand laborious human effort. To alleviate the need of human effort, dictionary-based approaches have been proposed to extract nam… ▽ More

    Submitted 13 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  21. arXiv:2305.12392  [pdf, other

    cs.CL

    PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs

    Authors: Jiuzhou Han, Nigel Collier, Wray Buntine, Ehsan Shareghi

    Abstract: Large language models (LLMs) have shown great abilities of solving various natural language tasks in different domains. Due to the training objective of LLMs and their pre-training data, LLMs are not very well equipped for tasks involving structured data generation. We propose a framework, Prompting with Iterative Verification (PiVe), to improve graph-based generative capability of LLMs. We show h… ▽ More

    Submitted 30 May, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Our code and data are at https://github.com/Jiuzhouh/PiVe (accepted to ACL 2024 Findings)

  22. arXiv:2304.14244  [pdf, other

    cs.DC

    Developing Distributed High-performance Computing Capabilities of an Open Science Platform for Robust Epidemic Analysis

    Authors: Nicholson Collier, Justin M. Wozniak, Abby Stevens, Yadu Babuji, Mickaël Binois, Arindam Fadikar, Alexandra Würth, Kyle Chard, Jonathan Ozik

    Abstract: COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among domain experts, mathematical modelers, and scientific computing specialists. Computationally, however, it also revealed critical gaps in the ability of researchers to exploit advanced computing systems. These challenging areas includ… ▽ More

    Submitted 10 May, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  23. arXiv:2304.04052  [pdf, other

    cs.CL cs.AI cs.LG

    Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

    Authors: Zihao Fu, Wai Lam, Qian Yu, Anthony Man-Cho So, Shengding Hu, Zhiyuan Liu, Nigel Collier

    Abstract: The sequence-to-sequence (seq2seq) task aims at generating the target sequence based on the given input source sequence. Traditionally, most of the seq2seq task is resolved by the Encoder-Decoder framework which requires an encoder to encode the source sequence and a decoder to generate the target text. Recently, a bunch of new approaches have emerged that apply decoder-only language models direct… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  24. Workflows Community Summit 2022: A Roadmap Revolution

    Authors: Rafael Ferreira da Silva, Rosa M. Badia, Venkat Bala, Debbie Bard, Peer-Timo Bremer, Ian Buckley, Silvina Caino-Lores, Kyle Chard, Carole Goble, Shantenu Jha, Daniel S. Katz, Daniel Laney, Manish Parashar, Frederic Suter, Nick Tyler, Thomas Uram, Ilkay Altintas, Stefan Andersson, William Arndt, Juan Aznar, Jonathan Bader, Bartosz Balis, Chris Blanton, Kelly Rosa Braghetto, Aharon Brodutch , et al. (80 additional authors not shown)

    Abstract: Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and t… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Report number: ORNL/TM-2023/2885

  25. arXiv:2303.14452  [pdf, other

    cs.CL

    COFFEE: A Contrastive Oracle-Free Framework for Event Extraction

    Authors: Meiru Zhang, Yixuan Su, Zaiqiao Meng, Zihao Fu, Nigel Collier

    Abstract: Event extraction is a complex information extraction task that involves extracting events from unstructured text. Prior classification-based methods require comprehensive entity annotations for joint training, while newer generation-based methods rely on heuristic templates containing oracle information such as event type, which is often unavailable in real-world scenarios. In this study, we consi… ▽ More

    Submitted 3 September, 2024; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: Accepted to MATCHING Workshop at ACL 2023

  26. arXiv:2301.09820  [pdf, other

    cs.LG cs.AI cs.CL

    A Stability Analysis of Fine-Tuning a Pre-Trained Model

    Authors: Zihao Fu, Anthony Man-Cho So, Nigel Collier

    Abstract: Fine-tuning a pre-trained model (such as BERT, ALBERT, RoBERTa, T5, GPT, etc.) has proven to be one of the most promising paradigms in recent NLP research. However, numerous recent works indicate that fine-tuning suffers from the instability problem, i.e., tuning the same model under the same setting results in significantly different performance. Many recent works have proposed different methods… ▽ More

    Submitted 7 December, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  27. arXiv:2212.10505  [pdf, other

    cs.CL cs.AI cs.CV

    DePlot: One-shot visual language reasoning by plot-to-table translation

    Authors: Fangyu Liu, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Wenhu Chen, Nigel Collier, Yasemin Altun

    Abstract: Visual language such as charts and plots is ubiquitous in the human world. Comprehending plots and charts requires strong reasoning skills. Prior state-of-the-art (SOTA) models require at least tens of thousands of training examples and their reasoning capabilities are still much limited, especially on complex human-written queries. This paper presents the first one-shot solution to visual languag… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (Findings)

  28. arXiv:2212.09662  [pdf, other

    cs.CL cs.AI cs.CV

    MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering

    Authors: Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Yasemin Altun, Nigel Collier, Julian Martin Eisenschlos

    Abstract: Visual language data such as plots, charts, and infographics are ubiquitous in the human world. However, state-of-the-art vision-language models do not perform well on these data. We propose MatCha (Math reasoning and Chart derendering pretraining) to enhance visual language models' capabilities in jointly modeling charts/plots and language data. Specifically, we propose several pretraining tasks… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  29. arXiv:2212.05093  [pdf, other

    cs.CL

    Plug-and-Play Recipe Generation with Content Planning

    Authors: Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

    Abstract: Recent pre-trained language models have shown promising capabilities in generating fluent and realistic natural language text. However, generating multi-sentence text with global content planning has been a long-existing research question. Current approaches for controlled text generation can hardly address this issue, as they usually condition on single known control attributes. In this study, we… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: Paper accepted by EMNLP 2022 GEM workshop

  30. arXiv:2211.15583  [pdf, other

    cs.CL cs.AI cs.LG

    On the Effectiveness of Parameter-Efficient Fine-Tuning

    Authors: Zihao Fu, Haoran Yang, Anthony Man-Cho So, Wai Lam, Lidong Bing, Nigel Collier

    Abstract: Fine-tuning pre-trained models has been ubiquitously proven to be effective in a wide range of NLP tasks. However, fine-tuning the whole model is parameter inefficient as it always yields an entirely new model for each task. Currently, many research works propose to only fine-tune a small portion of the parameters while keeping most of the parameters shared across different tasks. These methods ac… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  31. arXiv:2210.14140  [pdf, other

    cs.CL

    Contrastive Search Is What You Need For Neural Text Generation

    Authors: Yixuan Su, Nigel Collier

    Abstract: Generating text with autoregressive language models (LMs) is of great importance to many natural language processing (NLP) applications. Previous solutions for this task often produce text that contains degenerative expressions or lacks semantic consistency. Recently, Su et al. introduced a new decoding method, contrastive search, based on the isotropic representation space of the language model a… ▽ More

    Submitted 14 February, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: TMLR'23

  32. arXiv:2209.15108  [pdf, other

    cs.CL cs.AI cs.LG

    How to tackle an emerging topic? Combining strong and weak labels for Covid news NER

    Authors: Aleksander Ficek, Fangyu Liu, Nigel Collier

    Abstract: Being able to train Named Entity Recognition (NER) models for emerging topics is crucial for many real-world applications especially in the medical domain where new topics are continuously evolving out of the scope of existing models and datasets. For a realistic evaluation setup, we introduce a novel COVID-19 news NER dataset (COVIDNEWS-NER) and release 3000 entries of hand annotated strongly lab… ▽ More

    Submitted 8 October, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: AACL-IJCNLP 2022

  33. arXiv:2209.12786  [pdf, other

    cs.CL cs.AI

    Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour

    Authors: Fangyu Liu, Julian Martin Eisenschlos, Jeremy R. Cole, Nigel Collier

    Abstract: Language models (LMs) trained on raw texts have no direct access to the physical world. Gordon and Van Durme (2013) point out that LMs can thus suffer from reporting bias: texts rarely report on common facts, instead focusing on the unusual aspects of a situation. If LMs are only trained on text corpora and naively memorise local co-occurrence statistics, they thus naturally would learn a biased v… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: AACL 2022

  34. arXiv:2208.11981  [pdf, ps, other

    cs.CL cs.AI cs.LG

    On Reality and the Limits of Language Data: Aligning LLMs with Human Norms

    Authors: Nigel H. Collier, Fangyu Liu, Ehsan Shareghi

    Abstract: Recent advancements in Large Language Models (LLMs) harness linguistic associations in vast natural language data for practical applications. However, their ability to understand the physical world using only language data remains a question. After reviewing existing protocols, we explore this question using a novel and tightly controlled reasoning test (ART) and compare human norms against versio… ▽ More

    Submitted 9 May, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: 9 pages; data available, see https://sites.google.com/site/nhcollier/projects/art

  35. arXiv:2205.02655  [pdf, other

    cs.CV cs.CL

    Language Models Can See: Plugging Visual Controls in Text Generation

    Authors: Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier

    Abstract: Generative language models (LMs) such as GPT-2/3 can be prompted to generate text with remarkable quality. While they are designed for text-prompted generation, it remains an open question how the generation process could be guided by modalities beyond text such as images. In this work, we propose a training-free framework, called MAGIC (iMAge-Guided text generatIon with CLIP), for plugging in vis… ▽ More

    Submitted 30 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 21 pages, 5 figures, 5 tables; (v2 adds some experimental details)

  36. arXiv:2205.00363  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Spatial Reasoning

    Authors: Fangyu Liu, Guy Emerson, Nigel Collier

    Abstract: Spatial relations are a basic part of human cognition. However, they are expressed in natural language in a variety of ways, and previous work has suggested that current vision-and-language models (VLMs) struggle to capture relational information. In this paper, we present Visual Spatial Reasoning (VSR), a dataset containing more than 10k natural text-image pairs with 66 types of spatial relations… ▽ More

    Submitted 22 March, 2023; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: TACL camera-ready version; code and data available at https://github.com/cambridgeltl/visual-spatial-reasoning

  37. arXiv:2205.00267  [pdf, other

    cs.CL

    Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders

    Authors: Ivan Vulić, Goran Glavaš, Fangyu Liu, Nigel Collier, Edoardo Maria Ponti, Anna Korhonen

    Abstract: Pretrained multilingual language models (LMs) can be successfully transformed into multilingual sentence encoders (SEs; e.g., LaBSE, xMPNet) via additional fine-tuning or model distillation with parallel data. However, it remains unclear how to best leverage them to represent sub-sentence lexical items (i.e., words and phrases) in cross-lingual lexical tasks. In this work, we probe SEs for the amo… ▽ More

    Submitted 13 October, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

  38. arXiv:2203.08307  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Improving Word Translation via Two-Stage Contrastive Learning

    Authors: Yaoyiran Li, Fangyu Liu, Nigel Collier, Anna Korhonen, Ivan Vulić

    Abstract: Word translation or bilingual lexicon induction (BLI) is a key cross-lingual task, aiming to bridge the lexical gap between different languages. In this work, we propose a robust and effective two-stage contrastive learning framework for the BLI task. At Stage C1, we propose to refine standard cross-lingual linear maps between static word embeddings (WEs) via a contrastive learning objective; we a… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022 Main

    Journal ref: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022 ,pages 4353-4374

  39. arXiv:2202.06417  [pdf, other

    cs.CL

    A Contrastive Framework for Neural Text Generation

    Authors: Yixuan Su, Tian Lan, Yan Wang, Dani Yogatama, Lingpeng Kong, Nigel Collier

    Abstract: Text generation is of great importance to many natural language processing applications. However, maximization-based decoding methods (e.g. beam search) of neural language models often lead to degenerate solutions -- the generated text is unnatural and contains undesirable repetitions. Existing approaches introduce stochasticity via sampling or modify training objectives to decrease probabilities… ▽ More

    Submitted 26 September, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  40. arXiv:2111.04198  [pdf, other

    cs.CL

    TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

    Authors: Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier

    Abstract: Masked language models (MLMs) such as BERT and RoBERTa have revolutionized the field of Natural Language Understanding in the past few years. However, existing pre-trained MLMs often output an anisotropic distribution of token representations that occupies a narrow subset of the entire representation space. Such token representations are not ideal, especially for tasks that demand discriminative s… ▽ More

    Submitted 28 April, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: Camera-ready for NAACL 2022

  41. arXiv:2110.08443  [pdf, other

    cs.CL

    Prix-LM: Pretraining for Multilingual Knowledge Base Construction

    Authors: Wenxuan Zhou, Fangyu Liu, Ivan Vulić, Nigel Collier, Muhao Chen

    Abstract: Knowledge bases (KBs) contain plenty of structured world and commonsense knowledge. As such, they often complement distributional text-based information and facilitate various downstream tasks. Since their manual construction is resource- and time-intensive, recent efforts have tried leveraging large pretrained language models (PLMs) to generate additional monolingual knowledge facts for KBs. Howe… ▽ More

    Submitted 9 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022

  42. arXiv:2110.08173  [pdf, other

    cs.CL

    Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models

    Authors: Zaiqiao Meng, Fangyu Liu, Ehsan Shareghi, Yixuan Su, Charlotte Collins, Nigel Collier

    Abstract: Knowledge probing is crucial for understanding the knowledge transfer mechanism behind the pre-trained language models (PLMs). Despite the growing progress of probing knowledge for PLMs in the general domain, specialised areas such as biomedical domain are vastly under-explored. To catalyse the research in this direction, we release a well-curated biomedical knowledge probing benchmark, MedLAMA, w… ▽ More

    Submitted 22 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022; code and data are released at https://github.com/cambridgeltl/medlama

  43. arXiv:2109.13238  [pdf

    cs.CL cs.AI cs.CV

    Visually Grounded Reasoning across Languages and Cultures

    Authors: Fangyu Liu, Emanuele Bugliarello, Edoardo Maria Ponti, Siva Reddy, Nigel Collier, Desmond Elliott

    Abstract: The design of widespread vision-and-language datasets and pre-trained encoders directly adopts, or draws inspiration from, the concepts and images of ImageNet. While one can hardly overestimate how much this benchmark contributed to progress in computer vision, it is mostly derived from lexical databases and image queries in English, resulting in source material with a North American or Western Eu… ▽ More

    Submitted 21 October, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021; Fangyu and Emanuele contributed equally; MaRVL website: https://marvl-challenge.github.io

  44. arXiv:2109.09237  [pdf, other

    cs.CL

    MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models

    Authors: Qianchu Liu, Fangyu Liu, Nigel Collier, Anna Korhonen, Ivan Vulić

    Abstract: Recent work indicated that pretrained language models (PLMs) such as BERT and RoBERTa can be transformed into effective sentence and word encoders even via simple self-supervised techniques. Inspired by this line of work, in this paper we propose a fully unsupervised approach to improving word-in-context (WiC) representations in PLMs, achieved via a simple and efficient WiC-targeted fine-tuning pr… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: CoNLL 2021

  45. arXiv:2109.04810  [pdf, other

    cs.CL

    Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

    Authors: Zaiqiao Meng, Fangyu Liu, Thomas Hikaru Clark, Ehsan Shareghi, Nigel Collier

    Abstract: Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it into smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowl… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera-ready version

  46. arXiv:2108.13740  [pdf, other

    cs.CL

    Plan-then-Generate: Controlled Data-to-Text Generation via Planning

    Authors: Yixuan Su, David Vandyke, Sihui Wang, Yimai Fang, Nigel Collier

    Abstract: Recent developments in neural networks have led to the advance in data-to-text generation. However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications. In this study, we propose a novel Plan-then-Generate (PlanGen) framework to improve the controllability of neural data-to-text models. Extensive experiments and analy… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted to Findings of EMNLP 2021

  47. arXiv:2108.12516  [pdf, other

    cs.CL

    Few-Shot Table-to-Text Generation with Prototype Memory

    Authors: Yixuan Su, Zaiqiao Meng, Simon Baker, Nigel Collier

    Abstract: Neural table-to-text generation models have achieved remarkable progress on an array of tasks. However, due to the data-hungry nature of neural models, their performances strongly rely on large-scale training examples, limiting their applicability in real-world applications. To address this, we propose a new framework: Prototype-to-Generate (P2G), for table-to-text generation under the few-shot sc… ▽ More

    Submitted 31 August, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: Accepted to Findings of EMNLP 2021

  48. arXiv:2105.14398  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking

    Authors: Fangyu Liu, Ivan Vulić, Anna Korhonen, Nigel Collier

    Abstract: Injecting external domain-specific knowledge (e.g., UMLS) into pretrained language models (LMs) advances their capability to handle specialised in-domain tasks such as biomedical entity linking (BEL). However, such abundant expert knowledge is available only for a handful of languages (e.g., English). In this work, by proposing a novel cross-lingual biomedical entity linking task (XL-BEL) and esta… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: ACL-IJCNLP 2021

  49. arXiv:2104.08027  [pdf, other

    cs.CL cs.AI cs.LG

    Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders

    Authors: Fangyu Liu, Ivan Vulić, Anna Korhonen, Nigel Collier

    Abstract: Pretrained Masked Language Models (MLMs) have revolutionised NLP in recent years. However, previous work has indicated that off-the-shelf MLMs are not effective as universal lexical or sentence encoders without further task-specific fine-tuning on NLI, sentence similarity, or paraphrasing tasks using annotated task data. In this work, we demonstrate that it is possible to turn MLMs into effective… ▽ More

    Submitted 9 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 camera-ready version

  50. arXiv:2102.08220  [pdf, other

    cs.CL

    Non-Autoregressive Text Generation with Pre-trained Language Models

    Authors: Yixuan Su, Deng Cai, Yan Wang, David Vandyke, Simon Baker, Piji Li, Nigel Collier

    Abstract: Non-autoregressive generation (NAG) has recently attracted great attention due to its fast inference speed. However, the generation quality of existing NAG models still lags behind their autoregressive counterparts. In this work, we show that BERT can be employed as the backbone of a NAG model to greatly improve performance. Additionally, we devise mechanisms to alleviate the two common problems o… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: Accepted to EACL 2021