Skip to main content

Showing 1–11 of 11 results for author: Shailza

Searching in archive cs. Search in all archives.
.
  1. Feature boosting with efficient attention for scene parsing

    Authors: Vivek Singh, Shailza Sharma, Fabio Cuzzolin

    Abstract: The complexity of scene parsing grows with the number of object and scene classes, which is higher in unrestricted open scenes. The biggest challenge is to model the spatial relation between scene elements while succeeding in identifying objects at smaller scales. This paper presents a novel feature-boosting network that gathers spatial context from multiple levels of feature extraction and comput… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  2. arXiv:2211.10883  [pdf, ps, other

    cs.CV

    Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss

    Authors: Shailza Sharma, Abhinav Dhall, Vinay Kumar, Vivek Singh Bawa

    Abstract: Recently, there has been numerous breakthroughs in face hallucination tasks. However, the task remains rather challenging in videos in comparison to the images due to inherent consistency issues. The presence of extra temporal dimension in video face hallucination makes it non-trivial to learn the facial motion through out the sequence. In order to learn these fine spatio-temporal motion details,… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  3. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  4. arXiv:2112.06924  [pdf, other

    cs.CL cs.LG

    Generating Fluent Fact Checking Explanations with Unsupervised Post-Editing

    Authors: Shailza Jolly, Pepa Atanasova, Isabelle Augenstein

    Abstract: Fact-checking systems have become important tools to verify fake and misguiding news. These systems become more trustworthy when human-readable explanations accompany the veracity labels. However, manual collection of such explanations is expensive and time-consuming. Recent works frame explanation generation as extractive summarization, and propose to automatically select a sufficient subset of t… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  5. arXiv:2112.02770  [pdf, other

    cs.CL

    Search and Learn: Improving Semantic Coverage for Data-to-Text Generation

    Authors: Shailza Jolly, Zi Xuan Zhang, Andreas Dengel, Lili Mou

    Abstract: Data-to-text generation systems aim to generate text descriptions based on input data (often represented in the tabular form). A typical system uses huge training samples for learning the correspondence between tables and texts. However, large training sets are expensive to obtain, limiting the applicability of these approaches in real-world scenarios. In this work, we focus on few-shot data-to-te… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI'22

  6. arXiv:2110.01880  [pdf, ps, other

    cs.CV cs.AI

    Frequency Aware Face Hallucination Generative Adversarial Network with Semantic Structural Constraint

    Authors: Shailza Sharma, Abhinav Dhall, Vinay Kumar

    Abstract: In this paper, we address the issue of face hallucination. Most current face hallucination methods rely on two-dimensional facial priors to generate high resolution face images from low resolution face images. These methods are only capable of assimilating global information into the generated image. Still there exist some inherent problems in these methods; such as, local features, subtle structu… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 12 pages, 12 figures, submitted to IEEE Transactions on Computational Imaging

    ACM Class: I.4.3; I.4.9

  7. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  8. arXiv:2010.14953  [pdf, other

    cs.CV

    Leveraging Visual Question Answering to Improve Text-to-Image Synthesis

    Authors: Stanislav Frolov, Shailza Jolly, Jörn Hees, Andreas Dengel

    Abstract: Generating images from textual descriptions has recently attracted a lot of interest. While current models can generate photo-realistic images of individual objects such as birds and human faces, synthesising images with multiple objects is still very difficult. In this paper, we propose an effective way to combine Text-to-Image (T2I) synthesis with Visual Question Answering (VQA) to improve the i… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted to the LANTERN workshop at COLING 2020

  9. arXiv:2003.11844  [pdf, other

    cs.CV

    P $\approx$ NP, at least in Visual Question Answering

    Authors: Shailza Jolly, Sebastian Palacio, Joachim Folz, Federico Raue, Joern Hees, Andreas Dengel

    Abstract: In recent years, progress in the Visual Question Answering (VQA) field has largely been driven by public challenges and large datasets. One of the most widely-used of these is the VQA 2.0 dataset, consisting of polar ("yes/no") and non-polar questions. Looking at the question distribution over all answers, we find that the answers "yes" and "no" account for 38 % of the questions, while the remaini… ▽ More

    Submitted 27 March, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

  10. arXiv:1809.04344  [pdf, other

    cs.CV cs.AI cs.CL

    The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA

    Authors: Shailza Jolly, Sandro Pezzelle, Tassilo Klein, Andreas Dengel, Moin Nabi

    Abstract: We introduce MASSES, a simple evaluation metric for the task of Visual Question Answering (VQA). In its standard form, the VQA task is operationalized as follows: Given an image and an open-ended question in natural language, systems are required to provide a suitable answer. Currently, model performance is evaluated by means of a somehow simplistic metric: If the predicted answer is chosen by at… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: 10 pages, 7 figures

  11. arXiv:1808.08402  [pdf, other

    cs.CV cs.MM

    How do Convolutional Neural Networks Learn Design?

    Authors: Shailza Jolly, Brian Kenji Iwana, Ryohei Kuroki, Seiichi Uchida

    Abstract: In this paper, we aim to understand the design principles in book cover images which are carefully crafted by experts. Book covers are designed in a unique way, specific to genres which convey important information to their readers. By using Convolutional Neural Networks (CNN) to predict book genres from cover images, visual cues which distinguish genres can be highlighted and analyzed. In order t… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: Accepted by ICPR 2018