Skip to main content

Showing 1–50 of 57 results for author: Choudhary, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05646  [pdf, other

    cs.LG cs.AI cs.CL

    Eigen Attention: Attention in Low-Rank Space for KV Cache Compression

    Authors: Utkarsh Saxena, Gobinda Saha, Sakshi Choudhary, Kaushik Roy

    Abstract: Large language models (LLMs) represent a groundbreaking advancement in the domain of natural language processing due to their impressive reasoning abilities. Recently, there has been considerable interest in increasing the context lengths for these models to enhance their applicability to complex tasks. However, at long context lengths and large batch sizes, the key-value (KV) cache, which stores… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 12 page, 6 figures, 6 tables

  2. arXiv:2407.05404  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    iSign: A Benchmark for Indian Sign Language Processing

    Authors: Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi

    Abstract: Indian Sign Language has limited resources for developing machine learning and data-driven approaches for automated language processing. Though text/audio-based language processing techniques have shown colossal research interest and tremendous improvements in the last few years, Sign Languages still need to catch up due to the need for more resources. To bridge this gap, in this work, we propose… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024 Findings. 18 Pages (9 Pages + References + Appendix)

  3. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.04744  [pdf, other

    cs.CL

    CRAG -- Comprehensive RAG Benchmark

    Authors: Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar , et al. (2 additional authors not shown)

    Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2405.15551  [pdf, other

    cs.LG

    Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

    Authors: Kunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan

    Abstract: Finetuning large language models (LLMs) in federated learning (FL) settings has become important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memory (especially from intermediate activations) for resource-constrained devices. While Forward-mode Auto-Differentiation (AD) can reduce memory footprin… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2405.14377  [pdf, other

    cs.LG cs.AI

    CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization

    Authors: Zi Yang, Samridhi Choudhary, Xinfeng Xie, Cao Gao, Siegfried Kunzmann, Zheng Zhang

    Abstract: Training large AI models such as deep learning recommendation systems and foundation language (or multi-modal) models costs massive GPUs and computing time. The high training cost has become only affordable to big tech companies, meanwhile also causing increasing concerns about the environmental impact. This paper presents CoMERA, a Computing- and Memory-Efficient training method via Rank-Adaptive… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.13961  [pdf, other

    cs.LG cs.DC cs.MA

    SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data

    Authors: Sakshi Choudhary, Sai Aparna Aketi, Kaushik Roy

    Abstract: Decentralized training enables learning with distributed datasets generated at different locations without relying on a central server. In realistic scenarios, the data distribution across these sparsely connected learning agents can be significantly heterogeneous, leading to local model over-fitting and poor global model generalization. Another challenge is the high communication cost of training… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  8. arXiv:2403.14003  [pdf, other

    cs.CV cs.CL cs.LG

    Multi-Modal Hallucination Control by Visual Information Grounding

    Authors: Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

    Abstract: Generative Vision-Language Models (VLMs) are prone to generate plausible-sounding textual answers that, however, are not always grounded in the input image. We investigate this phenomenon, usually referred to as "hallucination" and show that it stems from an excessive reliance on the language prior. In particular, we show that as more tokens are generated, the reliance on the visual prompt decreas… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  9. arXiv:2403.06319  [pdf, other

    cs.LG cs.CR

    Fake or Compromised? Making Sense of Malicious Clients in Federated Learning

    Authors: Hamid Mozaffari, Sunav Choudhary, Amir Houmansadr

    Abstract: Federated learning (FL) is a distributed machine learning paradigm that enables training models on decentralized data. The field of FL security against poisoning attacks is plagued with confusion due to the proliferation of research that makes different assumptions about the capabilities of adversaries and the adversary models they operate under. Our work aims to clarify this confusion by presenti… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  10. arXiv:2403.05764  [pdf, ps, other

    quant-ph cs.AI

    Investigation into the Potential of Parallel Quantum Annealing for Simultaneous Optimization of Multiple Problems: A Comprehensive Study

    Authors: Arit Kumar Bishwas, Anuraj Som, Saurabh Choudhary

    Abstract: Parallel Quantum Annealing is a technique to solve multiple optimization problems simultaneously. Parallel quantum annealing aims to optimize the utilization of available qubits on a quantum topology by addressing multiple independent problems in a single annealing cycle. This study provides insights into the potential and the limitations of this parallelization method. The experiments consisting… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  11. arXiv:2403.03292  [pdf, other

    cs.LG cs.DC

    Averaging Rate Scheduler for Decentralized Learning on Heterogeneous Data

    Authors: Sai Aparna Aketi, Sakshi Choudhary, Kaushik Roy

    Abstract: State-of-the-art decentralized learning algorithms typically require the data distribution to be Independent and Identically Distributed (IID). However, in practical scenarios, the data distribution across the agents can have significant heterogeneity. In this work, we propose averaging rate scheduling as a simple yet effective way to reduce the impact of heterogeneity in decentralized learning. O… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures, 4 tables. arXiv admin note: text overlap with arXiv:2305.04792

  12. arXiv:2402.03388  [pdf, other

    cs.AI cs.IR cs.LG

    Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint

    Authors: Harshita Chopra, Atanu R. Sinha, Sunav Choudhary, Ryan A. Rossi, Paavan Kumar Indela, Veda Pranav Parwatala, Srinjayee Paul, Aurghya Maiti

    Abstract: Users' behavioral footprints online enable firms to discover behavior-based user segments (or, segments) and deliver segment specific messages to users. Following the discovery of segments, delivery of messages to users through preferred media channels like Facebook and Google can be challenging, as only a portion of users in a behavior segment find match in a medium, and only a fraction of those… ▽ More

    Submitted 15 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  13. A Holistic Approach on Smart Garment for Patients with Juvenile Idiopathic Arthritis

    Authors: Safal Choudhary, Princy Randhawa, Sampath Kumar P Jinka, Shiva Prasad H. C

    Abstract: Juvenile Idiopathic Arthritis (JIA) is a widespread and chronic condition that affects children and adolescents worldwide. The person suffering from JIA is characterized by chronic joint inflammation leading to pain, swelling, stiffness, and limited body movements. Individuals suffering from JIA require ongoing treatment for their lifetime. Beyond inflammation, JIA patients have expressed concerns… ▽ More

    Submitted 25 December, 2023; originally announced January 2024.

    Comments: 08 pages

    Journal ref: 10.3390/engproc2023059083;2023

  14. arXiv:2312.14461  [pdf, other

    cs.CR cs.AI cs.LG

    Attacking Byzantine Robust Aggregation in High Dimensions

    Authors: Sarthak Choudhary, Aashish Kolluri, Prateek Saxena

    Abstract: Training modern neural networks or models typically requires averaging over a sample of high-dimensional vectors. Poisoning attacks can skew or bias the average vectors used to train the model, forcing the model to learn specific patterns or avoid learning anything useful. Byzantine robust aggregation is a principled algorithmic defense against such biasing. Robust aggregators can bound the maximu… ▽ More

    Submitted 19 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  15. arXiv:2311.10812  [pdf, other

    cs.CV cs.GR cs.LG

    SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

    Authors: Rohit Jena, Ganesh Subramanian Iyer, Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, James Gee

    Abstract: We propose SplatArmor, a novel approach for recovering detailed and animatable human models by `armoring' a parameterized body model with 3D Gaussians. Our approach represents the human as a set of 3D Gaussians within a canonical space, whose articulation is defined by extending the skinning of the underlying SMPL geometry to arbitrary locations in the canonical space. To account for pose-dependen… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  16. arXiv:2307.02773  [pdf, other

    cs.CV cs.HC

    SeLiNet: Sentiment enriched Lightweight Network for Emotion Recognition in Images

    Authors: Tuneer Khargonkar, Shwetank Choudhary, Sumit Kumar, Barath Raj KR

    Abstract: In this paper, we propose a sentiment-enriched lightweight network SeLiNet and an end-to-end on-device pipeline for contextual emotion recognition in images. SeLiNet model consists of body feature extractor, image aesthetics feature extractor, and learning-based fusion network which jointly estimates discrete emotion and human sentiments tasks. On the EMOTIC dataset, the proposed approach achieves… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Paper submitted in ISCAS 2023

  17. arXiv:2306.01076  [pdf, ps, other

    cs.CL cs.AI

    Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding

    Authors: Zi Yang, Samridhi Choudhary, Siegfried Kunzmann, Zheng Zhang

    Abstract: Fine-tuned transformer models have shown superior performances in many natural language tasks. However, the large model size prohibits deploying high-performance transformer models on resource-constrained devices. This paper proposes a quantization-aware tensor-compressed training approach to reduce the model size, arithmetic operations, and ultimately runtime latency of transformer-based models.… ▽ More

    Submitted 8 July, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  18. LEAN: Light and Efficient Audio Classification Network

    Authors: Shwetank Choudhary, CR Karthik, Punuru Sri Lakshmi, Sumit Kumar

    Abstract: Over the past few years, audio classification task on large-scale dataset such as AudioSet has been an important research area. Several deeper Convolution-based Neural networks have shown compelling performance notably Vggish, YAMNet, and Pretrained Audio Neural Network (PANN). These models are available as pretrained architecture for transfer learning as well as specific audio task adoption. In t… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at INDICON 2022

  19. arXiv:2303.15378  [pdf, other

    cs.LG cs.DC cs.MA

    CoDeC: Communication-Efficient Decentralized Continual Learning

    Authors: Sakshi Choudhary, Sai Aparna Aketi, Gobinda Saha, Kaushik Roy

    Abstract: Training at the edge utilizes continuously evolving data generated at different locations. Privacy concerns prohibit the co-location of this spatially as well as temporally distributed data, deeming it crucial to design training algorithms that enable efficient continual learning over decentralized private data. Decentralized learning allows serverless training with spatially distributed data. A f… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  20. arXiv:2303.08808  [pdf, other

    cs.CV

    Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos

    Authors: Rohit Jena, Pratik Chaudhari, James Gee, Ganesh Iyer, Siddharth Choudhary, Brandon M. Smith

    Abstract: Human reconstruction and synthesis from monocular RGB videos is a challenging problem due to clothing, occlusion, texture discontinuities and sharpness, and framespecific pose changes. Many methods employ deferred rendering, NeRFs and implicit methods to represent clothed humans, on the premise that mesh-based representations cannot capture complex clothing and textures from RGB, silhouettes, and… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  21. arXiv:2302.13053  [pdf, other

    cs.LG cs.AI cs.IR

    Scalable Neural Network Training over Distributed Graphs

    Authors: Aashish Kolluri, Sarthak Choudhary, Bryan Hooi, Prateek Saxena

    Abstract: Graph neural networks (GNNs) fuel diverse machine learning tasks involving graph-structured data, ranging from predicting protein structures to serving personalized recommendations. Real-world graph data must often be stored distributed across many machines not just because of capacity constraints, but because of compliance with data residency or privacy laws. In such setups, network communication… ▽ More

    Submitted 11 February, 2024; v1 submitted 25 February, 2023; originally announced February 2023.

  22. A Multimodal Sensing Ring for Quantification of Scratch Intensity

    Authors: Akhil Padmanabha, Sonal Choudhary, Carmel Majidi, Zackory Erickson

    Abstract: An objective measurement of chronic itch is necessary for improvements in patient care for numerous medical conditions. While wearables have shown promise for scratch detection, they are currently unable to estimate scratch intensity, preventing a comprehensive understanding of the effect of itch on an individual. In this work, we present a framework for the estimation of scratch intensity in addi… ▽ More

    Submitted 31 October, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: Commun Med 3, 115 (2023)

  23. arXiv:2211.15281  [pdf, other

    cs.LG

    Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing

    Authors: Kunjal Panchal, Sunav Choudhary, Nisarg Parikh, Lijun Zhang, Hui Guan

    Abstract: Personalization in Federated Learning (FL) aims to modify a collaboratively trained global model according to each client. Current approaches to personalization in FL are at a coarse granularity, i.e. all the input instances of a client use the same personalized model. This ignores the fact that some instances are more accurately handled by the global model due to better generalizability. To addre… ▽ More

    Submitted 10 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 37th Annual Conference on Neural Information Processing Systems (NeurIPS), 2023

  24. arXiv:2209.04333  [pdf, other

    cs.CL

    Ranking-Enhanced Unsupervised Sentence Representation Learning

    Authors: Yeon Seonwoo, Guoyin Wang, Changmin Seo, Sajal Choudhary, Jiwei Li, Xiang Li, Puyang Xu, Sunghyun Park, Alice Oh

    Abstract: Unsupervised sentence representation learning has progressed through contrastive learning and data augmentation methods such as dropout masking. Despite this progress, sentence encoders are still limited to using only an input sentence when predicting its semantic vector. In this work, we show that the semantic meaning of a sentence is also determined by nearest-neighbor sentences that are similar… ▽ More

    Submitted 18 May, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: ACL 2023

  25. arXiv:2207.01551  [pdf, other

    cs.DS

    Correlated Stochastic Knapsack with a Submodular Objective

    Authors: Sheng Yang, Samir Khuller, Sunav Choudhary, Subrata Mitra, Kanak Mahadik

    Abstract: We study the correlated stochastic knapsack problem of a submodular target function, with optional additional constraints. We utilize the multilinear extension of submodular function, and bundle it with an adaptation of the relaxed linear constraints from Ma [Mathematics of Operations Research, Volume 43(3), 2018] on correlated stochastic knapsack problem. The relaxation is then solved by the stoc… ▽ More

    Submitted 3 August, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted to ESA 2022. (fix typo in previous version)

  26. arXiv:2203.17081  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Interpretation of Black Box NLP Models: A Survey

    Authors: Shivani Choudhary, Niladri Chatterjee, Subir Kumar Saha

    Abstract: An increasing number of machine learning models have been deployed in domains with high stakes such as finance and healthcare. Despite their superior performances, many models are black boxes in nature which are hard to explain. There are growing efforts for researchers to develop methods to interpret these black-box models. Post hoc explanations based on perturbations, such as LIME, are widely us… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  27. arXiv:2203.17042  [pdf, other

    cs.IR cs.AI cs.CY

    IITD-DBAI: Multi-Stage Retrieval with Pseudo-Relevance Feedback and Query Reformulation

    Authors: Shivani Choudhary

    Abstract: Resolving the contextual dependency is one of the most challenging tasks in the Conversational system. Our submission to CAsT-2021 aimed to preserve the key terms and the context in all subsequent turns and use classical Information retrieval methods. It was aimed to pull as relevant documents as possible from the corpus. We have participated in automatic track and submitted two runs in the CAsT-2… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  28. arXiv:2110.11719  [pdf, other

    cs.PF

    Experience with PCIe streaming on FPGA for high throughput ML inferencing

    Authors: Piyush Manavar, Manoj Nambiar, Nupur Sumeet, Rekha Singhal, Sharod Choudhary, Amey Pandit

    Abstract: Achieving maximum possible rate of inferencing with minimum hardware resources plays a major role in reducing enterprise operational costs. In this paper we explore use of PCIe streaming on FPGA based platforms to achieve high throughput. PCIe streaming is a unique capability available on FPGA that eliminates the need for memory copy overheads. We have presented our results for inferences on a gra… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    MSC Class: 68T99 ACM Class: C.4

  29. arXiv:2108.04558  [pdf, other

    cs.CV

    Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks

    Authors: Chetan Ralekar, Shubham Choudhary, Tapan Kumar Gandhi, Santanu Chaudhury

    Abstract: Human observers engage in selective information uptake when classifying visual patterns. The same is true of deep neural networks, which currently constitute the best performing artificial vision systems. Our goal is to examine the congruence, or lack thereof, in the information-gathering strategies of the two systems. We have operationalized our investigation as a character recognition task. We h… ▽ More

    Submitted 29 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

  30. arXiv:2107.07842  [pdf, other

    cs.IR cs.AI

    A Survey of Knowledge Graph Embedding and Their Applications

    Authors: Shivani Choudhary, Tarun Luthra, Ashima Mittal, Rajat Singh

    Abstract: Knowledge Graph embedding provides a versatile technique for representing knowledge. These techniques can be used in a variety of applications such as completion of knowledge graph to predict missing information, recommender systems, question answering, query expansion, etc. The information embedded in Knowledge graph though being structured is challenging to consume in a real-world application. K… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 9 figures

  31. arXiv:2106.12871  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    DCoM: A Deep Column Mapper for Semantic Data Type Detection

    Authors: Subhadip Maji, Swapna Sourav Rout, Sudeep Choudhary

    Abstract: Detection of semantic data types is a very crucial task in data science for automated data cleaning, schema matching, data discovery, semantic data type normalization and sensitive data identification. Existing methods include regular expression-based or dictionary lookup-based methods that are not robust to dirty as well unseen data and are limited to a very less number of semantic data types to… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 9 pages, 2 figures, 7 tables

  32. arXiv:2106.09009  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    End-to-End Spoken Language Understanding for Generalized Voice Assistants

    Authors: Michael Saxon, Samridhi Choudhary, Joseph P. McKenna, Athanasios Mouchtaris

    Abstract: End-to-end (E2E) spoken language understanding (SLU) systems predict utterance semantics directly from speech using a single model. Previous work in this area has focused on targeted tasks in fixed domains, where the output semantic structure is assumed a priori and the input speech is of limited complexity. In this work we present our approach to developing an E2E model for generalized SLU in com… ▽ More

    Submitted 19 July, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to Interspeech 2021; 5 pages, 2 tables, 1 figure

    Journal ref: Proc. Interspeech 2021, 4738-4742

  33. arXiv:2106.01251  [pdf, other

    cs.CL cs.IR

    Multilingual Medical Question Answering and Information Retrieval for Rural Health Intelligence Access

    Authors: Vishal Vinod, Susmit Agrawal, Vipul Gaurav, Pallavi R, Savita Choudhary

    Abstract: In rural regions of several developing countries, access to quality healthcare, medical infrastructure, and professional diagnosis is largely unavailable. Many of these regions are gradually gaining access to internet infrastructure, although not with a strong enough connection to allow for sustained communication with a medical practitioner. Several deaths resulting from this lack of medical acce… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Journal ref: ICLR 2021 Workshop

  34. arXiv:2104.13216  [pdf, other

    cs.LG cs.AI

    Handling Long-Tail Queries with Slice-Aware Conversational Systems

    Authors: Cheng Wang, Sun Kim, Taiwoo Park, Sajal Choudhary, Sunghyun Park, Young-Bum Kim, Ruhi Sarikaya, Sungjin Lee

    Abstract: We have been witnessing the usefulness of conversational AI systems such as Siri and Alexa, directly impacting our daily lives. These systems normally rely on machine learning models evolving over time to provide quality user experience. However, the development and improvement of the models are challenging because they need to support both high (head) and low (tail) usage scenarios, requiring fin… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: Published at ICLR 2021 Workshop on Weakly Supervised Learning

  35. arXiv:2012.00124  [pdf, other

    cs.CL cs.AI cs.LG

    Extreme Model Compression for On-device Natural Language Understanding

    Authors: Kanthashree Mysore Sathyendra, Samridhi Choudhary, Leah Nicolich-Henkin

    Abstract: In this paper, we propose and experiment with techniques for extreme compression of neural natural language understanding (NLU) models, making them suitable for execution on resource-constrained devices. We propose a task-aware, end-to-end compression approach that performs word-embedding compression jointly with NLU task learning. We show our results on a large-scale, commercial NLU system traine… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: Long paper at COLING 2020

  36. arXiv:2011.09044  [pdf, other

    eess.AS cs.CL cs.SD

    Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding

    Authors: Bhuvan Agrawal, Markus Müller, Martin Radfar, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann

    Abstract: End-to-end (E2E) spoken language understanding (SLU) systems can infer the semantics of a spoken utterance directly from an audio signal. However, training an E2E system remains a challenge, largely due to the scarcity of paired audio-semantics data. In this paper, we treat an E2E system as a multi-modal model, with audio and text functioning as its two modalities, and use a cross-modal latent spa… ▽ More

    Submitted 15 April, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: 7 pages, 6 figures

  37. arXiv:2008.02858  [pdf, other

    cs.CL cs.SD eess.AS

    Semantic Complexity in End-to-End Spoken Language Understanding

    Authors: Joseph P. McKenna, Samridhi Choudhary, Michael Saxon, Grant P. Strimel, Athanasios Mouchtaris

    Abstract: End-to-end spoken language understanding (SLU) models are a class of model architectures that predict semantics directly from speech. Because of their input and output types, we refer to them as speech-to-interpretation (STI) models. Previous works have successfully applied STI models to targeted use cases, such as recognizing home automation commands, however no study has yet addressed how these… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted at Interspeech, 2020

  38. arXiv:1912.00818  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning with Personalization Layers

    Authors: Manoj Ghuhan Arivazhagan, Vinay Aggarwal, Aaditya Kumar Singh, Sunav Choudhary

    Abstract: The emerging paradigm of federated learning strives to enable collaborative training of machine learning models on the network edge without centrally aggregating raw data and hence, improving data privacy. This sharply deviates from traditional machine learning and necessitates the design of algorithms robust to various sources of heterogeneity. Specifically, statistical heterogeneity of data acro… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

  39. arXiv:1911.12740  [pdf, other

    cs.LG stat.ML

    Data-Driven Compression of Convolutional Neural Networks

    Authors: Ramit Pahwa, Manoj Ghuhan Arivazhagan, Ankur Garg, Siddarth Krishnamoorthy, Rohit Saxena, Sunav Choudhary

    Abstract: Deploying trained convolutional neural networks (CNNs) to mobile devices is a challenging task because of the simultaneous requirements of the deployed model to be fast, lightweight and accurate. Designing and training a CNN architecture that does well on all three metrics is highly non-trivial and can be very time-consuming if done by hand. One way to solve this problem is to compress the trained… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: 17 pages, 10 tables, 1 figure

  40. arXiv:1910.04402  [pdf, ps, other

    eess.SP cs.NI

    Scheduling in Wireless Networks with Spatial Reuse of Spectrum as Restless Bandits

    Authors: Vivek S. Borkar, Shantanu Choudhary, Vaibhav Kumar Gupta, Gaurav S. Kasbekar

    Abstract: We study the problem of scheduling packet transmissions with the aim of minimizing the energy consumption and data transmission delay of users in a wireless network in which spatial reuse of spectrum is employed. We approach this problem using the theory of Whittle index for cost minimizing restless bandits, which has been used to effectively solve problems in a variety of applications. We design… ▽ More

    Submitted 8 June, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: Revision

  41. arXiv:1901.07646  [pdf, other

    cs.RO cs.AI

    Learning Configuration Space Belief Model from Collision Checks for Motion Planning

    Authors: Sumit Kumar, Shushman Choudhary, Siddhartha Srinivasa

    Abstract: For motion planning in high dimensional configuration spaces, a significant computational bottleneck is collision detection. Our aim is to reduce the expected number of collision checks by creating a belief model of the configuration space using results from collision tests. We assume the robot's configuration space to be a continuous ambient space whereby neighbouring points tend to share the sam… ▽ More

    Submitted 9 February, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  42. arXiv:1812.04215  [pdf, other

    cs.CV

    Automatic Feature Weight Determination using Indexing and Pseudo-Relevance Feedback for Multi-feature Content-Based Image Retrieval

    Authors: Asheet Kumar, Shivam Choudhary, Vaibhav Singh Khokhar, Vikas Meena, Chiranjoy Chattopadhyay

    Abstract: Content-based image retrieval (CBIR) is one of the most active research areas in multimedia information retrieval. Given a query image, the task is to search relevant images in a repository. Low level features like color, texture, and shape feature vectors of an image are always considered to be an important attribute in CBIR system. Thus the performance of the CBIR system can be enhanced by combi… ▽ More

    Submitted 10 December, 2018; originally announced December 2018.

    Comments: 9 pages, 6 figures

  43. arXiv:1802.04422  [pdf, other

    stat.ML cs.CY cs.LG

    A comparative study of fairness-enhancing interventions in machine learning

    Authors: Sorelle A. Friedler, Carlos Scheidegger, Suresh Venkatasubramanian, Sonam Choudhary, Evan P. Hamilton, Derek Roth

    Abstract: Computers are increasingly used to make decisions that have significant impact in people's lives. Often, these predictions can affect different population subgroups disproportionately. As a result, the issue of fairness has received much recent interest, and a number of fairness-enhanced classifiers and predictors have appeared in the literature. This paper seeks to study the following questions:… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  44. arXiv:1710.05772  [pdf, other

    cs.RO

    Data-Efficient Decentralized Visual SLAM

    Authors: Titus Cieslewski, Siddharth Choudhary, Davide Scaramuzza

    Abstract: Decentralized visual simultaneous localization and mapping (SLAM) is a powerful tool for multi-robot applications in environments where absolute positioning systems are not available. Being visual, it relies on cameras, cheap, lightweight and versatile sensors, and being decentralized, it does not rely on communication to a central ground station. In this work, we integrate state-of-the-art decent… ▽ More

    Submitted 16 October, 2017; originally announced October 2017.

    Comments: 8 pages, submitted to ICRA 2018

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018

  45. Advanced Page Rank Algorithm with Semantics, In Links, Out Links and Google Analytics

    Authors: Aritra Banerjee, Shrey Choudhary

    Abstract: In this paper we have modified the existing page ranking mechanism as an advanced Page Rank Algorithm based on Semantics Inlinks Outlinks and Google Analytics. We have used Semantics page ranking to rank pages according to the word searched and match it with the metadata of the website and provide a value of rank according to the highest priority.We have also used Google analytics to store the num… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: 6 pages, 2 figures, Published with International Journal of Computer Trends and Technology (IJCTT)

    Journal ref: International Journal of Computer Trends and Technology(IJCTT) V50 (3):137-142, August 2017. ISSN:2231-2803. Published by Seventh Sense Research Group

  46. arXiv:1708.00897  [pdf, other

    cs.CL

    Domain Aware Neural Dialog System

    Authors: Sajal Choudhary, Prerna Srivastava, Lyle Ungar, João Sedoc

    Abstract: We investigate the task of building a domain aware chat system which generates intelligent responses in a conversation comprising of different domains. The domain, in this case, is the topic or theme of the conversation. To achieve this, we present DOM-Seq2Seq, a domain aware neural network model based on the novel technique of using domain-targeted sequence-to-sequence models (Sutskever et al., 2… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

  47. arXiv:1707.04546  [pdf, other

    cs.CL cs.SI

    Linguistic Markers of Influence in Informal Interactions

    Authors: Shrimai Prabhumoye, Samridhi Choudhary, Evangelia Spiliopoulou, Christopher Bogart, Carolyn Penstein Rose, Alan W Black

    Abstract: There has been a long standing interest in understanding `Social Influence' both in Social Sciences and in Computational Linguistics. In this paper, we present a novel approach to study and measure interpersonal influence in daily interactions. Motivated by the basic principles of influence, we attempt to identify indicative linguistic features of the posts in an online knitting community. We pres… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: 10 pages, Accepted in NLP+CSS workshop for ACL (Association for Computational Linguistics) 2017

  48. arXiv:1702.03435  [pdf, other

    cs.RO cs.CV

    Distributed Mapping with Privacy and Communication Constraints: Lightweight Algorithms and Object-based Models

    Authors: Siddharth Choudhary, Luca Carlone, Carlos Nieto, John Rogers, Henrik I. Christensen, Frank Dellaert

    Abstract: We consider the following problem: a team of robots is deployed in an unknown environment and it has to collaboratively build a map of the area without a reliable infrastructure for communication. The backbone for modern mapping techniques is pose graph optimization, which estimates the trajectory of the robots, from which the map can be easily built. The first contribution of this paper is a set… ▽ More

    Submitted 11 February, 2017; originally announced February 2017.

    Comments: preprint for IJRR submission

  49. arXiv:1608.03624  [pdf, other

    cs.SE

    From Manual Android Tests to Automated and Platform Independent Test Scripts

    Authors: Mattia Fazzini, Eduardo Noronha de A. Freitas, Shauvik Roy Choudhary, Alessandro Orso

    Abstract: Because Mobile apps are extremely popular and often mission critical nowadays, companies invest a great deal of resources in testing the apps they provide to their customers. Testing is particularly important for Android apps, which must run on a multitude of devices and operating system versions. Unfortunately, as we confirmed in many interviews with quality assurance professionals, app testing i… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

  50. arXiv:1601.07254  [pdf, other

    cs.IT

    Active Target Localization using Low-Rank Matrix Completion and Unimodal Regression

    Authors: Sunav Choudhary, Naveen Kumar, Srikanth Narayanan, Urbashi Mitra

    Abstract: The detection and localization of a target from samples of its generated field is a problem of interest in a broad range of applications. Often, the target field admits structural properties that enable the design of lower sample detection strategies with good performance. This paper designs a sampling and localization strategy which exploits separability and unimodality in target fields and theor… ▽ More

    Submitted 26 January, 2016; originally announced January 2016.

    Comments: 24 pages, 10 figures