Skip to main content

Showing 1–6 of 6 results for author: Runwal, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.15734  [pdf, other

    cs.AI cs.MA

    TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

    Authors: John Chong Min Tan, Prince Saroj, Bharat Runwal, Hardik Maheshwari, Brian Lim Yi Sheng, Richard Cottrill, Alankrit Chona, Ambuj Kumar, Mehul Motani

    Abstract: TaskGen is an open-sourced agentic framework which uses an Agent to solve an arbitrary task by breaking them down into subtasks. Each subtask is mapped to an Equipped Function or another Agent to execute. In order to reduce verbosity (and hence token usage), TaskGen uses StrictJSON that ensures JSON output from the Large Language Model (LLM), along with additional features such as type checking an… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 53 pages

  2. arXiv:2404.18239  [pdf, other

    cs.LG cs.CL

    SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

    Authors: Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu

    Abstract: Large Language Models (LLMs) have highlighted the necessity of effective unlearning mechanisms to comply with data regulations and ethical AI practices. LLM unlearning aims at removing undesired data influences and associated model capabilities without compromising utility beyond the scope of unlearning. While interest in studying LLM unlearning is growing, the impact of the optimizer choice for L… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2402.01911  [pdf, other

    cs.LG

    From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

    Authors: Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen

    Abstract: Pretrained Language Models (PLMs) have become the de facto starting point for fine-tuning on downstream tasks. However, as model sizes continue to increase, traditional fine-tuning of all the parameters becomes challenging. To address this, parameter-efficient fine-tuning (PEFT) methods have gained popularity as a means to adapt PLMs effectively. In parallel, recent studies have revealed the prese… ▽ More

    Submitted 14 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Preprint

  4. arXiv:2308.14969  [pdf, other

    cs.LG cs.CV

    Uncovering the Hidden Cost of Model Compression

    Authors: Diganta Misra, Muawiz Chaudhary, Agam Goyal, Bharat Runwal, Pin Yu Chen

    Abstract: In an age dominated by resource-intensive foundation models, the ability to efficiently adapt to downstream tasks is crucial. Visual Prompting (VP), drawing inspiration from the prompting techniques employed in Large Language Models (LLMs), has emerged as a pivotal method for transfer learning in the realm of computer vision. As the importance of efficiency continues to rise, research into model c… ▽ More

    Submitted 15 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Preprint

  5. arXiv:2208.01853  [pdf, other

    cs.LG

    Robust Graph Neural Networks using Weighted Graph Laplacian

    Authors: Bharat Runwal, Vivek, Sandeep Kumar

    Abstract: Graph neural network (GNN) is achieving remarkable performances in a variety of application domains. However, GNN is vulnerable to noise and adversarial attacks in input data. Making GNN robust against noises and adversarial attacks is an important problem. The existing defense methods for GNNs are computationally demanding and are not scalable. In this paper, we propose a generic framework for ro… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at IEEE International Conference on Signal Processing and Communications (SPCOM), 2022

  6. arXiv:2204.01640  [pdf, other

    cs.LG cs.CV

    APP: Anytime Progressive Pruning

    Authors: Diganta Misra, Bharat Runwal, Tianlong Chen, Zhangyang Wang, Irina Rish

    Abstract: With the latest advances in deep learning, there has been a lot of focus on the online learning paradigm due to its relevance in practical settings. Although many methods have been investigated for optimal learning settings in scenarios where the data stream is continuous over time, sparse networks training in such settings have often been overlooked. In this paper, we explore the problem of train… ▽ More

    Submitted 1 June, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: 21 pages including 4 pages of references. Preprint version