Skip to main content

Showing 1–8 of 8 results for author: Agnihotri, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09574  [pdf, other

    cs.LG

    Online Bandit Learning with Offline Preference Data

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Zheng Wen

    Abstract: Reinforcement Learning with Human Feedback (RLHF) is at the core of fine-tuning methods for generative AI models for language and images. Such feedback is often sought as rank or preference feedback from human raters, as opposed to eliciting scores since the latter tends to be very noisy. On the other hand, RL theory and algorithms predominantly assume that a reward feedback is available. In parti… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.09563  [pdf, other

    cs.LG

    e-COP : Episodic Constrained Optimization of Policies

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Sahil Singla

    Abstract: In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate sets of optimization criteria and constraints on a system's behavior. We approach this problem by first establishing a policy difference lemma for the episodic se… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2311.08422  [pdf

    cs.LG

    k-Parameter Approach for False In-Season Anomaly Suppression in Daily Time Series Anomaly Detection

    Authors: Vincent Yuansang Zha, Vaishnavi Kommaraju, Okenna Obi-Njoku, Vijay Dakshinamoorthy, Anirudh Agnihotri, Nantes Kirsten

    Abstract: Detecting anomalies in a daily time series with a weekly pattern is a common task with a wide range of applications. A typical way of performing the task is by using decomposition method. However, the method often generates false positive results where a data point falls within its weekly range but is just off from its weekday position. We refer to this type of anomalies as "in-season anomalies",… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 5 pages, 7 figures

  4. arXiv:2304.00763  [pdf

    cs.CV

    BOLLWM: A real-world dataset for bollworm pest monitoring from cotton fields in India

    Authors: Jerome White, Chandan Agrawal, Anmol Ojha, Apoorv Agnihotri, Makkunda Sharma, Jigar Doshi

    Abstract: This paper presents a dataset of agricultural pest images captured over five years by thousands of small holder farmers and farming extension workers across India. The dataset has been used to support a mobile application that relies on artificial intelligence to assist farmers with pest management decisions. Creation came from a mix of organized data collection, and from mobile application usage… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Journal ref: ICLR 2023 workshop on Practical Machine Learning for Developing Countries

  5. arXiv:2302.00808  [pdf, other

    cs.LG cs.AI

    ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints

    Authors: Akhil Agnihotri, Rahul Jain, Haipeng Luo

    Abstract: Reinforcement Learning (RL) for constrained MDPs (CMDPs) is an increasingly important problem for various applications. Often, the average criterion is more suitable than the discounted criterion. Yet, RL for average-CMDPs (ACMDPs) remains a challenging problem. Algorithms designed for discounted constrained RL problems often do not perform well for the average CMDP setting. In this paper, we intr… ▽ More

    Submitted 24 May, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: To appear in Proceedings of the $\mathit{41}^{st}$ International Conference on Machine Learning (ICML), Vienna, Austria. PMLR 235, 2024

  6. arXiv:2208.06359  [pdf

    cs.LG cs.CV

    A Case for Rejection in Low Resource ML Deployment

    Authors: Jerome White, Pulkit Madaan, Nikhil Shenoy, Apoorv Agnihotri, Makkunda Sharma, Jigar Doshi

    Abstract: Building reliable AI decision support systems requires a robust set of data on which to train models; both with respect to quantity and diversity. Obtaining such datasets can be difficult in resource limited settings, or for applications in early stages of deployment. Sample rejection is one way to work around this challenge, however much of the existing work in this area is ill-suited for such sc… ▽ More

    Submitted 15 August, 2022; v1 submitted 12 August, 2022; originally announced August 2022.

    Journal ref: NeurIPS 2022 workshop on Challenges In Deploying And Monitoring Machine Learning Systems

  7. arXiv:2105.00373  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving

    Authors: Hanjiang Hu, Zuxin Liu, Sharad Chitlangia, Akhil Agnihotri, Ding Zhao

    Abstract: The past few years have witnessed an increasing interest in improving the perception performance of LiDARs on autonomous vehicles. While most of the existing works focus on developing new deep learning algorithms or model architectures, we study the problem from the physical design perspective, i.e., how different placements of multiple LiDARs influence the learning-based perception. To this end,… ▽ More

    Submitted 4 May, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: CVPR 2022 camera-ready version:15 pages, 14 figures, 9 tables

  8. arXiv:1909.03854  [pdf

    cs.RO

    A Convolutional Neural Network Approach Towards Self-Driving Cars

    Authors: Akhil Agnihotri, Prathamesh Saraf, Kriti Rajesh Bapnad

    Abstract: A convolutional neural network (CNN) approach is used to implement a level 2 autonomous vehicle by mapping pixels from the camera input to the steering commands. The network automatically learns the maximum variable features from the camera input, hence requires minimal human intervention. Given realistic frames as input, the driving policy trained on the dataset by NVIDIA and Udacity can adapt to… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 4 pages, 7 figures