Skip to main content

Showing 1–50 of 672 results for author: Jain, R

.
  1. arXiv:2408.14670  [pdf, ps, other

    cs.CC

    Lossy Catalytic Computation

    Authors: Chetan Gupta, Rahul Jain, Vimal Raj Sharma, Raghunath Tewari

    Abstract: A catalytic Turing machine is a variant of a Turing machine in which there exists an auxiliary tape in addition to the input tape and the work tape. This auxiliary tape is initially filled with arbitrary content. The machine can read and write on the auxiliary tape, but it is constrained to restore its initial content when it halts. Studying such a model and finding its powers and limitations has… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.11919  [pdf, other

    cs.DC

    PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters

    Authors: Rutwik Jain, Brandon Tran, Keting Chen, Matthew D. Sinclair, Shivaram Venkataraman

    Abstract: Large-scale computing systems are increasingly using accelerators such as GPUs to enable peta- and exa-scale levels of compute to meet the needs of Machine Learning (ML) and scientific computing applications. Given the widespread and growing use of ML, including in some scientific applications, optimizing these clusters for ML workloads is particularly important. However, recent work has demonstra… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  3. arXiv:2408.09419  [pdf, other

    cond-mat.stat-mech quant-ph

    Dynamical response and time correlation functions in random quantum systems

    Authors: Sudhir Ranjan Jain, Pierre Gaspard

    Abstract: Time-dependent response and correlation functions are studied in random quantum systems composed of infinitely many parts without mutual interaction and defined with statistically independent random matrices. The latter are taken within the three Wigner-Dyson universality classes. In these systems, the response functions are shown to be exactly given by statistical averages over the random-matrix… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  4. arXiv:2408.09125  [pdf, other

    cs.LG cs.AI

    Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

    Authors: Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar

    Abstract: Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot make environmental interactions during learning. It does not have additional supplementary datasets beyond the expert's dataset nor any information a… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  5. arXiv:2408.08967  [pdf, other

    cs.CR

    Phishing Codebook: A Structured Framework for the Characterization of Phishing Emails

    Authors: Tarini Saka, Rachiyta Jain, Kami Vaniea, Nadin Kökciyan

    Abstract: Phishing is one of the most prevalent and expensive types of cybercrime faced by organizations and individuals worldwide. Most prior research has focused on various technical features and traditional representations of text to characterize phishing emails. There is a significant knowledge gap about the qualitative traits embedded in them, which could be useful in a range of phishing mitigation tas… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 18 pages

  6. arXiv:2408.06722  [pdf, other

    quant-ph

    Quantum cloning transformation unlocks the potential of W class of states in a secret sharing protocol

    Authors: Rashi Jain, Satyabrata Adhikari

    Abstract: One of the most challenging problems is to share a secret because the sender does not trust the receiver completely. Thus, the sender provides one part of the information to the receiver and shares the other part of the information to a third party on whom the sender can rely. The secret can be revealed when the receiver and the third party agree to cooperate. This is the essence of the secret-sha… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 11 pages, 3 figures

  7. arXiv:2408.06207  [pdf

    cs.NI quant-ph

    Multi-tree Quantum Routing in Realistic Topologies

    Authors: Zebo Yang, Ali Ghubaish, Raj Jain, Ramana Kompella, Hassan Shapourian

    Abstract: In entanglement distribution networks, communication between two nodes necessitates the generation of end-to-end entanglement by entanglement swapping at intermediate nodes. Efficiently creating end-to-end entanglements over long distances is a key objective. In our prior study on asynchronous routing, we enhanced these entanglement rates by leveraging solely the local knowledge of the entanglemen… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: This article has been accepted for publication in IEEE Communications Magazine

  8. arXiv:2407.16410  [pdf, other

    cs.CR cs.SE

    Securing Tomorrow's Smart Cities: Investigating Software Security in Internet of Vehicles and Deep Learning Technologies

    Authors: Ridhi Jain, Norbert Tihanyi, Mohamed Amine Ferrag

    Abstract: Integrating Deep Learning (DL) techniques in the Internet of Vehicles (IoV) introduces many security challenges and issues that require thorough examination. This literature review delves into the inherent vulnerabilities and risks associated with DL in IoV systems, shedding light on the multifaceted nature of security threats. Through an extensive analysis of existing research, we explore potenti… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  9. arXiv:2407.15373  [pdf, other

    cs.HC

    avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality

    Authors: Dizhi Ma, Xiyun Hu, Jingyu Shi, Mayank Patel, Rahul Jain, Ziyi Liu, Zhengzhe Zhu, Karthik Ramani

    Abstract: Table tennis stroke training is a critical aspect of player development. We designed a new augmented reality (AR) system, avaTTAR, for table tennis stroke training. The system provides both "on-body" (first-person view) and "detached" (third-person view) visual cues, enabling users to visualize target strokes and correct their attempts effectively with this dual perspectives setup. By employing a… ▽ More

    Submitted 26 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

  10. arXiv:2407.09434  [pdf, other

    cs.LG cs.AI cs.CE eess.SY

    A Perspective on Foundation Models for the Electric Power Grid

    Authors: Hendrik F. Hamann, Thomas Brunschwiler, Blazhe Gjorgiev, Leonardo S. A. Martins, Alban Puech, Anna Varbella, Jonas Weiss, Juan Bernabe-Moreno, Alexandre Blondin Massé, Seong Choi, Ian Foster, Bri-Mathias Hodge, Rishabh Jain, Kibaek Kim, Vincent Mai, François Mirallès, Martin De Montigny, Octavio Ramos-Leaños, Hussein Suprême, Le Xie, El-Nasser S. Youssef, Arnaud Zinflou, Alexander J. Belvi, Ricardo J. Bessa, Bishnu Prasad Bhattari , et al. (2 additional authors not shown)

    Abstract: Foundation models (FMs) currently dominate news headlines. They employ advanced deep learning architectures to extract structural information autonomously from vast datasets through self-supervision. The resulting rich representations of complex systems and dynamics can be applied to many downstream applications. Therefore, FMs can find uses in electric power grids, challenged by the energy transi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Lead contact: H.F.H.; Major equal contributors: H.F.H., T.B., B.G., L.S.A.M., A.P., A.V., J.W.; Significant equal contributors: J.B., A.B.M., S.C., I.F., B.H., R.J., K.K., V.M., F.M., M.D.M., O.R., H.S., L.X., E.S.Y., A.Z.; Other equal contributors: A.J.B., R.J.B., B.P.B., J.S., S.S

  11. arXiv:2407.09180  [pdf, other

    cs.AR

    iMIV: in-Memory Integrity Verification for NVM

    Authors: Rajat Jain, Aravinda Prasad, Sreenivas Subramoney, Arkaprava Basu

    Abstract: Non-volatile Memory (NVM) could bridge the gap between memory and storage. However, NVMs are susceptible to data remanence attacks. Thus, multiple security metadata must persist along with the data to protect the confidentiality and integrity of NVM-resident data. Persisting Bonsai Merkel Tree (BMT) nodes, critical for data integrity, can add significant overheads due to need to write large amount… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  12. arXiv:2407.01904  [pdf, other

    cs.DS

    From Directed Steiner Tree to Directed Polymatroid Steiner Tree in Planar Graphs

    Authors: Chandra Chekuri, Rhea Jain, Shubhang Kulkarni, Da Wei Zheng, Weihao Zhu

    Abstract: In the Directed Steiner Tree (DST) problem the input is a directed edge-weighted graph $G=(V,E)$, a root vertex $r$ and a set $S \subseteq V$ of $k$ terminals. The goal is to find a min-cost subgraph that connects $r$ to each of the terminals. DST admits an $O(\log^2 k/\log \log k)$-approximation in quasi-polynomial time, and an $O(k^ε)$-approximation for any fixed $ε> 0$ in polynomial-time. Resol… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  13. arXiv:2407.00527  [pdf, other

    math.OC

    Assessing the Value of Coupling Thermal Energy Storage with Air-Source Heat Pumps for Residential Space Heating in U.S. Cities

    Authors: An T. Pham, Bryan Kinzer, Ritvik Jain, Rohini Bala Chandran, Michael T. Craig

    Abstract: Widespread air source heat pump (ASHP) adoption faces several challenges that on-site thermal energy storage (TES), particularly thermochemical salt hydrate TES, can mitigate. No techno-economic analyses for salt-hydrate-based TES in residential applications exist. We quantify the residential space heating value of four salt hydrate TES materials - MgSO4, MgCl2, K2CO3, and SrBr2 - coupled with ASH… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  14. arXiv:2406.18058  [pdf

    cs.CR

    Fuzzing at Scale: The Untold Story of the Scheduler

    Authors: Ivica Nikolic, Racchit Jain

    Abstract: How to search for bugs in 1,000 programs using a pre-existing fuzzer and a standard PC? We consider this problem and show that a well-designed strategy that determines which programs to fuzz and for how long can greatly impact the number of bugs found across the programs. In fact, the impact of employing an effective strategy is comparable to that of utilizing a state-of-the-art fuzzer. The consid… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  15. arXiv:2406.16957  [pdf, other

    eess.SP

    Towards a data-driven and scalable approach for window operation detection in multi-family residential buildings

    Authors: Juliet Nwagwu Ume-Ezeoke, Kopal Nihar, Catherine Gorle, Rishee Jain

    Abstract: Natural cooling, utilizing non-mechanical cooling, presents a low-carbon and low-cost way to provide thermal comfort in residential buildings. However, designing naturally cooled buildings requires a clear understanding of how opening and closing windows affect occupants' comfort. Predicting when and why occupants open windows is a challenging task, often relying on specialized sensors and buildin… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  16. arXiv:2406.15754  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    Multimodal Segmentation for Vocal Tract Modeling

    Authors: Rishi Jain, Bohan Yu, Peter Wu, Tejas Prabhune, Gopala Anumanchipalli

    Abstract: Accurate modeling of the vocal tract is necessary to construct articulatory representations for interpretable speech processing and linguistics. However, vocal tract modeling is challenging because many internal articulators are occluded from external motion capture technologies. Real-time magnetic resonance imaging (RT-MRI) allows measuring precise movements of internal articulators during speech… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  17. arXiv:2406.15563  [pdf, ps, other

    cs.DS cs.DM

    Exponential Time Approximation for Coloring 3-Colorable Graphs

    Authors: Venkatesan Guruswami, Rhea Jain

    Abstract: The problem of efficiently coloring $3$-colorable graphs with few colors has received much attention on both the algorithmic and inapproximability fronts. We consider exponential time approximations, in which given a parameter $r$, we aim to develop an $r$-approximation algorithm with the best possible runtime, providing a tradeoff between runtime and approximation ratio. In this vein, an algorith… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  18. arXiv:2406.14290  [pdf, ps, other

    cs.CY cs.SI

    Examining the Implications of Deepfakes for Election Integrity

    Authors: Hriday Ranka, Mokshit Surana, Neel Kothari, Veer Pariawala, Pratyay Banerjee, Aditya Surve, Sainath Reddy Sankepally, Raghav Jain, Jhagrut Lalwani, Swapneel Mehta

    Abstract: It is becoming cheaper to launch disinformation operations at scale using AI-generated content, in particular 'deepfake' technology. We have observed instances of deepfakes in political campaigns, where generated content is employed to both bolster the credibility of certain narratives (reinforcing outcomes) and manipulate public perception to the detriment of targeted candidates or causes (advers… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at the AAAI 2024 conference, AI for Credible Elections Workshop-AI4CE 2024

  19. arXiv:2406.09574  [pdf, other

    cs.LG

    Online Bandit Learning with Offline Preference Data

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Zheng Wen

    Abstract: Reinforcement Learning with Human Feedback (RLHF) is at the core of fine-tuning methods for generative AI models for language and images. Such feedback is often sought as rank or preference feedback from human raters, as opposed to eliciting scores since the latter tends to be very noisy. On the other hand, RL theory and algorithms predominantly assume that a reward feedback is available. In parti… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  20. arXiv:2406.09563  [pdf, other

    cs.LG

    e-COP : Episodic Constrained Optimization of Policies

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Sahil Singla

    Abstract: In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate sets of optimization criteria and constraints on a system's behavior. We approach this problem by first establishing a policy difference lemma for the episodic se… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  21. arXiv:2406.08354  [pdf, other

    cs.CV cs.AI cs.LG

    DocSynthv2: A Practical Autoregressive Modeling for Document Generation

    Authors: Sanket Biswas, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Puneet Mathur, Curtis Wigington, Tong Sun, Josep Lladós

    Abstract: While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge. This paper delves into this advanced domain, proposing a novel approach called DocSynthv2 through the development of a simple yet effective autoregressive structured model. Our model, distinct in its integration of both la… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Spotlight (Oral) Acceptance to CVPR 2024 Workshop for Graphic Design Understanding and Generation (GDUG)

  22. arXiv:2406.05344  [pdf, other

    cs.CL

    MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention

    Authors: Prince Jha, Raghav Jain, Konika Mandal, Aman Chadha, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: In the digital world, memes present a unique challenge for content moderation due to their potential to spread harmful content. Although detection methods have improved, proactive solutions such as intervention are still limited, with current research focusing mostly on text-based content, neglecting the widespread influence of multimodal content like memes. Addressing this gap, we present \textit… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  23. arXiv:2406.02634  [pdf, other

    astro-ph.HE nucl-ex

    Nuclear Data to Quantify Urca Cooling in Accreting Neutron Stars

    Authors: Rahul Jain

    Abstract: Neutron stars in Low Mass X-ray Binaries (LMXBs) can accrete matter onto their surface from the companion star. Transiently accreting neutron stars go through alternating phases of active accretion outbursts and quiescence. X-ray observations during the quiescence phase show a drop in X-ray luminosity with the time in quiescence. This is also inferred as the drop in surface temperature or the cool… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: PhD Dissertation submitted to Michigan State University

  24. arXiv:2406.00833  [pdf, other

    cs.CY cs.AI

    Harvard Undergraduate Survey on Generative AI

    Authors: Shikoh Hirabayashi, Rishab Jain, Nikola Jurković, Gabriel Wu

    Abstract: How has generative AI impacted the experiences of college students? We study the influence of AI on the study habits, class choices, and career prospects of Harvard undergraduates (n=326), finding that almost 90% of students use generative AI. For roughly 25% of these students, AI has begun to substitute for attending office hours and completing required readings. Half of students are concerned th… ▽ More

    Submitted 7 August, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  25. arXiv:2405.20648  [pdf, other

    cs.CV cs.CL cs.LG

    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization

    Authors: Richard Luo, Austin Peng, Adithya Vasudev, Rishabh Jain

    Abstract: Video is an increasingly prominent and information-dense medium, yet it poses substantial challenges for language models. A typical video consists of a sequence of shorter segments, or shots, that collectively form a coherent narrative. Each shot is analogous to a word in a sentence where multiple data streams of information (such as visual and auditory data) must be processed simultaneously. Comp… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  26. arXiv:2405.17046  [pdf, other

    quant-ph

    Modified Six State Cryptographic Protocol with Entangled Ancilla Component States

    Authors: Rashi Jain, Satyabrata Adhikari

    Abstract: In a realistic situation, it is very difficult to communicate securely between two distant parties without introducing any disturbances. These disturbances might occur either due to external noise or may be due to the interference of an eavesdropper sitting in between the sender and the receiver. In this work, we probe here the existence of the possibility of the situation of generation of a secre… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 10 pages, 2 figures

  27. arXiv:2405.15090  [pdf, other

    cs.LG stat.ML

    Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget

    Authors: Dengwang Tang, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: In this paper, we introduce the constrained best mixed arm identification (CBMAI) problem with a fixed budget. This is a pure exploration problem in a stochastic finite armed bandit model. Each arm is associated with a reward and multiple types of costs from unknown distributions. Unlike the unconstrained best arm identification problem, the optimal solution for the CBMAI problem may be a randomiz… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures, 1 table

  28. arXiv:2405.13333  [pdf

    cs.NI

    Service Mesh: Architectures, Applications, and Implementations

    Authors: Behrooz Farkiani, Raj Jain

    Abstract: The scalability and flexibility of microservice architecture have led to major changes in cloud-native application architectures. However, the complexity of managing thousands of small services written in different languages and handling the exchange of data between them have caused significant management challenges. Service mesh is a promising solution that could mitigate these problems by introd… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 20 pages

  29. arXiv:2405.04777  [pdf, other

    cs.CL

    Empathy Through Multimodality in Conversational Interfaces

    Authors: Mahyar Abbasian, Iman Azimi, Mohammad Feli, Amir M. Rahmani, Ramesh Jain

    Abstract: Agents represent one of the most emerging applications of Large Language Models (LLMs) and Generative AI, with their effectiveness hinging on multimodal capabilities to navigate complex user environments. Conversational Health Agents (CHAs), a prime example of this, are redefining healthcare by offering nuanced support that transcends textual analysis to incorporate emotional intelligence. This pa… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures, 2 tables, conference paper

  30. arXiv:2404.18353  [pdf, other

    cs.CR cs.AI cs.PL

    Do Neutral Prompts Produce Insecure Code? FormAI-v2 Dataset: Labelling Vulnerabilities in Code Generated by Large Language Models

    Authors: Norbert Tihanyi, Tamas Bisztray, Mohamed Amine Ferrag, Ridhi Jain, Lucas C. Cordeiro

    Abstract: This study provides a comparative analysis of state-of-the-art large language models (LLMs), analyzing how likely they generate vulnerabilities when writing simple C programs using a neutral zero-shot prompt. We address a significant gap in the literature concerning the security properties of code produced by these models without specific directives. N. Tihanyi et al. introduced the FormAI dataset… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  31. arXiv:2404.16870  [pdf, ps, other

    cs.CR cs.AI cs.LG

    LEMDA: A Novel Feature Engineering Method for Intrusion Detection in IoT Systems

    Authors: Ali Ghubaish, Zebo Yang, Aiman Erbad, Raj Jain

    Abstract: Intrusion detection systems (IDS) for the Internet of Things (IoT) systems can use AI-based models to ensure secure communications. IoT systems tend to have many connected devices producing massive amounts of data with high dimensionality, which requires complex models. Complex models have notorious problems such as overfitting, low interpretability, and high computational complexity. Adding model… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  32. arXiv:2404.16725  [pdf, ps, other

    cs.DS

    Approximation Algorithms for Hop Constrained and Buy-at-Bulk Network Design via Hop Constrained Oblivious Routing

    Authors: Chandra Chekuri, Rhea Jain

    Abstract: We consider two-cost network design models in which edges of the input graph have an associated cost and length. We build upon recent advances in hop-constrained oblivious routing to obtain two sets of results. We address multicommodity buy-at-bulk network design in the nonuniform setting. Existing poly-logarithmic approximations are based on the junction tree approach [CHKS09,KN11]. We obtain a… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  33. arXiv:2404.11283  [pdf, ps, other

    quant-ph

    Robust and composable device-independent quantum protocols for oblivious transfer and bit commitment

    Authors: Rishabh Batra, Sayantan Chakraborty, Rahul Jain, Upendra Kapshikar

    Abstract: We present robust and composable device-independent quantum protocols for oblivious transfer (OT) and bit commitment (BC) using Magic Square devices. We assume there is no long-term quantum memory, that is, after a finite time interval, referred to as \textbf{DELAY}, the states stored in the devices decohere. By robustness, which is a highlight of our protocols, we mean that the protocols are corr… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  34. arXiv:2404.06768  [pdf, ps, other

    cs.IT math.RA

    A new approach to construct minimal linear codes over $\mathbb{F}_{3}$

    Authors: Wajid M. Shaikh, Rupali S. Jain, B. Surendranath Reddy, Bhagyashri S. Patil, Sahar M. A. Maqbol

    Abstract: In this article, we present two new approaches to construct minimal linear codes of dimension $n+1$ over $\mathbb{F}_{3}$ using characteristic and ternary functions. We also obtain the weight distributions of these constructed minimal linear codes. We further show that a specific class of these codes violates Ashikhmin-Barg condition.

    Submitted 10 April, 2024; originally announced April 2024.

    Journal ref: MJMS-2024-0154

  35. arXiv:2404.03220  [pdf, ps, other

    quant-ph cs.CR

    Commitments are equivalent to one-way state generators

    Authors: Rishabh Batra, Rahul Jain

    Abstract: One-way state generators (OWSG) are natural quantum analogs to classical one-way functions. We show that $O\left(\frac{n}{\log(n)}\right)$-copy OWSGs ($n$ represents the input length) are equivalent to $poly(n)$-copy OWSG and to quantum commitments. Since known results show that $o\left(\frac{n}{\log(n)}\right)$-copy OWSG cannot imply commitments, this shows that $O\left(\frac{n}{\log(n)}\right)$-… ▽ More

    Submitted 17 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: minor changes to previous version

  36. arXiv:2404.03150  [pdf, other

    cs.CL cs.AI

    NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA

    Authors: Anish Pahilajani, Samyak Rajesh Jain, Devasha Trivedi

    Abstract: This paper presents our submission to the SemEval 2024 Task 5: The Legal Argument Reasoning Task in Civil Procedure. We present two approaches to solving the task of legal answer validation, given an introduction to the case, a question and an answer candidate. Firstly, we fine-tuned pre-trained BERT-based models and found that models trained on domain knowledge perform better. Secondly, we perfor… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  37. arXiv:2404.00477  [pdf, other

    cs.LG cs.AR

    DE-HNN: An effective neural model for Circuit Netlist representation

    Authors: Zhishang Luo, Truong Son Hy, Puoya Tabaghi, Donghyeon Koh, Michael Defferrard, Elahe Rezaei, Ryan Carey, Rhett Davis, Rajeev Jain, Yusu Wang

    Abstract: The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts… ▽ More

    Submitted 16 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  38. arXiv:2403.16466  [pdf, ps, other

    quant-ph

    One-Shot Non-Catalytic Distributed Purity Distillation

    Authors: Sayantan Chakraborty, Rahul Jain, Pranab Sen

    Abstract: Pure states are an important resource in many quantum information processing protocols. However, even making a fixed pure state, say $|0\rangle$, in the laboratory requires a considerable amount of effort. Often one ends up with a mixed state $ρ$ whose classical description is nevertheless known. Hence it is important to develop protocols that extract a fixed pure state from a known mixed state. I… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  39. arXiv:2403.15547  [pdf, ps, other

    cs.DS

    Approximation Algorithms for Network Design in Non-Uniform Fault Models

    Authors: Chandra Chekuri, Rhea Jain

    Abstract: The Survivable Network Design problem (SNDP) is a well-studied problem, motivated by the design of networks that are robust to faults under the assumption that any subset of edges up to a specific number can fail. We consider non-uniform fault models where the subset of edges that fail can be specified in different ways. Our primary interest is in the flexible graph connectivity model, in which th… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: A preliminary version of this paper appeared in the Proc. of ICALP 2023 (10.4230/LIPIcs.ICALP.2023.36), which combines and extends results from two earlier versions: arXiv:2209.12273 (for the first set of results) and arXiv:2211.08324 (for the second set of results)

  40. arXiv:2403.14416  [pdf, other

    quant-ph cs.IT

    Quantum Channel Simulation in Fidelity is no more difficult than State Splitting

    Authors: Michael X. Cao, Rahul Jain, Marco Tomamichel

    Abstract: Characterizing the minimal communication needed for the quantum channel simulation is a fundamental task in the quantum information theory. In this paper, we show that, in fidelity, the quantum channel simulation can be directly achieved via quantum state splitting without using a technique known as the de~Finetti reduction, and thus provide a pair of tighter one-shot bounds. Using the bounds, we… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  41. arXiv:2403.13350  [pdf, ps, other

    cs.IT math.RA

    Construction of Minimal Binary Linear Codes of dimension $n+3$

    Authors: Wajid M. Shaikh, Rupali S. Jain, B. Surendranath Reddy, Bhagyashri S. Patil

    Abstract: In this paper, we will give the generic construction of a binary linear code of dimension $n+3$ and derive the necessary and sufficient conditions for the constructed code to be minimal. Using generic construction, a new family of minimal binary linear code will be constructed from a special class of Boolean functions violating the Ashikhmin-Barg condition. We also obtain the weight distribution o… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    MSC Class: 94B05; 94C10; 94A60

  42. arXiv:2403.13106  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data

    Authors: Divyansh Singhvi, Andrej Erkelens, Raghav Jain, Diganta Misra, Naomi Saphra

    Abstract: Measuring nonlinear feature interaction is an established approach to understanding complex patterns of attribution in many models. In this paper, we use Shapley Taylor interaction indices (STII) to analyze the impact of underlying data structure on model representations in a variety of modalities, tasks, and architectures. Considering linguistic structure in masked and auto-regressive language mo… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  43. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  44. arXiv:2403.01317  [pdf, other

    cs.LG cs.AR

    Less is More: Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits

    Authors: Chenhui Deng, Zichao Yue, Cunxi Yu, Gokce Sarar, Ryan Carey, Rajeev Jain, Zhiru Zhang

    Abstract: While graph neural networks (GNNs) have gained popularity for learning circuit representations in various electronic design automation (EDA) tasks, they face challenges in scalability when applied to large graphs and exhibit limited generalizability to new designs. These limitations make them less practical for addressing large-scale, complex circuit problems. In this work we propose HOGA, a novel… ▽ More

    Submitted 10 April, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at Design Automation Conference (DAC) 2024

  45. arXiv:2403.00781  [pdf, other

    cs.IR cs.AI cs.LG cs.MM

    ChatDiet: Empowering Personalized Nutrition-Oriented Food Recommender Chatbots through an LLM-Augmented Framework

    Authors: Zhongqi Yang, Elahe Khatibi, Nitish Nagesh, Mahyar Abbasian, Iman Azimi, Ramesh Jain, Amir M. Rahmani

    Abstract: The profound impact of food on health necessitates advanced nutrition-oriented food recommendation services. Conventional methods often lack the crucial elements of personalization, explainability, and interactivity. While Large Language Models (LLMs) bring interpretability and explainability, their standalone use falls short of achieving true personalization. In this paper, we introduce ChatDiet,… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2024; originally announced March 2024.

    Comments: Accepted by The IEEE/ACM international conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE) 2024

  46. arXiv:2403.00141  [pdf, other

    cs.CL cs.AI

    EROS: Entity-Driven Controlled Policy Document Summarization

    Authors: Joykirat Singh, Sehban Fazili, Rohan Jain, Md Shad Akhtar

    Abstract: Privacy policy documents have a crucial role in educating individuals about the collection, usage, and protection of users' personal data by organizations. However, they are notorious for their lengthy, complex, and convoluted language especially involving privacy-related entities. Hence, they pose a significant challenge to users who attempt to comprehend organization's data usage policy. In this… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted in LREC-COLING 2024

  47. arXiv:2402.11477  [pdf, other

    cs.CY

    Studying Differential Mental Health Expressions in India

    Authors: Khushi Shelat, Sunny Rai, Devansh R Jain, Kishen Sivabalan, Young Min Cho, Maitreyi Redkar, Samindara Sawant, Sharath Chandra Guntuku

    Abstract: Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  48. arXiv:2402.11257  [pdf, other

    math.RA

    Construction of Linear Codes from the Unit Graph $G(\mathbb{Z}_{n}\oplus \mathbb{Z}_{m})$

    Authors: Wajid M. Shaikh, Rupali S. Jain, B. Surendranath Reddy

    Abstract: In this paper, we develop the python code for generating unit graph $G(\mathbb{Z}_{n}\oplus\mathbb{Z}_{m})$, for any integers $m\ \& \ n$. For any prime $r$, we construct $r$-ary linear codes from the incidence matrix of the unit graph $G(\mathbb{Z}_{n}\oplus\mathbb{Z}_{m})$, where $n \ \& \ m$ are either power of prime or product of power of primes. We also prove the minimum distance of dual of t… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    MSC Class: 94B05; 94C10; 94A60; 05C50; 05C38

  49. arXiv:2402.10400  [pdf, other

    cs.CL

    Chain of Logic: Rule-Based Reasoning with Large Language Models

    Authors: Sergio Servantez, Joe Barrow, Kristian Hammond, Rajiv Jain

    Abstract: Rule-based reasoning, a fundamental type of legal reasoning, enables us to draw conclusions by accurately applying a rule to a set of facts. We explore causal language models as rule-based reasoners, specifically with respect to compositional rules - rules consisting of multiple elements which form a complex logical expression. Reasoning about compositional rules is challenging because it requires… ▽ More

    Submitted 23 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  50. arXiv:2402.10153  [pdf, other

    cs.CL

    Knowledge-Infused LLM-Powered Conversational Health Agent: A Case Study for Diabetes Patients

    Authors: Mahyar Abbasian, Zhongqi Yang, Elahe Khatibi, Pengfei Zhang, Nitish Nagesh, Iman Azimi, Ramesh Jain, Amir M. Rahmani

    Abstract: Effective diabetes management is crucial for maintaining health in diabetic patients. Large Language Models (LLMs) have opened new avenues for diabetes management, facilitating their efficacy. However, current LLM-based approaches are limited by their dependence on general sources and lack of integration with domain-specific knowledge, leading to inaccurate responses. In this paper, we propose a k… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, and 2 tables, conference paper