Skip to main content

Showing 1–50 of 68 results for author: Divesh

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05435  [pdf, other

    cs.DS

    Polynomial Time Algorithms for Integer Programming and Unbounded Subset Sum in the Total Regime

    Authors: Divesh Aggarwal, Antoine Joux, Miklos Santha, Karol Węgrzycki

    Abstract: The Unbounded Subset Sum (USS) problem is an NP-hard computational problem where the goal is to decide whether there exist non-negative integers $x_1, \ldots, x_n$ such that $x_1 a_1 + \ldots + x_n a_n = b$, where $a_1 < \cdots < a_n < b$ are distinct positive integers with $\text{gcd}(a_1, \ldots, a_n)$ dividing $b$. The problem can be solved in pseudopolynomial time, while specialized cases, suc… ▽ More

    Submitted 11 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: 12 pages

  2. arXiv:2405.13000  [pdf, other

    cs.CL cs.AI cs.IR

    RAGE Against the Machine: Retrieval-Augmented LLM Explanations

    Authors: Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: This paper demonstrates RAGE, an interactive tool for explaining Large Language Models (LLMs) augmented with retrieval capabilities; i.e., able to query external sources and pull relevant information into their input context. Our explanations are counterfactual in the sense that they identify parts of the input context that, when removed, change the answer to the question posed to the LLM. RAGE in… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted by ICDE 2024 (Demonstration Track)

  3. arXiv:2404.07354  [pdf, other

    cs.DB cs.CY cs.LG

    FairEM360: A Suite for Responsible Entity Matching

    Authors: Nima Shahbazi, Mahdi Erfanian, Abolfazl Asudeh, Fatemeh Nargesian, Divesh Srivastava

    Abstract: Entity matching is one the earliest tasks that occur in the big data pipeline and is alarmingly exposed to unintentional biases that affect the quality of data. Identifying and mitigating the biases that exist in the data or are introduced by the matcher at this stage can contribute to promoting fairness in downstream tasks. This demonstration showcases FairEM360, a framework for 1) auditing the o… ▽ More

    Submitted 18 July, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  4. arXiv:2403.00526  [pdf, other

    cs.DB

    Data Quality Assessment: Challenges and Opportunities

    Authors: Sedir Mohammed, Hazar Harmouch, Felix Naumann, Divesh Srivastava

    Abstract: Data-oriented applications, their users, and even the law require data of high quality. Research has broken down the rather vague notion of data quality into various dimensions, such as accuracy, consistency, and reputation, to name but a few. To achieve the goal of high data quality, many tools and techniques exist to clean and otherwise improve data. Yet, systematic research on actually assessin… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  5. arXiv:2402.14863  [pdf, other

    cs.CL

    Evaluation of a semi-autonomous attentive listening system with takeover prompting

    Authors: Haruki Kawai, Divesh Lala, Koji Inoue, Keiko Ochi, Tatsuya Kawahara

    Abstract: The handling of communication breakdowns and loss of engagement is an important aspect of spoken dialogue systems, particularly for chatting systems such as attentive listening, where the user is mostly speaking. We presume that a human is best equipped to handle this task and rescue the flow of conversation. To this end, we propose a semi-autonomous system, where a remote operator can take contro… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  6. arXiv:2402.12770  [pdf, other

    cs.CL

    Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

    Authors: Zi Haur Pang, Yahui Fu, Divesh Lala, Keiko Ochi, Koji Inoue, Tatsuya Kawahara

    Abstract: In the realm of human-AI dialogue, the facilitation of empathetic responses is important. Validation is one of the key communication techniques in psychology, which entails recognizing, understanding, and acknowledging others' emotional states, thoughts, and actions. This study introduces the first framework designed to engender empathetic dialogue with validating responses. Our approach incorpora… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted for presentation at International Workshop on Spoken Dialogue Systems Technology 2024 (IWSDS 2024)

  7. arXiv:2401.04867  [pdf, other

    cs.CL cs.AI cs.HC

    An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems

    Authors: Koji Inoue, Divesh Lala, Keiko Ochi, Tatsuya Kawahara, Gabriel Skantze

    Abstract: Establishing evaluation schemes for spoken dialogue systems is important, but it can also be challenging. While subjective evaluations are commonly used in user experiments, objective evaluations are necessary for research comparison and reproducibility. To address this issue, we propose a framework for indirectly but objectively evaluating systems based on users' behaviors. In this paper, to this… ▽ More

    Submitted 23 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted for presentation at International Workshop on Spoken Dialogue Systems Technology 2024 (IWSDS 2024) and represents the author's version of the work

  8. arXiv:2311.15064  [pdf, other

    cs.DS

    Recursive lattice reduction -- A framework for finding short lattice vectors

    Authors: Divesh Aggarwal, Thomas Espitau, Spencer Peters, Noah Stephens-Davidowitz

    Abstract: We propose a new framework called recursive lattice reduction for finding short non-zero vectors in a lattice or for finding dense sublattices of a lattice. At a high level, the framework works by recursively searching for dense sublattices of dense sublattices (or their duals). Eventually, the procedure encounters a recursive call on a lattice $\mathcal{L}$ with relatively low rank $k$, at which… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  9. arXiv:2308.11020  [pdf, other

    cs.CL cs.HC cs.RO

    Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors

    Authors: Koji Inoue, Divesh Lala, Keiko Ochi, Tatsuya Kawahara, Gabriel Skantze

    Abstract: This paper tackles the challenging task of evaluating socially situated conversational robots and presents a novel objective evaluation approach that relies on multimodal user behaviors. In this study, our main focus is on assessing the human-likeness of the robot as the primary evaluation metric. While previous research often relied on subjective evaluations from users, our approach aims to evalu… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by 25th ACM International Conference on Multimodal Interaction (ICMI '23), Late-Breaking Results

  10. arXiv:2307.02726  [pdf, other

    cs.DB cs.CY cs.LG

    Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching

    Authors: Nima Shahbazi, Nikola Danevski, Fatemeh Nargesian, Abolfazl Asudeh, Divesh Srivastava

    Abstract: Entity matching (EM) is a challenging problem studied by different communities for over half a century. Algorithmic fairness has also become a timely topic to address machine bias and its societal impacts. Despite extensive research on these two topics, little attention has been paid to the fairness of entity matching. Towards addressing this gap, we perform an extensive experimental evaluation… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted to VLDB'23

  11. CREDENCE: Counterfactual Explanations for Document Ranking

    Authors: Joel Rorseth, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Towards better explainability in the field of information retrieval, we present CREDENCE, an interactive tool capable of generating counterfactual explanations for document rankers. Embracing the unique properties of the ranking problem, we present counterfactual explanations in terms of document perturbations, query perturbations, and even other documents. Additionally, users may build and test t… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted by ICDE 2023 (Demonstration Track)

  12. arXiv:2211.11693  [pdf, other

    cs.CC cs.CR cs.DS

    Lattice Problems Beyond Polynomial Time

    Authors: Divesh Aggarwal, Huck Bennett, Zvika Brakerski, Alexander Golovnev, Rajendra Kumar, Zeyong Li, Spencer Peters, Noah Stephens-Davidowitz, Vinod Vaikuntanathan

    Abstract: We study the complexity of lattice problems in a world where algorithms, reductions, and protocols can run in superpolynomial time, revisiting four foundational results: two worst-case to average-case reductions and two protocols. We also show a novel protocol. 1. We prove that secret-key cryptography exists if $\widetilde{O}(\sqrt{n})$-approximate SVP is hard for $2^{\varepsilon n}$-time algori… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  13. arXiv:2211.08526  [pdf, other

    cs.RO

    Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners

    Authors: Yuanchao Li, Catherine Lai, Divesh Lala, Koji Inoue, Tatsuya Kawahara

    Abstract: As the aging of society continues to accelerate, Alzheimer's Disease (AD) has received more and more attention from not only medical but also other fields, such as computer science, over the past decade. Since speech is considered one of the effective ways to diagnose cognitive decline, AD detection from speech has emerged as a hot topic. Nevertheless, such approaches fail to tackle several key is… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted for HRI2022 Late-Breaking Report

  14. arXiv:2211.04568  [pdf, ps, other

    stat.AP cs.CY cs.LG

    Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

    Authors: Cheryl Flynn, Aritra Guha, Subhabrata Majumdar, Divesh Srivastava, Zhengyi Zhou

    Abstract: New technologies and the availability of geospatial data have drawn attention to spatio-temporal biases present in society. For example: the COVID-19 pandemic highlighted disparities in the availability of broadband service and its role in the digital divide; the environmental justice movement in the United States has raised awareness to health implications for minority populations stemming from h… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  15. arXiv:2211.04385  [pdf, other

    cs.CC cs.CR cs.DS

    Why we couldn't prove SETH hardness of the Closest Vector Problem for even norms!

    Authors: Divesh Aggarwal, Rajendra Kumar

    Abstract: Recent work [BGS17,ABGS19] has shown SETH hardness of CVP in the $\ell_p$ norm for any $p$ that is not an even integer. This result was shown by giving a Karp reduction from $k$-SAT on $n$ variables to CVP on a lattice of rank $n$. In this work, we show a barrier towards proving a similar result for CVP in the $\ell_p$ norm where $p$ is an even integer. We show that for any $c>0$, if for every… ▽ More

    Submitted 25 November, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Added: Instance compression of exact-CVP

  16. arXiv:2203.12978  [pdf, other

    cs.DB cs.LG

    Effective Explanations for Entity Resolution Models

    Authors: Tommaso Teofili, Donatella Firmani, Nick Koudas, Vincenzo Martello, Paolo Merialdo, Divesh Srivastava

    Abstract: Entity resolution (ER) aims at matching records that refer to the same real-world entity. Although widely studied for the last 50 years, ER still represents a challenging data management problem, and several recent works have started to investigate the opportunity of applying deep learning (DL) techniques to solve this problem. In this paper, we study the fundamental problem of explainability of t… ▽ More

    Submitted 1 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  17. arXiv:2202.13354  [pdf, ps, other

    cs.CR quant-ph

    Quantum secure non-malleable codes in the split-state model

    Authors: Divesh Aggarwal, Naresh Goud Boddu, Rahul Jain

    Abstract: Non-malleable-codes introduced by Dziembowski, Pietrzak and Wichs [DPW18] encode a classical message $S$ in a manner such that tampering the codeword results in the decoder either outputting the original message $S$ or a message that is unrelated/independent of $S$. Providing such non-malleable security for various tampering function families has received significant attention in recent years. We… ▽ More

    Submitted 8 June, 2023; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2109.03097. text overlap with arXiv:1611.09248 by other authors

  18. arXiv:2111.04157  [pdf, ps, other

    cs.IT cs.CR

    Extractors: Low Entropy Requirements Colliding With Non-Malleability

    Authors: Divesh Aggarwal, Eldon Chung, Maciej Obremski

    Abstract: The known constructions of negligible error (non-malleable) two-source extractors can be broadly classified in three categories: (1) Constructions where one source has min-entropy rate about $1/2$, the other source can have small min-entropy rate, but the extractor doesn't guarantee non-malleability. (2) Constructions where one source is uniform, and the other can have small min-entropy rate,… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 November, 2021; originally announced November 2021.

  19. Real-World Trajectory Sharing with Local Differential Privacy

    Authors: Teddy Cunningham, Graham Cormode, Hakan Ferhatosmanoglu, Divesh Srivastava

    Abstract: Sharing trajectories is beneficial for many real-world applications, such as managing disease spread through contact tracing and tailoring public services to a population's travel patterns. However, public concern over privacy and data protection has limited the extent to which this data is shared. Local differential privacy enables data sharing in which users share a perturbed version of their da… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Journal ref: PVLDB, 14(11): 2283 - 2295, 2021

  20. arXiv:2106.02766  [pdf, ps, other

    cs.CR quant-ph

    Quantum Measurement Adversary

    Authors: Divesh Aggarwal, Naresh Goud Boddu, Rahul Jain, Maciej Obremski

    Abstract: Multi-source-extractors are functions that extract uniform randomness from multiple (weak) sources of randomness. Quantum multi-source-extractors were considered by Kasher and Kempe (for the quantum-independent-adversary and the quantum-bounded-storage-adversary), Chung, Li and Wu (for the general-entangled-adversary) and Arnon-Friedman, Portmann and Scholz (for the quantum-Markov-adversary). One… ▽ More

    Submitted 6 June, 2023; v1 submitted 4 June, 2021; originally announced June 2021.

  21. arXiv:2106.02325  [pdf, other

    cs.CL cs.HC

    ERICA: An Empathetic Android Companion for Covid-19 Quarantine

    Authors: Etsuko Ishii, Genta Indra Winata, Samuel Cahyawijaya, Divesh Lala, Tatsuya Kawahara, Pascale Fung

    Abstract: Over the past year, research in various domains, including Natural Language Processing (NLP), has been accelerated to fight against the COVID-19 pandemic, yet such research has just started on dialogue systems. In this paper, we introduce an end-to-end dialogue system which aims to ease the isolation of people under self-quarantine. We conduct a control simulation experiment to assess the effects… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted in SIGDIAL 2021

  22. arXiv:2105.06058  [pdf, other

    cs.DB

    DataExposer: Exposing Disconnect between Data and Systems

    Authors: Sainyam Galhotra, Anna Fariha, Raoni Lourenço, Juliana Freire, Alexandra Meliou, Divesh Srivastava

    Abstract: As data is a central component of many modern systems, the cause of a system malfunction may reside in the data, and, specifically, particular properties of the data. For example, a health-monitoring system that is designed under the assumption that weight is reported in imperial units (lbs) will malfunction when encountering weight reported in metric units (kilograms). Similar to software debuggi… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  23. arXiv:2105.00403  [pdf, other

    cs.CL cs.AI cs.RO

    Intelligent Conversational Android ERICA Applied to Attentive Listening and Job Interview

    Authors: Tatsuya Kawahara, Koji Inoue, Divesh Lala

    Abstract: Following the success of spoken dialogue systems (SDS) in smartphone assistants and smart speakers, a number of communicative robots are developed and commercialized. Compared with the conventional SDSs designed as a human-machine interface, interaction with robots is expected to be in a closer manner to talking to a human because of the anthropomorphism and physical presence. The goal or task of… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: 7 pages, 5 figures, 1 table

  24. arXiv:2104.06576  [pdf, ps, other

    cs.DS cs.CR

    Dimension-Preserving Reductions Between SVP and CVP in Different $p$-Norms

    Authors: Divesh Aggarwal, Yanlin Chen, Rajendra Kumar, Zeyong Li, Noah Stephens-Davidowitz

    Abstract: $ \newcommand{\SVP}{\textsf{SVP}} \newcommand{\CVP}{\textsf{CVP}} \newcommand{\eps}{\varepsilon} $We show a number of reductions between the Shortest Vector Problem and the Closest Vector Problem over lattices in different $\ell_p$ norms ($\SVP_p$ and $\CVP_p$ respectively). Specifically, we present the following $2^{\eps m}$-time reductions for $1 \leq p \leq q \leq \infty… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  25. arXiv:2101.11259  [pdf, other

    cs.DB

    Alaska: A Flexible Benchmark for Data Integration Tasks

    Authors: Valter Crescenzi, Andrea De Angelis, Donatella Firmani, Maurizio Mazzei, Paolo Merialdo, Federico Piai, Divesh Srivastava

    Abstract: Data integration is a long-standing interest of the data management community and has many disparate applications, including business, science and government. We have recently witnessed impressive results in specific data integration tasks, such as Entity Resolution, thanks to the increasing availability of benchmarks. A limitation of such benchmarks is that they typically come with their own task… ▽ More

    Submitted 3 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  26. arXiv:2101.02174  [pdf, other

    cs.DB

    Efficient Discovery of Approximate Order Dependencies

    Authors: Reza Karegar, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Order dependencies (ODs) capture relationships between ordered domains of attributes. Approximate ODs (AODs) capture such relationships even when there exist exceptions in the data. During automated discovery of ODs, validation is the process of verifying whether an OD holds. We present an algorithm for validating approximate ODs with significantly improved runtime performance over existing method… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

  27. arXiv:2012.04117  [pdf, other

    cs.CR

    Local Dampening: Differential Privacy for Non-numeric Queries via Local Sensitivity

    Authors: Victor A. E. Farias, Felipe T. Brito, Cheryl Flynn, Javam C. Machado, Subhabrata Majumdar, Divesh Srivastava

    Abstract: Differential privacy is the state-of-the-art formal definition for data release under strong privacy guarantees. A variety of mechanisms have been proposed in the literature for releasing the output of numeric queries (e.g., the Laplace mechanism and smooth sensitivity mechanism). Those mechanisms guarantee differential privacy by adding noise to the true query's output. The amount of noise added… ▽ More

    Submitted 14 April, 2022; v1 submitted 7 December, 2020; originally announced December 2020.

  28. arXiv:2010.08886  [pdf, other

    cs.PL cs.LG

    PPL Bench: Evaluation Framework For Probabilistic Programming Languages

    Authors: Sourabh Kulkarni, Kinjal Divesh Shah, Nimar Arora, Xiaoyan Wang, Yucen Lily Li, Nazanin Khosravani Tehrani, Michael Tingley, David Noursi, Narjes Torabi, Sepehr Akhavan Masouleh, Eric Lippert, Erik Meijer

    Abstract: We introduce PPL Bench, a new benchmark for evaluating Probabilistic Programming Languages (PPLs) on a variety of statistical models. The benchmark includes data generation and evaluation code for a number of models as well as implementations in some common PPLs. All of the benchmark code and PPL implementations are available on Github. We welcome contributions of new models and PPLs and as well a… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: 6 pages, PROBPROG 2020

  29. arXiv:2007.09556  [pdf, ps, other

    cs.DS

    A $2^{n/2}$-Time Algorithm for $\sqrt{n}$-SVP and $\sqrt{n}$-Hermite SVP, and an Improved Time-Approximation Tradeoff for (H)SVP

    Authors: Divesh Aggarwal, Zeyong Li, Noah Stephens-Davidowitz

    Abstract: We show a $2^{n/2+o(n)}$-time algorithm that finds a (non-zero) vector in a lattice $\mathcal{L} \subset \mathbb{R}^n$ with norm at most $\tilde{O}(\sqrt{n})\cdot \min\{λ_1(\mathcal{L}), \det(\mathcal{L})^{1/n}\}$, where $λ_1(\mathcal{L})$ is the length of a shortest non-zero lattice vector and $\det(\mathcal{L})$ is the lattice determinant. Minkowski showed that… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

  30. Efficient and Effective ER with Progressive Blocking

    Authors: Sainyam Galhotra, Donatella Firmani, Barna Saha, Divesh Srivastava

    Abstract: Blocking is a mechanism to improve the efficiency of Entity Resolution (ER) which aims to quickly prune out all non-matching record pairs. However, depending on the distributions of entity cluster sizes, existing techniques can be either (a) too aggressive, such that they help scale but can adversely affect the ER effectiveness, or (b) too permissive, potentially harming ER efficiency. In this pap… ▽ More

    Submitted 16 March, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: Galhotra, S., Firmani, D., Saha, B. et al. Efficient and effective ER with progressive blocking. The VLDB Journal (2021)

  31. arXiv:2005.14068  [pdf, other

    cs.DB

    Discovering Domain Orders through Order Dependencies

    Authors: Reza Karegar, Melicaalsadat Mirsafian, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Much real-world data come with explicitly defined domain orders; e.g., lexicographic order for strings, numeric for integers, and chronological for time. Our goal is to discover implicit domain orders that we do not already know; for instance, that the order of months in the Chinese Lunar calendar is Corner < Apricot < Peach. To do so, we enhance data profiling methods by discovering implicit doma… ▽ More

    Submitted 7 September, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

  32. arXiv:2005.11654  [pdf, ps, other

    cs.CC cs.DS

    A Note on the Concrete Hardness of the Shortest Independent Vectors Problem in Lattices

    Authors: Divesh Aggarwal, Eldon Chung

    Abstract: Blömer and Seifert showed that $\mathsf{SIVP}_2$ is NP-hard to approximate by giving a reduction from $\mathsf{CVP}_2$ to $\mathsf{SIVP}_2$ for constant approximation factors as long as the $\mathsf{CVP}$ instance has a certain property. In order to formally define this requirement on the $\mathsf{CVP}$ instance, we introduce a new computational problem called the Gap Closest Vector Problem with B… ▽ More

    Submitted 31 October, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  33. arXiv:2004.04908  [pdf, ps, other

    cs.CL

    Designing Precise and Robust Dialogue Response Evaluators

    Authors: Tianyu Zhao, Divesh Lala, Tatsuya Kawahara

    Abstract: Automatic dialogue response evaluator has been proposed as an alternative to automated metrics and human evaluation. However, existing automatic evaluators achieve only moderate correlation with human judgement and they are not robust. In this work, we propose to build a reference-free evaluator and exploit the power of semi-supervised training and pretrained (masked) language models. Experimental… ▽ More

    Submitted 24 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: Accepted at ACL 2020

  34. arXiv:2002.07955  [pdf, ps, other

    cs.DS cs.CR

    Improved Classical and Quantum Algorithms for the Shortest Vector Problem via Bounded Distance Decoding

    Authors: Divesh Aggarwal, Yanlin Chen, Rajendra Kumar, Yixin Shen

    Abstract: The most important computational problem on lattices is the Shortest Vector Problem (SVP). In this paper, we present new algorithms that improve the state-of-the-art for provable classical/quantum algorithms for SVP. We present the following results. $\bullet$ A new algorithm for SVP that provides a smooth tradeoff between time complexity and memory requirement. For any positive integer… ▽ More

    Submitted 10 May, 2022; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: Faster Quantum Algorithm for SVP in QRAM, 42 pages

  35. arXiv:2001.05567  [pdf, other

    cs.LG stat.ML

    Newtonian Monte Carlo: single-site MCMC meets second-order gradient methods

    Authors: Nimar S. Arora, Nazanin Khosravani Tehrani, Kinjal Divesh Shah, Michael Tingley, Yucen Lily Li, Narjes Torabi, David Noursi, Sepehr Akhavan Masouleh, Eric Lippert, Erik Meijer

    Abstract: Single-site Markov Chain Monte Carlo (MCMC) is a variant of MCMC in which a single coordinate in the state space is modified in each step. Structured relational models are a good candidate for this style of inference. In the single-site context, second order methods become feasible because the typical cubic costs associated with these methods is now restricted to the dimension of each coordinate.… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: StarAI has a 6 page limit excluding references

  36. arXiv:1911.02440  [pdf, other

    cs.CC cs.DS

    Fine-grained hardness of CVP(P) -- Everything that we can prove (and nothing else)

    Authors: Divesh Aggarwal, Huck Bennett, Alexander Golovnev, Noah Stephens-Davidowitz

    Abstract: We show a number of fine-grained hardness results for the Closest Vector Problem in the $\ell_p$ norm ($\mathrm{CVP}_p$), and its approximate and non-uniform variants. First, we show that $\mathrm{CVP}_p$ cannot be solved in $2^{(1-\varepsilon)n}$ time for all $p \notin 2\mathbb{Z}$ and $\varepsilon > 0$, assuming the Strong Exponential Time Hypothesis (SETH). Second, we extend this by showing tha… ▽ More

    Submitted 7 August, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

  37. arXiv:1910.08678  [pdf, other

    cs.DB

    Effective Discovery of Meaningful Outlier Relationships

    Authors: Aline Bessa, Juliana Freire, Divesh Srivastava, Tamraparni Dasu

    Abstract: We propose PODS (Predictable Outliers in Data-trendS), a method that, given a collection of temporal data sets, derives data-driven explanations for outliers by identifying meaningful relationships between them. First, we formalize the notion of meaningfulness, which so far has been informally framed in terms of explainability. Next, since outliers are rare and it is difficult to determine whether… ▽ More

    Submitted 8 April, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

  38. arXiv:1909.02629  [pdf, other

    cs.DB cs.DS

    Random Sampling for Group-By Queries

    Authors: Trong Duc Nguyen, Ming-Hung Shih, Sai Sree Parvathaneni, Bojian Xu, Divesh Srivastava, Srikanta Tirthapura

    Abstract: Random sampling has been widely used in approximate query processing on large databases, due to its potential to significantly reduce resource usage and response times, at the cost of a small approximation error. We consider random sampling for answering the ubiquitous class of group-by queries, which first group data according to one or more attributes, and then aggregate within each group after… ▽ More

    Submitted 12 September, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

  39. arXiv:1908.03724  [pdf, other

    cs.DS cs.CR

    Slide Reduction, Revisited---Filling the Gaps in SVP Approximation

    Authors: Divesh Aggarwal, Jianwei Li, Phong Q. Nguyen, Noah Stephens-Davidowitz

    Abstract: We show how to generalize Gama and Nguyen's slide reduction algorithm [STOC '08] for solving the approximate Shortest Vector Problem over lattices (SVP). As a result, we show the fastest provably correct algorithm for $δ$-approximate SVP for all approximation factors $n^{1/2+\varepsilon} \leq δ\leq n^{O(1)}$. This is the range of approximation factors most relevant for cryptography.

    Submitted 10 August, 2019; originally announced August 2019.

  40. arXiv:1905.11948  [pdf, other

    cs.DB

    ABC of Order Dependencies

    Authors: Pei Li, Michael Bohlen, Jaroslaw Szlichta, Divesh Srivastava

    Abstract: We enhance constrained-based data quality with approximate band conditional order dependencies (abcODs). Band ODs model the semantics of attributes that are monotonically related with small variations without there being an intrinsic violation of semantics. The class of abcODs generalizes band ODs to make them more relevant to real-world applications by relaxing them to hold approximately (abODs)… ▽ More

    Submitted 28 February, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  41. arXiv:1905.02010  [pdf, other

    cs.DB

    Errata Note: Discovering Order Dependencies through Order Compatibility

    Authors: Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: A number of extensions to the classical notion of functional dependencies have been proposed to express and enforce application semantics. One of these extensions is that of order dependencies (ODs), which express rules involving order. The article entitled "Discovering Order Dependencies through Order Compatibility" by Consonni et al., published in the EDBT conference proceedings in March 2019, i… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: 5

  42. arXiv:1812.10942  [pdf, other

    cs.DB

    Answering Range Queries Under Local Differential Privacy

    Authors: Tejas Kulkarni, Graham Cormode, Divesh Srivastava

    Abstract: Counting the fraction of a population having an input within a specified interval i.e. a \emph{range query}, is a fundamental data analysis primitive. Range queries can also be used to compute other interesting statistics such as \emph{quantiles}, and to build prediction models. However, frequently the data is subject to privacy concerns when it is drawn from individuals, and relates for example t… ▽ More

    Submitted 31 December, 2018; v1 submitted 28 December, 2018; originally announced December 2018.

  43. arXiv:1801.09039  [pdf, other

    cs.DB

    Variance-Optimal Offline and Streaming Stratified Random Sampling

    Authors: Trong Duc Nguyen, Ming-Hung Shih, Divesh Srivastava, Srikanta Tirthapura, Bojian Xu

    Abstract: Stratified random sampling (SRS) is a fundamental sampling technique that provides accurate estimates for aggregate queries using a small size sample, and has been used widely for approximate query processing. A key question in SRS is how to partition a target sample size among different strata. While Neyman allocation provides a solution that minimizes the variance of an estimate using this sampl… ▽ More

    Submitted 20 February, 2018; v1 submitted 27 January, 2018; originally announced January 2018.

  44. arXiv:1801.02358  [pdf, ps, other

    cs.DS

    Improved algorithms for the Shortest Vector Problem and the Closest Vector Problem in the infinity norm

    Authors: Divesh Aggarwal, Priyanka Mukhopadhyay

    Abstract: Blomer and Naewe[BN09] modified the randomized sieving algorithm of Ajtai, Kumar and Sivakumar[AKS01] to solve the shortest vector problem (SVP). The algorithm starts with $N = 2^{O(n)}$ randomly chosen vectors in the lattice and employs a sieving procedure to iteratively obtain shorter vectors in the lattice. The running time of the sieving procedure is quadratic in $N$. We study this problem f… ▽ More

    Submitted 15 May, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

    Comments: Changed the title

  45. (Gap/S)ETH Hardness of SVP

    Authors: Divesh Aggarwal, Noah Stephens-Davidowitz

    Abstract: $ \newcommand{\problem}[1]{\ensuremath{\mathrm{#1}} } \newcommand{\SVP}{\problem{SVP}} \newcommand{\ensuremath}[1]{#1} $We prove the following quantitative hardness results for the Shortest Vector Problem in the $\ell_p$ norm ($\SVP_p$), where $n$ is the rank of the input lattice. $\bullet$ For "almost all" $p > p_0 \approx 2.1397$, there no $2^{n/C_p}$-time algorithm for $\SVP_p… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

    Journal ref: STOC 2018

  46. arXiv:1711.02952  [pdf, other

    cs.DB

    Marginal Release Under Local Differential Privacy

    Authors: Tejas Kulkarni, Graham Cormode, Divesh Srivastava

    Abstract: Many analysis and machine learning tasks require the availability of marginal statistics on multidimensional datasets while providing strong privacy guarantees for the data subjects. Applications for these statistics range from finding correlations in the data to fitting sophisticated prediction models. In this paper, we provide a set of algorithms for materializing marginal statistics under the s… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

  47. arXiv:1710.00608  [pdf, other

    cs.DB cs.CR

    Constrained Differential Privacy for Count Data

    Authors: Graham Cormode, Tejas Kulkarni, Divesh Srivastava

    Abstract: Concern about how to aggregate sensitive user data without compromising individual privacy is a major barrier to greater availability of data. The model of differential privacy has emerged as an accepted model to release sensitive information while giving a statistical guarantee for privacy. Many different algorithms are possible to address different target functions. We focus on the core problem… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

  48. arXiv:1710.00557  [pdf, ps, other

    quant-ph cs.CR

    A Quantum-Proof Non-Malleable Extractor, With Application to Privacy Amplification against Active Quantum Adversaries

    Authors: Divesh Aggarwal, Kai-Min Chung, Han-Hsuan Lin, Thomas Vidick

    Abstract: In privacy amplification, two mutually trusted parties aim to amplify the secrecy of an initial shared secret $X$ in order to establish a shared private key $K$ by exchanging messages over an insecure communication channel. If the channel is authenticated the task can be solved in a single round of communication using a strong randomness extractor; choosing a quantum-proof extractor allows one to… ▽ More

    Submitted 14 February, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

  49. arXiv:1709.10257  [pdf, other

    cs.HC cs.RO

    Detection of social signals for recognizing engagement in human-robot interaction

    Authors: Divesh Lala, Koji Inoue, Pierrick Milhorat, Tatsuya Kawahara

    Abstract: Detection of engagement during a conversation is an important function of human-robot interaction. The level of user engagement can influence the dialogue strategy of the robot. Our motivation in this work is to detect several behaviors which will be used as social signal inputs for a real-time engagement recognition model. These behaviors are nodding, laughter, verbal backchannels and eye gaze. W… ▽ More

    Submitted 29 September, 2017; originally announced September 2017.

    Comments: AAAI Fall Symposium on Natural Communication for Human-Robot Collaboration, 2017

  50. arXiv:1709.01535  [pdf, ps, other

    cs.DS

    Just Take the Average! An Embarrassingly Simple $2^n$-Time Algorithm for SVP (and CVP)

    Authors: Divesh Aggarwal, Noah Stephens-Davidowitz

    Abstract: We show a $2^{n+o(n)}$-time (and space) algorithm for the Shortest Vector Problem on lattices (SVP) that works by repeatedly running an embarrassingly simple "pair and average" sieving-like procedure on a list of lattice vectors. This matches the running time (and space) of the current fastest known algorithm, due to Aggarwal, Dadush, Regev, and Stephens-Davidowitz (ADRS, in STOC, 2015), with a fa… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Journal ref: SOSA 2018