Skip to main content

Showing 1–7 of 7 results for author: Daliri, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07978  [pdf, other

    cs.DS cs.CL cs.LG

    Coupling without Communication and Drafter-Invariant Speculative Decoding

    Authors: Majid Daliri, Christopher Musco, Ananda Theertha Suresh

    Abstract: Suppose Alice has a distribution $P$ and Bob has a distribution $Q$. Alice wants to generate a sample $a\sim P$ and Bob a sample $b \sim Q$ such that $a = b$ with has as high of probability as possible. It is well-known that, by sampling from an optimal coupling between the distributions, Alice and Bob can achieve $Pr[a = b] = 1 - D_{TV}(P,Q)$, where $D_{TV}(P,Q)$ is the total variation distance.… ▽ More

    Submitted 19 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: 16 pages

  2. arXiv:2406.03482  [pdf, other

    cs.LG cs.AI cs.CL cs.PF

    QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

    Authors: Amir Zandieh, Majid Daliri, Insu Han

    Abstract: Serving LLMs requires substantial memory due to the storage requirements of Key-Value (KV) embeddings in the KV cache, which grows with sequence length. An effective approach to compress KV cache is quantization. However, traditional quantization methods face significant memory overhead due to the need to store quantization constants (at least a zero point and a scale) in full precision per data b… ▽ More

    Submitted 18 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 13 pages

  3. arXiv:2309.16157  [pdf, other

    cs.DB cs.DS

    Sampling Methods for Inner Product Sketching

    Authors: Majid Daliri, Juliana Freire, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: Recently, Bessa et al. (PODS 2023) showed that sketches based on coordinated weighted sampling theoretically and empirically outperform popular linear sketching methods like Johnson-Lindentrauss projection and CountSketch for the ubiquitous problem of inner product estimation. We further develop this finding by introducing and analyzing two alternative sampling-based methods. In contrast to the co… ▽ More

    Submitted 22 August, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 17 pages, 10 figures

  4. arXiv:2308.05907  [pdf, ps, other

    cs.DS cs.DB

    Simple Analysis of Priority Sampling

    Authors: Majid Daliri, Juliana Freire, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: We prove a tight upper bound on the variance of the priority sampling method (aka sequential Poisson sampling). Our proof is significantly shorter and simpler than the original proof given by Mario Szegedy at STOC 2006, which resolved a conjecture by Duffield, Lund, and Thorup.

    Submitted 20 August, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 7 pages

  5. arXiv:2302.02451  [pdf, other

    cs.LG cs.CV cs.DS

    KDEformer: Accelerating Transformers via Kernel Density Estimation

    Authors: Amir Zandieh, Insu Han, Majid Daliri, Amin Karbasi

    Abstract: Dot-product attention mechanism plays a crucial role in modern deep architectures (e.g., Transformer) for sequence modeling, however, naïve exact computation of this model incurs quadratic time and memory complexities in sequence length, hindering the training of long-sequence models. Critical bottlenecks are due to the computation of partition functions in the denominator of softmax function as w… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 26 pages, 7 figures

  6. arXiv:2301.05811  [pdf, other

    cs.DB cs.DS

    Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation

    Authors: Aline Bessa, Majid Daliri, Juliana Freire, Cameron Musco, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: We present a new approach for computing compact sketches that can be used to approximate the inner product between pairs of high-dimensional vectors. Based on the Weighted MinHash algorithm, our approach admits strong accuracy guarantees that improve on the guarantees of popular linear sketching approaches for inner product estimation, such as CountSketch and Johnson-Lindenstrauss projection. Spec… ▽ More

    Submitted 5 May, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 23 pages, 6 figures

    Journal ref: In Proceedings of the ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS) 2023

  7. Brain Electrical Stimulation for Animal Navigation

    Authors: Amirmasoud Ahmadi, Sepideh Farakhor Seghinsara, Mohammad Reza Daliri, Vahid Shalchyan

    Abstract: The brain stimulation and its widespread use is one of the most important subjects in studies of neurophysiology. In brain electrical stimulation methods, following the surgery and electrode implantation, electrodes send electrical impulses to the specific targets in the brain. The use of this stimulation method is provided therapeutic benefits for treatment chronic pain, essential tremor, Parkins… ▽ More

    Submitted 1 December, 2018; originally announced January 2019.

    Comments: in Farsi

    Journal ref: Iranian Journal of Biomedical Engineering, 11(1), pp. 83-100