Skip to main content

Showing 1–5 of 5 results for author: Tsan, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04026  [pdf, other

    cs.DB

    Spanning Tree-based Query Plan Enumeration

    Authors: Yesdaulet Izenov, Asoke Datta, Brian Tsan, Abylay Amanbayev, Florin Rusu

    Abstract: In this work, we define the problem of finding an optimal query plan as finding spanning trees with low costs. This approach empowers the utilization of a series of spanning tree algorithms, thereby enabling systematic exploration of the plan search space over a join graph. Capitalizing on the polynomial time complexity of spanning tree algorithms, we present the Ensemble Spanning Tree Enumeration… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  2. arXiv:2311.17293  [pdf, other

    cs.DB

    Analyzing Query Optimizer Performance in the Presence and Absence of Cardinality Estimates

    Authors: Asoke Datta, Brian Tsan, Yesdaulet Izenov, Florin Rusu

    Abstract: Most query optimizers rely on cardinality estimates to determine optimal execution plans. While traditional databases such as PostgreSQL, Oracle, and Db2 utilize many types of synopses -- including histograms, samples, and sketches -- recent main-memory databases like DuckDB and Heavy.AI often operate with minimal or no estimates, yet their performance does not necessarily suffer. To the best of o… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  3. arXiv:2210.01959  [pdf, other

    cs.CL cs.AI cs.LG

    Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering

    Authors: Tavish McDonald, Brian Tsan, Amar Saini, Juanita Ordonez, Luis Gutierrez, Phan Nguyen, Blake Mason, Brenda Ng

    Abstract: Researchers produce thousands of scholarly documents containing valuable technical knowledge. The community faces the laborious task of reading these documents to identify, extract, and synthesize information. To automate information gathering, document-level question answering (QA) offers a flexible framework where human-posed questions can be adapted to extract diverse knowledge. Finetuning QA s… ▽ More

    Submitted 11 December, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  4. arXiv:2111.00163  [pdf, other

    cs.DB

    Simpli-Squared: A Very Simple Yet Unexpectedly Powerful Join Ordering Algorithm Without Cardinality Estimates

    Authors: Asoke Datta, Yesdaulet Izenov, Brian Tsan, Florin Rusu

    Abstract: The Join Order Benchmark (JOB) has become the de facto standard to assess the performance of relational database query optimizers due to its complexity and completeness. In order to compute the optimal execution plan -- join order -- existing solutions employ extensive data synopses and correlations -- functional dependencies -- between table attributes. These structures incur significant overhead… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  5. arXiv:2005.11093  [pdf, other

    cs.AI cs.LG

    DJEnsemble: On the Selection of a Disjoint Ensemble of Deep Learning Black-Box Spatio-Temporal Models

    Authors: Yania Molina Souto, Rafael Pereira, RocĂ­o Zorrilla, Anderson Chaves, Brian Tsan, Florin Rusu, Eduardo Ogasawara, Artur Ziviani, Fabio Porto

    Abstract: In this paper, we present a cost-based approach for the automatic selection and allocation of a disjoint ensemble of black-box predictors to answer predictive spatio-temporal queries. Our approach is divided into two parts -- offline and online. During the offline part, we preprocess the predictive domain data -- transforming it into a regular grid -- and the black-box models -- computing their sp… ▽ More

    Submitted 17 November, 2020; v1 submitted 22 May, 2020; originally announced May 2020.