Skip to main content

Showing 1–15 of 15 results for author: Paliwal, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11200  [pdf, other

    cs.LG

    UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library

    Authors: Alireza Moradzadeh, Lukasz Wawrzyniak, Miles Macklin, Saee G. Paliwal

    Abstract: In this work, we present a GPU-accelerated library for the underlying components of Kolmogorov-Arnold Networks (KANs), along with an algorithm to eliminate bounded grids in KANs. The GPU-accelerated library reduces the computational complexity of Basis Spline (B-spline) evaluation by a factor of $\mathcal{O}$(grid size) compared to existing codes, enabling batch computation for large-scale learnin… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 10 pages, 7 figures, 4 tables

  2. arXiv:2406.07770  [pdf, other

    cs.LG cs.AI q-bio.QM

    DualBind: A Dual-Loss Framework for Protein-Ligand Binding Affinity Prediction

    Authors: Meng Liu, Saee Gopal Paliwal

    Abstract: Accurate prediction of protein-ligand binding affinities is crucial for drug development. Recent advances in machine learning show promising results on this task. However, these methods typically rely heavily on labeled data, which can be scarce or unreliable, or they rely on assumptions like Boltzmann-distributed data that may not hold true in practice. Here, we present DualBind, a novel framewor… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Preprint, work in progress

  3. arXiv:2406.00855  [pdf, other

    cs.LG cs.AI cs.SI

    LinkLogic: A New Method and Benchmark for Explainable Knowledge Graph Predictions

    Authors: Niraj Kumar-Singh, Gustavo Polleti, Saee Paliwal, Rachel Hodos-Nkhereanye

    Abstract: While there are a plethora of methods for link prediction in knowledge graphs, state-of-the-art approaches are often black box, obfuscating model reasoning and thereby limiting the ability of users to make informed decisions about model predictions. Recently, methods have emerged to generate prediction explanations for Knowledge Graph Embedding models, a widely-used class of methods for link predi… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures in main text. For code and data, see https://github.com/niraj17singh/LinkLogic

    ACM Class: I.2.4

  4. arXiv:2405.12842  [pdf, other

    cs.RO cs.CV

    SmartFlow: Robotic Process Automation using LLMs

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig, Gautam Shroff

    Abstract: Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through drag-and-drop or automation frameworks such as Selenium to create navigation workflows, rather than visual understanding of screen elements. In this context, we p… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 32nd ACM International Conference on Information and Knowledge Management

  5. arXiv:2405.12742  [pdf, other

    cs.CV

    Multi-Subject Personalization

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Creative story illustration requires a consistent interplay of multiple characters or objects. However, conventional text-to-image models face significant challenges while producing images featuring multiple personalized subjects. For example, they distort the subject rendering, or the text descriptions fail to render coherent subject interactions. We present Multi-Subject Personalization (MSP) to… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 2023 Conference on Neural Information Processing Systems

  6. arXiv:2405.12531  [pdf, other

    cs.CV cs.LG

    CustomText: Customized Textual Image Generation using Diffusion Models

    Authors: Shubham Paliwal, Arushi Jain, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Textual image generation spans diverse fields like advertising, education, product packaging, social media, information visualization, and branding. Despite recent strides in language-guided image synthesis using diffusion models, current models excel in image generation but struggle with accurate text rendering and offer limited control over font attributes. In this paper, we aim to enhance the s… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2024

  7. arXiv:2212.03720  [pdf, other

    cs.SI cs.LG stat.ML

    Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

    Authors: Saee Paliwal, Angus Brayne, Benedek Fabian, Maciej Wiatrak, Aaron Sim

    Abstract: In this paper we generalize single-relation pseudo-Riemannian graph embedding models to multi-relational networks, and show that the typical approach of encoding relations as manifold transformations translates from the Riemannian to the pseudo-Riemannian case. In addition we construct a view of relations as separate spacetime submanifolds of multi-time manifolds, and consider an interpolation bet… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 11 pages, 3 figures, AKBC 2022 conference

    Journal ref: 4th Conference on Automated Knowledge Base Construction 2022

  8. arXiv:2203.06873  [pdf, other

    cs.CV

    TSR-DSAW: Table Structure Recognition via Deep Spatial Association of Words

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig

    Abstract: Existing methods for Table Structure Recognition (TSR) from camera-captured or scanned documents perform poorly on complex tables consisting of nested rows / columns, multi-line texts and missing cell data. This is because current data-driven methods work by simply training deep models on large volumes of data and fail to generalize when an unseen table structure is encountered. In this paper, we… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 6 pages, 1 figure, 1 table, ESANN 2021 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Online event, 6-8 October 2021, i6doc.com publ., ISBN 978287587082-7

    Journal ref: In ESANN 2021 proceedings, pages 257-262

  9. arXiv:2109.03849  [pdf, other

    cs.CV

    OSSR-PID: One-Shot Symbol Recognition in P&ID Sheets using Path Sampling and GCN

    Authors: Shubham Paliwal, Monika Sharma, Lovekesh Vig

    Abstract: Piping and Instrumentation Diagrams (P&ID) are ubiquitous in several manufacturing, oil and gas enterprises for representing engineering schematics and equipment layout. There is an urgent need to extract and digitize information from P&IDs without the cost of annotating a varying set of symbols for each new use case. A robust one-shot learning approach for symbol recognition i.e., localization fo… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Journal ref: International Joint Conference on Neural Network (IJCNN), 2021

  10. Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams

    Authors: Shubham Paliwal, Arushi Jain, Monika Sharma, Lovekesh Vig

    Abstract: Digitization of scanned Piping and Instrumentation diagrams(P&ID), widely used in manufacturing or mechanical industries such as oil and gas over several decades, has become a critical bottleneck in dynamic inventory management and creation of smart P&IDs that are compatible with the latest CAD tools. Historically, P&ID sheets have been manually generated at the design stage, before being scanned… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: 13 pages

    Journal ref: Trends and Applications in Knowledge Discovery and Data Mining. 168-180, PAKDD 2021

  11. arXiv:2106.08678  [pdf, other

    stat.ML cs.AI cs.LG

    Directed Graph Embeddings in Pseudo-Riemannian Manifolds

    Authors: Aaron Sim, Maciej Wiatrak, Angus Brayne, Páidí Creed, Saee Paliwal

    Abstract: The inductive biases of graph representation learning algorithms are often encoded in the background geometry of their embedding space. In this paper, we show that general directed graphs can be effectively represented by an embedding model that combines three components: a pseudo-Riemannian metric structure, a non-trivial global topology, and a unique likelihood function that explicitly incorpora… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted at ICML 2021

  12. Latent Alignment of Procedural Concepts in Multimodal Recipes

    Authors: Hossein Rajaby Faghihi, Roshanak Mirzaee, Sudarshan Paliwal, Parisa Kordjamshidi

    Abstract: We propose a novel alignment mechanism to deal with procedural reasoning on a newly released multimodal QA dataset, named RecipeQA. Our model is solving the textual cloze task which is a reading comprehension on a recipe containing images and instructions. We exploit the power of attention networks, cross-modal representations, and a latent alignment space between instructions and candidate answer… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: Published in ALVR 2020, a workshop in ACL 2020

    ACM Class: I.2.7

    Journal ref: Proceedings of the First Workshop on Advances in Language and Vision Research 2020 (26-31)

  13. arXiv:2001.01469  [pdf, other

    cs.CV cs.LG eess.IV

    TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

    Authors: Shubham Paliwal, Vishwanath D, Rohit Rahul, Monika Sharma, Lovekesh Vig

    Abstract: With the widespread use of mobile phones and scanners to photograph and upload documents, the need for extracting the information trapped in unstructured document images such as retail receipts, insurance claim forms and financial invoices is becoming more acute. A major hurdle to this objective is that these images often contain information in the form of tables and extracting data from tabular s… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

  14. arXiv:1901.11383  [pdf, other

    cs.CV

    Automatic Information Extraction from Piping and Instrumentation Diagrams

    Authors: Rohit Rahul, Shubham Paliwal, Monika Sharma, Lovekesh Vig

    Abstract: One of the most common modes of representing engineering schematics are Piping and Instrumentation diagrams (P&IDs) that describe the layout of an engineering process flow along with the interconnected process equipment. Over the years, P&ID diagrams have been manually generated, scanned and stored as image files. These files need to be digitized for purposes of inventory management and updation,… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Journal ref: IEEE ICPRAM 2019

  15. arXiv:1507.05398  [pdf, other

    cs.IT cs.DC

    Generating Binary Optimal Codes Using Heterogeneous Parallel Computing

    Authors: Srajan Paliwal, Saurabh Tiwary, Bhaskar Chaudhury, Manish K. Gupta

    Abstract: Generation of optimal codes is a well known problem in coding theory. Many computational approaches exist in the literature for finding record breaking codes. However generating codes with long lengths $n$ using serial algorithms is computationally very expensive, for example the worst case time complexity of a Greedy algorithm is $\mathcal{O}(n\; 4^n)$. In order to improve the efficiency of gener… ▽ More

    Submitted 20 July, 2015; originally announced July 2015.

    Comments: 8 pages, draft