Skip to main content

Showing 1–50 of 114 results for author: Patel, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00611  [pdf, other

    cs.NE cs.LG

    Using CSNNs to Perform Event-based Data Processing & Classification on ASL-DVS

    Authors: Ria Patel, Sujit Tripathy, Zachary Sublett, Seoyoung An, Riya Patel

    Abstract: Recent advancements in bio-inspired visual sensing and neuromorphic computing have led to the development of various highly efficient bio-inspired solutions with real-world applications. One notable application integrates event-based cameras with spiking neural networks (SNNs) to process event-based sequences that are asynchronous and sparse, making them difficult to handle. In this project, we de… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 8 pages, 14 figures

  2. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  3. arXiv:2406.09266  [pdf, other

    cs.MS

    SySTeC: A Symmetric Sparse Tensor Compiler

    Authors: Radha Patel, Willow Ahrens, Saman Amarasinghe

    Abstract: Symmetric and sparse tensors arise naturally in many domains including linear algebra, statistics, physics, chemistry, and graph theory. Symmetric tensors are equal to their transposes, so in the $n$-dimensional case we can save up to a factor of $n!$ by avoiding redundant operations. Sparse tensors, on the other hand, are mostly zero, and we can save asymptotically by processing only nonzeros. Un… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2404.17584  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Equivariant graph convolutional neural networks for the representation of homogenized anisotropic microstructural mechanical response

    Authors: Ravi Patel, Cosmin Safta, Reese E. Jones

    Abstract: Composite materials with different microstructural material symmetries are common in engineering applications where grain structure, alloying and particle/fiber packing are optimized via controlled manufacturing. In fact these microstructural tunings can be done throughout a part to achieve functional gradation and optimization at a structural level. To predict the performance of particular micros… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 23 pages, 10 figures

  5. arXiv:2404.16730  [pdf, other

    cs.MS

    Finch: Sparse and Structured Array Programming with Control Flow

    Authors: Willow Ahrens, Teodoro Fields Collin, Radha Patel, Kyle Deeds, Changwan Hong, Saman Amarasinghe

    Abstract: From FORTRAN to NumPy, arrays have revolutionized how we express computation. However, arrays in these, and almost all prominent systems, can only handle dense rectilinear integer grids. Real world arrays often contain underlying structure, such as sparsity, runs of repeated values, or symmetry. Support for structured data is fragmented and incomplete. Existing frameworks limit the array structure… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  6. arXiv:2404.14643  [pdf, other

    cs.CR cs.CY cs.GR cs.NI cs.SI

    Teaching Network Traffic Matrices in an Interactive Game Environment

    Authors: Chasen Milner, Hayden Jananthan, Jeremy Kepner, Vijay Gadepally, Michael Jones, Peter Michaleas, Ritesh Patel, Sandeep Pisharody, Gabriel Wachman, Alex Pentland

    Abstract: The Internet has become a critical domain for modern society that requires ongoing efforts for its improvement and protection. Network traffic matrices are a powerful tool for understanding and analyzing networks and are broadly taught in online graph theory educational resources. Network traffic matrix concepts are rarely available in online computer network and cybersecurity educational resource… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages, 10 figures, 52 references; accepted to IEEE GrAPL

  7. arXiv:2403.02482  [pdf, other

    cs.AI

    MORBDD: Multiobjective Restricted Binary Decision Diagrams by Learning to Sparsify

    Authors: Rahul Patel, Elias B. Khalil, David Bergman

    Abstract: In multicriteria decision-making, a user seeks a set of non-dominated solutions to a (constrained) multiobjective optimization problem, the so-called Pareto frontier. In this work, we seek to bring a state-of-the-art method for exact multiobjective integer linear programming into the heuristic realm. We focus on binary decision diagrams (BDDs) which first construct a graph that represents all feas… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  8. arXiv:2402.11179  [pdf, other

    cs.LG math.ST physics.comp-ph

    Uncertainty Quantification of Graph Convolution Neural Network Models of Evolving Processes

    Authors: Jeremiah Hauth, Cosmin Safta, Xun Huan, Ravi G. Patel, Reese E. Jones

    Abstract: The application of neural network models to scientific machine learning tasks has proliferated in recent years. In particular, neural network models have proved to be adept at modeling processes with spatial-temporal complexity. Nevertheless, these highly parameterized models have garnered skepticism in their ability to produce outputs with quantified error bounds over the regimes of interest. Hen… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 27 pages, 20 figures

  9. arXiv:2402.04068  [pdf, other

    cs.LG cs.CL

    Retrieve to Explain: Evidence-driven Predictions with Language Models

    Authors: Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane Corneil

    Abstract: Language models hold incredible promise for enabling scientific discovery by synthesizing massive research corpora. Many complex scientific research questions have multiple plausible answers, each supported by evidence of varying strength. However, existing language models lack the capability to quantitatively and faithfully compare answer plausibility in terms of supporting evidence. To address t… ▽ More

    Submitted 18 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML AI for Science 2024

  10. arXiv:2402.01704  [pdf, other

    cs.CL cs.AI cs.GT

    States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers

    Authors: Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls

    Abstract: Game theory is the study of mathematical models of strategic interactions among rational agents. Language is a key medium of interaction for humans, though it has historically proven difficult to model dialogue and its strategic motivations mathematically. A suitable model of the players, strategies, and payoffs associated with linguistic interactions (i.e., a binding to the conventional symbolic… ▽ More

    Submitted 6 February, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

    Comments: 32 pages, 8 figures, code available @ https://github.com/google-deepmind/open_spiel/blob/master/open_spiel/python/games/chat_game.py

  11. arXiv:2401.02199  [pdf, other

    eess.SY cs.AI cs.LG cs.SE

    LADRI: LeArning-based Dynamic Risk Indicator in Automated Driving System

    Authors: Anil Ranjitbhai Patel, Peter Liggesmeyer

    Abstract: As the horizon of intelligent transportation expands with the evolution of Automated Driving Systems (ADS), ensuring paramount safety becomes more imperative than ever. Traditional risk assessment methodologies, primarily crafted for human-driven vehicles, grapple to adequately adapt to the multifaceted, evolving environments of ADS. This paper introduces a framework for real-time Dynamic Risk Ass… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 2023 IEEE International Test Conference, 8th Edition of Automotive, Reliability, Test & Safety Workshop in Disneyland, Anaheim, CA

  12. arXiv:2401.00972  [pdf

    cs.LG cs.CY stat.AP

    Robust Meta-Model for Predicting the Need for Blood Transfusion in Non-traumatic ICU Patients

    Authors: Alireza Rafiei, Ronald Moore, Tilendra Choudhary, Curtis Marshall, Geoffrey Smith, John D. Roback, Ravi M. Patel, Cassandra D. Josephson, Rishikesan Kamaleswaran

    Abstract: Objective: Blood transfusions, crucial in managing anemia and coagulopathy in ICU settings, require accurate prediction for effective resource allocation and patient risk assessment. However, existing clinical decision support systems have primarily targeted a particular patient demographic with unique medical conditions and focused on a single type of blood transfusion. This study aims to develop… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  13. arXiv:2312.17670  [pdf, other

    cs.CV cs.LG q-bio.QM q-bio.TO

    Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

    Authors: Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Houjing Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli , et al. (59 additional authors not shown)

    Abstract: The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 11 figures, 9 tables. Summary Paper for the MICCAI TopCoW 2023 Challenge

  14. arXiv:2312.14506  [pdf, other

    cs.CR cs.DC

    Concurrent Asynchronous Byzantine Agreement in Expected-Constant Rounds, Revisited

    Authors: Ran Cohen, Pouyan Forghani, Juan Garay, Rutvik Patel, Vassilis Zikas

    Abstract: It is well known that without randomization, Byzantine agreement (BA) requires a linear number of rounds in the synchronous setting, while it is flat out impossible in the asynchronous setting. The primitive which allows to bypass the above limitation is known as oblivious common coin (OCC). It allows parties to agree with constant probability on a random coin, where agreement is oblivious, i.e.,… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: A preliminary version of this work appeared in TCC 2023

  15. arXiv:2312.10048  [pdf

    cs.CL

    Knowledge Graph Enhanced Aspect-Level Sentiment Analysis

    Authors: Kavita Sharma, Ritu Patel, Sunita Iyer

    Abstract: In this paper, we propose a novel method to enhance sentiment analysis by addressing the challenge of context-specific word meanings. It combines the advantages of a BERT model with a knowledge graph based synonym data. This synergy leverages a dynamic attention mechanism to develop a knowledge-driven state vector. For classifying sentiments linked to specific aspects, the approach constructs a me… ▽ More

    Submitted 26 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  16. arXiv:2312.06652  [pdf, other

    cs.AI cs.CL

    Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?

    Authors: Shabaz Patel, Hassan Kane, Rayhan Patel

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted for Muslims in ML workshop at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  17. arXiv:2312.04470  [pdf, other

    cs.HC cs.CR

    GaitGuard: Towards Private Gait in Mixed Reality

    Authors: Diana Romero, Ruchi Jagdish Patel, Athina Markopoulou, Salma Elmalaki

    Abstract: Augmented/Mixed Reality (AR/MR) technologies offers a new era of immersive, collaborative experiences, distinctively setting them apart from conventional mobile systems. However, as we further investigate the privacy and security implications within these environments, the issue of gait privacy emerges as a critical yet underexplored concern. Given its uniqueness as a biometric identifier that can… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 21 pages, 17 figures

  18. arXiv:2311.10856  [pdf

    cs.AI

    Exploring the Consistency, Quality and Challenges in Manual and Automated Coding of Free-text Diagnoses from Hospital Outpatient Letters

    Authors: Warren Del-Pinto, George Demetriou, Meghna Jani, Rikesh Patel, Leanne Gray, Alex Bulcock, Niels Peek, Andrew S. Kanter, William G Dixon, Goran Nenadic

    Abstract: Coding of unstructured clinical free-text to produce interoperable structured data is essential to improve direct care, support clinical communication and to enable clinical research.However, manual clinical coding is difficult and time consuming, which motivates the development and use of natural language processing for automated coding. This work evaluates the quality and consistency of both man… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  19. arXiv:2311.00408  [pdf, other

    cs.CL

    AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification

    Authors: Yongxin Huang, Kexin Wang, Sourav Dutta, Raj Nath Patel, Goran Glavaš, Iryna Gurevych

    Abstract: Recent work has found that few-shot sentence classification based on pre-trained Sentence Encoders (SEs) is efficient, robust, and effective. In this work, we investigate strategies for domain-specialization in the context of few-shot sentence classification with SEs. We first establish that unsupervised Domain-Adaptive Pre-Training (DAPT) of a base Pre-trained Language Model (PLM) (i.e., not an S… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 Main

  20. arXiv:2310.00522  [pdf, other

    cs.SI

    Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations

    Authors: Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg , et al. (3 additional authors not shown)

    Abstract: Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, IEEE HPEC 2023 (accepted)

  21. Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

    Authors: Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas

    Abstract: Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 8 pages, 8 figures, 1 table, 69 references. arXiv admin note: text overlap with arXiv:2203.13934. text overlap with arXiv:2309.01806

  22. Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

    Authors: Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 9 pages, 12 figures, 1 table, 63 references, 2 appendices

  23. arXiv:2308.16139  [pdf, other

    cs.CV cs.DB cs.LG

    MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision

    Authors: Jianning Li, Zongwei Zhou, Jiancheng Yang, Antonio Pepe, Christina Gsaxner, Gijs Luijten, Chongyu Qu, Tiezheng Zhang, Xiaoxi Chen, Wenxuan Li, Marek Wodzinski, Paul Friedrich, Kangxian Xie, Yuan Jin, Narmada Ambigapathy, Enrico Nasca, Naida Solak, Gian Marco Melito, Viet Duc Vu, Afaque R. Memon, Christopher Schlachta, Sandrine De Ribaupierre, Rajnikant Patel, Roy Eagleson, Xiaojun Chen , et al. (132 additional authors not shown)

    Abstract: Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 16 pages

    MSC Class: 68T01

  24. arXiv:2308.00009  [pdf

    eess.IV cs.LG

    A 3D deep learning classifier and its explainability when assessing coronary artery disease

    Authors: Wing Keung Cheung, Jeremy Kalindjian, Robert Bell, Arjun Nair, Leon J. Menezes, Riyaz Patel, Simon Wan, Kacy Chou, Jiahang Chen, Ryo Torii, Rhodri H. Davies, James C. Moon, Daniel C. Alexander, Joseph Jacob

    Abstract: Early detection and diagnosis of coronary artery disease (CAD) could save lives and reduce healthcare costs. In this study, we propose a 3D Resnet-50 deep learning model to directly classify normal subjects and CAD patients on computed tomography coronary angiography images. Our proposed method outperforms a 2D Resnet-50 model by 23.65%. Explainability is also provided by using a Grad-GAM. Further… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

  25. arXiv:2307.03171  [pdf, other

    cs.AI

    LEO: Learning Efficient Orderings for Multiobjective Binary Decision Diagrams

    Authors: Rahul Patel, Elias B. Khalil

    Abstract: Approaches based on Binary decision diagrams (BDDs) have recently achieved state-of-the-art results for multiobjective integer programming problems. The variable ordering used in constructing BDDs can have a significant impact on their size and on the quality of bounds derived from relaxed or restricted BDDs for single-objective optimization problems. We first showcase a similar impact of variable… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  26. arXiv:2306.09946  [pdf, other

    cs.RO

    Tactile-Reactive Roller Grasper

    Authors: Shenli Yuan, Shaoxiong Wang, Radhen Patel, Megha Tippur, Connor Yako, Edward Adelson, Kenneth Salisbury

    Abstract: Manipulation of objects within a robot's hand is one of the most important challenges in achieving robot dexterity. The "Roller Graspers" refers to a family of non-anthropomorphic hands utilizing motorized, rolling fingertips to achieve in-hand manipulation. These graspers manipulate grasped objects by commanding the rollers to exert forces that propel the object in the desired motion directions.… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  27. arXiv:2305.00385  [pdf

    eess.IV cs.CV

    Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI

    Authors: Yuheng Li, Jacob Wynne, Jing Wang, Richard L. J. Qiu, Justin Roper, Shaoyan Pan, Ashesh B. Jani, Tian Liu, Pretesh R. Patel, Hui Mao, Xiaofeng Yang

    Abstract: Biparametric magnetic resonance imaging (bpMRI) has demonstrated promising results in prostate cancer (PCa) detection using convolutional neural networks (CNNs). Recently, transformers have achieved competitive performance compared to CNNs in computer vision. Large scale transformers need abundant annotated data for training, which are difficult to obtain in medical imaging. Self-supervised learni… ▽ More

    Submitted 17 March, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

  28. arXiv:2304.09750  [pdf, other

    q-fin.CP cs.CE cs.LG quant-ph

    Application of Tensor Neural Networks to Pricing Bermudan Swaptions

    Authors: Raj G. Patel, Tomas Dominguez, Mohammad Dib, Samuel Palmer, Andrea Cadarso, Fernando De Lope Contreras, Abdelkader Ratnani, Francisco Gomez Casanova, Senaida Hernández-Santana, Álvaro Díaz-Fernández, Eva Andrés, Jorge Luis-Hita, Escolástico Sánchez-Martínez, Samuel Mugel, Roman Orus

    Abstract: The Cheyette model is a quasi-Gaussian volatility interest rate model widely used to price interest rate derivatives such as European and Bermudan Swaptions for which Monte Carlo simulation has become the industry standard. In low dimensions, these approaches provide accurate and robust prices for European Swaptions but, even in this computationally simple setting, they are known to underestimate… ▽ More

    Submitted 10 March, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 16 pages, 9 figures, 2 tables, minor changes

  29. arXiv:2212.14076  [pdf, other

    q-fin.PR cs.CE cs.LG quant-ph

    Quantum-Inspired Tensor Neural Networks for Option Pricing

    Authors: Raj G. Patel, Chia-Wei Hsing, Serkan Sahin, Samuel Palmer, Saeed S. Jahromi, Shivam Sharma, Tomas Dominguez, Kris Tziritas, Christophe Michel, Vincent Porte, Mustafa Abid, Stephane Aubert, Pierre Castellani, Samuel Mugel, Roman Orus

    Abstract: Recent advances in deep learning have enabled us to address the curse of dimensionality (COD) by solving problems in higher dimensions. A subset of such approaches of addressing the COD has led us to solving high-dimensional PDEs. This has resulted in opening doors to solving a variety of real-world problems ranging from mathematical finance to stochastic control for industrial applications. Altho… ▽ More

    Submitted 10 March, 2024; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 11 pages, 8 figures, minor changes. arXiv admin note: substantial text overlap with arXiv:2208.02235

  30. arXiv:2211.08371  [pdf, other

    cs.CL

    Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches

    Authors: Daniel Fried, Nicholas Tomlin, Jennifer Hu, Roma Patel, Aida Nematzadeh

    Abstract: People rely heavily on context to enrich meaning beyond what is literally said, enabling concise but effective communication. To interact successfully and naturally with people, user-facing artificial intelligence systems will require similar skills in pragmatics: relying on various types of context -- from shared linguistic goals and conventions, to the visual and embodied world -- to use languag… ▽ More

    Submitted 21 November, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Findings of EMNLP 2023

  31. arXiv:2211.06247  [pdf, other

    eess.IV cs.CV

    Joint Deep Learning for Improved Myocardial Scar Detection from Cardiac MRI

    Authors: Jiarui Xing, Shuo Wang, Kenneth C. Bilchick, Amit R. Patel, Miaomiao Zhang

    Abstract: Automated identification of myocardial scar from late gadolinium enhancement cardiac magnetic resonance images (LGE-CMR) is limited by image noise and artifacts such as those related to motion and partial volume effect. This paper presents a novel joint deep learning (JDL) framework that improves such tasks by utilizing simultaneously learned myocardium segmentations to eliminate negative effects… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  32. arXiv:2211.06238  [pdf, other

    eess.IV cs.CV

    Multitask Learning for Improved Late Mechanical Activation Detection of Heart from Cine DENSE MRI

    Authors: Jiarui Xing, Shuo Wang, Kenneth C. Bilchick, Frederick H. Epstein, Amit R. Patel, Miaomiao Zhang

    Abstract: The selection of an optimal pacing site, which is ideally scar-free and late activated, is critical to the response of cardiac resynchronization therapy (CRT). Despite the success of current approaches formulating the detection of such late mechanical activation (LMA) regions as a problem of activation time regression, their accuracy remains unsatisfactory, particularly in cases where myocardial s… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  33. arXiv:2210.08664  [pdf, other

    cs.RO eess.SY

    Design and Modeling of a Smart Torque-Adjustable Rotary Electroadhesive Clutch for Application in Human-Robot Interaction

    Authors: Navid Feizi, S. Farokh Atashzar, Mehrdad R. Kermani, Rajni V. Patel

    Abstract: The increasing need for sharing workspace and interactive physical tasks between robots and humans has raised concerns regarding safety of such operations. In this regard, controllable clutches have shown great potential for addressing important safety concerns at the hardware level by separating the high-impedance actuator from the end effector by providing the power transfer from electromagnetic… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: submitted to IEEE T-MECH, 11 pages, 14 figures,

  34. arXiv:2210.07335  [pdf

    cs.RO cs.AI

    FOON Creation and Traversal for Recipe Generation

    Authors: Raj Patel

    Abstract: Task competition by robots is still off from being completely dependable and usable. One way a robot may decipher information given to it and accomplish tasks is by utilizing FOON, which stands for functional object-oriented network. The network first needs to be created by having a human creates action nodes as well as input and output nodes in a .txt file. After the network is sizeable, utilizat… ▽ More

    Submitted 15 November, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  35. arXiv:2208.06448  [pdf, other

    cs.AI cs.LG

    RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents

    Authors: Rafael Rodriguez-Sanchez, Benjamin A. Spiegel, Jennifer Wang, Roma Patel, Stefanie Tellex, George Konidaris

    Abstract: We introduce RLang, a domain-specific language (DSL) for communicating domain knowledge to an RL agent. Unlike existing RL DSLs that ground to \textit{single} elements of a decision-making formalism (e.g., the reward function or policy), RLang can specify information about every element of a Markov decision process. We define precise syntax and grounding semantics for RLang, and provide a parser t… ▽ More

    Submitted 30 May, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

  36. arXiv:2208.02235  [pdf, other

    cs.LG cond-mat.str-el cs.AI physics.comp-ph quant-ph

    Quantum-Inspired Tensor Neural Networks for Partial Differential Equations

    Authors: Raj Patel, Chia-Wei Hsing, Serkan Sahin, Saeed S. Jahromi, Samuel Palmer, Shivam Sharma, Christophe Michel, Vincent Porte, Mustafa Abid, Stephane Aubert, Pierre Castellani, Chi-Guhn Lee, Samuel Mugel, Roman Orus

    Abstract: Partial Differential Equations (PDEs) are used to model a variety of dynamical systems in science and engineering. Recent advances in deep learning have enabled us to solve them in a higher dimension by addressing the curse of dimensionality in new ways. However, deep learning methods are constrained by training time and memory. To tackle these shortcomings, we implement Tensor Neural Networks (TN… ▽ More

    Submitted 10 August, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: 14 pages, 11 figures, minimal changes

  37. arXiv:2207.02891  [pdf, other

    cs.LG cs.AI

    Don't overfit the history -- Recursive time series data augmentation

    Authors: Amine Mohamed Aboussalah, Min-Jae Kwon, Raj G Patel, Cheng Chi, Chi-Guhn Lee

    Abstract: Time series observations can be seen as realizations of an underlying dynamical system governed by rules that we typically do not know. For time series learning tasks, we need to understand that we fit our model on available data, which is a unique realized history. Training on a single realization often induces severe overfitting lacking generalization. To address this issue, we introduce a gener… ▽ More

    Submitted 28 January, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted to ICLR 2023 Resubmitted here due to major change in proofs following conference submission

  38. arXiv:2206.02571  [pdf, other

    eess.SY cs.CE physics.comp-ph

    Wigner-Smith Time Delay Matrix for Electromagnetics: Guiding and Periodic Systems with Evanescent Modes

    Authors: Yiqian Mao, Utkarsh R. Patel, Eric Michielssen

    Abstract: The Wigner-Smith (WS) time delay matrix relates an electromagnetic system's scattering matrix and its frequency derivative. Previous work showed that the entries of WS time delay matrices of systems excited by propagating waves consist of volume integrals of energy-like field quantities. This paper introduces a generalized WS relationship that applies to systems excited by mixtures of propagating… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  39. arXiv:2206.01403  [pdf, other

    physics.comp-ph cs.CE

    Wigner-Smith Time Delay Matrix for Electromagnetics: Systems with Material Dispersion and Losses

    Authors: Yiqian Mao, Utkarsh R. Patel, Eric Michielssen

    Abstract: The Wigner-Smith (WS) time delay matrix relates a system's scattering matrix to its frequency derivative and gives rise to so-called WS modes that experience well-defined group delays when interacting with the system. For systems composed of nondispersive and lossless materials, the WS time delay matrix previously was shown to consist of volume integrals of energy-like densities plus correction te… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  40. arXiv:2205.12725  [pdf, other

    cs.CE

    Wigner-Smith Time Delay Matrix for Acoustic Scattering: Computational Aspects

    Authors: Utkarsh R. Patel, Yiqian Mao, Jack Hamel, Eric Michielssen

    Abstract: The Wigner-Smith (WS) time delay matrix relates an acoustic system's scattering matrix to its wavenumber derivative. The entries of the WS time delay matrix can be expressed in terms of energy density-like volume integrals, which cannot be efficiently evaluated in a boundary element method framework. This paper presents two schemes for efficiently populating the WS time delay matrix. The direct fo… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: Submitted the Journal of the Acoustical Society of America

  41. arXiv:2205.12006  [pdf, other

    math.OC cs.AI cs.LG

    Neur2SP: Neural Two-Stage Stochastic Programming

    Authors: Justin Dumouchelle, Rahul Patel, Elias B. Khalil, Merve Bodur

    Abstract: Stochastic Programming is a powerful modeling framework for decision-making under uncertainty. In this work, we tackle two-stage stochastic programs (2SPs), the most widely used class of stochastic programming models. Solving 2SPs exactly requires optimizing over an expected value function that is computationally intractable. Having a mixed-integer linear program (MIP) or a nonlinear program (NLP)… ▽ More

    Submitted 12 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: To appear in the proceedings of NeurIPS 2022

  42. arXiv:2205.10267  [pdf, other

    astro-ph.IM astro-ph.HE cs.DC gr-qc

    Reproducibility of the First Image of a Black Hole in the Galaxy M87 from the Event Horizon Telescope (EHT) Collaboration

    Authors: Ria Patel, Brandan Roachell, Silvina Caino-Lores, Ross Ketron, Jacob Leonard, Nigel Tan, Duncan Brown, Ewa Deelman, Michela Taufer

    Abstract: This paper presents an interdisciplinary effort aiming to develop and share sustainable knowledge necessary to analyze, understand, and use published scientific results to advance reproducibility in multi-messenger astrophysics. Specifically, we target the breakthrough work associated with the generation of the first image of a black hole, called M87. The image was computed by the Event Horizon Te… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  43. Wigner-Smith Time Delay Matrix for Acoustic Scattering: Theory and Phenomenology

    Authors: Utkarsh R. Patel, Yiqian Mao, Eric Michielssen

    Abstract: The Wigner-Smith (WS) time delay matrix relates a lossless system's scattering matrix to its frequency derivative. First proposed in the realm of quantum mechanics to characterize time delays experienced by particles during a collision, this article extends the use of WS time delay techniques to acoustic scattering problems governed by the Helmholtz equation. Expression for the entries of the WS t… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: Submitted to The Journal of Acoustical Society of America

  44. arXiv:2204.10909  [pdf, other

    cs.LG stat.ML

    Error-in-variables modelling for operator learning

    Authors: Ravi G. Patel, Indu Manickam, Myoungkyu Lee, Mamikon Gulian

    Abstract: Deep operator learning has emerged as a promising tool for reduced-order modelling and PDE model discovery. Leveraging the expressive power of deep neural networks, especially in high dimensions, such methods learn the mapping between functional state variables. While proposed methods have assumed noise only in the dependent variables, experimental and numerical data for operator learning typicall… ▽ More

    Submitted 19 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 23 pages, 10 figures

  45. arXiv:2204.04816  [pdf, other

    cs.CR

    Distributed Hardware Accelerated Secure Joint Computation on the COPA Framework

    Authors: Rushi Patel, Pouya Haghi, Shweta Jain, Andriy Kot, Venkata Krishnan, Mayank Varia, Martin Herbordt

    Abstract: Performance of distributed data center applications can be improved through use of FPGA-based SmartNICs, which provide additional functionality and enable higher bandwidth communication. Until lately, however, the lack of a simple approach for customizing SmartNICs to application requirements has limited the potential benefits. Intel's Configurable Network Protocol Accelerator (COPA) provides a cu… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  46. arXiv:2110.05603  [pdf, other

    cs.CL cs.RO

    Generalizing to New Domains by Mapping Natural Language to Lifted LTL

    Authors: Eric Hsiung, Hiloni Mehta, Junchi Chu, Xinyu Liu, Roma Patel, Stefanie Tellex, George Konidaris

    Abstract: Recent work on using natural language to specify commands to robots has grounded that language to LTL. However, mapping natural language task specifications to LTL task specifications using language models require probability distributions over finite vocabulary. Existing state-of-the-art methods have extended this finite vocabulary to include unseen terms from the input sequence to improve output… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 7 pages (6 + 1 references page), 3 figures, 2 tables. Accepted to ICRA 2022. To appear in Proceedings of the 2022 International Conference on Robotics and Automation, May 2022

  47. arXiv:2107.09247  [pdf, ps, other

    cs.GT

    Prior-Free Clock Auctions for Bidders with Interdependent Values

    Authors: Vasilis Gkatzelis, Rishi Patel, Emmanouil Pountourakis, Daniel Schoepflin

    Abstract: We study the problem of selling a good to a group of bidders with interdependent values in a prior-free setting. Each bidder has a signal that can take one of $k$ different values, and her value for the good is a weakly increasing function of all the bidders' signals. The bidders are partitioned into $\ell$ expertise-groups, based on how their signal can impact the values for the good, and we prov… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: To appear in the 14th International Symposium on Algorithmic Game Theory (SAGT)

  48. arXiv:2107.07341  [pdf

    cs.HC cs.AI cs.DC cs.LG cs.NE cs.SI

    Utilizing a digital swarm intelligence platform to improve consensus among radiologists and exploring its applications

    Authors: Rutwik Shah, Bruno Astuto, Tyler Gleason, Will Fletcher, Justin Banaga, Kevin Sweetwood, Allen Ye, Rina Patel, Kevin McGill, Thomas Link, Jason Crane, Valentina Pedoia, Sharmila Majumdar

    Abstract: Radiologists today play a key role in making diagnostic decisions and labeling images for training A.I. algorithms. Low inter-reader reliability (IRR) can be seen between experts when interpreting challenging cases. While teams-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit non-dominant participants from expressi… ▽ More

    Submitted 6 September, 2021; v1 submitted 26 June, 2021; originally announced July 2021.

    Comments: 29 pages, 3 tables, 7 figures

  49. arXiv:2105.12564  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Predicting invasive ductal carcinoma using a Reinforcement Sample Learning Strategy using Deep Learning

    Authors: Rushabh Patel

    Abstract: Invasive ductal carcinoma is a prevalent, potentially deadly disease associated with a high rate of morbidity and mortality. Its malignancy is the second leading cause of death from cancer in women. The mammogram is an extremely useful resource for mass detection and invasive ductal carcinoma diagnosis. We are proposing a method for Invasive ductal carcinoma that will use convolutional neural netw… ▽ More

    Submitted 7 November, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Updated References

  50. arXiv:2105.11989  [pdf

    cs.AI cs.LG cs.NE

    Graph Based Link Prediction between Human Phenotypes and Genes

    Authors: Rushabh Patel, Yanhui Guo

    Abstract: Background: The learning of genotype-phenotype associations and history of human disease by doing detailed and precise analysis of phenotypic abnormalities can be defined as deep phenotyping. To understand and detect this interaction between phenotype and genotype is a fundamental step when translating precision medicine to clinical practice. The recent advances in the field of machine learning is… ▽ More

    Submitted 1 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.