Skip to main content

Showing 1–50 of 80 results for author: Banerjee, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00480  [pdf

    cs.CV

    Development of an interactive GUI using MATLAB for the detection of type and stage of Breast Tumor

    Authors: Poulmi Banerjee, Satadal Saha

    Abstract: Breast cancer is described as one of the most common types of cancer which has been diagnosed mainly in women. When compared in the ratio of male to female, it has been duly found that the prone of having breast cancer is more in females than males. Breast lumps are classified mainly into two groups namely: cancerous and non-cancerous. When we say that the lump in the breast is cancerous, it means… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.14290  [pdf, ps, other

    cs.CY cs.SI

    Examining the Implications of Deepfakes for Election Integrity

    Authors: Hriday Ranka, Mokshit Surana, Neel Kothari, Veer Pariawala, Pratyay Banerjee, Aditya Surve, Sainath Reddy Sankepally, Raghav Jain, Jhagrut Lalwani, Swapneel Mehta

    Abstract: It is becoming cheaper to launch disinformation operations at scale using AI-generated content, in particular 'deepfake' technology. We have observed instances of deepfakes in political campaigns, where generated content is employed to both bolster the credibility of certain narratives (reinforcing outcomes) and manipulate public perception to the detriment of targeted candidates or causes (advers… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at the AAAI 2024 conference, AI for Credible Elections Workshop-AI4CE 2024

  3. arXiv:2406.09598  [pdf, other

    cs.CV

    Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking

    Authors: Prithviraj Banerjee, Sindi Shkodrani, Pierre Moulon, Shreyas Hampali, Fan Zhang, Jade Fountain, Edward Miller, Selen Basol, Richard Newcombe, Robert Wang, Jakob Julian Engel, Tomas Hodan

    Abstract: We introduce HOT3D, a publicly available dataset for egocentric hand and object tracking in 3D. The dataset offers over 833 minutes (more than 3.7M images) of multi-view RGB/monochrome image streams showing 19 subjects interacting with 33 diverse rigid objects, multi-modal signals such as eye gaze or scene point clouds, as well as comprehensive ground truth annotations including 3D poses of object… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2405.10431  [pdf, other

    cs.CL

    Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models

    Authors: Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka

    Abstract: Existing debiasing techniques are typically training-based or require access to the model's internals and output distributions, so they are inaccessible to end-users looking to adapt LLM outputs for their particular needs. In this study, we examine whether structured prompting techniques can offer opportunities for fair text generation. We evaluate a comprehensive end-user-focused iterative framew… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: The first two authors have equal contribution

  5. arXiv:2403.15924  [pdf, other

    cs.HC eess.SY

    Perception and Control of Surfing in Virtual Reality using a 6-DoF Motion Platform

    Authors: Premankur Banerjee, Jason Cherin, Jayati Upadhyay, Jason Kutch, Heather Culbertson

    Abstract: The paper presents a system for simulating surfing in Virtual Reality (VR), emphasizing the recreation of aquatic motions and user-initiated propulsive forces using a 6-Degree of Freedom (DoF) motion platform. We present an algorithmic approach to accurately render surfboard kinematics and interactive paddling dynamics, validated through experimental evaluation with \(N=17\) participants. Results… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  6. arXiv:2403.13672  [pdf, other

    cs.LG physics.flu-dyn

    Machine Learning Optimized Approach for Parameter Selection in MESHFREE Simulations

    Authors: Paulami Banerjee, Mohan Padmanabha, Chaitanya Sanghavi, Isabel Michel, Simone Gramsch

    Abstract: Meshfree simulation methods are emerging as compelling alternatives to conventional mesh-based approaches, particularly in the fields of Computational Fluid Dynamics (CFD) and continuum mechanics. In this publication, we provide a comprehensive overview of our research combining Machine Learning (ML) and Fraunhofer's MESHFREE software (www.meshfree.eu), a powerful tool utilizing a numerical point… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  7. arXiv:2402.00295  [pdf

    cs.CV stat.AP

    Comparative Evaluation of Traditional and Deep Learning-Based Segmentation Methods for Spoil Pile Delineation Using UAV Images

    Authors: Sureka Thiruchittampalam, Bikram P. Banerjee, Nancy F. Glenn, Simit Raval

    Abstract: The stability of mine dumps is contingent upon the precise arrangement of spoil piles, taking into account their geological and geotechnical attributes. Yet, on-site characterisation of individual piles poses a formidable challenge. The utilisation of image-based techniques for spoil pile characterisation, employing remotely acquired data through unmanned aerial systems, is a promising complementa… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  8. arXiv:2311.11214  [pdf

    cs.CV

    Infrared image identification method of substation equipment fault under weak supervision

    Authors: Anjali Sharma, Priya Banerjee, Nikhil Singh

    Abstract: This study presents a weakly supervised method for identifying faults in infrared images of substation equipment. It utilizes the Faster RCNN model for equipment identification, enhancing detection accuracy through modifications to the model's network structure and parameters. The method is exemplified through the analysis of infrared images captured by inspection robots at substations. Performanc… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  9. arXiv:2311.05451  [pdf, other

    cs.CL cs.CY cs.LG

    All Should Be Equal in the Eyes of Language Models: Counterfactually Aware Fair Text Generation

    Authors: Pragyan Banerjee, Abhinav Java, Surgan Jandial, Simra Shahid, Shaz Furniturewala, Balaji Krishnamurthy, Sumit Bhatia

    Abstract: Fairness in Language Models (LMs) remains a longstanding challenge, given the inherent biases in training data that can be perpetuated by models and affect the downstream tasks. Recent methods employ expensive retraining or attempt debiasing during inference by constraining model outputs to contrast from a reference set of biased templates or exemplars. Regardless, they dont address the primary go… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: The first four authors contributed equally to the work

  10. arXiv:2310.20174  [pdf, other

    cs.AI

    GraphTransformers for Geospatial Forecasting of Hurricane Trajectories

    Authors: Pallavi Banerjee, Satyaki Chakraborty

    Abstract: In this paper we introduce a novel framework for trajectory prediction of geospatial sequences using GraphTransformers. When viewed across several sequences, we observed that a graph structure automatically emerges between different geospatial points that is often not taken into account for such sequence modeling tasks. We show that by leveraging this graph structure explicitly, geospatial traject… ▽ More

    Submitted 26 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

  11. arXiv:2310.00836  [pdf, other

    cs.CL cs.AI

    Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models

    Authors: Man Luo, Shrinidhi Kumbhar, Ming shen, Mihir Parmar, Neeraj Varshney, Pratyay Banerjee, Somak Aditya, Chitta Baral

    Abstract: Logical reasoning is fundamental for humans yet presents a substantial challenge in the domain of Artificial Intelligence. Initially, researchers used Knowledge Representation and Reasoning (KR) systems that did not scale and required non-trivial manual effort. Recently, the emergence of large language models (LLMs) has demonstrated the ability to overcome various limitations of formal Knowledge R… ▽ More

    Submitted 30 March, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Work in progress

  12. arXiv:2308.04407  [pdf, other

    cs.CR cs.DS

    Chrisimos: A useful Proof-of-Work for finding Minimal Dominating Set of a graph

    Authors: Diptendu Chatterjee, Prabal Banerjee, Subhra Mazumdar

    Abstract: Hash-based Proof-of-Work (PoW) used in the Bitcoin Blockchain leads to high energy consumption and resource wastage. In this paper, we aim to re-purpose the energy by replacing the hash function with real-life problems having commercial utility. We propose Chrisimos, a useful Proof-of-Work where miners are required to find a minimal dominating set for real-life graph instances. A miner who is able… ▽ More

    Submitted 13 September, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 20 pages, 3 figures. An abridged version of the paper got accepted in The International Symposium on Intelligent and Trustworthy Computing, Communications, and Networking (ITCCN-2023) held in conjunction with the 22nd IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom-2023)

  13. arXiv:2304.03805  [pdf, other

    cs.LG

    Correcting Model Misspecification via Generative Adversarial Networks

    Authors: Pronoma Banerjee, Manasi V Gude, Rajvi J Sampat, Sharvari M Hedaoo, Soma Dhavala, Snehanshu Saha

    Abstract: Machine learning models are often misspecified in the likelihood, which leads to a lack of robustness in the predictions. In this paper, we introduce a framework for correcting likelihood misspecifications in several paradigm agnostic noisy prior models and test the model's ability to remove the misspecification. The "ABC-GAN" framework introduced is a novel generative modeling paradigm, which com… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  14. arXiv:2303.14994  [pdf, other

    cs.DS

    Analysis of DNA sequences through local distribution of nucleotides in strategic neighborhoods

    Authors: Probir Mondal, Pratyay Banerjee, Krishnendu Basuli

    Abstract: We propose a new alignment-free algorithm by constructing a compact vector representation on $\mathbb{R}^{24}$ of a DNA sequence of arbitrary length. Each component of this vector is obtained from a representative sequence, the elements of which are the values realized by a function $Γ$. $Γ$ acts on neighborhoods of arbitrary radius that are located at strategic positions within the DNA sequence a… ▽ More

    Submitted 16 January, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 12 pages, 5 figures

  15. arXiv:2301.10165  [pdf, other

    cs.CL cs.AI

    Lexi: Self-Supervised Learning of the UI Language

    Authors: Pratyay Banerjee, Shweti Mahajan, Kushal Arora, Chitta Baral, Oriana Riva

    Abstract: Humans can learn to operate the user interface (UI) of an application by reading an instruction manual or how-to guide. Along with text, these resources include visual content such as UI screenshots and images of application icons referenced in the text. We explore how to leverage this data to learn generic visio-linguistic representations of UI screens and their components. These representations… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: EMNLP (Findings) 2022

  16. arXiv:2212.03866  [pdf, other

    cs.CV

    Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task

    Authors: Shailaja Keyur Sampat, Pratyay Banerjee, Yezhou Yang, Chitta Baral

    Abstract: 'Actions' play a vital role in how humans interact with the world. Thus, autonomous agents that would assist us in everyday tasks also require the capability to perform 'Reasoning about Actions & Change' (RAC). This has been an important research direction in Artificial Intelligence (AI) in general, but the study of RAC with visual and linguistic inputs is relatively recent. The CLEVR_HYP (Sampat… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 11 pages, 9 figures; Accepted at Findings of EMNLP 2022. arXiv admin note: substantial text overlap with arXiv:2212.03433

  17. arXiv:2212.03433  [pdf, other

    cs.CV

    Learning Action-Effect Dynamics from Pairs of Scene-graphs

    Authors: Shailaja Keyur Sampat, Pratyay Banerjee, Yezhou Yang, Chitta Baral

    Abstract: 'Actions' play a vital role in how humans interact with the world. Thus, autonomous agents that would assist us in everyday tasks also require the capability to perform 'Reasoning about Actions & Change' (RAC). Recently, there has been growing interest in the study of RAC with visual and linguistic inputs. Graphs are often used to represent semantic structure of the visual content (i.e. objects, t… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 5 pages, 6 figures; Accepted at 3rd Workshop on Graphs and more Complex structures for Learning and Reasoning (GCLR) workshop, AAAI 2023

  18. A review of laser scanning for geological and geotechnical applications in underground mining

    Authors: Sarvesh Kumar Singh, Bikram Pratap Banerjee, Simit Raval

    Abstract: Laser scanning can provide timely assessments of mine sites despite adverse challenges in the operational environment. Although there are several published articles on laser scanning, there is a need to review them in the context of underground mining applications. To this end, a holistic review of laser scanning is presented including progress in 3D scanning systems, data capture/processing techn… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  19. arXiv:2210.11790  [pdf, other

    cs.LG stat.ML

    FoSR: First-order spectral rewiring for addressing oversquashing in GNNs

    Authors: Kedar Karhadkar, Pradeep Kr. Banerjee, Guido Montúfar

    Abstract: Graph neural networks (GNNs) are able to leverage the structure of graph data by passing messages along the edges of the graph. While this allows GNNs to learn features depending on the graph structure, for certain graph topologies it leads to inefficient information propagation and a problem known as oversquashing. This has recently been linked with the curvature and spectral gap of the graph. On… ▽ More

    Submitted 15 February, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: 21 pages, accepted to ICLR 2023

  20. arXiv:2208.03471  [pdf, other

    cs.LG cs.IT

    Oversquashing in GNNs through the lens of information contraction and graph expansion

    Authors: Pradeep Kr. Banerjee, Kedar Karhadkar, Yu Guang Wang, Uri Alon, Guido Montúfar

    Abstract: The quality of signal propagation in message-passing graph neural networks (GNNs) strongly influences their expressivity as has been observed in recent works. In particular, for prediction tasks relying on long-range interactions, recursive aggregation of node features can lead to an undesired phenomenon called "oversquashing". We present a framework for analyzing oversquashing based on informatio… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 8 pages, 5 figures; Accepted at the 58th Annual Allerton Conference on Communication, Control, and Computing

  21. arXiv:2204.10982  [pdf, ps, other

    cs.IT

    Continuity and Additivity Properties of Information Decompositions

    Authors: Johannes Rauh, Pradeep Kr. Banerjee, Eckehard Olbrich, Guido Montúfar, Jürgen Jost

    Abstract: Information decompositions quantify how the Shannon information about a given random variable is distributed among several other random variables. Various requirements have been proposed that such a decomposition should satisfy, leading to different candidate solutions. Curiously, however, only two of the original requirements that determined the Shannon information have been considered, namely mo… ▽ More

    Submitted 9 July, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 17 pages

    MSC Class: 94A15; 94A17

    Journal ref: International Journal of Approximate Reasoning, 2023

  22. arXiv:2204.10869  [pdf, other

    cs.CV

    Identity Preserving Loss for Learned Image Compression

    Authors: Jiuhong Xiao, Lavisha Aggarwal, Prithviraj Banerjee, Manoj Aggarwal, Gerard Medioni

    Abstract: Deep learning model inference on embedded devices is challenging due to the limited availability of computation resources. A popular alternative is to perform model inference on the cloud, which requires transmitting images from the embedded device to the cloud. Image compression techniques are commonly employed in such cloud-based architectures to reduce transmission latency over low bandwidth ne… ▽ More

    Submitted 26 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR 2022 Workshop on New Trends in Image Restoration and Enhancement and Challenges

  23. arXiv:2203.16682  [pdf, other

    cs.CV cs.CL cs.LG

    To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo

    Authors: Yiran Luo, Pratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral

    Abstract: We present a debiased dataset for the Person-centric Visual Grounding (PCVG) task first proposed by Cui et al. (2021) in the Who's Waldo dataset. Given an image and a caption, PCVG requires pairing up a person's name mentioned in a caption with a bounding box that points to the person in the image. We find that the original Who's Waldo dataset compiled for this task contains a large number of bias… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 (Short Paper)

  24. Strategic Analysis of Griefing Attack in Lightning Network

    Authors: Subhra Mazumdar, Prabal Banerjee, Abhinandan Sinha, Sushmita Ruj, Bimal Roy

    Abstract: Hashed Timelock Contract (HTLC) in Lightning Network is susceptible to a griefing attack. An attacker can block several channels and stall payments by mounting this attack. A state-of-the-art countermeasure, Hashed Timelock Contract with Griefing-Penalty (HTLC-GP) is found to work under the classical assumption of participants being either honest or malicious but fails for rational participants. T… ▽ More

    Submitted 20 December, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: 17 pages, Accepted in IEEE Transactions on Network and Service Management (Special Issue Advances on Blockchain)

  25. arXiv:2201.04933  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Machine Learning-enhanced Efficient Spectroscopic Ellipsometry Modeling

    Authors: Ayush Arunachalam, S. Novia Berriel, Parag Banerjee, Kanad Basu

    Abstract: Over the recent years, there has been an extensive adoption of Machine Learning (ML) in a plethora of real-world applications, ranging from computer vision to data mining and drug discovery. In this paper, we utilize ML to facilitate efficient film fabrication, specifically Atomic Layer Deposition (ALD). In order to make advances in ALD process development, which is utilized to generate thin films… ▽ More

    Submitted 8 February, 2022; v1 submitted 1 January, 2022; originally announced January 2022.

  26. arXiv:2110.12231  [pdf, other

    cs.LG

    Learning curves for Gaussian process regression with power-law priors and targets

    Authors: Hui Jin, Pradeep Kr. Banerjee, Guido Montúfar

    Abstract: We characterize the power-law asymptotics of learning curves for Gaussian process regression (GPR) under the assumption that the eigenspectrum of the prior and the eigenexpansion coefficients of the target function follow a power law. Under similar assumptions, we leverage the equivalence between GPR and kernel ridge regression (KRR) to show the generalization error of KRR. Infinitely wide neural… ▽ More

    Submitted 27 November, 2021; v1 submitted 23 October, 2021; originally announced October 2021.

    Comments: 76 pages, 7 table, 6 figure

  27. arXiv:2110.08438  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Natural Language Inference Using PHL Triplet Generation

    Authors: Neeraj Varshney, Pratyay Banerjee, Tejas Gokhale, Chitta Baral

    Abstract: Transformer-based models achieve impressive performance on numerous Natural Language Inference (NLI) benchmarks when trained on respective training datasets. However, in certain cases, training samples may not be available or collecting them could be time-consuming and resource-intensive. In this work, we address the above challenge and present an explorative study on unsupervised NLI, a paradigm… ▽ More

    Submitted 15 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022 Findings

  28. arXiv:2110.07165  [pdf, other

    cs.CV cs.CL

    Semantically Distributed Robust Optimization for Vision-and-Language Inference

    Authors: Tejas Gokhale, Abhishek Chaudhary, Pratyay Banerjee, Chitta Baral, Yezhou Yang

    Abstract: Analysis of vision-and-language models has revealed their brittleness under linguistic phenomena such as paraphrasing, negation, textual entailment, and word substitutions with synonyms or antonyms. While data augmentation techniques have been designed to mitigate against these failure modes, methods that can integrate this knowledge into the training pipeline remain under-explored. In this paper,… ▽ More

    Submitted 14 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Findings of ACL 2022; code available at https://github.com/ASU-APG/VLI_SDRO

  29. arXiv:2109.04014  [pdf, other

    cs.CL

    Weakly-Supervised Visual-Retriever-Reader for Knowledge-based Question Answering

    Authors: Man Luo, Yankai Zeng, Pratyay Banerjee, Chitta Baral

    Abstract: Knowledge-based visual question answering (VQA) requires answering questions with external knowledge in addition to the content of images. One dataset that is mostly used in evaluating knowledge-based VQA is OK-VQA, but it lacks a gold standard knowledge corpus for retrieval. Existing work leverage different knowledge bases (e.g., ConceptNet and Wikipedia) to obtain external knowledge. Because of… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: accepted at EMNLP 2021

  30. arXiv:2109.01934  [pdf, other

    cs.CV cs.CL cs.LG

    Weakly Supervised Relative Spatial Reasoning for Visual Question Answering

    Authors: Pratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral

    Abstract: Vision-and-language (V\&L) reasoning necessitates perception of visual concepts such as objects and actions, understanding semantics and language grounding, and reasoning about the interplay between the two modalities. One crucial aspect of visual reasoning is spatial understanding, which involves understanding relative locations of objects, i.e.\ implicitly learning the geometry of the scene. In… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: Accepted to ICCV 2021. PaperId : ICCV2021-10857 Copyright transferred to IEEE ICCV. DOI will be updated later

  31. arXiv:2105.14357  [pdf, other

    cs.CL cs.AI cs.CR

    Constructing Flow Graphs from Procedural Cybersecurity Texts

    Authors: Kuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang, Chitta Baral

    Abstract: Following procedural texts written in natural languages is challenging. We must read the whole text to identify the relevant information or identify the instruction flows to complete a task, which is prone to failures. If such texts are structured, we can readily visualize instruction-flows, reason or infer a particular step, or even build automated systems to help novice agents achieve a goal. Ho… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: 13 pages, 5 pages, accepted in the Findings of ACL 2021

  32. arXiv:2105.12392  [pdf, other

    cs.CL

    Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction

    Authors: Ming Shen, Pratyay Banerjee, Chitta Baral

    Abstract: In this work, we propose Masked Noun-Phrase Prediction (MNPP), a pre-training strategy to tackle pronoun resolution in a fully unsupervised setting. Firstly, We evaluate our pre-trained model on various pronoun resolution datasets without any finetuning. Our method outperforms all previous unsupervised methods on all datasets by large margins. Secondly, we proceed to a few-shot setting where we fi… ▽ More

    Submitted 28 May, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted to ACL2021

  33. Information Complexity and Generalization Bounds

    Authors: Pradeep Kr. Banerjee, Guido Montúfar

    Abstract: We present a unifying picture of PAC-Bayesian and mutual information-based upper bounds on the generalization error of randomized learning algorithms. As we show, Tong Zhang's information exponential inequality (IEI) gives a general recipe for constructing bounds of both flavors. We show that several important results in the literature can be obtained as simple corollaries of the IEI under differe… ▽ More

    Submitted 23 October, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: To appear in 2021 IEEE International Symposium on Information Theory (ISIT); 23 pages

    MSC Class: 68Q32; 68T05; 94A15 ACM Class: I.2.6; G.3

  34. arXiv:2103.12801  [pdf, other

    cs.LG cs.CL cs.CR

    Variable Name Recovery in Decompiled Binary Code using Constrained Masked Language Modeling

    Authors: Pratyay Banerjee, Kuntal Kumar Pal, Fish Wang, Chitta Baral

    Abstract: Decompilation is the procedure of transforming binary programs into a high-level representation, such as source code, for human analysts to examine. While modern decompilers can reconstruct and recover much information that is discarded during compilation, inferring variable names is still extremely difficult. Inspired by recent advances in natural language processing, we propose a novel solution… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Work In Progress

  35. arXiv:2103.11263  [pdf, other

    cs.CL cs.LG

    Self-Supervised Test-Time Learning for Reading Comprehension

    Authors: Pratyay Banerjee, Tejas Gokhale, Chitta Baral

    Abstract: Recent work on unsupervised question answering has shown that models can be trained with procedurally generated question-answer pairs and can achieve performance competitive with supervised methods. In this work, we consider the task of unsupervised reading comprehension and present a method that performs "test-time learning" (TTL) on a given context (text passage), without requiring training on l… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: Accepted to NAACL 2021

  36. Three dimensional unique identifier based automated georeferencing and coregistration of point clouds in underground environment

    Authors: Sarvesh Kumar Singh, Bikram Pratap Banerjee, Simit Raval

    Abstract: Spatially and geometrically accurate laser scans are essential in modelling infrastructure for applications in civil, mining and transportation. Monitoring of underground or indoor environments such as mines or tunnels is challenging due to unavailability of a sensor positioning framework, complicated structurally symmetric layouts, repetitive features and occlusions. Current practices largely inc… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: 26 pages, 10 figures

    ACM Class: I.4.9

    Journal ref: Remote Sensing. 2021; 13(16):3145

  37. arXiv:2012.09938  [pdf, other

    cs.CL cs.AI

    Can Transformers Reason About Effects of Actions?

    Authors: Pratyay Banerjee, Chitta Baral, Man Luo, Arindam Mitra, Kuntal Pal, Tran C. Son, Neeraj Varshney

    Abstract: A recent work has shown that transformers are able to "reason" with facts and rules in a limited setting where the rules are natural language expressions of conjunctions of conditions implying a conclusion. Since this suggests that transformers may be used for reasoning with knowledge given in natural language, we do a rigorous evaluation of this with respect to a common form of knowledge and its… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  38. arXiv:2012.03354  [pdf, other

    cs.SI cs.AI cs.IT

    Maximizing Social Welfare in a Competitive Diffusion Model

    Authors: Prithu Banerjee, Wei Chen, Laks V. S. Lakshmanan

    Abstract: Influence maximization (IM) has garnered a lot of attention in the literature owing to applications such as viral marketing and infection containment. It aims to select a small number of seed users to adopt an item such that adoption propagates to a large number of users in the network. Competitive IM focuses on the propagation of competing items in the network. Existing works on competitive IM ha… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

  39. arXiv:2012.02356  [pdf, other

    cs.CV cs.CL

    WeaQA: Weak Supervision via Captions for Visual Question Answering

    Authors: Pratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral

    Abstract: Methodologies for training visual question answering (VQA) models assume the availability of datasets with human-annotated \textit{Image-Question-Answer} (I-Q-A) triplets. This has led to heavy reliance on datasets and a lack of generalization to new types of questions and scenes. Linguistic priors along with biases and errors due to annotator subjectivity have been shown to percolate into VQA mod… ▽ More

    Submitted 28 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted in Findings of ACL 2021

  40. arXiv:2010.04833  [pdf, other

    cs.CY cs.SI econ.GN

    Pandemic Lessons -- Devising an assessment framework to analyse policies for sustainability

    Authors: Pradipta Banerjee, Subhrabrata Choudhury

    Abstract: COVID-19 pandemic has sharply projected the globally persistent multi-dimensional fundamental challenges in securing general socio-economic wellbeing of the society. The problems intensify with increasing population densities and also vary with several socio-economic-geo-cultural activity parameters. These problems directly highlight the urgent need for accomplishing the interdependent United Nati… ▽ More

    Submitted 24 May, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: 11 pages

  41. arXiv:2010.03677  [pdf, other

    cs.CY cs.SI econ.GN

    Agent Based Computational Model Aided Approach to Improvise the Inequality-Adjusted Human Development Index (IHDI) for Greater Parity in Real Scenario Assessments

    Authors: Pradipta Banerjee, Subhrabrata Choudhury

    Abstract: To design, evaluate and tune policies for all-inclusive human development, the primary requisite is to assess the true state of affairs of the society. Statistical indices like GDP, Gini Coefficients have been developed to accomplish the evaluation of the socio-economic systems. They have remained prevalent in the conventional economic theories but little do they have in the offing regarding true… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 8 pages, 4 figures

  42. arXiv:2009.11033  [pdf, other

    cs.CR cs.DC

    Reliable, Fair and Decentralized Marketplace for Content Sharing Using Blockchain

    Authors: Prabal Banerjee, Chander Govindarajan, Praveen Jayachandran, Sushmita Ruj

    Abstract: Content sharing platforms such as Youtube and Vimeo have promoted pay per view models for artists to monetize their content. Yet, artists remain at the mercy of centralized platforms that control content listing and advertisement, with little transparency and fairness in terms of number of views or revenue. On the other hand, consumers are distanced from the publishers and cannot authenticate orig… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  43. arXiv:2009.08566  [pdf, other

    cs.CV cs.CL

    MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question Answering

    Authors: Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang

    Abstract: While progress has been made on the visual question answering leaderboards, models often utilize spurious correlations and priors in datasets under the i.i.d. setting. As such, evaluation on out-of-distribution (OOD) test samples has emerged as a proxy for generalization. In this paper, we present MUTANT, a training paradigm that exposes the model to perceptually similar, yet semantically distinct… ▽ More

    Submitted 15 October, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020, Long Papers

  44. arXiv:2005.09327  [pdf, other

    cs.CR

    Griefing-Penalty: Countermeasure for Griefing Attack in Lightning Network

    Authors: Subhra Mazumdar, Prabal Banerjee, Sushmita Ruj

    Abstract: Lightning Network can execute unlimited number of off-chain payments, without incurring the cost of recording each of them in the blockchain. However, conditional payments in such networks is susceptible to Griefing Attack. In this attack, an adversary doesn't resolve the payment with the intention of blocking channel capacity of the network. We propose an efficient countermeasure for the attack,… ▽ More

    Submitted 16 June, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: 29 pages, 20 figures, 2 table, A preliminary version of the paper was accepted in the proceedings of The 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (IEEE TrustCom 2020), DOI Bookmark: 10.1109/TrustCom50675.2020.00138

  45. arXiv:2005.04612  [pdf, other

    cs.LG stat.ML

    A machine learning based heuristic to predict the efficacy of online sale

    Authors: Aditya Vikram Singhania, Saronyo Lal Mukherjee, Ritajit Majumdar, Akash Mehta, Priyanka Banerjee, Debasmita Bhoumik

    Abstract: It is difficult to decide upon the efficacy of an online sale simply from the discount offered on commodities. Different features have different influence on the price of a product which must be taken into consideration when determining the significance of a discount. In this paper we have proposed a machine learning based heuristic to quantify the \textit{"significance"} of the discount offered o… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: Paper selected for Oral presentation at the 2nd International Conference on Emerging Technologies in Data Mining and Information Security (IEMIS 2020). Will appear in Springer Advances in Intelligent Systems and Computing (AISC) Series

  46. arXiv:2005.00316  [pdf, other

    cs.CL cs.AI cs.LG

    Self-supervised Knowledge Triplet Learning for Zero-shot Question Answering

    Authors: Pratyay Banerjee, Chitta Baral

    Abstract: The aim of all Question Answering (QA) systems is to be able to generalize to unseen questions. Current supervised methods are reliant on expensive data annotation. Moreover, such annotations can introduce unintended annotator bias which makes systems focus more on the bias than the actual task. In this work, we propose Knowledge Triplet Learning (KTL), a self-supervised task over knowledge graphs… ▽ More

    Submitted 17 September, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted to EMNLP 2020 Long Papers

  47. arXiv:2004.03101  [pdf, other

    cs.CL cs.AI cs.LG

    Knowledge Fusion and Semantic Knowledge Ranking for Open Domain Question Answering

    Authors: Pratyay Banerjee, Chitta Baral

    Abstract: Open Domain Question Answering requires systems to retrieve external knowledge and perform multi-hop reasoning by composing knowledge spread over multiple sentences. In the recently introduced open domain question answering challenge datasets, QASC and OpenBookQA, we need to perform retrieval of facts and compose facts to correctly answer questions. In our work, we learn a semantic knowledge ranki… ▽ More

    Submitted 17 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 9 pages. 4 figures, 4 tables

  48. arXiv:2003.05162  [pdf, other

    cs.CV cs.CL

    Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

    Authors: Zhiyuan Fang, Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang

    Abstract: Captioning is a crucial and challenging task for video understanding. In videos that involve active agents such as humans, the agent's actions can bring about myriad changes in the scene. Observable changes such as movements, manipulations, and transformations of the objects in the scene, are reflected in conventional video captioning. Unlike images, actions in videos are also inherently linked to… ▽ More

    Submitted 7 January, 2023; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: EMNLP 2020. V2C Website: https://asu-apg.github.io/Video2Commonsense/

  49. arXiv:2003.03446  [pdf, other

    cs.CL cs.AI cs.LG

    Natural Language QA Approaches using Reasoning with External Knowledge

    Authors: Chitta Baral, Pratyay Banerjee, Kuntal Kumar Pal, Arindam Mitra

    Abstract: Question answering (QA) in natural language (NL) has been an important aspect of AI from its early days. Winograd's ``councilmen'' example in his 1972 paper and McCarthy's Mr. Hug example of 1976 highlights the role of external knowledge in NL understanding. While Machine Learning has been the go-to approach in NL processing as well as NL question answering (NLQA) for the last 30 years, recently t… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 6 pages, 3 figures, Work in Progress

  50. arXiv:2002.08325  [pdf, other

    cs.CV cs.CL

    VQA-LOL: Visual Question Answering under the Lens of Logic

    Authors: Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang

    Abstract: Logical connectives and their implications on the meaning of a natural language sentence are a fundamental aspect of understanding. In this paper, we investigate whether visual question answering (VQA) systems trained to answer a question about an image, are able to answer the logical composition of multiple such questions. When put under this \textit{Lens of Logic}, state-of-the-art VQA models ha… ▽ More

    Submitted 15 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted to ECCV 2020