Skip to main content

Showing 1–17 of 17 results for author: Stan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03118  [pdf, other

    cs.CV

    LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models

    Authors: Gabriela Ben Melech Stan, Estelle Aflalo, Raanan Yehezkel Rohekar, Anahita Bhiwandiwalla, Shao-Yen Tseng, Matthew Lyle Olson, Yaniv Gurwicz, Chenfei Wu, Nan Duan, Vasudev Lal

    Abstract: In the rapidly evolving landscape of artificial intelligence, multi-modal large language models are emerging as a significant area of interest. These models, which combine various forms of data input, are becoming increasingly popular. However, understanding their internal mechanisms remains a complex task. Numerous advancements have been made in the field of explainability tools and mechanisms, y… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  2. arXiv:2404.01197  [pdf, other

    cs.CV

    Getting it Right: Improving Spatial Consistency in Text-to-Image Models

    Authors: Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang

    Abstract: One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt. In this paper, we offer a comprehensive investigation of this limitation, while also developing datasets and methods that achieve state-of-the-art performance. First, we find that current vision-language… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: project webpage : https://spright-t2i.github.io/

  3. Hot-LEGO: Architect Microfluidic Cooling Equipped 3DICs with Pre-RTL Thermal Simulation

    Authors: Runxi Wang, Jun-Han Han, Mircea Stan, Xinfei Guo

    Abstract: Microfluidic cooling has been recognized as one of the most promising solutions to achieve efficient thermal management for three-dimensional integrated circuits (3DICs). It enables more opportunities to architect 3DICs with different die configurations. It becomes increasingly important to perform thermal analysis in the early design phases to validate the architectural design decisions. This is… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Journal ref: The 14th international Green and Sustainable Computing Conference (IGSC'23), Oct 2023

  4. arXiv:2401.00955  [pdf, other

    cs.NE cs.LG

    Learning Long Sequences in Spiking Neural Networks

    Authors: Matei Ioan Stan, Oliver Rhodes

    Abstract: Spiking neural networks (SNNs) take inspiration from the brain to enable energy-efficient computations. Since the advent of Transformers, SNNs have struggled to compete with artificial networks on modern sequential tasks, as they inherit limitations from recurrent neural networks (RNNs), with the added challenge of training with non-differentiable binary spiking activations. However, a recent rene… ▽ More

    Submitted 14 December, 2023; originally announced January 2024.

    Comments: 18 pages, 10 Figures/Tables

  5. arXiv:2311.03226  [pdf, other

    cs.CV cs.AI

    LDM3D-VR: Latent Diffusion Model for 3D VR

    Authors: Gabriela Ben Melech Stan, Diana Wofk, Estelle Aflalo, Shao-Yen Tseng, Zhipeng Cai, Michael Paulitsch, Vasudev Lal

    Abstract: Latent diffusion models have proven to be state-of-the-art in the creation and manipulation of visual outputs. However, as far as we know, the generation of depth maps jointly with RGB is still limited. We introduce LDM3D-VR, a suite of diffusion models targeting virtual reality development that includes LDM3D-pano and LDM3D-SR. These models enable the generation of panoramic RGBD based on textual… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted to Workshop on Diffusion Models, NeurIPS 2023

  6. arXiv:2305.10853  [pdf, other

    cs.CV

    LDM3D: Latent Diffusion Model for 3D

    Authors: Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Muller, Vasudev Lal

    Abstract: This research paper proposes a Latent Diffusion Model for 3D (LDM3D) that generates both image and depth map data from a given text prompt, allowing users to generate RGBD images from text prompts. The LDM3D model is fine-tuned on a dataset of tuples containing an RGB image, depth map and caption, and validated through extensive experiments. We also develop an application called DepthFusion, which… ▽ More

    Submitted 21 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  7. arXiv:2208.11553  [pdf, other

    cs.CV

    MuMUR : Multilingual Multimodal Universal Retrieval

    Authors: Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shachar Rosenman, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal

    Abstract: Multi-modal retrieval has seen tremendous progress with the development of vision-language models. However, further improving these models require additional labelled data which is a huge manual effort. In this paper, we propose a framework MuMUR, that utilizes knowledge transfer from a multilingual model to boost the performance of multi-modal (image and video) retrieval. We first use state-of-th… ▽ More

    Submitted 19 September, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: This is an extension of the previous MKTVR paper (for which you can find a reference here : https://dl.acm.org/doi/abs/10.1007/978-3-031-28244-7_42 or in a previous version on arxiv). This version was published to the Information Retrieval Journal

  8. arXiv:2011.08673  [pdf

    cs.LG cs.CV

    Flame Stability Analysis of Flame Spray Pyrolysis by Artificial Intelligence

    Authors: Jessica Pan, Joseph A. Libera, Noah H. Paulson, Marius Stan

    Abstract: Flame spray pyrolysis (FSP) is a process used to synthesize nanoparticles through the combustion of an atomized precursor solution; this process has applications in catalysts, battery materials, and pigments. Current limitations revolve around understanding how to consistently achieve a stable flame and the reliable production of nanoparticles. Machine learning and artificial intelligence algorith… ▽ More

    Submitted 22 October, 2020; originally announced November 2020.

    Comments: 25 pages, 8 figures. International Journal of Advanced Manufacturing Technology 2020

    ACM Class: I.2.10; I.4; I.5

  9. Towards Online Steering of Flame Spray Pyrolysis Nanoparticle Synthesis

    Authors: Maksim Levental, Ryan Chard, Joseph A. Libera, Kyle Chard, Aarthi Koripelly, Jakob R. Elias, Marcus Schwarting, Ben Blaiszik, Marius Stan, Santanu Chaudhuri, Ian Foster

    Abstract: Flame Spray Pyrolysis (FSP) is a manufacturing technique to mass produce engineered nanoparticles for applications in catalysis, energy materials, composites, and more. FSP instruments are highly dependent on a number of adjustable parameters, including fuel injection rate, fuel-oxygen mixtures, and temperature, which can greatly affect the quality, quantity, and properties of the yielded nanopart… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  10. arXiv:2009.04045  [pdf

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    An Experimentally Driven Automated Machine Learned lnter-Atomic Potential for a Refractory Oxide

    Authors: Ganesh Sivaraman, Leighanne Gallington, Anand Narayanan Krishnamoorthy, Marius Stan, Gabor Csanyi, Alvaro Vazquez-Mayagoitia, Chris J. Benmore

    Abstract: Understanding the structure and properties of refractory oxides are critical for high temperature applications. In this work, a combined experimental and simulation approach uses an automated closed loop via an active-learner, which is initialized by X-ray and neutron diffraction measurements, and sequentially improves a machine-learning model until the experimentally predetermined phase space is… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Journal ref: Phys. Rev. Lett. 126, 156002 (2021)

  11. arXiv:2005.10704  [pdf, other

    physics.app-ph cond-mat.mes-hall cs.ET

    Temporal Memory with Magnetic Racetracks

    Authors: Hamed Vakili, Mohammad Nazmus Sakib, Samiran Ganguly, Mircea Stan, Matthew W. Daniels, Advait Madhavan, Mark D. Stiles, Avik W. Ghosh

    Abstract: Race logic is a relative timing code that represents information in a wavefront of digital edges on a set of wires in order to accelerate dynamic programming and machine learning algorithms. Skyrmions, bubbles, and domain walls are mobile magnetic configurations (solitons) with applications for Boolean data storage. We propose to use current-induced displacement of these solitons on magnetic racet… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 9 pages, 3 figures, submitted for review

  12. Memristive Learning Cellular Automata: Theory and Applications

    Authors: Rafailia-Eleni Karamani, Iosif-Angelos Fyrigos, Vasileios Ntinas, Orestis Liolis, Giorgos Dimitrakopoulos, Mustafa Altun, Andrew Adamatzky, Mircea R. Stan, Georgios Ch. Sirakoulis

    Abstract: Memristors are novel non volatile devices that manage to combine storing and processing capabilities in the same physical place.Their nanoscale dimensions and low power consumption enable the further design of various nanoelectronic processing circuits and corresponding computing architectures, like neuromorhpic, in memory, unconventional, etc.One of the possible ways to exploit the memristor's ad… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

  13. arXiv:1910.10254  [pdf

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Machine Learning Inter-Atomic Potentials Generation Driven by Active Learning: A Case Study for Amorphous and Liquid Hafnium dioxide

    Authors: Ganesh Sivaraman, Anand Narayanan Krishnamoorthy, Matthias Baur, Christian Holm, Marius Stan, Gabor Csányi, Chris Benmore, Álvaro Vázquez-Mayagoitia

    Abstract: We propose a novel active learning scheme for automatically sampling a minimum number of uncorrelated configurations for fitting the Gaussian Approximation Potential (GAP). Our active learning scheme consists of an unsupervised machine learning (ML) scheme coupled to Bayesian optimization technique that evaluates the GAP model. We apply this scheme to a Hafnium dioxide (HfO2) dataset generated fro… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: to be submitted NPJ Computational Materials

    Journal ref: npj Computational Materials 6 (2020) 1-8

  14. arXiv:1809.02651  [pdf

    cs.CV cs.ET cs.NE

    Reservoir Computing based Neural Image Filters

    Authors: Samiran Ganguly, Yunfei Gu, Yunkun Xie, Mircea R. Stan, Avik W. Ghosh, Nibir K. Dhar

    Abstract: Clean images are an important requirement for machine vision systems to recognize visual features correctly. However, the environment, optics, electronics of the physical imaging systems can introduce extreme distortions and noise in the acquired images. In this work, we explore the use of reservoir computing, a dynamical neural network model inspired from biological systems, in creating dynamic i… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 5 pages, 4 figures, To appear in Conference Proceedings of The 44th Annual Conference of IEEE Industrial Electronics Society (2018): Special Session on Machine Vision, Control and Navigation

  15. arXiv:1803.08635  [pdf, other

    cs.CV cs.ET cs.NE

    Hardware based Spatio-Temporal Neural Processing Backend for Imaging Sensors: Towards a Smart Camera

    Authors: Samiran Ganguly, Yunfei Gu, Mircea R. Stan, Avik W. Ghosh

    Abstract: In this work we show how we can build a technology platform for cognitive imaging sensors using recent advances in recurrent neural network architectures and training methods inspired from biology. We demonstrate learning and processing tasks specific to imaging sensors, including enhancement of sensitivity and signal-to-noise ratio (SNR) purely through neural filtering beyond the fundamental limi… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: 11 pages, 5 figures. To be presented in SPIE DCS 2018: Image Sensing Technologies: Materials, Devices, Systems, and Applications V

  16. Tolerating Soft Errors in Processor Cores Using CLEAR (Cross-Layer Exploration for Architecting Resilience)

    Authors: Eric Cheng, Shahrzad Mirkhani, Lukasz G. Szafaryn, Chen-Yong Cher, Hyungmin Cho, Kevin Skadron, Mircea R. Stan, Klas Lilja, Jacob A. Abraham, Pradip Bose, Subhasish Mitra

    Abstract: We present CLEAR (Cross-Layer Exploration for Architecting Resilience), a first of its kind framework which overcomes a major challenge in the design of digital systems that are resilient to reliability failures: achieve desired resilience targets at minimal costs (energy, power, execution time, area) by combining resilience techniques across various layers of the system stack (circuit, logic, arc… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: Unedited version of paper published in Transactions on Computer-Aided Design of Integrated Circuits and Systems

  17. CLEAR: Cross-Layer Exploration for Architecting Resilience - Combining Hardware and Software Techniques to Tolerate Soft Errors in Processor Cores

    Authors: Eric Cheng, Shahrzad Mirkhani, Lukasz G. Szafaryn, Chen-Yong Cher, Hyungmin Cho, Kevin Skadron, Mircea R. Stan, Klas Lilja, Jacob A. Abraham, Pradip Bose, Subhasish Mitra

    Abstract: We present a first of its kind framework which overcomes a major challenge in the design of digital systems that are resilient to reliability failures: achieve desired resilience targets at minimal costs (energy, power, execution time, area) by combining resilience techniques across various layers of the system stack (circuit, logic, architecture, software, algorithm). This is also referred to as… ▽ More

    Submitted 23 June, 2016; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: Extended version of paper published in Proceedings of the 53rd Annual Design Automation Conference

    ACM Class: B.8.1