Search | arXiv e-print repository

arXiv:2407.18892 [pdf, other]

SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces

Authors: Seunghyeop Nam, Tuan Anh Nguyen, Eunmi Choi, Dugki Min

Abstract: This paper introduces SHANGUS, an advanced framework combining Deep Reinforcement Learning (DRL) with heuristic optimization to improve frontier-based exploration efficiency in unknown environments, particularly for intelligent vehicles in autonomous air services, search and rescue operations, and space exploration robotics. SHANGUS harnesses DRL's adaptability and heuristic prioritization, marked… ▽ More This paper introduces SHANGUS, an advanced framework combining Deep Reinforcement Learning (DRL) with heuristic optimization to improve frontier-based exploration efficiency in unknown environments, particularly for intelligent vehicles in autonomous air services, search and rescue operations, and space exploration robotics. SHANGUS harnesses DRL's adaptability and heuristic prioritization, markedly enhancing exploration efficiency, reducing completion time, and minimizing travel distance. The strategy involves a frontier selection node to identify unexplored areas and a DRL navigation node using the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for robust path planning and dynamic obstacle avoidance. Extensive experiments in ROS2 and Gazebo simulation environments show SHANGUS surpasses representative traditional methods like the Nearest Frontier (NF), Novel Frontier-Based Exploration Algorithm (CFE), and Goal-Driven Autonomous Exploration (GDAE) algorithms, especially in complex scenarios, excelling in completion time, travel distance, and exploration rate. This scalable solution is suitable for real-time autonomous navigation in fields such as industrial automation, autonomous driving, household robotics, and space exploration. Future research will integrate additional sensory inputs and refine heuristic functions to further boost SHANGUS's efficiency and robustness. △ Less

Submitted 26 July, 2024; originally announced July 2024.

arXiv:2407.09779 [pdf, other]

Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

Authors: Kangyeol Kim, Wooseok Seo, Sehyun Nam, Bodam Kim, Suhyeon Jeong, Wonwoo Cho, Jaegul Choo, Youngjae Yu

Abstract: Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generati… ▽ More Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generation and 2) retouch. In the first stage, our step-blended inference utilizes the inherent sample diversity of vanilla T2I models to produce diversified layout images, while also enhancing prompt fidelity. In the second stage, multi-source attention swapping integrates the context image from the first stage with the reference image, leveraging the structure from the context image and extracting visual features from the reference image. This achieves high prompt fidelity while preserving identity characteristics. Through our extensive experiments, we demonstrate that our method generates a wide variety of images with diverse layouts while maintaining the unique identity features of the personalized objects, even with challenging text prompts. This versatility highlights the potential of our framework to handle complex conditions, significantly enhancing the diversity and applicability of personalized image synthesis. △ Less

Submitted 13 July, 2024; originally announced July 2024.

arXiv:2407.01034 [pdf, other]

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert

Authors: Han EunGi, Oh Hyun-Bin, Kim Sung-Bin, Corentin Nivelet Etcheberry, Suekyeong Nam, Janghoon Joo, Tae-Hyun Oh

Abstract: Speech-driven 3D facial animation has recently garnered attention due to its cost-effective usability in multimedia production. However, most current advances overlook the intelligibility of lip movements, limiting the realism of facial expressions. In this paper, we introduce a method for speech-driven 3D facial animation to generate accurate lip movements, proposing an audio-visual multimodal pe… ▽ More Speech-driven 3D facial animation has recently garnered attention due to its cost-effective usability in multimedia production. However, most current advances overlook the intelligibility of lip movements, limiting the realism of facial expressions. In this paper, we introduce a method for speech-driven 3D facial animation to generate accurate lip movements, proposing an audio-visual multimodal perceptual loss. This loss provides guidance to train the speech-driven 3D facial animators to generate plausible lip motions aligned with the spoken transcripts. Furthermore, to incorporate the proposed audio-visual perceptual loss, we devise an audio-visual lip reading expert leveraging its prior knowledge about correlations between speech and lip motions. We validate the effectiveness of our approach through broad experiments, showing noticeable improvements in lip synchronization and lip readability performance. Codes are available at https://3d-talking-head-avguide.github.io/. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: INTERSPEECH 2024

arXiv:2406.14272 [pdf, other]

MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset

Authors: Kim Sung-Bin, Lee Chae-Yeon, Gihun Son, Oh Hyun-Bin, Janghoon Ju, Suekyeong Nam, Tae-Hyun Oh

Abstract: Recent studies in speech-driven 3D talking head generation have achieved convincing results in verbal articulations. However, generating accurate lip-syncs degrades when applied to input speech in other languages, possibly due to the lack of datasets covering a broad spectrum of facial movements across languages. In this work, we introduce a novel task to generate 3D talking heads from speeches of… ▽ More Recent studies in speech-driven 3D talking head generation have achieved convincing results in verbal articulations. However, generating accurate lip-syncs degrades when applied to input speech in other languages, possibly due to the lack of datasets covering a broad spectrum of facial movements across languages. In this work, we introduce a novel task to generate 3D talking heads from speeches of diverse languages. We collect a new multilingual 2D video dataset comprising over 420 hours of talking videos in 20 languages. With our proposed dataset, we present a multilingually enhanced model that incorporates language-specific style embeddings, enabling it to capture the unique mouth movements associated with each language. Additionally, we present a metric for assessing lip-sync accuracy in multilingual settings. We demonstrate that training a 3D talking head model with our proposed dataset significantly enhances its multilingual performance. Codes and datasets are available at https://multi-talk.github.io/. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Interspeech 2024

arXiv:2406.13251 [pdf, other]

Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields

Authors: Youngin Park, Seungtae Nam, Cheul-hee Hahm, Eunbyung Park

Abstract: Neural Radiance Fields (NeRF) have shown remarkable success in representing 3D scenes and generating novel views. However, they often struggle with aliasing artifacts, especially when rendering images from different camera distances from the training views. To address the issue, Mip-NeRF proposed using volumetric frustums to render a pixel and suggested integrated positional encoding (IPE). While… ▽ More Neural Radiance Fields (NeRF) have shown remarkable success in representing 3D scenes and generating novel views. However, they often struggle with aliasing artifacts, especially when rendering images from different camera distances from the training views. To address the issue, Mip-NeRF proposed using volumetric frustums to render a pixel and suggested integrated positional encoding (IPE). While effective, this approach requires long training times due to its reliance on MLP architecture. In this work, we propose a novel anti-aliasing technique that utilizes grid-based representations, usually showing significantly faster training time. In addition, we exploit frequency-domain representation to handle the aliasing problem inspired by the sampling theorem. The proposed method, FreqMipAA, utilizes scale-specific low-pass filtering (LPF) and learnable frequency masks. Scale-specific low-pass filters (LPF) prevent aliasing and prioritize important image details, and learnable masks effectively remove problematic high-frequency elements while retaining essential information. By employing a scale-specific LPF and trainable masks, FreqMipAA can effectively eliminate the aliasing factor while retaining important details. We validated the proposed technique by incorporating it into a widely used grid-based method. The experimental results have shown that the FreqMipAA effectively resolved the aliasing issues and achieved state-of-the-art results in the multi-scale Blender dataset. Our code is available at https://github.com/yi0109/FreqMipAA . △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: Accepted to ICIP 2024, 7 pages, 3 figures

arXiv:2406.12904 [pdf, other]

Meent: Differentiable Electromagnetic Simulator for Machine Learning

Authors: Yongha Kim, Anthony W. Jung, Sanmun Kim, Kevin Octavian, Doyoung Heo, Chaejin Park, Jeongmin Shin, Sunghyun Nam, Chanhyung Park, Juho Park, Sangjun Han, Jinmyoung Lee, Seolho Kim, Min Seok Jang, Chan Y. Park

Abstract: Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reachin… ▽ More Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reaching real world impact. Traditional algorithms for such tasks require iteratively refining parameters through simulations, which often yield sub-optimal results due to the high computational cost of both the algorithms and EM simulations. Machine learning (ML) emerged as a promising candidate to mitigate these challenges, and optics research community has increasingly adopted ML algorithms to obtain results surpassing classical methods across various tasks. To foster a synergistic collaboration between the optics and ML communities, it is essential to have an EM simulation software that is user-friendly for both research communities. To this end, we present Meent, an EM simulation software that employs rigorous coupled-wave analysis (RCWA). Developed in Python and equipped with automatic differentiation (AD) capabilities, Meent serves as a versatile platform for integrating ML into optics research and vice versa. To demonstrate its utility as a research platform, we present three applications of Meent: 1) generating a dataset for training neural operator, 2) serving as an environment for the reinforcement learning of nanophotonic device optimization, and 3) providing a solution for inverse problems with gradient-based optimizers. These applications highlight Meent's potential to advance both EM simulation and ML methodologies. The code is available at https://github.com/kc-ml2/meent with the MIT license to promote the cross-polinations of ideas among academic researchers and industry practitioners. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: under review

arXiv:2406.04661 [pdf, other]

doi 10.1038/s41467-022-29376-4

Quantum channel correction outperforming direct transmission

Authors: Sergei Slussarenko, Morgan M. Weston, Lynden K. Shalm, Varun B. Verma, Sae-Woo Nam, Sacha Kocsis, Timothy C. Ralph, Geoff J. Pryde

Abstract: Long-distance optical quantum channels are necessarily lossy, leading to errors in transmitted quantum information, entanglement degradation and, ultimately, poor protocol performance. Quantum states carrying information in the channel can be probabilistically amplified to compensate for loss, but are destroyed when amplification fails. Quantum correction of the channel itself is therefore require… ▽ More Long-distance optical quantum channels are necessarily lossy, leading to errors in transmitted quantum information, entanglement degradation and, ultimately, poor protocol performance. Quantum states carrying information in the channel can be probabilistically amplified to compensate for loss, but are destroyed when amplification fails. Quantum correction of the channel itself is therefore required, but break-even performance -- where arbitrary states can be better transmitted through a corrected channel than an uncorrected one -- has so far remained out of reach. Here we perform distillation by heralded amplification to improve a noisy entanglement channel. We subsequently employ entanglement swapping to demonstrate that arbitrary quantum information transmission is unconditionally improved -- i.e. without relying on postselection or post-processing of data -- compared to the uncorrected channel. In this way, it represents realisation of a genuine quantum relay. Our channel correction for single-mode quantum states will find use in quantum repeater, communication and metrology applications. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 11 pages, 6 figures, supplementary included

Journal ref: Nature Communications 13, 1832 (2022)

arXiv:2405.15017 [pdf, ps, other]

Kinetic inductance current sensor for visible to near-infrared wavelength transition-edge sensor readout

Authors: Paul Szypryt, Douglas A. Bennett, Ian Fogarty Florang, Joseph W. Fowler, Andrea Giachero, Ruslan Hummatov, Adriana E. Lita, John A. B. Mates, Sae Woo Nam, Galen C. O'Neil, Daniel S. Swetz, Joel N. Ullom, Michael R. Vissers, Jordan Wheeler, Jiansong Gao

Abstract: Single-photon detectors based on the superconducting transition-edge sensor (TES) are used in a number of visible to near-infrared (VNIR) applications, particularly for photon-number-resolving measurements in quantum information science. To be practical for large-scale photonic quantum computing or for future spectroscopic imaging applications in astronomy, the size of VNIR TES arrays must be incr… ▽ More Single-photon detectors based on the superconducting transition-edge sensor (TES) are used in a number of visible to near-infrared (VNIR) applications, particularly for photon-number-resolving measurements in quantum information science. To be practical for large-scale photonic quantum computing or for future spectroscopic imaging applications in astronomy, the size of VNIR TES arrays must be increased from a few pixels to many thousands. Historically, TES arrays have been read out with multiplexed superconducting quantum interference devices (SQUIDs), but the microsecond-duration pulse signals of VNIR TESs are notoriously difficult to multiplex. In this manuscript, we introduce the kinetic inductance current sensor (KICS), a more readily scalable readout technology that exploits the nonlinear kinetic inductance in a superconducting resonator to make sensitive current measurements. KICS devices can replace SQUIDs for many applications because of their ability to measure fast, high slew-rate signals, their compatibility with standard microwave frequency-division multiplexing techniques, and their relatively simple fabrication. Here, we demonstrate the readout of a VNIR TES using a KICS with $3.7$ $\text{MHz}$ of bandwidth. We measure a readout noise of $1.4$ $\text{pA}/\sqrt{\text{Hz}}$, considerably below the TES noise at frequencies of interest, and a TES energy resolution of $(0.137 \pm 0.001)$ $\text{eV}$ at $0.8$ $\text{eV}$, comparable to resolutions observed with non-multiplexed SQUID readouts. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 14 pages, 8 figures

arXiv:2405.05867 [pdf, ps, other]

Quasisymmetric Schur $Q$-functions and peak Young quasisymmetric Schur functions

Authors: Seung-Il Choi, Sun-Young Nam, Young-Tak Oh

Abstract: In this paper, we explore the relationship between quasisymmetric Schur $Q$-functions and peak Young quasisymmetric Schur functions. We introduce a bijection on $\mathsf{SPIT}(α)$ such that $\{\mathrm{w}_{\rm c}(T) \mid T \in \mathsf{SPIT}(α)\}$ and $\{\mathrm{w}_{\rm r}(T) \mid T \in \mathsf{SPIT}(α)\}$ share identical descent distributions. Here, $\mathsf{SPIT}(α)$ is the set of standard peak im… ▽ More In this paper, we explore the relationship between quasisymmetric Schur $Q$-functions and peak Young quasisymmetric Schur functions. We introduce a bijection on $\mathsf{SPIT}(α)$ such that $\{\mathrm{w}_{\rm c}(T) \mid T \in \mathsf{SPIT}(α)\}$ and $\{\mathrm{w}_{\rm r}(T) \mid T \in \mathsf{SPIT}(α)\}$ share identical descent distributions. Here, $\mathsf{SPIT}(α)$ is the set of standard peak immaculate tableaux of shape $α$, and $\mathrm{w}_{\rm c}$ and $\mathrm{w}_{\rm r}$ denote column reading and row reading, respectively. By combining this equidistribution with the algorithm developed by Allen, Hallam, and Mason, we demonstrate that the transition matrix from the basis of quasisymmetric Schur $Q$-functions to the basis of peak Young quasisymmetric Schur functions is upper triangular, with entries being non-negative integers. Furthermore, we provide explicit descriptions of the expansion of peak Young quasisymmetric Schur functions in specific cases, in terms of quasisymmetric Schur $Q$-functions. We also investigate the combinatorial properties of standard peak immaculate tableaux, standard Young composition tableaux, and standard peak Young composition tableaux. We provide a hook length formula for $\mathsf{SPIT}(α)$ and show that standard Young composition tableaux and standard peak Young composition tableaux can be bijectively mapped to specific words in a familiar form. Especially, cases of compositions with rectangular shape are examined in detail. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: 51 pages

MSC Class: 20C08; 05E05; 05E10

arXiv:2405.02116 [pdf, other]

Intriguing aspects of light baryon resonances

Authors: K. P. Khemchandani, A. Martinez Torres, Sang-Ho Kim, Seung-il Nam, A. Hosaka, H. Nagahiro

Abstract: We discuss that some light baryon resonances exhibit properties which cannot be described when attributing a three-valence quark structure to them. Besides pointing out the hadron resonances which clearly require description beyond the quark model, we focus on the third $s_{11},~ N^*$ state and its decay to final states consisting of the lightest hyperon resonances which have a partial width compa… ▽ More We discuss that some light baryon resonances exhibit properties which cannot be described when attributing a three-valence quark structure to them. Besides pointing out the hadron resonances which clearly require description beyond the quark model, we focus on the third $s_{11},~ N^*$ state and its decay to final states consisting of the lightest hyperon resonances which have a partial width comparable to that for the decay to $πN$. Such properties of the mentioned nucleon resonance get manifested in the cross sections and other observables related to processes producing the lightest hyperon resonances. We show that all these findings arise from the strong association of the baryon resonances to the dynamics among the ground-state hadrons. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: Proceedings for the XLV Symposium on Nuclear Physics, held in Cocoyoc, Morelos, Mexico

arXiv:2404.11810 [pdf, other]

Holographic Parallax Improves 3D Perceptual Realism

Authors: Dongyeon Kim, Seung-Woo Nam, Suyeon Choi, Jong-Mo Seo, Gordon Wetzstein, Yoonchan Jeong

Abstract: Holographic near-eye displays are a promising technology to solve long-standing challenges in virtual and augmented reality display systems. Over the last few years, many different computer-generated holography (CGH) algorithms have been proposed that are supervised by different types of target content, such as 2.5D RGB-depth maps, 3D focal stacks, and 4D light fields. It is unclear, however, what… ▽ More Holographic near-eye displays are a promising technology to solve long-standing challenges in virtual and augmented reality display systems. Over the last few years, many different computer-generated holography (CGH) algorithms have been proposed that are supervised by different types of target content, such as 2.5D RGB-depth maps, 3D focal stacks, and 4D light fields. It is unclear, however, what the perceptual implications are of the choice of algorithm and target content type. In this work, we build a perceptual testbed of a full-color, high-quality holographic near-eye display. Under natural viewing conditions, we examine the effects of various CGH supervision formats and conduct user studies to assess their perceptual impacts on 3D realism. Our results indicate that CGH algorithms designed for specific viewpoints exhibit noticeable deficiencies in achieving 3D realism. In contrast, holograms incorporating parallax cues consistently outperform other formats across different viewing conditions, including the center of the eyebox. This finding is particularly interesting and suggests that the inclusion of parallax cues in CGH rendering plays a crucial role in enhancing the overall quality of the holographic experience. This work represents an initial stride towards delivering a perceptually realistic 3D experience with holographic near-eye displays. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 33 pages, 34 figures

arXiv:2404.04078 [pdf, other]

Strangeness $+1$ light multiquark baryons: a jinx?

Authors: Brenda B. Malabarba, K. P. Khemchandani, A. Martinez Torres, Seung-il Nam

Abstract: In view of the renewing experimental interest for searching strangeness $+1$ baryons at J-PARC, we study the existence of light baryon resonances with strangeness +1 generated in the $K$-$(N^*/Δ^*)$ system, where $N^*$ represents either $N^*(1535)$/$N^*(1650)$/$N^*(1700)$, and $Δ^*$ corresponds to $Δ(1620)$. The description of the properties of the aforementioned states requires considering the dy… ▽ More In view of the renewing experimental interest for searching strangeness $+1$ baryons at J-PARC, we study the existence of light baryon resonances with strangeness +1 generated in the $K$-$(N^*/Δ^*)$ system, where $N^*$ represents either $N^*(1535)$/$N^*(1650)$/$N^*(1700)$, and $Δ^*$ corresponds to $Δ(1620)$. The description of the properties of the aforementioned states requires considering the dynamics involved in the coupled pseudoscalar-baryon and vector-baryon systems with strangeness $S=0$ in the s-wave. For the purpose of our current study, we consider the pseudoscalar-baryon (PB) and vector-baryon channels (VB) to which the mentioned $N^*$ and $Δ^*$ resonances couple and solve the Faddeev equations for the coupled channel system $K$-$\text{PB}$, $K$-$\text{VB}$, with all interactions being in the s-wave. Despite some strong attraction present in two of the subsystems, we do not find clear evidence supporting the formation of strangeness +1 states, with spin-parity $J^P=1/2^+$, in the energy region $2000-2200$ MeV. However, the case of spin-parity $J^P=3/2^+$ seems more promising, showing the formation of a resonance with a mass around 2167 MeV, with a width of 90-100 MeV. We suggest that a signal of such a state could be found in processes with final states like $KN$, $K^*(892) N$. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.02079 [pdf, other]

Coherent Control of an Optical Quantum Dot Using Phonons and Photons

Authors: Ryan A DeCrescent, Zixuan Wang, Joseph T Bush, Poolad Imany, Alex Kwiatkowski, Dileep V Reddy, Sae Woo Nam, Richard P Mirin, Kevin L Silverman

Abstract: Genuine quantum-mechanical effects are readily observable in modern optomechanical systems comprising bosonic ("classical") optical resonators. Here we describe unique features and advantages of optical two-level systems, or qubits, for optomechanics. The qubit state can be coherently controlled using both phonons and resonant or detuned photons. We experimentally demonstrate this using charge-con… ▽ More Genuine quantum-mechanical effects are readily observable in modern optomechanical systems comprising bosonic ("classical") optical resonators. Here we describe unique features and advantages of optical two-level systems, or qubits, for optomechanics. The qubit state can be coherently controlled using both phonons and resonant or detuned photons. We experimentally demonstrate this using charge-controlled InAs quantum dots (QDs) in surface-acoustic-wave resonators. Time-correlated single-photon counting measurements reveal the control of QD population dynamics using engineered optical pulses and mechanical motion. As a first example, at moderate acoustic drive strengths, we demonstrate the potential of this technique to maximize fidelity in quantum microwave-to-optical transduction. Specifically, we tailor the scheme so that mechanically assisted photon scattering is enhanced over the direct detuned photon scattering from the QD. Spectral analysis reveals distinct scattering channels related to Rayleigh scattering and luminescence in our pulsed excitation measurements which lead to time-dependent scattering spectra. Quantum-mechanical calculations show good agreement with our experimental results, together providing a comprehensive description of excitation, scattering and emission in a coupled QD-phonon optomechanical system. △ Less

Submitted 16 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 19 pages, 4 main figures, 7 supplementary figures

arXiv:2404.01878 [pdf, other]

Real, fake and synthetic faces -- does the coin have three sides?

Authors: Shahzeb Naeem, Ramzi Al-Sharawi, Muhammad Riyyan Khan, Usman Tariq, Abhinav Dhall, Hasan Al-Nashash

Abstract: With the ever-growing power of generative artificial intelligence, deepfake and artificially generated (synthetic) media have continued to spread online, which creates various ethical and moral concerns regarding their usage. To tackle this, we thus present a novel exploration of the trends and patterns observed in real, deepfake and synthetic facial images. The proposed analysis is done in two pa… ▽ More With the ever-growing power of generative artificial intelligence, deepfake and artificially generated (synthetic) media have continued to spread online, which creates various ethical and moral concerns regarding their usage. To tackle this, we thus present a novel exploration of the trends and patterns observed in real, deepfake and synthetic facial images. The proposed analysis is done in two parts: firstly, we incorporate eight deep learning models and analyze their performances in distinguishing between the three classes of images. Next, we look to further delve into the similarities and differences between these three sets of images by investigating their image properties both in the context of the entire image as well as in the context of specific regions within the image. ANOVA test was also performed and provided further clarity amongst the patterns associated between the images of the three classes. From our findings, we observe that the investigated deeplearning models found it easier to detect synthetic facial images, with the ViT Patch-16 model performing best on this task with a class-averaged sensitivity, specificity, precision, and accuracy of 97.37%, 98.69%, 97.48%, and 98.25%, respectively. This observation was supported by further analysis of various image properties. We saw noticeable differences across the three category of images. This analysis can help us build better algorithms for facial image generation, and also shows that synthetic, deepfake and real face images are indeed three different classes. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2404.01745 [pdf, other]

Unleash the Potential of CLIP for Video Highlight Detection

Authors: Donghoon Han, Seunghyeon Seo, Eunhwan Park, Seong-Uk Nam, Nojun Kwak

Abstract: Multimodal and large language models (LLMs) have revolutionized the utilization of open-world knowledge, unlocking novel potentials across various tasks and applications. Among these domains, the video domain has notably benefited from their capabilities. In this paper, we present Highlight-CLIP (HL-CLIP), a method designed to excel in the video highlight detection task by leveraging the pre-train… ▽ More Multimodal and large language models (LLMs) have revolutionized the utilization of open-world knowledge, unlocking novel potentials across various tasks and applications. Among these domains, the video domain has notably benefited from their capabilities. In this paper, we present Highlight-CLIP (HL-CLIP), a method designed to excel in the video highlight detection task by leveraging the pre-trained knowledge embedded in multimodal models. By simply fine-tuning the multimodal encoder in combination with our innovative saliency pooling technique, we have achieved the state-of-the-art performance in the highlight detection task, the QVHighlight Benchmark, to the best of our knowledge. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2404.01438 [pdf]

Generation and Detection of Sign Language Deepfakes -- A Linguistic and Visual Analysis

Authors: Shahzeb Naeem, Muhammad Riyyan Khan, Usman Tariq, Abhinav Dhall, Carlos Ivan Colon, Hasan Al-Nashash

Abstract: A question in the realm of deepfakes is slowly emerging pertaining to whether we can go beyond facial deepfakes and whether it would be beneficial to society. Therefore, this research presents a positive application of deepfake technology in upper body generation, while performing sign-language for the Deaf and Hard of Hearing (DHoH) community. The resulting videos are later vetted with a sign lan… ▽ More A question in the realm of deepfakes is slowly emerging pertaining to whether we can go beyond facial deepfakes and whether it would be beneficial to society. Therefore, this research presents a positive application of deepfake technology in upper body generation, while performing sign-language for the Deaf and Hard of Hearing (DHoH) community. The resulting videos are later vetted with a sign language expert. This is particularly helpful, given the intricate nature of sign language, a scarcity of sign language experts, and potential benefits for health and education. The objectives of this work encompass constructing a reliable deepfake dataset, evaluating its technical and visual credibility through computer vision and natural language processing models, and assessing the plausibility of the generated content. With over 1200 videos, featuring both previously seen and unseen individuals for the generation model, using the help of a sign language expert, we establish a deepfake dataset in sign language that can further be utilized to detect fake videos that may target certain people of determination. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: 13 pages, 13 figures, Computer Vision and Image Understanding Journal

arXiv:2404.00559 [pdf, other]

Hierarchical Climate Control Strategy for Electric Vehicles with Door-Opening Consideration

Authors: Sanghyeon Nam, Hyejin Lee, Youngki Kim, Kyoung hyun Kwak, Kyoungseok Han

Abstract: This study proposes a novel climate control strategy for electric vehicles (EVs) by addressing door-opening interruptions, an overlooked aspect in EV thermal management. We create and validate an EV simulation model that incorporates door-opening scenarios. Three controllers are compared using the simulation model: (i) a hierarchical non-linear model predictive control (NMPC) with a unique coolant… ▽ More This study proposes a novel climate control strategy for electric vehicles (EVs) by addressing door-opening interruptions, an overlooked aspect in EV thermal management. We create and validate an EV simulation model that incorporates door-opening scenarios. Three controllers are compared using the simulation model: (i) a hierarchical non-linear model predictive control (NMPC) with a unique coolant dividing layer and a component for cabin air inflow regulation based on door-opening signals; (ii) a single MPC controller; and (iii) a rule-based controller. The hierarchical controller outperforms, reducing door-opening temperature drops by 46.96% and 51.33% compared to single layer MPC and rule-based methods in the relevant section. Additionally, our strategy minimizes the maximum temperature gaps between the sections during recovery by 86.4% and 78.7%, surpassing single layer MPC and rule-based approaches, respectively. We believe that this result opens up future possibilities for incorporating the thermal comfort of passengers across all sections within the vehicle. △ Less

Submitted 31 March, 2024; originally announced April 2024.

Comments: This paper, intended for presentation at the IEEE Intelligent Vehicles Symposium (IV) 2024, comprises six pages and includes eight figures

arXiv:2403.19739 [pdf, other]

Detecting Light Dark Matter with Kinetic Inductance Detectors

Authors: Jiansong Gao, Yonit Hochberg, Benjamin V. Lehmann, Sae Woo Nam, Paul Szypryt, Michael R. Vissers, Tao Xu

Abstract: Superconducting detectors are a promising technology for probing dark matter at extremely low masses, where dark matter interactions are currently unconstrained. Realizing the potential of such detectors requires new readout technologies to achieve the lowest possible thresholds for deposited energy. Here we perform a prototype search for dark matter--electron interactions with kinetic inductance… ▽ More Superconducting detectors are a promising technology for probing dark matter at extremely low masses, where dark matter interactions are currently unconstrained. Realizing the potential of such detectors requires new readout technologies to achieve the lowest possible thresholds for deposited energy. Here we perform a prototype search for dark matter--electron interactions with kinetic inductance detectors (KIDs), a class of superconducting detector originally designed for infrared astronomy applications. We demonstrate that existing KIDs can achieve effective thresholds as low as 0.2 eV, and we use existing data to set new dark matter constraints. The relative maturity of the technology underlying KIDs means that this platform can be scaled significantly with existing tools, enabling powerful new searches in the coming years. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 6+6 pages, 4+3 figures

Report number: MIT-CTP/5654

arXiv:2403.19254 [pdf, other]

Imperceptible Protection against Style Imitation from Diffusion Models

Authors: Namhyuk Ahn, Wonhyuk Ahn, KiYoon Yoo, Daesik Kim, Seung-Hun Nam

Abstract: Recent progress in diffusion models has profoundly enhanced the fidelity of image generation. However, this has raised concerns about copyright infringements. While prior methods have introduced adversarial perturbations to prevent style imitation, most are accompanied by the degradation of artworks' visual quality. Recognizing the importance of maintaining this, we develop a visually improved pro… ▽ More Recent progress in diffusion models has profoundly enhanced the fidelity of image generation. However, this has raised concerns about copyright infringements. While prior methods have introduced adversarial perturbations to prevent style imitation, most are accompanied by the degradation of artworks' visual quality. Recognizing the importance of maintaining this, we develop a visually improved protection method that preserves its protection capability. To this end, we create a perceptual map to identify areas most sensitive to human eyes. We then adjust the protection intensity guided by an instance-aware refinement. We also integrate a perceptual constraints bank to further improve the imperceptibility. Results show that our method substantially elevates the quality of the protected image without compromising on protection efficacy. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.14264 [pdf, other]

A Framework for Portrait Stylization with Skin-Tone Awareness and Nudity Identification

Authors: Seungkwon Kim, Sangyeon Kim, Seung-Hun Nam

Abstract: Portrait stylization is a challenging task involving the transformation of an input portrait image into a specific style while preserving its inherent characteristics. The recent introduction of Stable Diffusion (SD) has significantly improved the quality of outcomes in this field. However, a practical stylization framework that can effectively filter harmful input content and preserve the distinc… ▽ More Portrait stylization is a challenging task involving the transformation of an input portrait image into a specific style while preserving its inherent characteristics. The recent introduction of Stable Diffusion (SD) has significantly improved the quality of outcomes in this field. However, a practical stylization framework that can effectively filter harmful input content and preserve the distinct characteristics of an input, such as skin-tone, while maintaining the quality of stylization remains lacking. These challenges have hindered the wide deployment of such a framework. To address these issues, this study proposes a portrait stylization framework that incorporates a nudity content identification module (NCIM) and a skin-tone-aware portrait stylization module (STAPSM). In experiments, NCIM showed good performance in enhancing explicit content filtering, and STAPSM accurately represented a diverse range of skin tones. Our proposed framework has been successfully deployed in practice, and it has effectively satisfied critical requirements of real-world applications. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: Accepted to ICASSP 2024

arXiv:2403.01191 [pdf, other]

Strangeness plus-one ($S=+1$) resonance-state $P^{+*}_0$ via $K^+n\to K^{*0}p$

Authors: Dayoung Lee, Seung-il Nam

Abstract: In our current study, we delve into the peak-like structure observed during the reaction process of $K^+n\to K^{0}p$ at approximately $\sqrt{s}\sim2.5$ GeV. Our focus centers on exploring the potential $S=+1$ resonance $P^{+*}_0\equiv P^*_0$ as an excited state within the extended vector-meson and baryon ($VB$) antidecuplet. To achieve this aim, we employ the effective Lagrangian method in conjunc… ▽ More In our current study, we delve into the peak-like structure observed during the reaction process of $K^+n\to K^{0}p$ at approximately $\sqrt{s}\sim2.5$ GeV. Our focus centers on exploring the potential $S=+1$ resonance $P^{+*}_0\equiv P^*_0$ as an excited state within the extended vector-meson and baryon ($VB$) antidecuplet. To achieve this aim, we employ the effective Lagrangian method in conjunction with the $(u,t)$-channel Regge approach, operating within the tree-level Born approximation. We thoroughly examine various spin-parity quantum numbers for the resonance, resulting in a compelling description of the data, where $M_{P^*_0}\approx2.5$ GeV and $Γ_{P^*_0}\approx100$ MeV. Furthermore, we propose an experimental technique to amplify the signal-to-noise ratio ($S/N$) for accurately measuring the resonance. Notably, our findings reveal that background interference diminishes significantly within the $K^*$ forward-scattering region in the center-of-mass frame when the $K^*$ is perpendicularly polarized to the reaction plane. Additionally, we explore the recoil-proton spin asymmetry to definitively determine the spin and parity of the resonance. This study stands to serve as a valuable reference for designing experimental setups aimed at investigating and comprehending exotic phenomena in QCD. Specifically, our insights will inform future J-PARC experiments, particularly those employing higher kaon beam energies. △ Less

Submitted 2 March, 2024; originally announced March 2024.

Comments: 9 pages, 5 figures

Report number: PKNU-NuHaTh-2024

arXiv:2402.14196 [pdf, other]

Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields

Authors: Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park

Abstract: Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP ar… ▽ More Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP architecture to represent the radiance fields, missing out on the fast training speed offered by the latest grid-based methods. In this work, we present mip-Grid, a novel approach that integrates anti-aliasing techniques into grid-based representations for radiance fields, mitigating the aliasing artifacts while enjoying fast training time. The proposed method generates multi-scale grids by applying simple convolution operations over a shared grid representation and uses the scale-aware coordinate to retrieve features at different scales from the generated multi-scale grids. To test the effectiveness, we integrated the proposed method into the two recent representative grid-based methods, TensoRF and K-Planes. Experimental results demonstrate that mip-Grid greatly improves the rendering performance of both methods and even outperforms mip-NeRF on multi-scale datasets while achieving significantly faster training time. For code and demo videos, please see https://stnamjef.github.io/mipgrid.github.io/. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: Accepted to NeurIPS 2023

arXiv:2402.13526 [pdf, other]

Pion-nucleus elastic scatterings incorporating medium effects within the Eikonal-Glauber mode

Authors: Hyeon-dong Han, Parada T. P. Hutauruk, Seung-il Nam

Abstract: In this present investigation, we explore the elastic scattering of pions with nuclei ($π$-$A$), primarily influenced by the $Δ$(1232) resonance, within the Eikonal-Glauber model. The medium effects are incorporated by considering nuclear-density ($ρ_A$) dependent masses of baryons and strong coupling constants. These dependencies are computed and parameterized up to $\mathcal{O}(ρ_A^2)$ based on… ▽ More In this present investigation, we explore the elastic scattering of pions with nuclei ($π$-$A$), primarily influenced by the $Δ$(1232) resonance, within the Eikonal-Glauber model. The medium effects are incorporated by considering nuclear-density ($ρ_A$) dependent masses of baryons and strong coupling constants. These dependencies are computed and parameterized up to $\mathcal{O}(ρ_A^2)$ based on the quark-meson coupling (QMC) model. The Wood-Saxon type density profile is utilized for the bound nucleons within finite nuclei. The element $π^+$-$N$ scattering cross section for the Glauber approach is determined using the conventional effective Lagrangian method. Subsequently, we analyze the total cross sections for elastic scattering with $^4$He and $^{12}$C targets. Our numerical results demonstrate a favorable agreement with JINR data for the $^4$He target, accurately reproducing the total cross-section. However, when considering the $^{12}$C target, deviations of approximately $\lesssim10\%$. We also consider the multiple-scattering effects inside the nucleus approximately, using the single-channel meson-baryon Bethe-Salpeter equation, resulting in the effective width broadening of the $Δ$ resonance to reproduce the data better. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: 8 pages, 4 figures

Report number: PKNU-NuHaTh-2024

arXiv:2402.11597 [pdf, other]

Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

Authors: Guijin Son, Sangwon Baek, Sangdae Nam, Ilgyun Jeong, Seungone Kim

Abstract: Large language models (LLMs) are typically prompted to follow a single instruction per inference call. In this work, we analyze whether LLMs also hold the capability to handle multiple instructions simultaneously, denoted as Multi-Task Inference. For this purpose, we introduce the MTI Bench(Multi-Task Inference Benchmark), a comprehensive evaluation benchmark encompassing 5,000 instances across 25… ▽ More Large language models (LLMs) are typically prompted to follow a single instruction per inference call. In this work, we analyze whether LLMs also hold the capability to handle multiple instructions simultaneously, denoted as Multi-Task Inference. For this purpose, we introduce the MTI Bench(Multi-Task Inference Benchmark), a comprehensive evaluation benchmark encompassing 5,000 instances across 25 tasks. Each task in the MTI Bench involves 2 to 3 sub-tasks. As expected, we first demonstrate that Multi-Task Inference reduces the total inference time by 1.46 times in average since it does not require multiple inference calls. Interestingly, contrary to the expectation that LLMs would perform better when tasks are divided, we find that state-of-the-art LLMs, such as Llama-2-Chat-70B and GPT-4, show up to 7.3% and 12.4% improved performance with Multi-Task Inference compared to Single-Task Inference on the MTI Bench. We release the MTI Bench dataset and our code at this link https://github.com/guijinSON/MTI-Bench. △ Less

Submitted 6 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

Comments: acl 2024 (main)

arXiv:2402.09289 [pdf, other]

Study of quasi-projectile properties at Fermi energies in 48Ca projectile systems

Authors: S. Upadhyaya, K. Mazurek, T. Kozik, D. Gruyer, G. Casini, S. Piantelli, L. Baldesi, S. Barlini, B. Borderie, R. Bougault, A. Camaiani, C. Ciampi, M. Cicerchia, M. Ciemala, D. Dell Aquila, J. A. Duenas, Q. Fable, J. D. Frankland, F. Gramegna, M. Henri, B. Hong, A. Kordyasz, M. J. Kweon, N. Le Neindre, I. Lombardo , et al. (10 additional authors not shown)

Abstract: The emission of the pre-equilibrium particles during nuclear collisions at moderate beam energies is still an open question. This influences the properties of the compound nucleus but also changes the interpretation of the quasi-fission process. A systematic analysis of the data obtained by the FAZIA collaboration during a recent experiment with a neutron rich projectile is presented. The full ran… ▽ More The emission of the pre-equilibrium particles during nuclear collisions at moderate beam energies is still an open question. This influences the properties of the compound nucleus but also changes the interpretation of the quasi-fission process. A systematic analysis of the data obtained by the FAZIA collaboration during a recent experiment with a neutron rich projectile is presented. The full range of charged particles detected in the experiment is within the limit of isotopic resolution of the FAZIA detector. Quasi-projectile (QP) fragments were detected in majority thanks to the forward angular acceptance of the experimental setup which was confirmed by introducing cuts based on the HIPSE event generator calculations. The main goal was to compare the experimental results with the HIPSE simulations after introducing these cuts to investigate the influence of the n-rich entrance channel on the QP fragment properties. More specifically, the lowering of N/Z of QP fragments with beam energy was found to be present since the initial phase of the reaction. Thus, pre-equilibrium emissions might be a possible candidate to explain such an effect. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 10 pages, 10 figures

arXiv:2402.07392 [pdf, other]

Study on the $φ$-meson photoproduction off the proton target with the pentaquark-like $K^*Σ$ bound state $P_s$

Authors: Sang in Shim, Yongsun Kim, Seung-il Nam

Abstract: We utilize the effective Lagrangian method within the tree-level Born approximation to explore $φ$-meson photoproduction, i.e., $γp \to φp$. Our analysis encompasses contributions from various sources, including the Pomeron, $f_1$-Regge, pseudoscalar particles ($π$, $η$), scalar particles ($a_0$, $f_0$), protons, and three-nucleon resonance states. In addition, we consider a possible pentaquark-li… ▽ More We utilize the effective Lagrangian method within the tree-level Born approximation to explore $φ$-meson photoproduction, i.e., $γp \to φp$. Our analysis encompasses contributions from various sources, including the Pomeron, $f_1$-Regge, pseudoscalar particles ($π$, $η$), scalar particles ($a_0$, $f_0$), protons, and three-nucleon resonance states. In addition, we consider a possible pentaquark-like $K^* Σ$-bound state $P_s$. The findings indicate that, apart from the region near the threshold, contributions other than the Pomeron generally have a limited impact on the total cross section. However, at specific angles, alternative contributions become crucial, particularly at smaller values of ${\rm cos}\,θ$. The incorporation of $P_s$ and other nucleon resonances proves essential to elucidate the bump observed near $W \sim 2.15$ GeV at very forward angles and behaviors within the range of $W=(2.0-2.3)$ GeV. Furthermore, in the region with $W\ge 2.5$ GeV, where nucleon resonances become negligible, contributions from the t-channel mesons become pivotal. Our calculations for spin density matrix components, examined in various frames, exhibit improvement when considering all contributions. This comprehensive approach successfully reproduces the observed bump by including $P_s$. We also briefly estimate the $P_s$ production via $φ$-meson photoproduction in the future Electron-Ion Collider (EIC), resulting in the luminosity of 10 fb$^{-1}$ per month. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: 12 pages, 6 figures

Report number: PKNU-NuHaTh-2024

arXiv:2402.00863 [pdf, other]

Geometry Transfer for Stylizing Radiance Fields

Authors: Hyunyoung Jung, Seonghyeon Nam, Nikolaos Sarafianos, Sungjoo Yoo, Alexander Sorkine-Hornung, Rakesh Ranjan

Abstract: Shape and geometric patterns are essential in defining stylistic identity. However, current 3D style transfer methods predominantly focus on transferring colors and textures, often overlooking geometric aspects. In this paper, we introduce Geometry Transfer, a novel method that leverages geometric deformation for 3D style transfer. This technique employs depth maps to extract a style guide, subseq… ▽ More Shape and geometric patterns are essential in defining stylistic identity. However, current 3D style transfer methods predominantly focus on transferring colors and textures, often overlooking geometric aspects. In this paper, we introduce Geometry Transfer, a novel method that leverages geometric deformation for 3D style transfer. This technique employs depth maps to extract a style guide, subsequently applied to stylize the geometry of radiance fields. Moreover, we propose new techniques that utilize geometric cues from the 3D scene, thereby enhancing aesthetic expressiveness and more accurately reflecting intended styles. Our extensive experiments show that Geometry Transfer enables a broader and more expressive range of stylizations, thereby significantly expanding the scope of 3D style transfer. △ Less

Submitted 6 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: CVPR 2024. Project page: https://hyblue.github.io/geo-srf/

arXiv:2401.15313 [pdf, other]

Multi-Robot Relative Pose Estimation in SE(2) with Observability Analysis: A Comparison of Extended Kalman Filtering and Robust Pose Graph Optimization

Authors: Kihoon Shin, Hyunjae Sim, Seungwon Nam, Yonghee Kim, Jae Hu, Kwang-Ki K. Kim

Abstract: In this study, we address multi-robot localization issues, with a specific focus on cooperative localization and observability analysis of relative pose estimation. Cooperative localization involves enhancing each robot's information through a communication network and message passing. If odometry data from a target robot can be transmitted to the ego robot, observability of their relative pose es… ▽ More In this study, we address multi-robot localization issues, with a specific focus on cooperative localization and observability analysis of relative pose estimation. Cooperative localization involves enhancing each robot's information through a communication network and message passing. If odometry data from a target robot can be transmitted to the ego robot, observability of their relative pose estimation can be achieved through range-only or bearing-only measurements, provided both robots have non-zero linear velocities. In cases where odometry data from a target robot are not directly transmitted but estimated by the ego robot, both range and bearing measurements are necessary to ensure observability of relative pose estimation. For ROS/Gazebo simulations, we explore four sensing and communication structures. We compare extended Kalman filtering (EKF) and pose graph optimization (PGO) estimation using different robust loss functions (filtering and smoothing with varying batch sizes of sliding windows) in terms of estimation accuracy. In hardware experiments, two Turtlebot3 equipped with UWB modules are used for real-world inter-robot relative pose estimation, applying both EKF and PGO and comparing their performance. △ Less

Submitted 4 February, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

Comments: 20 pages, 21 figures

MSC Class: 93C85; 93E11; 93E24; 90C26; 93E10; 62M20;

arXiv:2401.03079 [pdf, other]

Integrating Open-World Shared Control in Immersive Avatars

Authors: Patrick Naughton, James Seungbum Nam, Andrew Stratton, Kris Hauser

Abstract: Teleoperated avatar robots allow people to transport their manipulation skills to environments that may be difficult or dangerous to work in. Current systems are able to give operators direct control of many components of the robot to immerse them in the remote environment, but operators still struggle to complete tasks as competently as they could in person. We present a framework for incorporati… ▽ More Teleoperated avatar robots allow people to transport their manipulation skills to environments that may be difficult or dangerous to work in. Current systems are able to give operators direct control of many components of the robot to immerse them in the remote environment, but operators still struggle to complete tasks as competently as they could in person. We present a framework for incorporating open-world shared control into avatar robots to combine the benefits of direct and shared control. This framework preserves the fluency of our avatar interface by minimizing obstructions to the operator's view and using the same interface for direct, shared, and fully autonomous control. In a human subjects study (N=19), we find that operators using this framework complete a range of tasks significantly more quickly and reliably than those that do not. △ Less

Submitted 10 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.10215 [pdf, other]

Gated InAs quantum dots embedded in surface acoustic wave cavities for low-noise optomechanics

Authors: Zixuan Wang, Ryan A. DeCrescent, Poolad Imany, Joseph T. Bush, Dileep V. Reddy, Sae Woo Nam, Richard P. Mirin, Kevin L. Silverman

Abstract: Self-assembled InAs quantum dots (QDs) are promising optomechanical elements due to their excellent photonic properties and sensitivity to local strain fields. Microwave-frequency modulation of photons scattered from these efficient quantum emitters has been recently demonstrated using surface acoustic wave (SAW) cavities. However, for optimal performance, a gate structure is required to determini… ▽ More Self-assembled InAs quantum dots (QDs) are promising optomechanical elements due to their excellent photonic properties and sensitivity to local strain fields. Microwave-frequency modulation of photons scattered from these efficient quantum emitters has been recently demonstrated using surface acoustic wave (SAW) cavities. However, for optimal performance, a gate structure is required to deterministically control the charge state and reduce charge noise of the QDs. Here, we integrate gated QDs and SAW cavities using molecular beam epitaxy and nanofabrication. We demonstrate that with careful design of the substrate layer structure, integration of the two systems can be accomplished while retaining the optimal performance of each subsystem. These results mark a critical step toward efficient and low-noise optomechanical systems for microwave-to-optical quantum transduction. △ Less

Submitted 15 December, 2023; originally announced December 2023.

arXiv:2312.01763 [pdf, other]

doi 10.1103/PhysRevC.109.064605

Isospin diffusion from $^{40,48}$Ca$+^{40,48}$Ca experimental data at Fermi energies: Direct comparisons with transport model calculations

Authors: Q. Fable, L. Baldesi, S. Barlini, Eric Bonnet, Bernard Borderie, Remi Bougault, A. Camaiani, G. Casini, A. Chbihi, Caterina Ciampi, J. A. Dueñas, J. D. Frankland, T. Genard, Diego D. Gruyer, Maxime Henri, Byungsik Hong, S. Kim, A. J. Kordyasz, T. Kozik, Arnaud Le Fèvre, Nicolas Le Neindre, Ivano Lombardo, Olivier Lopez, T. Marchi, Paola Marini , et al. (8 additional authors not shown)

Abstract: This article presents an investigation of isospin equilibration in cross-bombarding $^{40,48}$Ca$+^{40,48}$Ca reactions at 35 MeV/nucleon, by comparing experimental data with filtered transport model calculations. Isospin diffusion is studied using the evolution of the isospin transport ratio with centrality. The asymmetry parameter $δ=(N-Z)/A$ of the quasiprojectile (QP) residue is used as isospi… ▽ More This article presents an investigation of isospin equilibration in cross-bombarding $^{40,48}$Ca$+^{40,48}$Ca reactions at 35 MeV/nucleon, by comparing experimental data with filtered transport model calculations. Isospin diffusion is studied using the evolution of the isospin transport ratio with centrality. The asymmetry parameter $δ=(N-Z)/A$ of the quasiprojectile (QP) residue is used as isospin-sensitive observable, while a recent method for impact parameter reconstruction is used for centrality sorting. A benchmark of global observables is proposed to assess the relevance of the antisymmetrized molecular dynamics (AMD) model, coupled to GEMINI++, in the study of dissipative collisions. Our results demonstrate the importance of considering cluster formation to reproduce observables used for isospin transport and centrality studies. Within the AMD model, we prove the applicability of the impact parameter reconstruction method, enabling a direct comparison to the experimental data for the investigation of isospin diffusion. For both, we evidence a tendency to isospin equilibration with an impact parameter decreasing from 9 to 3 fm, while the full equilibration is not reached. A weak sensitivity to the stiffness of the equation of state employed in the model is also observed, with a better reproduction of the experimental trend for the neutron-rich reactions. △ Less

Submitted 6 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Journal ref: Physical Review C, 109 (064605)

arXiv:2311.14993 [pdf, other]

Coordinate-Aware Modulation for Neural Fields

Authors: Joo Chan Lee, Daniel Rho, Seungtae Nam, Jong Hwan Ko, Eunbyung Park

Abstract: Neural fields, mapping low-dimensional input coordinates to corresponding signals, have shown promising results in representing various signals. Numerous methodologies have been proposed, and techniques employing MLPs and grid representations have achieved substantial success. MLPs allow compact and high expressibility, yet often suffer from spectral bias and slow convergence speed. On the other h… ▽ More Neural fields, mapping low-dimensional input coordinates to corresponding signals, have shown promising results in representing various signals. Numerous methodologies have been proposed, and techniques employing MLPs and grid representations have achieved substantial success. MLPs allow compact and high expressibility, yet often suffer from spectral bias and slow convergence speed. On the other hand, methods using grids are free from spectral bias and achieve fast training speed, however, at the expense of high spatial complexity. In this work, we propose a novel way for exploiting both MLPs and grid representations in neural fields. Unlike the prevalent methods that combine them sequentially (extract features from the grids first and feed them to the MLP), we inject spectral bias-free grid representations into the intermediate features in the MLP. More specifically, we suggest a Coordinate-Aware Modulation (CAM), which modulates the intermediate features using scale and shift parameters extracted from the grid representations. This can maintain the strengths of MLPs while mitigating any remaining potential biases, facilitating the rapid learning of high-frequency components. In addition, we empirically found that the feature normalizations, which have not been successful in neural filed literature, proved to be effective when applied in conjunction with the proposed CAM. Experimental results demonstrate that CAM enhances the performance of neural representation and improves learning stability across a range of signals. Especially in the novel view synthesis task, we achieved state-of-the-art performance with the least number of parameters and fast training speed for dynamic scenes and the best performance under 1MB memory for static scenes. CAM also outperforms the best-performing video compression methods using neural fields by a large margin. △ Less

Submitted 25 November, 2023; originally announced November 2023.

Comments: Project page: http://maincold2.github.io/cam/

arXiv:2311.05881 [pdf, other]

Programmable Superconducting Optoelectronic Single-Photon Synapses with Integrated Multi-State Memory

Authors: Bryce A. Primavera, Saeed Khan, Richard P. Mirin, Sae Woo Nam, Jeffrey M. Shainline

Abstract: The co-location of memory and processing is a core principle of neuromorphic computing. A local memory device for synaptic weight storage has long been recognized as an enabling element for large-scale, high-performance neuromorphic hardware. In this work, we demonstrate programmable superconducting synapses with integrated memories for use in superconducting optoelectronic neural systems. Superco… ▽ More The co-location of memory and processing is a core principle of neuromorphic computing. A local memory device for synaptic weight storage has long been recognized as an enabling element for large-scale, high-performance neuromorphic hardware. In this work, we demonstrate programmable superconducting synapses with integrated memories for use in superconducting optoelectronic neural systems. Superconducting nanowire single-photon detectors and Josephson junctions are combined into programmable synaptic circuits that exhibit single-photon sensitivity, memory cells with more than 400 internal states, leaky integration of input spike events, and 0.4 fJ programming energies (including cooling power). These results are attractive for implementing a variety of supervised and unsupervised learning algorithms and lay the foundation for a new hardware platform optimized for large-scale spiking network accelerators. △ Less

Submitted 10 November, 2023; originally announced November 2023.

Comments: 16 pages, 11 figures

arXiv:2311.00994 [pdf, other]

LaughTalk: Expressive 3D Talking Head Generation with Laughter

Authors: Kim Sung-Bin, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju, Tae-Hyun Oh

Abstract: Laughter is a unique expression, essential to affirmative social interactions of humans. Although current 3D talking head generation methods produce convincing verbal articulations, they often fail to capture the vitality and subtleties of laughter and smiles despite their importance in social context. In this paper, we introduce a novel task to generate 3D talking heads capable of both articulate… ▽ More Laughter is a unique expression, essential to affirmative social interactions of humans. Although current 3D talking head generation methods produce convincing verbal articulations, they often fail to capture the vitality and subtleties of laughter and smiles despite their importance in social context. In this paper, we introduce a novel task to generate 3D talking heads capable of both articulate speech and authentic laughter. Our newly curated dataset comprises 2D laughing videos paired with pseudo-annotated and human-validated 3D FLAME parameters and vertices. Given our proposed dataset, we present a strong baseline with a two-stage training scheme: the model first learns to talk and then acquires the ability to express laughter. Extensive experiments demonstrate that our method performs favorably compared to existing approaches in both talking head generation and expressing laughter signals. We further explore potential applications on top of our proposed method for rigging realistic avatars. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: Accepted to WACV2024

arXiv:2310.13107 [pdf, other]

Monolithic Integration of Superconducting-Nanowire Single-Photon Detectors with Josephson Junctions for Scalable Single-photon Sensing

Authors: Saeed Khan, Bryce A. Primavera, Richard P. Mirin, Sae Woo Nam, Jeffrey M. Shainline

Abstract: We demonstrate superconducting single-photon detectors that integrate signals locally at each pixel. This capability is realized by the monolithic integration of superconducting-nanowire single-photon detectors with Josephson electronics. The motivation is to realize superconducting sensor elements with integrating capabilities similar to their CMOS-sensor counterparts. The pixels can operate in s… ▽ More We demonstrate superconducting single-photon detectors that integrate signals locally at each pixel. This capability is realized by the monolithic integration of superconducting-nanowire single-photon detectors with Josephson electronics. The motivation is to realize superconducting sensor elements with integrating capabilities similar to their CMOS-sensor counterparts. The pixels can operate in several modes. First, we demonstrate that photons can be counted individually, with each detection event adding an identical amount of supercurrent to an integrating element. Second, we demonstrate an active gain control option, in which the signal added per detection event can be dynamically adjusted to account for variable light conditions. Additionally, the pixels can either retain signal indefinitely to record all counts incurred over an integration period, or the pixels can record a fading signal of detection events within a decay time constant. We describe additional semiconductor readout circuitry that will be used in future work to realize scalable, large-format sensor arrays of superconducting single photon detectors compatible with CMOS array readout architectures. △ Less

Submitted 19 October, 2023; originally announced October 2023.

arXiv:2310.11005 [pdf, ps, other]

Optimal Private Discrete Distribution Estimation with One-bit Communication

Authors: Seung-Hyun Nam, Vincent Y. F. Tan, Si-Hyeon Lee

Abstract: We consider a private discrete distribution estimation problem with one-bit communication constraint. The privacy constraints are imposed with respect to the local differential privacy and the maximal leakage. The estimation error is quantified by the worst-case mean squared error. We completely characterize the first-order asymptotics of this privacy-utility trade-off under the one-bit communicat… ▽ More We consider a private discrete distribution estimation problem with one-bit communication constraint. The privacy constraints are imposed with respect to the local differential privacy and the maximal leakage. The estimation error is quantified by the worst-case mean squared error. We completely characterize the first-order asymptotics of this privacy-utility trade-off under the one-bit communication constraint for both types of privacy constraints by using ideas from local asymptotic normality and the resolution of a block design mechanism. These results demonstrate the optimal dependence of the privacy-utility trade-off under the one-bit communication constraint in terms of the parameters of the privacy constraint and the size of the alphabet of the discrete distribution. △ Less

Submitted 17 October, 2023; originally announced October 2023.

Comments: 13 pages, 5 figures, and 1 page of supplementary material

arXiv:2310.03205 [pdf, other]

A Large-Scale 3D Face Mesh Video Dataset via Neural Re-parameterized Optimization

Authors: Kim Youwang, Lee Hyun, Kim Sung-Bin, Suekyeong Nam, Janghoon Ju, Tae-Hyun Oh

Abstract: We propose NeuFace, a 3D face mesh pseudo annotation method on videos via neural re-parameterized optimization. Despite the huge progress in 3D face reconstruction methods, generating reliable 3D face labels for in-the-wild dynamic videos remains challenging. Using NeuFace optimization, we annotate the per-view/-frame accurate and consistent face meshes on large-scale face videos, called the NeuFa… ▽ More We propose NeuFace, a 3D face mesh pseudo annotation method on videos via neural re-parameterized optimization. Despite the huge progress in 3D face reconstruction methods, generating reliable 3D face labels for in-the-wild dynamic videos remains challenging. Using NeuFace optimization, we annotate the per-view/-frame accurate and consistent face meshes on large-scale face videos, called the NeuFace-dataset. We investigate how neural re-parameterization helps to reconstruct image-aligned facial details on 3D meshes via gradient analysis. By exploiting the naturalness and diversity of 3D faces in our dataset, we demonstrate the usefulness of our dataset for 3D face-related tasks: improving the reconstruction accuracy of an existing 3D face reconstruction model and learning 3D facial motion prior. Code and datasets will be available at https://neuface-dataset.github.io. △ Less

Submitted 6 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: 9 pages, 7 figures, and 3 tables for the main paper. 8 pages, 6 figures and 3 tables for the appendix

arXiv:2310.01625 [pdf, other]

Monolithic Polarizing Circular Dielectric Gratings on Bulk Substrates for Improved Photon Collection from InAs Quantum Dots

Authors: Ryan A. DeCrescent, Zixuan Wang, Poolad Imany, Sae Woo Nam, Richard P. Mirin, Kevin L. Silverman

Abstract: III-V semiconductor quantum dots (QDs) are near-ideal and versatile single-photon sources. Because of the capacity for monolithic integration with photonic structures as well as optoelectronic and optomechanical systems, they are proving useful in an increasingly broad application space. Here, we develop monolithic circular dielectric gratings on bulk substrates -- as opposed to suspended or wafer… ▽ More III-V semiconductor quantum dots (QDs) are near-ideal and versatile single-photon sources. Because of the capacity for monolithic integration with photonic structures as well as optoelectronic and optomechanical systems, they are proving useful in an increasingly broad application space. Here, we develop monolithic circular dielectric gratings on bulk substrates -- as opposed to suspended or wafer-bonded substrates -- for greatly improved photon collection from InAs quantum dots. The structures utilize a unique two-tiered distributed Bragg reflector (DBR) structure for vertical electric field confinement over a broad angular range. Opposing ``openings" in the cavities induce strongly polarized QD luminescence without harming collection efficiencies. We describe how measured enhancements depend critically on the choice of collection optics. This is important to consider when evaluating the performance of any photonic structure that concentrates farfield emission intensity. Our cavity designs are useful for integrating QDs with other quantum systems that require bulk substrates, such as surface acoustic wave phonons. △ Less

Submitted 6 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.16890 [pdf, other]

doi 10.1063/5.0178931

A 64-pixel mid-infrared single-photon imager based on superconducting nanowire detectors

Authors: Benedikt Hampel, Richard P. Mirin, Sae Woo Nam, Varun B. Verma

Abstract: A large-format mid-infrared single-photon imager with very low dark count rates would enable a broad range of applications in fields like astronomy and chemistry. Superconducting nanowire single-photon detectors (SNSPDs) are a mature photon-counting technology as demonstrated by their figures of merit. However, scaling SNSPDs to large array sizes for mid-infrared applications requires sophisticate… ▽ More A large-format mid-infrared single-photon imager with very low dark count rates would enable a broad range of applications in fields like astronomy and chemistry. Superconducting nanowire single-photon detectors (SNSPDs) are a mature photon-counting technology as demonstrated by their figures of merit. However, scaling SNSPDs to large array sizes for mid-infrared applications requires sophisticated readout architectures in addition to superconducting materials development. In this work, an SNSPD array design that combines a thermally coupled row-column multiplexing architecture with a thermally coupled time-of-flight transmission line was developed for mid-infrared applications. The design requires only six cables and can be scaled to larger array sizes. The demonstration of a 64-pixel array shows promising results for wavelengths between $\mathrm{3.4\,μm}$ and $\mathrm{10\,μm}$, which will enable the use of this single-photon detector technology for a broad range of new applications. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 7 pages, 3 figures, 1 page supplementary material. The following article has been submitted to Applied Physics Letters

Journal ref: Appl. Phys. Lett. 124, 042602 (2024)

arXiv:2309.14668 [pdf]

Depolarized Holography with Polarization-multiplexing Metasurface

Authors: Seung-Woo Nam, Youngjin Kim, Dongyeon Kim, Yoonchan Jeong

Abstract: The evolution of computer-generated holography (CGH) algorithms has prompted significant improvements in the performances of holographic displays. Nonetheless, they start to encounter a limited degree of freedom in CGH optimization and physical constraints stemming from the coherent nature of holograms. To surpass the physical limitations, we consider polarization as a new degree of freedom by uti… ▽ More The evolution of computer-generated holography (CGH) algorithms has prompted significant improvements in the performances of holographic displays. Nonetheless, they start to encounter a limited degree of freedom in CGH optimization and physical constraints stemming from the coherent nature of holograms. To surpass the physical limitations, we consider polarization as a new degree of freedom by utilizing a novel optical platform called metasurface. Polarization-multiplexing metasurfaces enable incoherent-like behavior in holographic displays due to the mutual incoherence of orthogonal polarization states. We leverage this unique characteristic of a metasurface by integrating it into a holographic display and exploiting polarization diversity to bring an additional degree of freedom for CGH algorithms. To minimize the speckle noise while maximizing the image quality, we devise a fully differentiable optimization pipeline by taking into account the metasurface proxy model, thereby jointly optimizing spatial light modulator phase patterns and geometric parameters of metasurface nanostructures. We evaluate the metasurface-enabled depolarized holography through simulations and experiments, demonstrating its ability to reduce speckle noise and enhance image quality. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 15 pages, 13 figures, to be published in SIGGRAPH Asia 2023

arXiv:2309.07029 [pdf, ps, other]

Local Calabi-Yau 3-folds for some rank 2 shrinkable surfaces

Authors: Sungwoo Nam

Abstract: Motivated by 5d rank 2 SCFTs, we construct a smooth, non-compact Calabi-Yau 3-fold $X$ containing a rank 2 shrinkable surface $S=S_1\cup S_2$ glued over a smooth curve. This construction will be a generalization of the construction of a local surface for a smooth surface $S$ Motivated by 5d rank 2 SCFTs, we construct a smooth, non-compact Calabi-Yau 3-fold $X$ containing a rank 2 shrinkable surface $S=S_1\cup S_2$ glued over a smooth curve. This construction will be a generalization of the construction of a local surface for a smooth surface $S$ △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: 16 pages, comments welcome

arXiv:2309.06933 [pdf, other]

DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models

Authors: Namhyuk Ahn, Junsoo Lee, Chunggi Lee, Kunhee Kim, Daesik Kim, Seung-Hun Nam, Kibeom Hong

Abstract: Recent progresses in large-scale text-to-image models have yielded remarkable accomplishments, finding various applications in art domain. However, expressing unique characteristics of an artwork (e.g. brushwork, colortone, or composition) with text prompts alone may encounter limitations due to the inherent constraints of verbal description. To this end, we introduce DreamStyler, a novel framewor… ▽ More Recent progresses in large-scale text-to-image models have yielded remarkable accomplishments, finding various applications in art domain. However, expressing unique characteristics of an artwork (e.g. brushwork, colortone, or composition) with text prompts alone may encounter limitations due to the inherent constraints of verbal description. To this end, we introduce DreamStyler, a novel framework designed for artistic image synthesis, proficient in both text-to-image synthesis and style transfer. DreamStyler optimizes a multi-stage textual embedding with a context-aware text prompt, resulting in prominent image quality. In addition, with content and style guidance, DreamStyler exhibits flexibility to accommodate a range of style references. Experimental results demonstrate its superior performance across multiple scenarios, suggesting its promising potential in artistic product creation. △ Less

Submitted 18 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

Comments: AAAI 2024

arXiv:2308.15077 [pdf, ps, other]

Quasiprojectile breakup and isospin equilibration at Fermi energies: an indication of longer projectile-target contact times?

Authors: C. Ciampi, S. Piantelli, G. Casini, A. Ono, J. D. Frankland, L. Baldesi, S. Barlini, B. Borderie, R. Bougault, A. Camaiani, A. Chbihi, J. A. Dueñas, Q. Fable, D. Fabris, C. Frosin, T. Génard, F. Gramegna, D. Gruyer, M. Henri, B. Hong, S. Kim, A. Kordyasz, T. Kozik, M. J. Kweon, N. Le Neindre , et al. (16 additional authors not shown)

Abstract: An investigation of the quasiprojectile breakup channel in semiperipheral and peripheral collisions of $^{58,64}$Ni+$^{58,64}$Ni at 32 and 52 MeV/nucleon is presented. Data have been acquired in the first experimental campaign of the INDRA-FAZIA apparatus in GANIL. The effect of isospin diffusion between projectile and target in the two asymmetric reactions has been highlighted by means of the iso… ▽ More An investigation of the quasiprojectile breakup channel in semiperipheral and peripheral collisions of $^{58,64}$Ni+$^{58,64}$Ni at 32 and 52 MeV/nucleon is presented. Data have been acquired in the first experimental campaign of the INDRA-FAZIA apparatus in GANIL. The effect of isospin diffusion between projectile and target in the two asymmetric reactions has been highlighted by means of the isospin transport ratio technique, exploiting the neutron-to-proton ratio of the quasiprojectile reconstructed from the two breakup fragments. We found evidence that, for the same reaction centrality, a higher degree of relaxation of the initial isospin imbalance is achieved in the breakup channel with respect to the more populated binary output, possibly indicating the indirect selection of specific dynamical features. We have proposed an interpretation based on different average projectile-target contact times related to the two exit channels under investigation, with a longer interaction for the breakup channel. The time information has been extracted from AMD simulations of the studied systems coupled to GEMINI++: the model calculations support the hypothesis hereby presented. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.02296 [pdf, other]

Scalable multiparty steering based on a single pair of entangled qubits

Authors: Alex Pepper, Travis. J. Baker, Yuanlong Wang, Qiu-Cheng Song, Lynden. K. Shalm, Varun. B. Varma, Sae Woo Nam, Nora Tischler, Sergei Slussarenko, Howard. M. Wiseman, Geoff. J. Pryde

Abstract: The distribution and verification of quantum nonlocality across a network of users is essential for future quantum information science and technology applications. However, beyond simple point-to-point protocols, existing methods struggle with increasingly complex state preparation for a growing number of parties. Here, we show that, surprisingly, multiparty loophole-free quantum steering, where o… ▽ More The distribution and verification of quantum nonlocality across a network of users is essential for future quantum information science and technology applications. However, beyond simple point-to-point protocols, existing methods struggle with increasingly complex state preparation for a growing number of parties. Here, we show that, surprisingly, multiparty loophole-free quantum steering, where one party simultaneously steers arbitrarily many spatially separate parties, is achievable by constructing a quantum network from a set of qubits of which only one pair is entangled. Using these insights, we experimentally demonstrate this type of steering between three parties with the detection loophole closed. With its modest and fixed entanglement requirements, this work introduces a scalable approach to rigorously verify quantum nonlocality across multiple parties, thus providing a practical tool towards developing the future quantum internet. △ Less

Submitted 4 August, 2023; originally announced August 2023.

arXiv:2307.10734 [pdf, other]

DC-DFT for Open Shells: How to Deal with Spin Contamination

Authors: Hayoung Yu, Suhwan Song, Seungsoo Nam, Kieron Burke, Eunji Sim

Abstract: Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong… ▽ More Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong spin contamination in the HF calculation becomes increasingly important. We compare two different open-shell HF densities across 13 different DFT functionals and two DC-DFT methods. For significant spin contamination, ROHF densities outperform UHF densities by as much as a factor of 3, depending on the energy functional, and ROHF-DFT improves over self-consistent DFT for most of the tested functionals. We refine the DC(HF)-DFT algorithm, recommending ROHF-DFT in cases of severe spin contamination. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: 10 pages, 4 figures, 2 tables

arXiv:2307.09038 [pdf, other]

Effects of Symmetry Energy on the Equation of State for Hybrid Neutron Stars

Authors: Parada T. P. Hutauruk, Hana Gil, Seung-il Nam, Chang Ho Hyun

Abstract: In this paper, the implications of the symmetry energy on the hadron and quark phase transitions in the compact star, including the properties of the possible configurations of the quark-hadron hybrid stars, are investigated in the frameworks of the energy-density functional (EDF) models and the flavor SU(2) Nambu--Jona-Lasinio (NJL) model with the help of the Schwinger's covariant proper-time reg… ▽ More In this paper, the implications of the symmetry energy on the hadron and quark phase transitions in the compact star, including the properties of the possible configurations of the quark-hadron hybrid stars, are investigated in the frameworks of the energy-density functional (EDF) models and the flavor SU(2) Nambu--Jona-Lasinio (NJL) model with the help of the Schwinger's covariant proper-time regularization (PTR) scheme. In this {theoretical setup}, the equations of states (EoSs) of hadronic matter for various values of symmetry energies obtained from the EDF models are employed to describe the hadronic matter, and the {flavor} SU(2) NJL model with various repulsive-vector interaction strengths are used to describe the quark matter. We then observe the obtained EoS in the mass-radius properties of the hybrid star configurations for various vector interactions and nuclear symmetry energies by solving the Tolman-Oppenheimer-Volkoff equation. We obtain that the critical density at which the phase transition occurs varies over the density (3.6--6.7)$ρ_0$ depending on the symmetry energy and the strength of the vector coupling $G_v$. The maximum mass of the neutron star (NS) is susceptible to $G_v$. When there is no repulsive force, the NS maximum mass is only about $1.5M_\odot$, but it becomes larger than $2.0M_\odot$ when the vector coupling constant is about half of the {attractive} scalar coupling constant. Surprisingly, the presence of the quark matter does not affect the canonical mass of NS ($1.4M_\odot$), so observing the canonical mass of NSs can provide unique constraints to the EoS of hadronic matter at high densities. △ Less

Submitted 18 July, 2023; originally announced July 2023.

Comments: 20 pages, 5 figures, 1 table

arXiv:2307.08610 [pdf, other]

Search for Light Dark Photon in the Forward Experiments at the LHC

Authors: Yeong Gyun Kim, Kang Young Lee, Soo-hyeon Nam

Abstract: We investigate detection possibility of light dark photon in the forward experiments at the LHC, such as the SND@LHC and the FASER experiments. We assume that the dark photon mass is smaller than twice of the electron mass. Then the dark photon is long-lived and copiously produced through a neutral pion decay. Such dark photons would easily pass through 100 m of rock in front of the forward experi… ▽ More We investigate detection possibility of light dark photon in the forward experiments at the LHC, such as the SND@LHC and the FASER experiments. We assume that the dark photon mass is smaller than twice of the electron mass. Then the dark photon is long-lived and copiously produced through a neutral pion decay. Such dark photons would easily pass through 100 m of rock in front of the forward experiments and the detector targets, but some portion of them could be converted into an electron-positron pair inside the detector leaving an isolated electromagnetic shower as a new physics signature of the dark photon. Our estimation shows that in the range of kinetic mixing parameter $4\times10^{-5} \lesssim ε\lesssim 2\times10^{-1}$, more than 10 signal events of the dark photon can be produced assuming 150 fb$^{-1}$ integrated luminosity. △ Less

Submitted 17 July, 2023; originally announced July 2023.

Comments: 8 pages, 2 figures

arXiv:2307.03962 [pdf, ps, other]

Achieving the Exactly Optimal Privacy-Utility Trade-Off with Low Communication Cost via Shared Randomness

Authors: Seung-Hyun Nam, Hyun-Young Park, Si-Hyeon Lee

Abstract: We consider a discrete distribution estimation problem under a local differential privacy (LDP) constraint in the presence of shared randomness. By exploiting the shared randomness, we suggest a new method for constructing LDP schemes which achieve the exactly optimal privacy-utility trade-off (PUT) with the communication cost of less than or equal to the input data size for any privacy regime. Th… ▽ More We consider a discrete distribution estimation problem under a local differential privacy (LDP) constraint in the presence of shared randomness. By exploiting the shared randomness, we suggest a new method for constructing LDP schemes which achieve the exactly optimal privacy-utility trade-off (PUT) with the communication cost of less than or equal to the input data size for any privacy regime. The main idea is to decompose a block design scheme by Park et al. (2023), based on the combinatorial concept called resolution. The LDP scheme decomposed from a block design scheme is called a resolution of the block design scheme, and it achieves the same PUT as the original block design scheme while requiring a less communication cost. We provide two resolutions of an exactly PUT-optimal block design scheme, called the Baranyai's resolution and the cyclic shift resolution, both requiring the communication cost of less than or equal to the input data size. In particular, we show that the Baranyai's resolution achieves the minimum communication cost among all the PUT-optimal resolutions of block design schemes. One drawback of the Baranyai's resolution is that it can be obtained through a recursive algorithm in general. In contrast, the cyclic shift resolution has an explicit structure, but its communication cost can be larger than that of Baranyai's resolution. To complement this, we also suggest resolutions of other block design schemes achieving the optimal PUT for some privacy budgets, which require the minimum communication cost as the Baranyai's resolution and have explicit structures as the cyclic shift resolution. △ Less

Submitted 8 July, 2023; originally announced July 2023.

Comments: 11 pages and 1 figure. This manuscript was submitted to IEEE Transactions on Information Theory

arXiv:2306.15969 [pdf, other]

Separable Physics-Informed Neural Networks

Authors: Junwoo Cho, Seungtae Nam, Hyunmo Yang, Seok-Bae Yun, Youngjoon Hong, Eunbyung Park

Abstract: Physics-informed neural networks (PINNs) have recently emerged as promising data-driven PDE solvers showing encouraging results on various PDEs. However, there is a fundamental limitation of training PINNs to solve multi-dimensional PDEs and approximate highly complex solution functions. The number of training points (collocation points) required on these challenging PDEs grows substantially, but… ▽ More Physics-informed neural networks (PINNs) have recently emerged as promising data-driven PDE solvers showing encouraging results on various PDEs. However, there is a fundamental limitation of training PINNs to solve multi-dimensional PDEs and approximate highly complex solution functions. The number of training points (collocation points) required on these challenging PDEs grows substantially, but it is severely limited due to the expensive computational costs and heavy memory overhead. To overcome this issue, we propose a network architecture and training algorithm for PINNs. The proposed method, separable PINN (SPINN), operates on a per-axis basis to significantly reduce the number of network propagations in multi-dimensional PDEs unlike point-wise processing in conventional PINNs. We also propose using forward-mode automatic differentiation to reduce the computational cost of computing PDE residuals, enabling a large number of collocation points (>10^7) on a single commodity GPU. The experimental results show drastically reduced computational costs (62x in wall-clock time, 1,394x in FLOPs given the same number of collocation points) in multi-dimensional PDEs while achieving better accuracy. Furthermore, we present that SPINN can solve a chaotic (2+1)-d Navier-Stokes equation significantly faster than the best-performing prior method (9 minutes vs 10 hours in a single GPU), maintaining accuracy. Finally, we showcase that SPINN can accurately obtain the solution of a highly nonlinear and multi-dimensional PDE, a (3+1)-d Navier-Stokes equation. For visualized results and code, please see https://jwcho5576.github.io/spinn.github.io/. △ Less

Submitted 31 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: To appear in NeurIPS 2023 (28 pages, 13 figures). workshop paper: arXiv:2211.08761

arXiv:2306.09473 [pdf, other]

doi 10.1038/s41586-023-06550-2

A superconducting-nanowire single-photon camera with 400,000 pixels

Authors: Bakhrom G. Oripov, Dana S. Rampini, Jason Allmaras, Matthew D. Shaw, Sae Woo Nam, Boris Korzh, Adam N. McCaughan

Abstract: For the last 50 years, superconducting detectors have offered exceptional sensitivity and speed for detecting faint electromagnetic signals in a wide range of applications. These detectors operate at very low temperatures and generate a minimum of excess noise, making them ideal for testing the non-local nature of reality, investigating dark matter, mapping the early universe, and performing quant… ▽ More For the last 50 years, superconducting detectors have offered exceptional sensitivity and speed for detecting faint electromagnetic signals in a wide range of applications. These detectors operate at very low temperatures and generate a minimum of excess noise, making them ideal for testing the non-local nature of reality, investigating dark matter, mapping the early universe, and performing quantum computation and communication. Despite their appealing properties, however, there are currently no large-scale superconducting cameras - even the largest demonstrations have never exceeded 20 thousand pixels. This is especially true for one of the most promising detector technologies, the superconducting nanowire single-photon detector (SNSPD). These detectors have been demonstrated with system detection efficiencies of 98.0%, sub-3-ps timing jitter, sensitivity from the ultraviolet (250nm) to the mid-infrared (10um), and dark count rates below 6.2e-6 counts per second (cps), but despite more than two decades of development they have never achieved an array size larger than a kilopixel. Here, we report on the implementation and characterization of a 400,000 pixel SNSPD camera, a factor of 400 improvement over the previous state-of-the-art. The array spanned an area 4x2.5 mm with a 5x5um resolution, reached unity quantum efficiency at wavelengths of 370 nm and 635 nm, counted at a rate of 1.1e5 cps, and had a dark count rate of 1e-4 cps per detector (corresponding to 0.13 cps over the whole array). The imaging area contains no ancillary circuitry and the architecture is scalable well beyond the current demonstration, paving the way for large-format superconducting cameras with 100% fill factors and near-unity detection efficiencies across a vast range of the electromagnetic spectrum. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Showing 1–50 of 589 results for author: Nam, S