Search | arXiv e-print repository

arXiv:2408.12606 [pdf, other]

Towards Non-invasive and Personalized Management of Breast Cancer Patients from Multiparametric MRI via A Large Mixture-of-Modality-Experts Model

Authors: Luyang Luo, Mingxiang Wu, Mei Li, Yi Xin, Qiong Wang, Varut Vardhanabhuti, Winnie CW Chu, Zhenhui Li, Juan Zhou, Pranav Rajpurkar, Hao Chen

Abstract: Breast magnetic resonance imaging (MRI) is the imaging technique with the highest sensitivity for detecting breast cancer and is routinely used for women at high risk. Despite the comprehensive multiparametric protocol of breast MRI, existing artificial intelligence-based studies predominantly rely on single sequences and have limited validation. Here we report a large mixture-of-modality-experts… ▽ More Breast magnetic resonance imaging (MRI) is the imaging technique with the highest sensitivity for detecting breast cancer and is routinely used for women at high risk. Despite the comprehensive multiparametric protocol of breast MRI, existing artificial intelligence-based studies predominantly rely on single sequences and have limited validation. Here we report a large mixture-of-modality-experts model (MOME) that integrates multiparametric MRI information within a unified structure, offering a noninvasive method for personalized breast cancer management. We have curated the largest multiparametric breast MRI dataset, involving 5,205 patients from three hospitals in the north, southeast, and southwest of China, for the development and extensive evaluation of our model. MOME demonstrated accurate and robust identification of breast cancer. It achieved comparable performance for malignancy recognition to that of four senior radiologists and significantly outperformed a junior radiologist, with 0.913 AUROC, 0.948 AUPRC, 0.905 F1 score, and 0.723 MCC. Our findings suggest that MOME could reduce the need for biopsies in BI-RADS 4 patients with a ratio of 7.3%, classify triple-negative breast cancer with an AUROC of 0.709, and predict pathological complete response to neoadjuvant chemotherapy with an AUROC of 0.694. The model further supports scalable and interpretable inference, adapting to missing modalities and providing decision explanations by highlighting lesions and measuring modality contributions. MOME exemplifies a discriminative, robust, scalable, and interpretable multimodal model, paving the way for noninvasive, personalized management of breast cancer patients based on multiparametric breast imaging data. △ Less

Submitted 8 August, 2024; originally announced August 2024.

Comments: 27 pages, 8 figures, 10 tables

arXiv:2408.11046 [pdf, other]

Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders

Authors: Yuan Xin, Zheng Li, Ning Yu, Dingfan Chen, Mario Fritz, Michael Backes, Yang Zhang

Abstract: Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data. In this paper, we pioneer a systematic exploration of such risks associated with pre-trained language encoders, specifically focusing on the membership leakage of pre-training… ▽ More Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data. In this paper, we pioneer a systematic exploration of such risks associated with pre-trained language encoders, specifically focusing on the membership leakage of pre-training data exposed through downstream models adapted from pre-trained language encoders-an aspect largely overlooked in existing literature. Our study encompasses comprehensive experiments across four types of pre-trained encoder architectures, three representative downstream tasks, and five benchmark datasets. Intriguingly, our evaluations reveal, for the first time, the existence of membership leakage even when only the black-box output of the downstream model is exposed, highlighting a privacy risk far greater than previously assumed. Alongside, we present in-depth analysis and insights toward guiding future researchers and practitioners in addressing the privacy considerations in developing pre-trained language models. △ Less

Submitted 20 August, 2024; originally announced August 2024.

Comments: ECAI24

arXiv:2408.10299 [pdf, other]

RV measurements of directly imaged brown dwarf GQ Lup B to search for exo-satellites

Authors: Katelyn Horstman, Jean-Baptiste Ruffio, Konstantin Batygin, Dimitri Mawet, Ashley Baker, Chih-Chun Hsu, Jason J. Wang, Ji Wang, Sarah Blunt, Jerry W. Xuan, Yinzi Xin, Joshua Liberman, Shubh Agrawal, Quinn M. Konopacky, Geoffrey A. Blake, Clarissa R. Do O, Randall Bartos, Charlotte Z. Bond, Benjamin Calvin, Sylvain Cetre, Jacques-Robert Delorme, Greg Doppmann, Daniel Echeverri, Luke Finnerty, Michael P. Fitzgerald , et al. (13 additional authors not shown)

Abstract: GQ Lup B is one of the few substellar companions with a detected cicumplanetary disk, or CPD. Observations of the CPD suggest the presence of a cavity, possibly formed by an exo-satellite. Using the Keck Planet Imager and Characterizer (KPIC), a high contrast imaging suite that feeds a high resolution spectrograph (1.9-2.5 microns, R$\sim$35,000), we present the first dedicated radial velocity (RV… ▽ More GQ Lup B is one of the few substellar companions with a detected cicumplanetary disk, or CPD. Observations of the CPD suggest the presence of a cavity, possibly formed by an exo-satellite. Using the Keck Planet Imager and Characterizer (KPIC), a high contrast imaging suite that feeds a high resolution spectrograph (1.9-2.5 microns, R$\sim$35,000), we present the first dedicated radial velocity (RV) observations around a high-contrast, directly imaged substellar companion, GQ Lup B, to search for exo-satellites. Over 11 epochs, we find a best and median RV error of 400-1000 m/s, most likely limited by systematic fringing in the spectra due to transmissive optics within KPIC. With this RV precision, KPIC is sensitive to exomoons 0.6-2.8% the mass of GQ Lup B ($\sim 30 M_{\text{Jup}}$) at separations between the Roche limit and $65 R_{\text{Jup}}$, or the extent of the cavity inferred within the CPD detected around GQ Lup B. Using simulations of HISPEC, a high resolution infrared spectrograph planned to debut at W.M. Keck Observatory in 2026, we estimate future exomoon sensitivity to increase by over an order of magnitude, providing sensitivity to less massive satellites potentially formed within the CPD itself. Additionally, we run simulations to estimate the amount of material that different masses of satellites could clear in a CPD to create the observed cavity. We find satellite-to-planet mass ratios of $q > 2 \times 10^{-4}$ can create observable cavities and report a maximum cavity size of $\sim 51 \, R_{\text{Jup}}$ carved from a satellite. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: 15 pages, 5 figures

arXiv:2408.06973 [pdf, other]

Deepest limits on scattered light emission from the Epsilon Eridani inner debris disk with HST/STIS

Authors: Sai Krishanth P. M., Ewan S. Douglas, Ramya M. Anche, Justin Hom, Kerri L. Cahoy, John H. Debes, Hannah Jang-Condell, Isabel Rebollido, Bin B. Ren, Christopher C. Stark, Robert Thompson, Yinzi Xin

Abstract: Epsilon Eridani ($ε$ Eri) is one of the first debris disk systems detected by the Infrared Astronomical Satellite (IRAS). However, the system has thus far eluded detection in scattered light with no components having been directly imaged. Its similarity to a relatively young Solar System combined with its proximity makes it an excellent candidate to further our understanding of planetary system ev… ▽ More Epsilon Eridani ($ε$ Eri) is one of the first debris disk systems detected by the Infrared Astronomical Satellite (IRAS). However, the system has thus far eluded detection in scattered light with no components having been directly imaged. Its similarity to a relatively young Solar System combined with its proximity makes it an excellent candidate to further our understanding of planetary system evolution. We present a set of coronagraphic images taken using the Space Telescope Imaging Spectrograph (STIS) coronagraph on the Hubble space telescope at a small inner working angle to detect a predicted warm inner debris disk inside 1". We used three different post-processing approaches; Non-negative Matrix Factorization (NMF), Karhunen-Lo`eve Image Processing (KLIP), and Classical reference differential imaging (RDI), to best optimize reference star subtraction, and find that NMF performed the best overall while KLIP produced the absolute best contrast inside 1". We present limits on scattered light from warm dust, with constraints on surface brightness at 6 mJy/as$^2$ at our inner working angle of 0.6". We also place a constraint of 0.5 mJy/as$^2$ outside 1", which gives us an upper limit on the brightness for outer disks and substellar companions. Finally, we calculated an upper limit on the dust albedo at $ω<$ 0.487. △ Less

Submitted 14 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

Comments: 13+2 pages, 7+2 figures; Accepted for publication in the Astronomical Journal

arXiv:2407.19198 [pdf, other]

Towards the Dynamics of a DNN Learning Symbolic Interactions

Authors: Qihan Ren, Yang Xu, Junpeng Zhang, Yue Xin, Dongrui Liu, Quanshi Zhang

Abstract: This study proves the two-phase dynamics of a deep neural network (DNN) learning interactions. Despite the long disappointing view of the faithfulness of post-hoc explanation of a DNN, in recent years, a series of theorems have been proven to show that given an input sample, a small number of interactions between input variables can be considered as primitive inference patterns, which can faithful… ▽ More This study proves the two-phase dynamics of a deep neural network (DNN) learning interactions. Despite the long disappointing view of the faithfulness of post-hoc explanation of a DNN, in recent years, a series of theorems have been proven to show that given an input sample, a small number of interactions between input variables can be considered as primitive inference patterns, which can faithfully represent every detailed inference logic of the DNN on this sample. Particularly, it has been observed that various DNNs all learn interactions of different complexities with two-phase dynamics, and this well explains how a DNN's generalization power changes from under-fitting to over-fitting. Therefore, in this study, we prove the dynamics of a DNN gradually encoding interactions of different complexities, which provides a theoretically grounded mechanism for the over-fitting of a DNN. Experiments show that our theory well predicts the real learning dynamics of various DNNs on different tasks. △ Less

Submitted 27 July, 2024; originally announced July 2024.

arXiv:2407.04942 [pdf, other]

FOSP: Fine-tuning Offline Safe Policy through World Models

Authors: Chenyang Cao, Yucheng Xin, Silang Wu, Longxiang He, Zichen Yan, Junbo Tan, Xueqian Wang

Abstract: Model-based Reinforcement Learning (RL) has shown its high training efficiency and capability of handling high-dimensional tasks. Regarding safety issues, safe model-based RL can achieve nearly zero-cost performance and effectively manage the trade-off between performance and safety. Nevertheless, prior works still pose safety challenges due to the online exploration in real-world deployment. To a… ▽ More Model-based Reinforcement Learning (RL) has shown its high training efficiency and capability of handling high-dimensional tasks. Regarding safety issues, safe model-based RL can achieve nearly zero-cost performance and effectively manage the trade-off between performance and safety. Nevertheless, prior works still pose safety challenges due to the online exploration in real-world deployment. To address this, some offline RL methods have emerged as solutions, which learn from a static dataset in a safe way by avoiding interactions with the environment. In this paper, we aim to further enhance safety during the deployment stage for vision-based robotic tasks by fine-tuning an offline-trained policy. We incorporate in-sample optimization, model-based policy expansion, and reachability guidance to construct a safe offline-to-online framework. Moreover, our method proves to improve the generalization of offline policy in unseen safety-constrained scenarios. Finally, the efficiency of our method is validated on simulation benchmarks with five vision-only tasks and a real robot by solving some deployment problems using limited data. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 21 pages

arXiv:2407.02757 [pdf, other]

Evolution of High-energy Electron Distribution in Pulsar Wind Nebulae

Authors: Yi-Ming Liu, Hou-Dun Zeng, Yu-Liang Xin, Si-Ming Liu, Yi Zhang

Abstract: In this paper, we analyze the spectral energy distributions (SEDs) of 17 powerful (with a spin-down luminosity greater than $10^{35}$ erg s$^{-1}$) young (with an age less than 15000 yrs) pulsar wind nebulae (PWNe) using a simple time-independent one-zone emission model. Our aim is to investigate correlations between model parameters and the ages of the corresponding PWNe, thereby revealing the ev… ▽ More In this paper, we analyze the spectral energy distributions (SEDs) of 17 powerful (with a spin-down luminosity greater than $10^{35}$ erg s$^{-1}$) young (with an age less than 15000 yrs) pulsar wind nebulae (PWNe) using a simple time-independent one-zone emission model. Our aim is to investigate correlations between model parameters and the ages of the corresponding PWNe, thereby revealing the evolution of high-energy electron distributions within PWNe. Our findings are as follows: (1) The electron distributions in PWNe can be characterized by a double power-law with a superexponential cutoff; (2) As PWNe evolve, the high-energy end of the electron distribution spectrum becomes harder with the index decreasing from approximately 3.5 to 2.5, while the low-energy end spectrum index remains constant near 1.5; (3) There is no apparent correlation between the break energy or cutoff energy and the age of PWNe. (4) The average magnetic field within PWNe decreases with age, leading to a positive correlation between the energy loss timescale of electrons at the break energy or the high-energy cutoff, and the age of the PWN. (5) The total electron energy within PWNe remains constant near $2 \times 10^{48}$ erg, while the total magnetic energy decreases with age. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 19 pages, 20 figures, 1 table accepted for publication in RAA

arXiv:2406.15028 [pdf, other]

The high-contrast performance of the Keck Planet Imager and Characterizer

Authors: Jason J. Wang, Dimitri Mawet, Jerry W. Xuan, Chih-Chun Hsu, Jean-Baptiste Ruffio, Katelyn Horstman, Yinzi Xin, Jacques-Robert Delorme, Nemanja Jovanovic, Yapeng Zhang, Luke Finnerty, Ashley Baker, Randall Bartos, Geoffrey A. Blake, Benjamin Calvin, Sylvain Cetre, Gregory W. Doppmann, Daniel Echeverri, Michael P. Fitzgerald, Joshua Liberman, Ronald Lopez, Evan Morris, Jacklyn Pezzato-Rovner, Ben Sappey, Tobias Schofield , et al. (3 additional authors not shown)

Abstract: The Keck Planet Imager and Characterizer (KPIC), a series of upgrades to the Keck II Adaptive Optics System and Instrument Suite, aims to demonstrate high-resolution spectroscopy of faint exoplanets that are spatially resolved from their host stars. In this paper, we measure KPIC's sensitivity to companions as a function of separation (i.e., the contrast curve) using on-sky data collected over fou… ▽ More The Keck Planet Imager and Characterizer (KPIC), a series of upgrades to the Keck II Adaptive Optics System and Instrument Suite, aims to demonstrate high-resolution spectroscopy of faint exoplanets that are spatially resolved from their host stars. In this paper, we measure KPIC's sensitivity to companions as a function of separation (i.e., the contrast curve) using on-sky data collected over four years of operation. We show that KPIC is able to reach contrasts of $1.3 \times 10^{-4}$ at 90 mas and $9.2 \times 10^{-6}$ at 420 mas separation from the star, and that KPIC can reach planet-level sensitivities at angular separations within the inner working angle of coronagraphic instruments such as GPI and SPHERE. KPIC is also able to achieve more extreme contrasts than other medium-/high-resolution spectrographs that are not as optimized for high-contrast performance. We decompose the KPIC performance budget into individual noise terms and discuss limiting factors. The fringing that results from combining a high-contrast imaging system with a high-resolution spectrograph is identified as an important source of systematic noise. After mitigation and correction, KPIC is able to reach within a factor of 2 of the photon noise limit at separations < 200 mas. At large separations, KPIC is limited by the background noise performance of NIRSPEC. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 16 pages, 6 figures, submitted to the proceedings of SPIE Astronomical Telescopes + Instrumentation 2024, 13096-69

arXiv:2406.13035 [pdf, other]

D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Authors: Zhongwei Wan, Xinjian Wu, Yu Zhang, Yi Xin, Chaofan Tao, Zhihong Zhu, Xin Wang, Siqi Luo, Jing Xiong, Mi Zhang

Abstract: Efficient inference in Large Language Models (LLMs) is impeded by the growing memory demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache eviction strategies, which prioritize less critical KV-pairs based on attention scores, often degrade generation quality, leading to issues such as context loss or hallucinations. To address this, we introduce Dynamic Discrimi… ▽ More Efficient inference in Large Language Models (LLMs) is impeded by the growing memory demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache eviction strategies, which prioritize less critical KV-pairs based on attention scores, often degrade generation quality, leading to issues such as context loss or hallucinations. To address this, we introduce Dynamic Discriminative Operations (D2O), a novel method that utilizes two-level discriminative strategies to optimize KV cache size without fine-tuning, while preserving essential context. Initially, by observing varying densities of attention weights between shallow and deep layers, we use this insight to determine which layers should avoid excessive eviction to minimize information loss. Subsequently, for the eviction strategy in each layer, D2O innovatively incorporates a compensation mechanism that maintains a similarity threshold to re-discriminate the importance of previously discarded tokens, determining whether they should be recalled and merged with similar tokens. Our approach not only achieves significant memory savings and enhances inference throughput by more than 3 times but also maintains high-quality long-text generation. Extensive experiments across various benchmarks and LLM architectures have demonstrated that D2O significantly enhances performance with a constrained KV cache budget. △ Less

Submitted 23 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.08698 [pdf, other]

Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 12 figures, accepted by PRL

arXiv:2406.07809 [pdf, other]

Did Harold Zuercher Have Time-Separable Preferences?

Authors: Jay Lu, Yao Luo, Kota Saito, Yi Xin

Abstract: This paper proposes an empirical model of dynamic discrete choice to allow for non-separable time preferences, generalizing the well-known Rust (1987) model. Under weak conditions, we show the existence of value functions and hence well-defined optimal choices. We construct a contraction mapping of the value function and propose an estimation method similar to Rust's nested fixed point algorithm.… ▽ More This paper proposes an empirical model of dynamic discrete choice to allow for non-separable time preferences, generalizing the well-known Rust (1987) model. Under weak conditions, we show the existence of value functions and hence well-defined optimal choices. We construct a contraction mapping of the value function and propose an estimation method similar to Rust's nested fixed point algorithm. Finally, we apply the framework to the bus engine replacement data. We improve the fit of the data with our general model and reject the null hypothesis that Harold Zuercher has separable time preferences. Misspecifying an agent's preference as time-separable when it is not leads to biased inferences about structure parameters (such as the agent's risk attitudes) and misleading policy recommendations. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.07476 [pdf, other]

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Authors: Zesen Cheng, Sicong Leng, Hang Zhang, Yifei Xin, Xin Li, Guanzheng Chen, Yongxin Zhu, Wenqi Zhang, Ziyang Luo, Deli Zhao, Lidong Bing

Abstract: In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data… ▽ More In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data. Additionally, we integrate an Audio Branch into the model through joint training, thereby enriching the multimodal understanding capabilities of the model by seamlessly incorporating audio cues. Comprehensive evaluations on multiple-choice video question answering (MC-VQA), open-ended video question answering (OE-VQA), and video captioning (VC) tasks demonstrate that VideoLLaMA 2 consistently achieves competitive results among open-source models and even gets close to some proprietary models on several benchmarks. Furthermore, VideoLLaMA 2 exhibits reasonable improvements in audio-only and audio-video question-answering (AQA & OE-AVQA) benchmarks over existing models. These advancements underline VideoLLaMA 2's superior performance in multimodal comprehension, setting a new standard for intelligent video analysis systems. All models are public to facilitate further research. △ Less

Submitted 17 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

Comments: ZC, SL, HZ, YX, and XL contributed equally to this project

arXiv:2406.02632 [pdf, other]

Redefining DDoS Attack Detection Using A Dual-Space Prototypical Network-Based Approach

Authors: Fernando Martinez, Mariyam Mapkar, Ali Alfatemi, Mohamed Rahouti, Yufeng Xin, Kaiqi Xiong, Nasir Ghani

Abstract: Distributed Denial of Service (DDoS) attacks pose an increasingly substantial cybersecurity threat to organizations across the globe. In this paper, we introduce a new deep learning-based technique for detecting DDoS attacks, a paramount cybersecurity challenge with evolving complexity and scale. Specifically, we propose a new dual-space prototypical network that leverages a unique dual-space loss… ▽ More Distributed Denial of Service (DDoS) attacks pose an increasingly substantial cybersecurity threat to organizations across the globe. In this paper, we introduce a new deep learning-based technique for detecting DDoS attacks, a paramount cybersecurity challenge with evolving complexity and scale. Specifically, we propose a new dual-space prototypical network that leverages a unique dual-space loss function to enhance detection accuracy for various attack patterns through geometric and angular similarity measures. This approach capitalizes on the strengths of representation learning within the latent space (a lower-dimensional representation of data that captures complex patterns for machine learning analysis), improving the model's adaptability and sensitivity towards varying DDoS attack vectors. Our comprehensive evaluation spans multiple training environments, including offline training, simulated online training, and prototypical network scenarios, to validate the model's robustness under diverse data abundance and scarcity conditions. The Multilayer Perceptron (MLP) with Attention, trained with our dual-space prototypical design over a reduced training set, achieves an average accuracy of 94.85% and an F1-Score of 94.71% across our tests, showcasing its effectiveness in dynamic and constrained real-world scenarios. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 9 pages, The 33rd International Conference on Computer Communications and Networks (ICCCN 2024)

arXiv:2406.00233 [pdf, other]

Plug-in UL-CSI-Assisted Precoder Upsampling Approach in Cellular FDD Systems

Authors: Yu-Chien Lin, Yan Xin, Ta-Sung Lee, Charlie, Zhang, Yibo Ma, Zhi Ding

Abstract: Acquiring downlink channel state information (CSI) is crucial for optimizing performance in massive Multiple Input Multiple Output (MIMO) systems operating under Frequency-Division Duplexing (FDD). Most cellular wireless communication systems employ codebook-based precoder designs, which offer advantages such as simpler, more efficient feedback mechanisms and reduced feedback overhead. Common code… ▽ More Acquiring downlink channel state information (CSI) is crucial for optimizing performance in massive Multiple Input Multiple Output (MIMO) systems operating under Frequency-Division Duplexing (FDD). Most cellular wireless communication systems employ codebook-based precoder designs, which offer advantages such as simpler, more efficient feedback mechanisms and reduced feedback overhead. Common codebook-based approaches include Type II and eType II precoding methods defined in the 3GPP standards. Feedback in these systems is typically standardized per subband (SB), allowing user equipment (UE) to select the optimal precoder from the codebook for each SB, thereby reducing feedback overhead. However, this subband-level feedback resolution may not suffice for frequency-selective channels. This paper addresses this issue by introducing an uplink CSI-assisted precoder upsampling module deployed at the gNodeB. This module upsamples SB-level precoders to resource block (RB)-level precoders, acting as a plug-in compatible with existing gNodeB or base stations. △ Less

Submitted 31 May, 2024; originally announced June 2024.

arXiv:2405.17125 [pdf]

Strategies to enhance THz harmonic generation combining multilayered, gated, and metamaterial-based architectures

Authors: Ali Maleki, Moritz B. Heindl, Yongbao Xin, Robert W. Boyd, Georg Herink, Jean-Michel Ménard

Abstract: Graphene has unique properties paving the way for groundbreaking future applications. Its large optical nonlinearity and ease of integration in devices notably makes it an ideal candidate to become a key component for all-optical switching and frequency conversion applications. In the terahertz (THz) region, various approaches have been independently demonstrated to optimize the nonlinear effects… ▽ More Graphene has unique properties paving the way for groundbreaking future applications. Its large optical nonlinearity and ease of integration in devices notably makes it an ideal candidate to become a key component for all-optical switching and frequency conversion applications. In the terahertz (THz) region, various approaches have been independently demonstrated to optimize the nonlinear effects in graphene, addressing a critical limitation arising from the atomically thin interaction length. Here, we demonstrate sample architectures that combine strategies to enhance THz nonlinearities in graphene-based structures. We achieve this by increasing the interaction length through a multilayered design, controlling carrier density with an electrical gate, and modulating the THz field spatial distribution with a metallic metasurface substrate. Our study specifically investigates third harmonic generation (THG) using a table-top high-field THz source. We measure THG enhancement factors exceeding thirty and propose architectures capable of achieving a two-order-of-magnitude increase. These findings highlight the potential of engineered graphene-based samples in advancing THz frequency conversion technologies for signal processing and wireless communication applications. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 13 pages (4 Figures) + 5 pages Supplementary Information (4 Figures)

arXiv:2405.15330 [pdf, other]

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model

Authors: Mingyang Yi, Aoxue Li, Yi Xin, Zhenguo Li

Abstract: Recently, the strong latent Diffusion Probabilistic Model (DPM) has been applied to high-quality Text-to-Image (T2I) generation (e.g., Stable Diffusion), by injecting the encoded target text prompt into the gradually denoised diffusion image generator. Despite the success of DPM in practice, the mechanism behind it remains to be explored. To fill this blank, we begin by examining the intermediate… ▽ More Recently, the strong latent Diffusion Probabilistic Model (DPM) has been applied to high-quality Text-to-Image (T2I) generation (e.g., Stable Diffusion), by injecting the encoded target text prompt into the gradually denoised diffusion image generator. Despite the success of DPM in practice, the mechanism behind it remains to be explored. To fill this blank, we begin by examining the intermediate statuses during the gradual denoising generation process in DPM. The empirical observations indicate, the shape of image is reconstructed after the first few denoising steps, and then the image is filled with details (e.g., texture). The phenomenon is because the low-frequency signal (shape relevant) of the noisy image is not corrupted until the final stage in the forward process (initial stage of generation) of adding noise in DPM. Inspired by the observations, we proceed to explore the influence of each token in the text prompt during the two stages. After a series of experiments of T2I generations conditioned on a set of text prompts. We conclude that in the earlier generation stage, the image is mostly decided by the special token [\texttt{EOS}] in the text prompt, and the information in the text prompt is already conveyed in this stage. After that, the diffusion model completes the details of generated images by information from themselves. Finally, we propose to apply this observation to accelerate the process of T2I generation by properly removing text guidance, which finally accelerates the sampling up to 25\%+. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.14700 [pdf, other]

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Authors: Ting Liu, Xuyang Liu, Siteng Huang, Liangtao Shi, Zunnan Xu, Yi Xin, Quanjun Yin, Xiaohong Liu

Abstract: Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications. While current PEFT methods have achieved parameter efficiency, they overlook the efficiency of computation and GPU memory during both fine-tuning and inference, falling short of practical requirements. In this paper, we propose \textbf{Sparse… ▽ More Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications. While current PEFT methods have achieved parameter efficiency, they overlook the efficiency of computation and GPU memory during both fine-tuning and inference, falling short of practical requirements. In this paper, we propose \textbf{Sparse-Tuning}, a novel PEFT method that accounts for the information redundancy in images and videos to boost the above efficiency. By sparsely preserving the semantic-relevant tokens and merging irrelevant ones, Sparse-Tuning minimizes the quantity of tokens processed at each layer, leading to a quadratic reduction in computational and memory overhead. To align our token sparsification strategy suitably with fine-tuning purposes, we further design Dense Adapters that establish dense connections from shallow layers to deeper layers. These Dense Adapters integrate multi-level local features to enrich the current tokens, improving both token preservation and model adaptation. Empirical results on VTAB-1K, three image datasets, and two video datasets show that our Sparse-Tuning reduces GFLOPs to \textbf{62\%-70\%} of the original ViT-B while achieving state-of-the-art performance. Source code is available at \url{https://github.com/liuting20/Sparse-Tuning}. △ Less

Submitted 29 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.11826 [pdf, other]

Data quality control system and long-term performance monitor of the LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively. △ Less

Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

arXiv:2405.08312 [pdf, other]

doi 10.3847/1538-4357/ad58d3

Rotation and Abundances of the Benchmark Brown Dwarf HD 33632 Ab from Keck/KPIC High-resolution Spectroscopy

Authors: Chih-Chun Hsu, Jason J. Wang, Jerry W. Xuan, Jean-Baptiste Ruffio, Daniel Echeverri, Yinzi Xin, Joshua Liberman, Luke Finnerty, Evan Morris, Katelyn Horstman, Ben Sappey, Gregory W. Doppmann, Dimitri Mawet, Nemanja Jovanovic, Michael P. Fitzgerald, Jacques-Robert Delorme, J. Kent Wallace, Ashley Baker, Randall Bartos, Geoffrey A. Blake, Benjamin Calvin, Sylvain Cetre, Ronald A. López, Jacklyn Pezzato, Tobias Schofield , et al. (2 additional authors not shown)

Abstract: We present the projected rotational velocity and molecular abundances for HD 33632 Ab obtained via Keck Planet Imager and Characterizer high-resolution spectroscopy. HD 33632 Ab is a nearby benchmark brown dwarf companion at a separation of $\sim$20 au that straddles the L/T transition. Using a forward-modeling framework with on-axis host star spectra, self-consistent substellar atmospheric and re… ▽ More We present the projected rotational velocity and molecular abundances for HD 33632 Ab obtained via Keck Planet Imager and Characterizer high-resolution spectroscopy. HD 33632 Ab is a nearby benchmark brown dwarf companion at a separation of $\sim$20 au that straddles the L/T transition. Using a forward-modeling framework with on-axis host star spectra, self-consistent substellar atmospheric and retrieval models for HD 33632 Ab, we derive a projected rotational velocity of 53 $\pm$ 3 km/s and carbon/water mass fractions of log CO = $-$2.3 $\pm$ 0.3 and log H$_2$O = $-$2.7 $\pm$ 0.2. The inferred carbon-to-oxygen ratio (C/O = 0.58 $\pm$ 0.14), molecular abundances, and metallicity ([C/H] = 0.0 $\pm$ 0.2 dex) of HD 33632 Ab are consistent with its host star. Although detectable methane opacities are expected in L/T transition objects, we did not recover methane in our KPIC spectra, partly due to the high $v\sin{i}$ and to disequilibrium chemistry at the pressures we are sensitive to. We parameterize the spin as the ratio of rotation over break-up velocity, and compare HD 33632 Ab to a compilation of >200 very low-mass objects (M$\lesssim$0.1 M$_{\odot}$) that have spin measurements in the literature. There appears to be no clear trend for the isolated field low-mass objects versus mass, but a tentative trend is identified for low-mass companions and directly imaged exoplanets, similar to previous findings. A larger sample of close-in gas giant exoplanets and brown dwarfs will critically examine our understanding of their formation and evolution through rotation and chemical abundance measurements. △ Less

Submitted 18 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

Comments: Accepted for publication in the Astrophysical Journal. 36 pages, 15 figures, 5 tables

arXiv:2405.07691 [pdf, other]

Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) is compatible with NGC 4278 within $\sim0.03$ degree. Variation analysis shows an indication of the variability at a few months level in the TeV band, which is consistent with low frequency observations. Based on these observations, we report the detection of TeV $γ$-ray emissions from this low-luminosity AGN NGC 4278. The observations by LHAASO-WCDA during active period has a significance level of 8.8\,$σ$ with best-fit photon spectral index $\varGamma=2.56\pm0.14$ and a flux $f_{1-10\,\rm{TeV}}=(7.0\pm1.1_{\rm{sta}}\pm0.35_{\rm{syst}})\times10^{-13}\,\rm{photons\,cm^{-2}\,s^{-1}}$, or approximately $5\%$ of the Crab Nebula. The discovery of VHE from NGC 4278 indicates that the compact, weak radio jet can efficiently accelerate particles and emit TeV photons. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 11 pages, 5 figures

arXiv:2405.06262 [pdf, other]

Creating cyclo-N$_5$$^{+}$ cation and assembling N$_5$$^{+}$N$_5$$^{-}$ salt via electronegativity co-matching in tailored ionic compounds

Authors: Bi Zhang, Yu Xin, Meiling Xu, Yiming Zhang, Yinwei Li, Yanchao Wang, Changfeng Chen

Abstract: The recent discovery of crystalline pentazolates marks a major advance in polynitrogen science and raises prospects of making the long-touted potent propellant N$_5$$^{+}$N$_5$$^{-}$ salt. However, despite the synthesis of cyclo-N$_5$$^{-}$ anion in pentazolates, counter cation cyclo-N$_5$$^{+}$ remains elusive due to the strong oxidizing power of pentazole ion; moreover, pure N$_5$$^{+}$N$_5$… ▽ More The recent discovery of crystalline pentazolates marks a major advance in polynitrogen science and raises prospects of making the long-touted potent propellant N$_5$$^{+}$N$_5$$^{-}$ salt. However, despite the synthesis of cyclo-N$_5$$^{-}$ anion in pentazolates, counter cation cyclo-N$_5$$^{+}$ remains elusive due to the strong oxidizing power of pentazole ion; moreover, pure N$_5$$^{+}$N$_5$$^{-}$ salt is known to be unstable. Here, we devise a new strategy for making rare cyclo-N$_5$$^{+}$ cation and assembling the long-sought N$_5$$^{+}$N$_5$$^{-}$ salt in tailored ionic compounds, wherein the negative/positive host ions act as oxidizing/reducing agents to form cyclo-N$_5$$^{+}$/N$_5$$^{-}$ species. This strategy is implemented via an advanced computational crystal structure search, which identifies XN$_5$N$_5$F (X = Li, Na, K) compounds that stabilize at high pressures and remain viable at ambient pressure-temperature conditions based on \textit{ab initio} molecular dynamics simulations. This finding opens an avenue for creating and stabilizing N$_5$$^{+}$N$_5$$^{-}$ salt assembly in ionic compounds, where cyclo-N$_5$ species are oxidized/reduced via co-matching with host ions of high/low electronegativity. The present results demonstrate novel polynitrogen chemistry, and these findings offer new insights and prospects in the design and synthesis of diverse chemical species that exhibit unusual charge states, bonding structures, and superior functionality. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 6 pages, 5 figures

arXiv:2405.00579 [pdf, other]

LEAP: Optimization Hierarchical Federated Learning on Non-IID Data with Coalition Formation Game

Authors: Jianfeng Lu, Yue Chen, Shuqin Cao, Longbiao Chen, Wei Wang, Yun Xin

Abstract: Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably i… ▽ More Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably increases computational costs and the risk of privacy leakage. The challenges in solving these issues are how to reduce the impact of non-IID data without involving raw data and how to rationalize the communication resource allocation for addressing straggler problem. To tackle these challenges, we propose a novel optimization method based on coaLition formation gamE and grAdient Projection, called LEAP. Specifically, we combine edge data distribution with coalition formation game innovatively to adjust the correlations between clients and ESs dynamically, which ensures optimal correlations. We further capture the client heterogeneity to achieve the rational bandwidth allocation from coalition perception and determine the optimal transmission power within specified delay constraints at client level. Experimental results on four real datasets show that LEAP is able to achieve 20.62% improvement in model accuracy compared to the state-of-the-art baselines. Moreover, LEAP effectively reduce transmission energy consumption by at least about 2.24 times. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2405.00456 [pdf, other]

Counterfactual Explanations for Deep Learning-Based Traffic Forecasting

Authors: Rushan Wang, Yanan Xin, Yatao Zhang, Fernando Perez-Cruz, Martin Raubal

Abstract: Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifi… ▽ More Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifically, the goal is to elucidate relationships between various input contextual features and their corresponding predictions. We present a comprehensive framework that generates counterfactual explanations for traffic forecasting and provides usable insights through the proposed scenario-driven counterfactual explanations. The study first implements a deep learning model to predict traffic speed based on historical traffic data and contextual variables. Counterfactual explanations are then used to illuminate how alterations in these input variables affect predicted outcomes, thereby enhancing the transparency of the deep learning model. We investigated the impact of contextual features on traffic speed prediction under varying spatial and temporal conditions. The scenario-driven counterfactual explanations integrate two types of user-defined constraints, directional and weighting constraints, to tailor the search for counterfactual explanations to specific use cases. These tailored explanations benefit machine learning practitioners who aim to understand the model's learning mechanisms and domain experts who seek insights for real-world applications. The results showcase the effectiveness of counterfactual explanations in revealing traffic patterns learned by deep learning models, showing its potential for interpreting black-box deep learning models used for spatiotemporal predictions in general. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 24 pages

arXiv:2404.08728 [pdf, other]

Keck Primary Mirror Closed-Loop Segment Control using a Vector-Zernike Wavefront Sensor

Authors: Maissa Salama, Charlotte Guthery, Vincent Chambouleyron, Rebecca Jensen-Clem, J. Kent Wallace, Jacques-Robert Delorme, Mitchell Troy, Tobias Wenger, Daniel Echeverri, Luke Finnerty, Nemanja Jovanovic, Joshua Liberman, Ronald A. Lopez, Dimitri Mawet, Evan C. Morris, Maaike van Kooten, Jason J. Wang, Peter Wizinowich, Yinzi Xin, Jerry Xuan

Abstract: We present the first on-sky segmented primary mirror closed-loop piston control using a Zernike wavefront sensor (ZWFS) installed on the Keck II telescope. Segment co-phasing errors are a primary contributor to contrast limits on Keck and will be necessary to correct for the next generation of space missions and ground-based extremely large telescopes (ELTs), which will all have segmented primary… ▽ More We present the first on-sky segmented primary mirror closed-loop piston control using a Zernike wavefront sensor (ZWFS) installed on the Keck II telescope. Segment co-phasing errors are a primary contributor to contrast limits on Keck and will be necessary to correct for the next generation of space missions and ground-based extremely large telescopes (ELTs), which will all have segmented primary mirrors. The goal of the ZWFS installed on Keck is to monitor and correct primary mirror co-phasing errors in parallel with science observations. The ZWFS is ideal for measuring phase discontinuities such as segment co-phasing errors and is one of the most sensitive WFS, but has limited dynamic range. The vector-ZWFS at Keck works on the adaptive optics (AO) corrected wavefront and consists of a metasurface focal plane mask which imposes two different phase shifts on the core of the point spread function (PSF) to two orthogonal light polarizations, producing two pupil images. This design extends the dynamic range compared with the scalar ZWFS. The primary mirror segment pistons were controlled in closed-loop using the ZWFS, improving the Strehl ratio on the NIRC2 science camera by up to 10 percentage points. We analyze the performance of the closed-loop tests, the impact on NIRC2 science data, and discuss the ZWFS measurements. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: Accepted for publication in the Astrophysical Journal (ApJ). 17 pages, 16 figures

arXiv:2404.04801 [pdf, ps, other]

doi 10.1007/s41605-024-00467-8

LHAASO-KM2A detector simulation using Geant4

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with large altitude difference (30 m) and huge coverage (1.3 km^2). In this paper, the design of the KM2A simulation code G4KM2A based on Geant4 is introduced. The process of G4KM2A is optimized mainly in memory consumption to avoid memory overffow. Some simpliffcations are used to signiffcantly speed up the execution of G4KM2A. The running time is reduced by at least 30 times compared to full detector simulation. The particle distributions and the core/angle resolution comparison between simulation and experimental data of the full KM2A array are also presented, which show good agreement. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2404.01426 [pdf, other]

Laboratory demonstration of a Photonic Lantern Nuller in monochromatic and broadband light

Authors: Yinzi Xin, Daniel Echeverri, Nemanja Jovanovic, Dimitri Mawet, Sergio Leon-Saval, Rodrigo Amezcua-Correa, Stephanos Yerolatsitis, Michael P. Fitzgerald, Pradip Gatkine, Yoo Jung Kim, Jonathan Lin, Barnaby Norris, Garreth Ruane, Steph Sallum

Abstract: Photonic lantern nulling (PLN) is a method for enabling the detection and characterization of close-in exoplanets by exploiting the symmetries of the ports of a mode-selective photonic lantern (MSPL) to cancel out starlight. A six-port MSPL provides four ports where on-axis starlight is suppressed, while off-axis planet light is coupled with efficiencies that vary as a function of the planet's spa… ▽ More Photonic lantern nulling (PLN) is a method for enabling the detection and characterization of close-in exoplanets by exploiting the symmetries of the ports of a mode-selective photonic lantern (MSPL) to cancel out starlight. A six-port MSPL provides four ports where on-axis starlight is suppressed, while off-axis planet light is coupled with efficiencies that vary as a function of the planet's spatial position. We characterize the properties of a six-port MSPL in the laboratory and perform the first testbed demonstration of the PLN in monochromatic light (1569 nm) and in broadband light (1450 nm to 1625 nm), each using two orthogonal polarizations. We compare the measured spatial throughput maps with those predicted by simulations using the lantern's modes. We find that the morphologies of the measured throughput maps are reproduced by the simulations, though the real lantern is lossy and has lower throughputs overall. The measured ratios of on-axis stellar leakage to peak off-axis throughput are around 10^(-2), likely limited by testbed wavefront errors. These null-depths are already sufficient for observing young gas giants at the diffraction limit using ground-based observatories. Future work includes using wavefront control to further improve the nulls, as well as testing and validating the PLN on-sky. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: 30 pages, 12 figures

arXiv:2403.17295 [pdf, other]

Vortex Fiber Nulling for Exoplanet Observations: First Direct Detection of M Dwarf Companions around HIP 21543, HIP 94666, and HIP 50319

Authors: Daniel Echeverri, Jerry W. Xuan, John D. Monnier, Jacques-Robert Delorme, Jason J. Wang, Nemanja Jovanovic, Katelyn Horstman, Garreth Ruane, Bertrand Mennesson, Eugene Serabyn, Dimitri Mawet, J. Kent Wallace, Sofia Hillman, Ashley Baker, Randall Bartos, Benjamin Calvin, Sylvain Cetre, Greg Doppmann, Luke Finnerty, Michael P. Fitzgerald, Chih-Chun Hsu, Joshua Liberman, Ronald Lopez, Maxwell Millar-Blanchaer, Evan Morris , et al. (13 additional authors not shown)

Abstract: Vortex fiber nulling (VFN) is a technique for detecting and characterizing faint companions at small separations from their host star. A near-infrared ($\sim2.3 μ$m) VFN demonstrator mode was deployed on the Keck Planet Imager and Characterizer (KPIC) instrument at the Keck Observatory and presented earlier. In this paper, we present the first VFN companion detections. Three targets, HIP 21543 Ab,… ▽ More Vortex fiber nulling (VFN) is a technique for detecting and characterizing faint companions at small separations from their host star. A near-infrared ($\sim2.3 μ$m) VFN demonstrator mode was deployed on the Keck Planet Imager and Characterizer (KPIC) instrument at the Keck Observatory and presented earlier. In this paper, we present the first VFN companion detections. Three targets, HIP 21543 Ab, HIP 94666 Ab, and HIP 50319 B, were detected with host-companion flux ratios between 70 and 430 at and within one diffraction beamwidth ($λ/D$). We complement the spectra from KPIC VFN with flux ratio and position measurements from the CHARA Array to validate the VFN results and provide a more complete characterization of the targets. This paper reports the first direct detection of these three M dwarf companions, yielding their first spectra and flux ratios. Our observations provide measurements of bulk properties such as effective temperatures, radial velocities, and v$\sin{i}$, and verify the accuracy of the published orbits. These detections corroborate earlier predictions of the KPIC VFN performance, demonstrating that the instrument mode is ready for science observations. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 13 pages, 2 figures; Accepted to ApJ Letters

arXiv:2403.10010 [pdf, other]

doi 10.1103/PhysRevLett.132.131002

Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at $3.67 \pm 0.05 \pm 0.15$ PeV. Below the knee, the spectral index is found to be -$2.7413 \pm 0.0004 \pm 0.0050$, while above the knee, it is -$3.128 \pm 0.005 \pm 0.027$, with the sharpness of the transition measured with a statistical error of 2%. The mean logarithmic mass of cosmic rays is almost heavier than helium in the whole measured energy range. It decreases from 1.7 at 0.3 PeV to 1.3 at 3 PeV, representing a 24% decline following a power law with an index of -$0.1200 \pm 0.0003 \pm 0.0341$. This is equivalent to an increase in abundance of light components. Above the knee, the mean logarithmic mass exhibits a power law trend towards heavier components, which is reversal to the behavior observed in the all-particle energy spectrum. Additionally, the knee position and the change in power-law index are approximately the same. These findings suggest that the knee observed in the all-particle spectrum corresponds to the knee of the light component, rather than the medium-heavy components. △ Less

Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: 8 pages, 3 figures

Journal ref: Physical Review Letters 132, 131002 (2024)

arXiv:2403.08133 [pdf, other]

Physics-Inspired Deep Learning Anti-Aliasing Framework in Efficient Channel State Feedback

Authors: Yu-Chien Lin, Yan Xin, Ta-Sung Lee, Charlie, Zhang, Zhi Ding

Abstract: Acquiring downlink channel state information (CSI) at the base station is vital for optimizing performance in massive Multiple input multiple output (MIMO) Frequency-Division Duplexing (FDD) systems. While deep learning architectures have been successful in facilitating UE-side CSI feedback and gNB-side recovery, the undersampling issue prior to CSI feedback is often overlooked. This issue, which… ▽ More Acquiring downlink channel state information (CSI) at the base station is vital for optimizing performance in massive Multiple input multiple output (MIMO) Frequency-Division Duplexing (FDD) systems. While deep learning architectures have been successful in facilitating UE-side CSI feedback and gNB-side recovery, the undersampling issue prior to CSI feedback is often overlooked. This issue, which arises from low density pilot placement in current standards, results in significant aliasing effects in outdoor channels and consequently limits CSI recovery performance. To this end, this work introduces a new CSI upsampling framework at the gNB as a post-processing solution to address the gaps caused by undersampling. Leveraging the physical principles of discrete Fourier transform shifting theorem and multipath reciprocity, our framework effectively uses uplink CSI to mitigate aliasing effects. We further develop a learning-based method that integrates the proposed algorithm with the Iterative Shrinkage-Thresholding Algorithm Net (ISTA-Net) architecture, enhancing our approach for non-uniform sampling recovery. Our numerical results show that both our rule-based and deep learning methods significantly outperform traditional interpolation techniques and current state-of-the-art approaches in terms of performance. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2402.11880 [pdf, other]

Revision of the GeV $γ$-ray Emission in the Region of HESS J1813-178 with Fermi-LAT

Authors: Xiaolei Guo, Yuliang Xin

Abstract: HESS J1813-178 is one of the brightest and most compact TeV $γ$-ray sources, and whether its $γ$-ray emission is associated with supernova remnant (SNR), pulsar wind nebula (PWN) or young stellar cluster (YSC) is still under debate. By analysing the GeV $γ$-ray data in the field of HESS J1813-178 using 14 years of PASS 8 data recorded by the Fermi Large Area Telescope (Fermi-LAT), we report the di… ▽ More HESS J1813-178 is one of the brightest and most compact TeV $γ$-ray sources, and whether its $γ$-ray emission is associated with supernova remnant (SNR), pulsar wind nebula (PWN) or young stellar cluster (YSC) is still under debate. By analysing the GeV $γ$-ray data in the field of HESS J1813-178 using 14 years of PASS 8 data recorded by the Fermi Large Area Telescope (Fermi-LAT), we report the discovery of three different sources with different spectra in this region. The hard source with a power law spectral index of 2.11 $\pm$ 0.08 has a small size extension, which is spatially and spectrally coincident with the TeV $γ$-ray emission from HESS J1813-178. CO observations display the dense molecular clouds surrounding HESS J1813-178 in the velocity range of 45-60 km s$^{\rm -1}$. The possible origins of the $γ$-ray emission from HESS J1813-178 are discussed, including SNR G12.82-0.02, the PWN driven by the energetic X-ray pulsar PSR J1813-1749, and YSC Cl 1813-178. However, none of them can be ruled out clearly. Note that the maximum energy of protons in the hadronic model should exceed a few hundred TeV, which makes HESS J1813-178 to be a promising PeVatron. The detailed LHAASO data analysis about the morphology and spectrum would be helpful to investigate the origin of the $γ$-ray emission in this region and test its PeVatron nature. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 15 pages, 7 figures, 4 tables, accepted for publication in ApJ

arXiv:2402.08158 [pdf, other]

Coherent Imaging with Photonic Lanterns

Authors: Yoo Jung Kim, Michael P. Fitzgerald, Jonathan Lin, Steph Sallum, Yinzi Xin, Nemanja Jovanovic, Sergio Leon-Saval

Abstract: Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pup… ▽ More Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pupil, analogous to subapertures in aperture masking interferometry (AMI). Coherent imaging with PLs can be enabled by interfering SMF outputs and applying phase modulation, which can be achieved using a photonic chip beam combiner at the backend (e.g., the ABCD beam combiner). In this study, we investigate the potential of coherent imaging by interfering SMF outputs of a PL with a single telescope. We demonstrate that the visibilities that can be measured from a PL are mutual intensities incident on the pupil weighted by the cross-correlation of a pair of lantern modes. From numerically simulated lantern principal modes of a 6-port PL, we find that interferometric observables using a PL behave similarly to separated-aperture visibilities for simple models on small angular scales ($<λ/D$) but with greater sensitivity to symmetries and capability to break phase angle degeneracies. Furthermore, we present simulated observations with wavefront errors and compare them to AMI. Despite the redundancy caused by extended lantern principal modes, spatial filtering offers stability to wavefront errors. Our simulated observations suggest that PLs may offer significant benefits in the photon noise-limited regime and in resolving small angular scales at low contrast regime. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: Accepted for publication in ApJ

arXiv:2402.07485 [pdf, other]

MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning

Authors: Hang Zhao, Yifei Xin, Zhesong Yu, Bilei Zhu, Lu Lu, Zejun Ma

Abstract: In the realm of audio-language pre-training (ALP), the challenge of achieving cross-modal alignment is significant. Moreover, the integration of audio inputs with diverse distributions and task variations poses challenges in developing generic audio-language models. In this study, we present MINT, a novel ALP framework boosting audio-language models through multi-target pre-training and instructio… ▽ More In the realm of audio-language pre-training (ALP), the challenge of achieving cross-modal alignment is significant. Moreover, the integration of audio inputs with diverse distributions and task variations poses challenges in developing generic audio-language models. In this study, we present MINT, a novel ALP framework boosting audio-language models through multi-target pre-training and instruction tuning. MINT leverages the strength of frozen pre-trained audio encoders and large language models (LLM) to improve audio-language pre-training, enabling effective transferablility to both audio-text understanding and generation tasks. To address the modality gap, we introduce Bridge-Net, a trainable module that enhances cross-modality alignment and the model's ability to follow instructions for a variety of audio-text tasks. Bridge-Net is pivotal within MINT, initially enhancing audio-language representation learning through a multi-target pre-training approach. Subsequently, Bridge-Net further boosts audio-to-language generative learning by integrating a frozen language model with instruction tuning. This integration empowers MINT to extract features in a flexible and effective manner, specifically tailored to the provided instructions for diverse tasks. Experimental results demonstrate that MINT attains superior performance across various audio-language understanding and generation tasks, highlighting its robust generalization capabilities even in zero-shot scenarios. △ Less

Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.02242 [pdf, other]

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Authors: Yi Xin, Siqi Luo, Haodi Zhou, Junlong Du, Xiaohong Liu, Yue Fan, Qing Li, Yuntao Du

Abstract: Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks. However, with state-of-the-art PVMs growing to billions or even trillions of parameters, the standard full fine-tuning paradigm is becoming unsustainable due to high computational and storage demands. In response, researchers are exploring parameter-efficient fine-tuning… ▽ More Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks. However, with state-of-the-art PVMs growing to billions or even trillions of parameters, the standard full fine-tuning paradigm is becoming unsustainable due to high computational and storage demands. In response, researchers are exploring parameter-efficient fine-tuning (PEFT), which seeks to exceed the performance of full fine-tuning with minimal parameter modifications. This survey provides a comprehensive overview and future directions for visual PEFT, offering a systematic review of the latest advancements. First, we provide a formal definition of PEFT and discuss model pre-training methods. We then categorize existing methods into three categories: addition-based, partial-based, and unified-based. Finally, we introduce the commonly used datasets and applications and suggest potential future research challenges. A comprehensive collection of resources is available at https://github.com/synbol/Awesome-Parameter-Efficient-Transfer-Learning. △ Less

Submitted 8 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

Comments: 9 pages, 3 figures, 2 tables

arXiv:2401.15953 [pdf, other]

Masked Audio Modeling with CLAP and Multi-Objective Learning

Authors: Yifei Xin, Xiulian Peng, Yan Lu

Abstract: Most existing masked audio modeling (MAM) methods learn audio representations by masking and reconstructing local spectrogram patches. However, the reconstruction loss mainly accounts for the signal-level quality of the reconstructed spectrogram and is still limited in extracting high-level audio semantics. In this paper, we propose to enhance the semantic modeling of MAM by distilling cross-modal… ▽ More Most existing masked audio modeling (MAM) methods learn audio representations by masking and reconstructing local spectrogram patches. However, the reconstruction loss mainly accounts for the signal-level quality of the reconstructed spectrogram and is still limited in extracting high-level audio semantics. In this paper, we propose to enhance the semantic modeling of MAM by distilling cross-modality knowledge from contrastive language-audio pretraining (CLAP) representations for both masked and unmasked regions (MAM-CLAP) and leveraging a multi-objective learning strategy with a supervised classification branch (SupMAM), thereby providing more semantic knowledge for MAM and enabling it to effectively learn global features from labels. Experiments show that our methods significantly improve the performance on multiple downstream tasks. Furthermore, by combining our MAM-CLAP with SupMAM, we can achieve new state-of-the-art results on various audio and speech classification tasks, exceeding previous self-supervised learning and supervised pretraining methods. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: Accepted by Interspeech2023

arXiv:2401.08125 [pdf, other]

Anomalous Proximitized Transport in Metal/Quantum Magnet Heterostructure $\rm{Bi_{2}Ir_{2}O_{7}/Yb_{2}Ti_{2}O_{7}}$

Authors: Chengkun Xing, Shu Zhang, Weiliang Yao, Dapeng Cui, Qing Huang, Junyi Yang, Shashi Pandey, Dongliang Gong, Lukas Horák, Yan Xin, Eun Sang Choi, Yang Zhang, Haidong Zhou, Jian Liu

Abstract: Fluctuations of quantum spins play a crucial role in the emergence of exotic magnetic phases and excitations. The lack of the charge degree of freedom in insulating quantum magnets, however, precludes such fluctuations from mediating electronic transport. Here we show that the quantum fluctuations of a localized frustrated magnet induce strong proximitized charge transport of the conduction electr… ▽ More Fluctuations of quantum spins play a crucial role in the emergence of exotic magnetic phases and excitations. The lack of the charge degree of freedom in insulating quantum magnets, however, precludes such fluctuations from mediating electronic transport. Here we show that the quantum fluctuations of a localized frustrated magnet induce strong proximitized charge transport of the conduction electrons in a synthetic heterostructure comprising an epitaxial $\rm{Bi_{2}Ir_{2}O_{7}}$ ultrathin film on the single crystal of $\rm{Yb_{2}Ti_{2}O_{7}}$. The proximity effects are evidenced by the scaling behavior of the $\rm{Bi_{2}Ir_{2}O_{7}}$ resistance in correspondance with the dynamic scaling of the dynamic spin correlation function of $\rm{Yb_{2}Ti_{2}O_{7}}$, which is a result of quantum fluctuations near a multi-phase quantum critical point. The proximitized transport in $\rm{Bi_{2}Ir_{2}O_{7}}$ can be effectively tuned by magnetic field through suppressing the quantum spin fluctuations as well as inducing transitions via magnetic anisotropy in $\rm{Yb_{2}Ti_{2}O_{7}}$. Our work establishes a new pathway for harnessing quantum spin fluctuations in magnetic insulators with electric transport, offering exciting prospects for potential applications in the realm of quantum spintronics. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 8 pages, 5 figures

arXiv:2401.04332 [pdf, other]

Flexible filtrations for multiparameter persistent homology detect digital images

Authors: Jiaxing He, Bingzhe Hou, Tieru Wu, Yue Xin

Abstract: Two important problems in the field of Topological Data Analysis are defining practical multifiltrations on objects and showing ability of TDA to detect the geometry. Motivated by the problems, we constuct three multifiltrations named multi-GENEO, multi-DGENEO and mix-GENEO, and prove the stability of both the interleaving distance and multiparameter persistence landscape of multi-GENEO with respe… ▽ More Two important problems in the field of Topological Data Analysis are defining practical multifiltrations on objects and showing ability of TDA to detect the geometry. Motivated by the problems, we constuct three multifiltrations named multi-GENEO, multi-DGENEO and mix-GENEO, and prove the stability of both the interleaving distance and multiparameter persistence landscape of multi-GENEO with respect to the pseudometric of the subspace of bounded functions. We also give the estimations of upper bound for multi-DGENEO and mix-GENEO. Finally, we provide experiment results on MNIST dataset to demonstrate our bifiltrations have ability to detect geometric and topological differences of digital images. △ Less

Submitted 1 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.04269 [pdf, other]

Coronagraphic Data Post-processing Using Projections on Instrumental Modes

Authors: Yinzi Xin, Laurent Pueyo, Romain Laugier, Leonid Pogorelyuk, Ewan S. Douglas, Benjamin J. S. Pope, Kerri L. Cahoy

Abstract: Directly observing exoplanets with coronagraphs is impeded by the presence of speckles from aberrations in the optical path, which can be mitigated in hardware with wavefront control as well as in post-processing. This work explores using an instrument model in post-processing to separate astrophysical signals from residual aberrations in coronagraphic data. The effect of wavefront error (WFE) on… ▽ More Directly observing exoplanets with coronagraphs is impeded by the presence of speckles from aberrations in the optical path, which can be mitigated in hardware with wavefront control as well as in post-processing. This work explores using an instrument model in post-processing to separate astrophysical signals from residual aberrations in coronagraphic data. The effect of wavefront error (WFE) on the coronagraphic intensity consists of a linear contribution and a quadratic contribution. When either of the terms is much larger than the other, the instrument response can be approximated by a transfer matrix mapping WFE to detector plane intensity. From this transfer matrix, a useful projection onto instrumental modes that removes the dominant error modes can be derived. We apply this projection to synthetically generated Roman Space Telescope hybrid Lyot coronagraph (HLC) data to extract "robust observables," which can be used instead of raw data for applications such as detection testing. The projection improves planet flux ratio detection limits by about 28% in the linear regime and by over a factor of 2 in the quadratic regime, illustrating that robust observables can increase sensitivity to astrophysical signals and improve the scientific yield from coronagraphic data. While this approach does not require additional information such as observations of reference stars or modulations of a deformable mirror, it can and should be combined with these other techniques, acting as a model-informed prior in an overall post-processing strategy. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: 19 pages, 10 figures

arXiv:2401.03116 [pdf, other]

Advancing DDoS Attack Detection: A Synergistic Approach Using Deep Residual Neural Networks and Synthetic Oversampling

Authors: Ali Alfatemi, Mohamed Rahouti, Ruhul Amin, Sarah ALJamal, Kaiqi Xiong, Yufeng Xin

Abstract: Distributed Denial of Service (DDoS) attacks pose a significant threat to the stability and reliability of online systems. Effective and early detection of such attacks is pivotal for safeguarding the integrity of networks. In this work, we introduce an enhanced approach for DDoS attack detection by leveraging the capabilities of Deep Residual Neural Networks (ResNets) coupled with synthetic overs… ▽ More Distributed Denial of Service (DDoS) attacks pose a significant threat to the stability and reliability of online systems. Effective and early detection of such attacks is pivotal for safeguarding the integrity of networks. In this work, we introduce an enhanced approach for DDoS attack detection by leveraging the capabilities of Deep Residual Neural Networks (ResNets) coupled with synthetic oversampling techniques. Because of the inherent class imbalance in many cyber-security datasets, conventional methods often struggle with false negatives, misclassifying subtle DDoS patterns as benign. By applying the Synthetic Minority Over-sampling Technique (SMOTE) to the CICIDS dataset, we balance the representation of benign and malicious data points, enabling the model to better discern intricate patterns indicative of an attack. Our deep residual network, tailored for this specific task, further refines the detection process. Experimental results on a real-world dataset demonstrate that our approach achieves an accuracy of 99.98%, significantly outperforming traditional methods. This work underscores the potential of combining advanced data augmentation techniques with deep learning models to bolster cyber-security defenses. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 8 pages, 3 figures

arXiv:2312.13381 [pdf, other]

Real-time experimental demonstrations of a photonic lantern wavefront sensor

Authors: Jonathan W. Lin, Michael P. Fitzgerald, Yinzi Xin, Yoo Jung Kim, Olivier Guyon, Barnaby Norris, Christopher Betters, Sergio Leon-Saval, Kyohoon Ahn, Vincent Deo, Julien Lozi, Sébastien Vievard, Daniel Levinstein, Steph Sallum, Nemanja Jovanovic

Abstract: The direct imaging of an Earth-like exoplanet will require sub-nanometric wavefront control across large light-collecting apertures, to reject host starlight and detect the faint planetary signal. Current adaptive optics (AO) systems, which use wavefront sensors that reimage the telescope pupil, face two challenges that prevent this level of control: non-common-path aberrations (NCPAs), caused by… ▽ More The direct imaging of an Earth-like exoplanet will require sub-nanometric wavefront control across large light-collecting apertures, to reject host starlight and detect the faint planetary signal. Current adaptive optics (AO) systems, which use wavefront sensors that reimage the telescope pupil, face two challenges that prevent this level of control: non-common-path aberrations (NCPAs), caused by differences between the sensing and science arms of the instrument; and petaling modes: discontinuous phase aberrations caused by pupil fragmentation, especially relevant for the upcoming 30-m class telescopes. Such aberrations drastically impact the capabilities of high-contrast instruments. To address these issues, we can add a second-stage wavefront sensor to the science focal plane. One promising architecture uses the photonic lantern (PL): a waveguide that efficiently couples aberrated light into single-mode fibers (SMFs). In turn, SMF-confined light can be stably injected into high-resolution spectrographs, enabling direct exoplanet characterization and precision radial velocity measurements; simultaneously, the PL can be used for focal-plane wavefront sensing. We present a real-time experimental demonstration of the PL wavefront sensor on the Subaru/SCExAO testbed. Our system is stable out to around ~400 nm of low-order Zernike wavefront error, and can correct petaling modes. When injecting ~30 nm RMS of low order time-varying error, we achieve ~10x rejection at 1 s timescales; further refinements to the control law and lantern fabrication process should make sub-nanometric wavefront control possible. In the future, novel sensors like the PLWFS may prove to be critical in resolving the wavefront control challenges posed by exoplanet direct imaging. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: Accepted to ApJL

arXiv:2312.11189 [pdf, other]

Energy-Dependent Analyses of the Gamma-Ray Emission from HESS J1857+026 with Fermi-LAT

Authors: Xiaolei Guo, Xi Liu, Yuliang Xin

Abstract: We report the discovery of energy-dependent morphology for the GeV gamma-ray emission from HESS J1857+026 with more than 13 years of {\it Fermi} Large Area Telescope (LAT) data. The GeV gamma-ray emission from this region is composed of two extended components. The hard component with an index of $1.74 \pm 0.07$ in the energy range of 0.5-500 GeV is spatially coincident with HESS J1857+026, and it… ▽ More We report the discovery of energy-dependent morphology for the GeV gamma-ray emission from HESS J1857+026 with more than 13 years of {\it Fermi} Large Area Telescope (LAT) data. The GeV gamma-ray emission from this region is composed of two extended components. The hard component with an index of $1.74 \pm 0.07$ in the energy range of 0.5-500 GeV is spatially coincident with HESS J1857+026, and its 68\% containment radius varies from $\sim 0.44^\circ$ below 40 GeV to $\sim 0.30^\circ$ above 140 GeV. The hard GeV gamma-ray spectrum and the energy-dependent morphology of HESS J1857+026 make it favor a PWN origin, which is associated with the energetic pulsar, PSR J1856+0245. The soft component with an index of $2.70 \pm 0.16$ and another extended gamma-ray source detected in this region, 4FGL J1857.9+0313e with an index of $2.55 \pm 0.07$, are spatially coincidence with two molecular clumps in the northeast and southwest of HESS J1857+026, which favors the hadronic process, and the protons could be accelerated by the hypothetical SNR associated with PSR J1856+0245. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 10 pages, 5 figures, 2 tables, accepted by ApJ

arXiv:2312.08733 [pdf, other]

VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding

Authors: Yi Xin, Junlong Du, Qiang Wang, Zhiwen Lin, Ke Yan

Abstract: Large-scale pre-trained models have achieved remarkable success in various computer vision tasks. A standard approach to leverage these models is to fine-tune all model parameters for downstream tasks, which poses challenges in terms of computational and storage costs. Recently, inspired by Natural Language Processing (NLP), parameter-efficient transfer learning has been successfully applied to vi… ▽ More Large-scale pre-trained models have achieved remarkable success in various computer vision tasks. A standard approach to leverage these models is to fine-tune all model parameters for downstream tasks, which poses challenges in terms of computational and storage costs. Recently, inspired by Natural Language Processing (NLP), parameter-efficient transfer learning has been successfully applied to vision tasks. However, most existing techniques primarily focus on single-task adaptation, and despite limited research on multi-task adaptation, these methods often exhibit suboptimal training and inference efficiency. In this paper, we first propose an once-for-all Vision Multi-Task Adapter (VMT-Adapter), which strikes approximately O(1) training and inference efficiency w.r.t task number. Concretely, VMT-Adapter shares the knowledge from multiple tasks to enhance cross-task interaction while preserves task-specific knowledge via independent knowledge extraction modules. Notably, since task-specific modules require few parameters, VMT-Adapter can handle an arbitrary number of tasks with a negligible increase of trainable parameters. We also propose VMT-Adapter-Lite, which further reduces the trainable parameters by learning shared parameters between down- and up-projections. Extensive experiments on four dense scene understanding tasks demonstrate the superiority of VMT-Adapter(-Lite), achieving a 3.96%(1.34%) relative improvement compared to single-task full fine-tuning, while utilizing merely ~1% (0.36%) trainable parameters of the pre-trained model. △ Less

Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: Accepted to AAAI2024

arXiv:2312.08636 [pdf, other]

MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning

Authors: Yi Xin, Junlong Du, Qiang Wang, Ke Yan, Shouhong Ding

Abstract: Multi-Task Learning (MTL) is designed to train multiple correlated tasks simultaneously, thereby enhancing the performance of individual tasks. Typically, a multi-task network structure consists of a shared backbone and task-specific decoders. However, the complexity of the decoders increases with the number of tasks. To tackle this challenge, we integrate the decoder-free vision-language model CL… ▽ More Multi-Task Learning (MTL) is designed to train multiple correlated tasks simultaneously, thereby enhancing the performance of individual tasks. Typically, a multi-task network structure consists of a shared backbone and task-specific decoders. However, the complexity of the decoders increases with the number of tasks. To tackle this challenge, we integrate the decoder-free vision-language model CLIP, which exhibits robust zero-shot generalization capability. Recently, parameter-efficient transfer learning methods have been extensively explored with CLIP for adapting to downstream tasks, where prompt tuning showcases strong potential. Nevertheless, these methods solely fine-tune a single modality (text or visual), disrupting the modality structure of CLIP. In this paper, we first propose Multi-modal Alignment Prompt (MmAP) for CLIP, which aligns text and visual modalities during fine-tuning process. Building upon MmAP, we develop an innovative multi-task prompt learning framework. On the one hand, to maximize the complementarity of tasks with high similarity, we utilize a gradient-driven task grouping method that partitions tasks into several disjoint groups and assign a group-shared MmAP to each group. On the other hand, to preserve the unique characteristics of each task, we assign an task-specific MmAP to each task. Comprehensive experiments on two large multi-task learning datasets demonstrate that our method achieves significant performance improvements compared to full fine-tuning while only utilizing approximately 0.09% of trainable parameters. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI2024

arXiv:2312.02297 [pdf, other]

Validation of elemental and isotopic abundances in late-M spectral types with the benchmark HIP 55507 AB system

Authors: Jerry W. Xuan, Jason J. Wang, Luke Finnerty, Katelyn Horstman, Simon Grimm, Anne Peck, Eric L. Nielsen, Heather A. Knutson, Dimitri Mawet, Howard Isaacson, Andrew W. Howard, Michael C. Liu, Sam Walker, Mark Phillips, Geoffrey Blake, Jean-Baptiste Ruffio, Yapeng Zhang, Julie Inglis, Nicole L. Wallack, Aniket Sanghi, Erica Gonzales, Fei Dai, Ashley Baker, Randall Bartos, Charlotte Bond , et al. (26 additional authors not shown)

Abstract: M dwarfs are common host stars to exoplanets but often lack atmospheric abundance measurements. Late-M dwarfs are also good analogs to the youngest substellar companions, which share similar $T_{\rm eff}\sim2300-2800~K$. We present atmospheric analyses for the M7.5 companion HIP 55507 B and its K6V primary star with Keck/KPIC high-resolution ($R\sim35,000$) $K$ band spectroscopy. First, by includi… ▽ More M dwarfs are common host stars to exoplanets but often lack atmospheric abundance measurements. Late-M dwarfs are also good analogs to the youngest substellar companions, which share similar $T_{\rm eff}\sim2300-2800~K$. We present atmospheric analyses for the M7.5 companion HIP 55507 B and its K6V primary star with Keck/KPIC high-resolution ($R\sim35,000$) $K$ band spectroscopy. First, by including KPIC relative radial velocities between the primary and secondary in the orbit fit, we improve the dynamical mass precision by 60% and find $M_B=88.0_{-3.2}^{+3.4}$ $M_{\rm Jup}$, putting HIP 55507 B above the stellar-substellar boundary. We also find that HIP 55507 B orbits its K6V primary star with $a=38^{+4}_{-3}$ AU and $e=0.40\pm0.04$. From atmospheric retrievals of HIP 55507 B, we measure $\rm [C/H]=0.24\pm0.13$, $\rm [O/H]=0.15\pm0.13$, and $\rm C/O=0.67\pm0.04$. Moreover, we strongly detect $\rm ^{13}CO$ ($7.8σ$ significance) and tentatively detect $\rm H_2^{18}O$ ($3.7σ$ significance) in companion's atmosphere, and measure $\rm ^{12}CO/^{13}CO=98_{-22}^{+28}$ and $\rm H_2^{16}O/H_2^{18}O=240_{-80}^{+145}$ after accounting for systematic errors. From a simplified retrieval analysis of HIP 55507 A, we measure $\rm ^{12}CO/^{13}CO=79_{-16}^{+21}$ and $\rm C^{16}O/C^{18}O=288_{-70}^{+125}$ for the primary star. These results demonstrate that HIP 55507 A and B have consistent $\rm ^{12} C/^{13}C$ and $\rm ^{16}O/^{18}O$ to the $<1σ$ level, as expected for a chemically homogeneous binary system. Given the similar flux ratios and separations between HIP 55507 AB and systems with young, substellar companions, our results open the door to systematically measuring $\rm ^{13}CO$ and $\rm H_2^{18}O$ abundances in the atmospheres of substellar or even planetary-mass companions with similar spectral types. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: Accepted to ApJ, 28 pages, 14 figures

arXiv:2312.00221 [pdf, other]

Spectroastrometry and Imaging Science with Photonic Lanterns on Extremely Large Telescopes

Authors: Yoo Jung Kim, Michael P. Fitzgerald, Jonathan Lin, Steph Sallum, Yinzi Xin, Nemanja Jovanovic, Sergio Leon-Saval, Christopher Betters, Pradip Gatkine, Olivier Guyon, Julien Lozi, Dimitri Mawet, Barnaby Norris, Sébastien Vievard

Abstract: Photonic lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers. In astronomical applications, PLs can efficiently couple multi-mode telescope light into a multi-mode fiber entrance and convert it into multiple single-mode beams. The output beams are highly stable and suitable for feeding into high-resolution spectrographs… ▽ More Photonic lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers. In astronomical applications, PLs can efficiently couple multi-mode telescope light into a multi-mode fiber entrance and convert it into multiple single-mode beams. The output beams are highly stable and suitable for feeding into high-resolution spectrographs or photonic chip beam combiners. For instance, by using relative intensities in the output cores as a function of wavelength, PLs can enable spectroastrometry. In addition, by interfering beams in the output cores with a beam combiner in the backend, PLs can be used for high-throughput interferometric imaging. When used on an Extremely Large Telescope (ELT), with its increased sensitivity and angular resolution, the imaging and spectroastrometric capabilities of PLs will be extended to higher contrast and smaller angular scales. We study the potential spectroastrometry and imaging science cases of PLs on ELTs, including study of exomoons, broad-line regions of quasars, and inner circumstellar disks. △ Less

Submitted 30 November, 2023; originally announced December 2023.

Comments: AO4ELT7 conference proceedings 2023

arXiv:2312.00141 [pdf, other]

Atmospheric metallicity and C/O of HD 189733 b from high-resolution spectroscopy

Authors: Luke Finnerty, Jerry W. Xuan, Yinzi Xin, Joshua Liberman, Tobias Schofield, Michael P. Fitzgerald, Shubh Agrawal, Ashley Baker, Randall Bartos, Geoffrey A. Blake, Benjamin Calvin, Sylvain Cetre, Jacques-Robert Delorme, Greg Doppman, Daniel Echeverri, Chih-Chun Hsu, Nemanja Jovanovic, Ronald A. López, Emily C. Martin, Dimitri Mawet, Evan Morris, Jacklyn Pezzato, Jean-Baptiste Ruffio, Ben Sappey, Andrew Skemer , et al. (5 additional authors not shown)

Abstract: We present high-resolution $K$-band emission spectra of the quintessential hot Jupiter HD 189733 b from the Keck Planet Imager and Characterizer (KPIC). Using a Bayesian retrieval framework, we fit the dayside pressure-temperature profile, orbital kinematics, mass-mixing ratios of H$_2$O, CO, CH$_4$, NH$_3$, HCN, and H$_2$S, and the $\rm ^{13}CO/^{12}CO$ ratio. We measure mass fractions of… ▽ More We present high-resolution $K$-band emission spectra of the quintessential hot Jupiter HD 189733 b from the Keck Planet Imager and Characterizer (KPIC). Using a Bayesian retrieval framework, we fit the dayside pressure-temperature profile, orbital kinematics, mass-mixing ratios of H$_2$O, CO, CH$_4$, NH$_3$, HCN, and H$_2$S, and the $\rm ^{13}CO/^{12}CO$ ratio. We measure mass fractions of $\rm \log H_2O = -2.0^{+0.4}_{-0.4}$ and $\rm \log CO = -2.2^{+0.5}_{-0.5}$, and place upper limits on the remaining species. Notably, we find $\rm \log CH_4 < -4.5$ at 99\% confidence, despite its anticipated presence at the equilibrium temperature of HD 189733 b assuming local thermal equilibrium. We make a tentative ($\sim3σ$) detection of $\rm ^{13}CO$, and the retrieved posteriors suggest a $\rm ^{12}C/^{13}C$ ratio similar to or substantially less than the local interstellar value. The possible $\rm ^{13}C$ enrichment would be consistent with accretion of fractionated material in ices or in the protoplanetary disk midplane. The retrieved abundances correspond to a substantially sub-stellar atmospheric $\rm C/O = 0.3\pm0.1$, while the carbon and oxygen abundances are stellar to slightly super-stellar, consistent with core-accretion models which predict an inverse correlation between C/O and metallicity. The specific combination of low C/O and high metallicity suggests significant accretion of solid material may have occurred late in the formation process of HD 189733 b. △ Less

Submitted 30 November, 2023; originally announced December 2023.

Comments: 17 pages, 7 figures, 2 tables, accepted in AJ

arXiv:2312.00006 [pdf, other]

Enhancing ML-Based DoS Attack Detection Through Combinatorial Fusion Analysis

Authors: Evans Owusu, Mohamed Rahouti, D. Frank Hsu, Kaiqi Xiong, Yufeng Xin

Abstract: Mitigating Denial-of-Service (DoS) attacks is vital for online service security and availability. While machine learning (ML) models are used for DoS attack detection, new strategies are needed to enhance their performance. We suggest an innovative method, combinatorial fusion, which combines multiple ML models using advanced algorithms. This includes score and rank combinations, weighted techniqu… ▽ More Mitigating Denial-of-Service (DoS) attacks is vital for online service security and availability. While machine learning (ML) models are used for DoS attack detection, new strategies are needed to enhance their performance. We suggest an innovative method, combinatorial fusion, which combines multiple ML models using advanced algorithms. This includes score and rank combinations, weighted techniques, and diversity strength of scoring systems. Through rigorous evaluations, we demonstrate the effectiveness of this fusion approach, considering metrics like precision, recall, and F1-score. We address the challenge of low-profiled attack classification by fusing models to create a comprehensive solution. Our findings emphasize the potential of this approach to improve DoS attack detection and contribute to stronger defense mechanisms. △ Less

Submitted 1 October, 2023; originally announced December 2023.

Comments: 6 pages, 3 figures, IEEE CNS

arXiv:2311.16290 [pdf, other]

Lightcone Hamiltonian for Ising Field Theory I: T < T_c

Authors: A. Liam Fitzpatrick, Emanuel Katz, Yuan Xin

Abstract: We study 2d Ising Field Theory (IFT) in the low-temperature phase in lightcone quantization, and show that integrating out zero modes generates a very compact form for the effective lightcone interaction that depends on the finite volume vacuum expectation value of the $σ$ operator. This form is most naturally understood in a conformal basis for the lightcone Hilbert space. We further verify that… ▽ More We study 2d Ising Field Theory (IFT) in the low-temperature phase in lightcone quantization, and show that integrating out zero modes generates a very compact form for the effective lightcone interaction that depends on the finite volume vacuum expectation value of the $σ$ operator. This form is most naturally understood in a conformal basis for the lightcone Hilbert space. We further verify that this simple form reproduces to high accuracy results for the spectra, the $c$-function, and the form-factors from integrability methods for the magnetic deformation of IFT. For generic non-integrable values of parameters we also compute the above observables and compare our numeric results to those of equal-time truncation. In particular, we report on new measurements of various bound-state form-factors as well as the stress-tensor spectral density. We find that the stress tensor spectral density provides additional evidence that certain resonances of IFT are surprisingly narrow, even at generic strong coupling. Explicit example code for constructing the effective Hamiltonian is included in an appendix. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 55 pages, 10 figures

arXiv:2311.12400 [pdf, ps, other]

Curvature estimates of ancient solutions to the mean curvature flow of higher codimension with convex Gauss image

Authors: Hongbing Qiu, Y. L. Xin

Abstract: By carrying out refined curvature estimates, we prove better rigidity theorems of complete noncompact ancient solutions to the mean curvature flow in higher codimension under various Gauss image restriction. By carrying out refined curvature estimates, we prove better rigidity theorems of complete noncompact ancient solutions to the mean curvature flow in higher codimension under various Gauss image restriction. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.11749 [pdf, other]

A causal intervention framework for synthesizing mobility data and evaluating predictive neural networks

Authors: Ye Hong, Yanan Xin, Simon Dirmeier, Fernando Perez-Cruz, Martin Raubal

Abstract: Deep neural networks are increasingly utilized in mobility prediction tasks, yet their intricate internal workings pose challenges for interpretability, especially in comprehending how various aspects of mobility behavior affect predictions. This study introduces a causal intervention framework to assess the impact of mobility-related factors on neural networks designed for next location predictio… ▽ More Deep neural networks are increasingly utilized in mobility prediction tasks, yet their intricate internal workings pose challenges for interpretability, especially in comprehending how various aspects of mobility behavior affect predictions. This study introduces a causal intervention framework to assess the impact of mobility-related factors on neural networks designed for next location prediction -- a task focusing on predicting the immediate next location of an individual. To achieve this, we employ individual mobility models to synthesize location visit sequences and control behavior dynamics by intervening in their data generation process. We evaluate the interventional location sequences using mobility metrics and input them into well-trained networks to analyze performance variations. The results demonstrate the effectiveness in producing location sequences with distinct mobility behaviors, thereby facilitating the simulation of diverse yet realistic spatial and temporal changes. These changes result in performance fluctuations in next location prediction networks, revealing impacts of critical mobility behavior factors, including sequential patterns in location transitions, proclivity for exploring new locations, and preferences in location choices at population and individual levels. The gained insights hold value for the real-world application of mobility prediction networks, and the framework is expected to promote the use of causal inference to enhance the interpretability and robustness of neural networks in mobility applications. △ Less

Submitted 1 August, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

Comments: 34 pages, 8 figures

arXiv:2311.09732 [pdf, other]

Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources

Authors: Yipei Xu, Dakuan Lu, Jiaqing Liang, Xintao Wang, Yipeng Geng, Yingsi Xin, Hengkui Wu, Ken Chen, ruiji zhang, Yanghua Xiao

Abstract: Pre-trained language models (PLMs) have established the new paradigm in the field of NLP. For more powerful PLMs, one of the most popular and successful way is to continuously scale up sizes of the models and the pre-training corpora. These large corpora are generally obtained by converging smaller ones from multiple sources, they are thus growing increasingly diverse. However, the side-effects of… ▽ More Pre-trained language models (PLMs) have established the new paradigm in the field of NLP. For more powerful PLMs, one of the most popular and successful way is to continuously scale up sizes of the models and the pre-training corpora. These large corpora are generally obtained by converging smaller ones from multiple sources, they are thus growing increasingly diverse. However, the side-effects of these colossal converged corpora remain understudied. In this paper, we identify the disadvantage of heterogeneous corpora from multiple sources for pre-training PLMs. Towards coordinated pre-training on diverse corpora, we further propose source prompts (SP), which explicitly prompt the model of the data source at the pre-training and fine-tuning stages. Results of extensive experiments demonstrate that PLMs pre-trained with SP on diverse corpora gain significant improvement in various downstream tasks. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Showing 1–50 of 313 results for author: Xin, Y