-
Towards Non-invasive and Personalized Management of Breast Cancer Patients from Multiparametric MRI via A Large Mixture-of-Modality-Experts Model
Authors:
Luyang Luo,
Mingxiang Wu,
Mei Li,
Yi Xin,
Qiong Wang,
Varut Vardhanabhuti,
Winnie CW Chu,
Zhenhui Li,
Juan Zhou,
Pranav Rajpurkar,
Hao Chen
Abstract:
Breast magnetic resonance imaging (MRI) is the imaging technique with the highest sensitivity for detecting breast cancer and is routinely used for women at high risk. Despite the comprehensive multiparametric protocol of breast MRI, existing artificial intelligence-based studies predominantly rely on single sequences and have limited validation. Here we report a large mixture-of-modality-experts…
▽ More
Breast magnetic resonance imaging (MRI) is the imaging technique with the highest sensitivity for detecting breast cancer and is routinely used for women at high risk. Despite the comprehensive multiparametric protocol of breast MRI, existing artificial intelligence-based studies predominantly rely on single sequences and have limited validation. Here we report a large mixture-of-modality-experts model (MOME) that integrates multiparametric MRI information within a unified structure, offering a noninvasive method for personalized breast cancer management. We have curated the largest multiparametric breast MRI dataset, involving 5,205 patients from three hospitals in the north, southeast, and southwest of China, for the development and extensive evaluation of our model. MOME demonstrated accurate and robust identification of breast cancer. It achieved comparable performance for malignancy recognition to that of four senior radiologists and significantly outperformed a junior radiologist, with 0.913 AUROC, 0.948 AUPRC, 0.905 F1 score, and 0.723 MCC. Our findings suggest that MOME could reduce the need for biopsies in BI-RADS 4 patients with a ratio of 7.3%, classify triple-negative breast cancer with an AUROC of 0.709, and predict pathological complete response to neoadjuvant chemotherapy with an AUROC of 0.694. The model further supports scalable and interpretable inference, adapting to missing modalities and providing decision explanations by highlighting lesions and measuring modality contributions. MOME exemplifies a discriminative, robust, scalable, and interpretable multimodal model, paving the way for noninvasive, personalized management of breast cancer patients based on multiparametric breast imaging data.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Authors:
Yuan Xin,
Zheng Li,
Ning Yu,
Dingfan Chen,
Mario Fritz,
Michael Backes,
Yang Zhang
Abstract:
Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data. In this paper, we pioneer a systematic exploration of such risks associated with pre-trained language encoders, specifically focusing on the membership leakage of pre-training…
▽ More
Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data. In this paper, we pioneer a systematic exploration of such risks associated with pre-trained language encoders, specifically focusing on the membership leakage of pre-training data exposed through downstream models adapted from pre-trained language encoders-an aspect largely overlooked in existing literature. Our study encompasses comprehensive experiments across four types of pre-trained encoder architectures, three representative downstream tasks, and five benchmark datasets. Intriguingly, our evaluations reveal, for the first time, the existence of membership leakage even when only the black-box output of the downstream model is exposed, highlighting a privacy risk far greater than previously assumed. Alongside, we present in-depth analysis and insights toward guiding future researchers and practitioners in addressing the privacy considerations in developing pre-trained language models.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
RV measurements of directly imaged brown dwarf GQ Lup B to search for exo-satellites
Authors:
Katelyn Horstman,
Jean-Baptiste Ruffio,
Konstantin Batygin,
Dimitri Mawet,
Ashley Baker,
Chih-Chun Hsu,
Jason J. Wang,
Ji Wang,
Sarah Blunt,
Jerry W. Xuan,
Yinzi Xin,
Joshua Liberman,
Shubh Agrawal,
Quinn M. Konopacky,
Geoffrey A. Blake,
Clarissa R. Do O,
Randall Bartos,
Charlotte Z. Bond,
Benjamin Calvin,
Sylvain Cetre,
Jacques-Robert Delorme,
Greg Doppmann,
Daniel Echeverri,
Luke Finnerty,
Michael P. Fitzgerald
, et al. (13 additional authors not shown)
Abstract:
GQ Lup B is one of the few substellar companions with a detected cicumplanetary disk, or CPD. Observations of the CPD suggest the presence of a cavity, possibly formed by an exo-satellite. Using the Keck Planet Imager and Characterizer (KPIC), a high contrast imaging suite that feeds a high resolution spectrograph (1.9-2.5 microns, R$\sim$35,000), we present the first dedicated radial velocity (RV…
▽ More
GQ Lup B is one of the few substellar companions with a detected cicumplanetary disk, or CPD. Observations of the CPD suggest the presence of a cavity, possibly formed by an exo-satellite. Using the Keck Planet Imager and Characterizer (KPIC), a high contrast imaging suite that feeds a high resolution spectrograph (1.9-2.5 microns, R$\sim$35,000), we present the first dedicated radial velocity (RV) observations around a high-contrast, directly imaged substellar companion, GQ Lup B, to search for exo-satellites. Over 11 epochs, we find a best and median RV error of 400-1000 m/s, most likely limited by systematic fringing in the spectra due to transmissive optics within KPIC. With this RV precision, KPIC is sensitive to exomoons 0.6-2.8% the mass of GQ Lup B ($\sim 30 M_{\text{Jup}}$) at separations between the Roche limit and $65 R_{\text{Jup}}$, or the extent of the cavity inferred within the CPD detected around GQ Lup B. Using simulations of HISPEC, a high resolution infrared spectrograph planned to debut at W.M. Keck Observatory in 2026, we estimate future exomoon sensitivity to increase by over an order of magnitude, providing sensitivity to less massive satellites potentially formed within the CPD itself. Additionally, we run simulations to estimate the amount of material that different masses of satellites could clear in a CPD to create the observed cavity. We find satellite-to-planet mass ratios of $q > 2 \times 10^{-4}$ can create observable cavities and report a maximum cavity size of $\sim 51 \, R_{\text{Jup}}$ carved from a satellite.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
Deepest limits on scattered light emission from the Epsilon Eridani inner debris disk with HST/STIS
Authors:
Sai Krishanth P. M.,
Ewan S. Douglas,
Ramya M. Anche,
Justin Hom,
Kerri L. Cahoy,
John H. Debes,
Hannah Jang-Condell,
Isabel Rebollido,
Bin B. Ren,
Christopher C. Stark,
Robert Thompson,
Yinzi Xin
Abstract:
Epsilon Eridani ($ε$ Eri) is one of the first debris disk systems detected by the Infrared Astronomical Satellite (IRAS). However, the system has thus far eluded detection in scattered light with no components having been directly imaged. Its similarity to a relatively young Solar System combined with its proximity makes it an excellent candidate to further our understanding of planetary system ev…
▽ More
Epsilon Eridani ($ε$ Eri) is one of the first debris disk systems detected by the Infrared Astronomical Satellite (IRAS). However, the system has thus far eluded detection in scattered light with no components having been directly imaged. Its similarity to a relatively young Solar System combined with its proximity makes it an excellent candidate to further our understanding of planetary system evolution. We present a set of coronagraphic images taken using the Space Telescope Imaging Spectrograph (STIS) coronagraph on the Hubble space telescope at a small inner working angle to detect a predicted warm inner debris disk inside 1". We used three different post-processing approaches; Non-negative Matrix Factorization (NMF), Karhunen-Lo`eve Image Processing (KLIP), and Classical reference differential imaging (RDI), to best optimize reference star subtraction, and find that NMF performed the best overall while KLIP produced the absolute best contrast inside 1". We present limits on scattered light from warm dust, with constraints on surface brightness at 6 mJy/as$^2$ at our inner working angle of 0.6". We also place a constraint of 0.5 mJy/as$^2$ outside 1", which gives us an upper limit on the brightness for outer disks and substellar companions. Finally, we calculated an upper limit on the dust albedo at $ω<$ 0.487.
△ Less
Submitted 14 August, 2024; v1 submitted 13 August, 2024;
originally announced August 2024.
-
Towards the Dynamics of a DNN Learning Symbolic Interactions
Authors:
Qihan Ren,
Yang Xu,
Junpeng Zhang,
Yue Xin,
Dongrui Liu,
Quanshi Zhang
Abstract:
This study proves the two-phase dynamics of a deep neural network (DNN) learning interactions. Despite the long disappointing view of the faithfulness of post-hoc explanation of a DNN, in recent years, a series of theorems have been proven to show that given an input sample, a small number of interactions between input variables can be considered as primitive inference patterns, which can faithful…
▽ More
This study proves the two-phase dynamics of a deep neural network (DNN) learning interactions. Despite the long disappointing view of the faithfulness of post-hoc explanation of a DNN, in recent years, a series of theorems have been proven to show that given an input sample, a small number of interactions between input variables can be considered as primitive inference patterns, which can faithfully represent every detailed inference logic of the DNN on this sample. Particularly, it has been observed that various DNNs all learn interactions of different complexities with two-phase dynamics, and this well explains how a DNN's generalization power changes from under-fitting to over-fitting. Therefore, in this study, we prove the dynamics of a DNN gradually encoding interactions of different complexities, which provides a theoretically grounded mechanism for the over-fitting of a DNN. Experiments show that our theory well predicts the real learning dynamics of various DNNs on different tasks.
△ Less
Submitted 27 July, 2024;
originally announced July 2024.
-
FOSP: Fine-tuning Offline Safe Policy through World Models
Authors:
Chenyang Cao,
Yucheng Xin,
Silang Wu,
Longxiang He,
Zichen Yan,
Junbo Tan,
Xueqian Wang
Abstract:
Model-based Reinforcement Learning (RL) has shown its high training efficiency and capability of handling high-dimensional tasks. Regarding safety issues, safe model-based RL can achieve nearly zero-cost performance and effectively manage the trade-off between performance and safety. Nevertheless, prior works still pose safety challenges due to the online exploration in real-world deployment. To a…
▽ More
Model-based Reinforcement Learning (RL) has shown its high training efficiency and capability of handling high-dimensional tasks. Regarding safety issues, safe model-based RL can achieve nearly zero-cost performance and effectively manage the trade-off between performance and safety. Nevertheless, prior works still pose safety challenges due to the online exploration in real-world deployment. To address this, some offline RL methods have emerged as solutions, which learn from a static dataset in a safe way by avoiding interactions with the environment. In this paper, we aim to further enhance safety during the deployment stage for vision-based robotic tasks by fine-tuning an offline-trained policy. We incorporate in-sample optimization, model-based policy expansion, and reachability guidance to construct a safe offline-to-online framework. Moreover, our method proves to improve the generalization of offline policy in unseen safety-constrained scenarios. Finally, the efficiency of our method is validated on simulation benchmarks with five vision-only tasks and a real robot by solving some deployment problems using limited data.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Evolution of High-energy Electron Distribution in Pulsar Wind Nebulae
Authors:
Yi-Ming Liu,
Hou-Dun Zeng,
Yu-Liang Xin,
Si-Ming Liu,
Yi Zhang
Abstract:
In this paper, we analyze the spectral energy distributions (SEDs) of 17 powerful (with a spin-down luminosity greater than $10^{35}$ erg s$^{-1}$) young (with an age less than 15000 yrs) pulsar wind nebulae (PWNe) using a simple time-independent one-zone emission model. Our aim is to investigate correlations between model parameters and the ages of the corresponding PWNe, thereby revealing the ev…
▽ More
In this paper, we analyze the spectral energy distributions (SEDs) of 17 powerful (with a spin-down luminosity greater than $10^{35}$ erg s$^{-1}$) young (with an age less than 15000 yrs) pulsar wind nebulae (PWNe) using a simple time-independent one-zone emission model. Our aim is to investigate correlations between model parameters and the ages of the corresponding PWNe, thereby revealing the evolution of high-energy electron distributions within PWNe. Our findings are as follows: (1) The electron distributions in PWNe can be characterized by a double power-law with a superexponential cutoff; (2) As PWNe evolve, the high-energy end of the electron distribution spectrum becomes harder with the index decreasing from approximately 3.5 to 2.5, while the low-energy end spectrum index remains constant near 1.5; (3) There is no apparent correlation between the break energy or cutoff energy and the age of PWNe. (4) The average magnetic field within PWNe decreases with age, leading to a positive correlation between the energy loss timescale of electrons at the break energy or the high-energy cutoff, and the age of the PWN. (5) The total electron energy within PWNe remains constant near $2 \times 10^{48}$ erg, while the total magnetic energy decreases with age.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
The high-contrast performance of the Keck Planet Imager and Characterizer
Authors:
Jason J. Wang,
Dimitri Mawet,
Jerry W. Xuan,
Chih-Chun Hsu,
Jean-Baptiste Ruffio,
Katelyn Horstman,
Yinzi Xin,
Jacques-Robert Delorme,
Nemanja Jovanovic,
Yapeng Zhang,
Luke Finnerty,
Ashley Baker,
Randall Bartos,
Geoffrey A. Blake,
Benjamin Calvin,
Sylvain Cetre,
Gregory W. Doppmann,
Daniel Echeverri,
Michael P. Fitzgerald,
Joshua Liberman,
Ronald Lopez,
Evan Morris,
Jacklyn Pezzato-Rovner,
Ben Sappey,
Tobias Schofield
, et al. (3 additional authors not shown)
Abstract:
The Keck Planet Imager and Characterizer (KPIC), a series of upgrades to the Keck II Adaptive Optics System and Instrument Suite, aims to demonstrate high-resolution spectroscopy of faint exoplanets that are spatially resolved from their host stars. In this paper, we measure KPIC's sensitivity to companions as a function of separation (i.e., the contrast curve) using on-sky data collected over fou…
▽ More
The Keck Planet Imager and Characterizer (KPIC), a series of upgrades to the Keck II Adaptive Optics System and Instrument Suite, aims to demonstrate high-resolution spectroscopy of faint exoplanets that are spatially resolved from their host stars. In this paper, we measure KPIC's sensitivity to companions as a function of separation (i.e., the contrast curve) using on-sky data collected over four years of operation. We show that KPIC is able to reach contrasts of $1.3 \times 10^{-4}$ at 90 mas and $9.2 \times 10^{-6}$ at 420 mas separation from the star, and that KPIC can reach planet-level sensitivities at angular separations within the inner working angle of coronagraphic instruments such as GPI and SPHERE. KPIC is also able to achieve more extreme contrasts than other medium-/high-resolution spectrographs that are not as optimized for high-contrast performance. We decompose the KPIC performance budget into individual noise terms and discuss limiting factors. The fringing that results from combining a high-contrast imaging system with a high-resolution spectrograph is identified as an important source of systematic noise. After mitigation and correction, KPIC is able to reach within a factor of 2 of the photon noise limit at separations < 200 mas. At large separations, KPIC is limited by the background noise performance of NIRSPEC.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Authors:
Zhongwei Wan,
Xinjian Wu,
Yu Zhang,
Yi Xin,
Chaofan Tao,
Zhihong Zhu,
Xin Wang,
Siqi Luo,
Jing Xiong,
Mi Zhang
Abstract:
Efficient inference in Large Language Models (LLMs) is impeded by the growing memory demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache eviction strategies, which prioritize less critical KV-pairs based on attention scores, often degrade generation quality, leading to issues such as context loss or hallucinations. To address this, we introduce Dynamic Discrimi…
▽ More
Efficient inference in Large Language Models (LLMs) is impeded by the growing memory demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache eviction strategies, which prioritize less critical KV-pairs based on attention scores, often degrade generation quality, leading to issues such as context loss or hallucinations. To address this, we introduce Dynamic Discriminative Operations (D2O), a novel method that utilizes two-level discriminative strategies to optimize KV cache size without fine-tuning, while preserving essential context. Initially, by observing varying densities of attention weights between shallow and deep layers, we use this insight to determine which layers should avoid excessive eviction to minimize information loss. Subsequently, for the eviction strategy in each layer, D2O innovatively incorporates a compensation mechanism that maintains a similarity threshold to re-discriminate the importance of previously discarded tokens, determining whether they should be recalled and merged with similar tokens. Our approach not only achieves significant memory savings and enhances inference throughput by more than 3 times but also maintains high-quality long-text generation. Extensive experiments across various benchmarks and LLM architectures have demonstrated that D2O significantly enhances performance with a constrained KV cache budget.
△ Less
Submitted 23 June, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes…
▽ More
In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Did Harold Zuercher Have Time-Separable Preferences?
Authors:
Jay Lu,
Yao Luo,
Kota Saito,
Yi Xin
Abstract:
This paper proposes an empirical model of dynamic discrete choice to allow for non-separable time preferences, generalizing the well-known Rust (1987) model. Under weak conditions, we show the existence of value functions and hence well-defined optimal choices. We construct a contraction mapping of the value function and propose an estimation method similar to Rust's nested fixed point algorithm.…
▽ More
This paper proposes an empirical model of dynamic discrete choice to allow for non-separable time preferences, generalizing the well-known Rust (1987) model. Under weak conditions, we show the existence of value functions and hence well-defined optimal choices. We construct a contraction mapping of the value function and propose an estimation method similar to Rust's nested fixed point algorithm. Finally, we apply the framework to the bus engine replacement data. We improve the fit of the data with our general model and reject the null hypothesis that Harold Zuercher has separable time preferences. Misspecifying an agent's preference as time-separable when it is not leads to biased inferences about structure parameters (such as the agent's risk attitudes) and misleading policy recommendations.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Authors:
Zesen Cheng,
Sicong Leng,
Hang Zhang,
Yifei Xin,
Xin Li,
Guanzheng Chen,
Yongxin Zhu,
Wenqi Zhang,
Ziyang Luo,
Deli Zhao,
Lidong Bing
Abstract:
In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data…
▽ More
In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data. Additionally, we integrate an Audio Branch into the model through joint training, thereby enriching the multimodal understanding capabilities of the model by seamlessly incorporating audio cues. Comprehensive evaluations on multiple-choice video question answering (MC-VQA), open-ended video question answering (OE-VQA), and video captioning (VC) tasks demonstrate that VideoLLaMA 2 consistently achieves competitive results among open-source models and even gets close to some proprietary models on several benchmarks. Furthermore, VideoLLaMA 2 exhibits reasonable improvements in audio-only and audio-video question-answering (AQA & OE-AVQA) benchmarks over existing models. These advancements underline VideoLLaMA 2's superior performance in multimodal comprehension, setting a new standard for intelligent video analysis systems. All models are public to facilitate further research.
△ Less
Submitted 17 June, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
Redefining DDoS Attack Detection Using A Dual-Space Prototypical Network-Based Approach
Authors:
Fernando Martinez,
Mariyam Mapkar,
Ali Alfatemi,
Mohamed Rahouti,
Yufeng Xin,
Kaiqi Xiong,
Nasir Ghani
Abstract:
Distributed Denial of Service (DDoS) attacks pose an increasingly substantial cybersecurity threat to organizations across the globe. In this paper, we introduce a new deep learning-based technique for detecting DDoS attacks, a paramount cybersecurity challenge with evolving complexity and scale. Specifically, we propose a new dual-space prototypical network that leverages a unique dual-space loss…
▽ More
Distributed Denial of Service (DDoS) attacks pose an increasingly substantial cybersecurity threat to organizations across the globe. In this paper, we introduce a new deep learning-based technique for detecting DDoS attacks, a paramount cybersecurity challenge with evolving complexity and scale. Specifically, we propose a new dual-space prototypical network that leverages a unique dual-space loss function to enhance detection accuracy for various attack patterns through geometric and angular similarity measures. This approach capitalizes on the strengths of representation learning within the latent space (a lower-dimensional representation of data that captures complex patterns for machine learning analysis), improving the model's adaptability and sensitivity towards varying DDoS attack vectors. Our comprehensive evaluation spans multiple training environments, including offline training, simulated online training, and prototypical network scenarios, to validate the model's robustness under diverse data abundance and scarcity conditions. The Multilayer Perceptron (MLP) with Attention, trained with our dual-space prototypical design over a reduced training set, achieves an average accuracy of 94.85% and an F1-Score of 94.71% across our tests, showcasing its effectiveness in dynamic and constrained real-world scenarios.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Plug-in UL-CSI-Assisted Precoder Upsampling Approach in Cellular FDD Systems
Authors:
Yu-Chien Lin,
Yan Xin,
Ta-Sung Lee,
Charlie,
Zhang,
Yibo Ma,
Zhi Ding
Abstract:
Acquiring downlink channel state information (CSI) is crucial for optimizing performance in massive Multiple Input Multiple Output (MIMO) systems operating under Frequency-Division Duplexing (FDD). Most cellular wireless communication systems employ codebook-based precoder designs, which offer advantages such as simpler, more efficient feedback mechanisms and reduced feedback overhead. Common code…
▽ More
Acquiring downlink channel state information (CSI) is crucial for optimizing performance in massive Multiple Input Multiple Output (MIMO) systems operating under Frequency-Division Duplexing (FDD). Most cellular wireless communication systems employ codebook-based precoder designs, which offer advantages such as simpler, more efficient feedback mechanisms and reduced feedback overhead. Common codebook-based approaches include Type II and eType II precoding methods defined in the 3GPP standards. Feedback in these systems is typically standardized per subband (SB), allowing user equipment (UE) to select the optimal precoder from the codebook for each SB, thereby reducing feedback overhead. However, this subband-level feedback resolution may not suffice for frequency-selective channels. This paper addresses this issue by introducing an uplink CSI-assisted precoder upsampling module deployed at the gNodeB. This module upsamples SB-level precoders to resource block (RB)-level precoders, acting as a plug-in compatible with existing gNodeB or base stations.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Strategies to enhance THz harmonic generation combining multilayered, gated, and metamaterial-based architectures
Authors:
Ali Maleki,
Moritz B. Heindl,
Yongbao Xin,
Robert W. Boyd,
Georg Herink,
Jean-Michel Ménard
Abstract:
Graphene has unique properties paving the way for groundbreaking future applications. Its large optical nonlinearity and ease of integration in devices notably makes it an ideal candidate to become a key component for all-optical switching and frequency conversion applications. In the terahertz (THz) region, various approaches have been independently demonstrated to optimize the nonlinear effects…
▽ More
Graphene has unique properties paving the way for groundbreaking future applications. Its large optical nonlinearity and ease of integration in devices notably makes it an ideal candidate to become a key component for all-optical switching and frequency conversion applications. In the terahertz (THz) region, various approaches have been independently demonstrated to optimize the nonlinear effects in graphene, addressing a critical limitation arising from the atomically thin interaction length. Here, we demonstrate sample architectures that combine strategies to enhance THz nonlinearities in graphene-based structures. We achieve this by increasing the interaction length through a multilayered design, controlling carrier density with an electrical gate, and modulating the THz field spatial distribution with a metallic metasurface substrate. Our study specifically investigates third harmonic generation (THG) using a table-top high-field THz source. We measure THG enhancement factors exceeding thirty and propose architectures capable of achieving a two-order-of-magnitude increase. These findings highlight the potential of engineered graphene-based samples in advancing THz frequency conversion technologies for signal processing and wireless communication applications.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model
Authors:
Mingyang Yi,
Aoxue Li,
Yi Xin,
Zhenguo Li
Abstract:
Recently, the strong latent Diffusion Probabilistic Model (DPM) has been applied to high-quality Text-to-Image (T2I) generation (e.g., Stable Diffusion), by injecting the encoded target text prompt into the gradually denoised diffusion image generator. Despite the success of DPM in practice, the mechanism behind it remains to be explored. To fill this blank, we begin by examining the intermediate…
▽ More
Recently, the strong latent Diffusion Probabilistic Model (DPM) has been applied to high-quality Text-to-Image (T2I) generation (e.g., Stable Diffusion), by injecting the encoded target text prompt into the gradually denoised diffusion image generator. Despite the success of DPM in practice, the mechanism behind it remains to be explored. To fill this blank, we begin by examining the intermediate statuses during the gradual denoising generation process in DPM. The empirical observations indicate, the shape of image is reconstructed after the first few denoising steps, and then the image is filled with details (e.g., texture). The phenomenon is because the low-frequency signal (shape relevant) of the noisy image is not corrupted until the final stage in the forward process (initial stage of generation) of adding noise in DPM. Inspired by the observations, we proceed to explore the influence of each token in the text prompt during the two stages. After a series of experiments of T2I generations conditioned on a set of text prompts. We conclude that in the earlier generation stage, the image is mostly decided by the special token [\texttt{EOS}] in the text prompt, and the information in the text prompt is already conveyed in this stage. After that, the diffusion model completes the details of generated images by information from themselves. Finally, we propose to apply this observation to accelerate the process of T2I generation by properly removing text guidance, which finally accelerates the sampling up to 25\%+.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Authors:
Ting Liu,
Xuyang Liu,
Siteng Huang,
Liangtao Shi,
Zunnan Xu,
Yi Xin,
Quanjun Yin,
Xiaohong Liu
Abstract:
Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications. While current PEFT methods have achieved parameter efficiency, they overlook the efficiency of computation and GPU memory during both fine-tuning and inference, falling short of practical requirements. In this paper, we propose \textbf{Sparse…
▽ More
Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications. While current PEFT methods have achieved parameter efficiency, they overlook the efficiency of computation and GPU memory during both fine-tuning and inference, falling short of practical requirements. In this paper, we propose \textbf{Sparse-Tuning}, a novel PEFT method that accounts for the information redundancy in images and videos to boost the above efficiency. By sparsely preserving the semantic-relevant tokens and merging irrelevant ones, Sparse-Tuning minimizes the quantity of tokens processed at each layer, leading to a quadratic reduction in computational and memory overhead. To align our token sparsification strategy suitably with fine-tuning purposes, we further design Dense Adapters that establish dense connections from shallow layers to deeper layers. These Dense Adapters integrate multi-level local features to enrich the current tokens, improving both token preservation and model adaptation. Empirical results on VTAB-1K, three image datasets, and two video datasets show that our Sparse-Tuning reduces GFLOPs to \textbf{62\%-70\%} of the original ViT-B while achieving state-of-the-art performance. Source code is available at \url{https://github.com/liuting20/Sparse-Tuning}.
△ Less
Submitted 29 August, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Data quality control system and long-term performance monitor of the LHAASO-KM2A
Authors:
Zhen Cao,
F. Aharonian,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
W. Bian,
A. V. Bukevich,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
H. X. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. Chen
, et al. (263 additional authors not shown)
Abstract:
The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To…
▽ More
The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively.
△ Less
Submitted 13 June, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Rotation and Abundances of the Benchmark Brown Dwarf HD 33632 Ab from Keck/KPIC High-resolution Spectroscopy
Authors:
Chih-Chun Hsu,
Jason J. Wang,
Jerry W. Xuan,
Jean-Baptiste Ruffio,
Daniel Echeverri,
Yinzi Xin,
Joshua Liberman,
Luke Finnerty,
Evan Morris,
Katelyn Horstman,
Ben Sappey,
Gregory W. Doppmann,
Dimitri Mawet,
Nemanja Jovanovic,
Michael P. Fitzgerald,
Jacques-Robert Delorme,
J. Kent Wallace,
Ashley Baker,
Randall Bartos,
Geoffrey A. Blake,
Benjamin Calvin,
Sylvain Cetre,
Ronald A. López,
Jacklyn Pezzato,
Tobias Schofield
, et al. (2 additional authors not shown)
Abstract:
We present the projected rotational velocity and molecular abundances for HD 33632 Ab obtained via Keck Planet Imager and Characterizer high-resolution spectroscopy. HD 33632 Ab is a nearby benchmark brown dwarf companion at a separation of $\sim$20 au that straddles the L/T transition. Using a forward-modeling framework with on-axis host star spectra, self-consistent substellar atmospheric and re…
▽ More
We present the projected rotational velocity and molecular abundances for HD 33632 Ab obtained via Keck Planet Imager and Characterizer high-resolution spectroscopy. HD 33632 Ab is a nearby benchmark brown dwarf companion at a separation of $\sim$20 au that straddles the L/T transition. Using a forward-modeling framework with on-axis host star spectra, self-consistent substellar atmospheric and retrieval models for HD 33632 Ab, we derive a projected rotational velocity of 53 $\pm$ 3 km/s and carbon/water mass fractions of log CO = $-$2.3 $\pm$ 0.3 and log H$_2$O = $-$2.7 $\pm$ 0.2. The inferred carbon-to-oxygen ratio (C/O = 0.58 $\pm$ 0.14), molecular abundances, and metallicity ([C/H] = 0.0 $\pm$ 0.2 dex) of HD 33632 Ab are consistent with its host star. Although detectable methane opacities are expected in L/T transition objects, we did not recover methane in our KPIC spectra, partly due to the high $v\sin{i}$ and to disequilibrium chemistry at the pressures we are sensitive to. We parameterize the spin as the ratio of rotation over break-up velocity, and compare HD 33632 Ab to a compilation of >200 very low-mass objects (M$\lesssim$0.1 M$_{\odot}$) that have spin measurements in the literature. There appears to be no clear trend for the isolated field low-mass objects versus mass, but a tentative trend is identified for low-mass companions and directly imaged exoplanets, similar to previous findings. A larger sample of close-in gas giant exoplanets and brown dwarfs will critically examine our understanding of their formation and evolution through rotation and chemical abundance measurements.
△ Less
Submitted 18 June, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i…
▽ More
The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) is compatible with NGC 4278 within $\sim0.03$ degree. Variation analysis shows an indication of the variability at a few months level in the TeV band, which is consistent with low frequency observations. Based on these observations, we report the detection of TeV $γ$-ray emissions from this low-luminosity AGN NGC 4278. The observations by LHAASO-WCDA during active period has a significance level of 8.8\,$σ$ with best-fit photon spectral index $\varGamma=2.56\pm0.14$ and a flux $f_{1-10\,\rm{TeV}}=(7.0\pm1.1_{\rm{sta}}\pm0.35_{\rm{syst}})\times10^{-13}\,\rm{photons\,cm^{-2}\,s^{-1}}$, or approximately $5\%$ of the Crab Nebula. The discovery of VHE from NGC 4278 indicates that the compact, weak radio jet can efficiently accelerate particles and emit TeV photons.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Creating cyclo-N$_5$$^{+}$ cation and assembling N$_5$$^{+}$N$_5$$^{-}$ salt via electronegativity co-matching in tailored ionic compounds
Authors:
Bi Zhang,
Yu Xin,
Meiling Xu,
Yiming Zhang,
Yinwei Li,
Yanchao Wang,
Changfeng Chen
Abstract:
The recent discovery of crystalline pentazolates marks a major advance in polynitrogen science and raises prospects of making the long-touted potent propellant N$_5$$^{+}$N$_5$$^{-}$ salt. However, despite the synthesis of cyclo-N$_5$$^{-}$ anion in pentazolates, counter cation cyclo-N$_5$$^{+}$ remains elusive due to the strong oxidizing power of pentazole ion; moreover, pure N$_5$$^{+}$N$_5$…
▽ More
The recent discovery of crystalline pentazolates marks a major advance in polynitrogen science and raises prospects of making the long-touted potent propellant N$_5$$^{+}$N$_5$$^{-}$ salt. However, despite the synthesis of cyclo-N$_5$$^{-}$ anion in pentazolates, counter cation cyclo-N$_5$$^{+}$ remains elusive due to the strong oxidizing power of pentazole ion; moreover, pure N$_5$$^{+}$N$_5$$^{-}$ salt is known to be unstable. Here, we devise a new strategy for making rare cyclo-N$_5$$^{+}$ cation and assembling the long-sought N$_5$$^{+}$N$_5$$^{-}$ salt in tailored ionic compounds, wherein the negative/positive host ions act as oxidizing/reducing agents to form cyclo-N$_5$$^{+}$/N$_5$$^{-}$ species. This strategy is implemented via an advanced computational crystal structure search, which identifies XN$_5$N$_5$F (X = Li, Na, K) compounds that stabilize at high pressures and remain viable at ambient pressure-temperature conditions based on \textit{ab initio} molecular dynamics simulations. This finding opens an avenue for creating and stabilizing N$_5$$^{+}$N$_5$$^{-}$ salt assembly in ionic compounds, where cyclo-N$_5$ species are oxidized/reduced via co-matching with host ions of high/low electronegativity. The present results demonstrate novel polynitrogen chemistry, and these findings offer new insights and prospects in the design and synthesis of diverse chemical species that exhibit unusual charge states, bonding structures, and superior functionality.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
LEAP: Optimization Hierarchical Federated Learning on Non-IID Data with Coalition Formation Game
Authors:
Jianfeng Lu,
Yue Chen,
Shuqin Cao,
Longbiao Chen,
Wei Wang,
Yun Xin
Abstract:
Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably i…
▽ More
Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably increases computational costs and the risk of privacy leakage. The challenges in solving these issues are how to reduce the impact of non-IID data without involving raw data and how to rationalize the communication resource allocation for addressing straggler problem. To tackle these challenges, we propose a novel optimization method based on coaLition formation gamE and grAdient Projection, called LEAP. Specifically, we combine edge data distribution with coalition formation game innovatively to adjust the correlations between clients and ESs dynamically, which ensures optimal correlations. We further capture the client heterogeneity to achieve the rational bandwidth allocation from coalition perception and determine the optimal transmission power within specified delay constraints at client level. Experimental results on four real datasets show that LEAP is able to achieve 20.62% improvement in model accuracy compared to the state-of-the-art baselines. Moreover, LEAP effectively reduce transmission energy consumption by at least about 2.24 times.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Counterfactual Explanations for Deep Learning-Based Traffic Forecasting
Authors:
Rushan Wang,
Yanan Xin,
Yatao Zhang,
Fernando Perez-Cruz,
Martin Raubal
Abstract:
Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifi…
▽ More
Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifically, the goal is to elucidate relationships between various input contextual features and their corresponding predictions. We present a comprehensive framework that generates counterfactual explanations for traffic forecasting and provides usable insights through the proposed scenario-driven counterfactual explanations. The study first implements a deep learning model to predict traffic speed based on historical traffic data and contextual variables. Counterfactual explanations are then used to illuminate how alterations in these input variables affect predicted outcomes, thereby enhancing the transparency of the deep learning model. We investigated the impact of contextual features on traffic speed prediction under varying spatial and temporal conditions. The scenario-driven counterfactual explanations integrate two types of user-defined constraints, directional and weighting constraints, to tailor the search for counterfactual explanations to specific use cases. These tailored explanations benefit machine learning practitioners who aim to understand the model's learning mechanisms and domain experts who seek insights for real-world applications. The results showcase the effectiveness of counterfactual explanations in revealing traffic patterns learned by deep learning models, showing its potential for interpreting black-box deep learning models used for spatiotemporal predictions in general.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Keck Primary Mirror Closed-Loop Segment Control using a Vector-Zernike Wavefront Sensor
Authors:
Maissa Salama,
Charlotte Guthery,
Vincent Chambouleyron,
Rebecca Jensen-Clem,
J. Kent Wallace,
Jacques-Robert Delorme,
Mitchell Troy,
Tobias Wenger,
Daniel Echeverri,
Luke Finnerty,
Nemanja Jovanovic,
Joshua Liberman,
Ronald A. Lopez,
Dimitri Mawet,
Evan C. Morris,
Maaike van Kooten,
Jason J. Wang,
Peter Wizinowich,
Yinzi Xin,
Jerry Xuan
Abstract:
We present the first on-sky segmented primary mirror closed-loop piston control using a Zernike wavefront sensor (ZWFS) installed on the Keck II telescope. Segment co-phasing errors are a primary contributor to contrast limits on Keck and will be necessary to correct for the next generation of space missions and ground-based extremely large telescopes (ELTs), which will all have segmented primary…
▽ More
We present the first on-sky segmented primary mirror closed-loop piston control using a Zernike wavefront sensor (ZWFS) installed on the Keck II telescope. Segment co-phasing errors are a primary contributor to contrast limits on Keck and will be necessary to correct for the next generation of space missions and ground-based extremely large telescopes (ELTs), which will all have segmented primary mirrors. The goal of the ZWFS installed on Keck is to monitor and correct primary mirror co-phasing errors in parallel with science observations. The ZWFS is ideal for measuring phase discontinuities such as segment co-phasing errors and is one of the most sensitive WFS, but has limited dynamic range. The vector-ZWFS at Keck works on the adaptive optics (AO) corrected wavefront and consists of a metasurface focal plane mask which imposes two different phase shifts on the core of the point spread function (PSF) to two orthogonal light polarizations, producing two pupil images. This design extends the dynamic range compared with the scalar ZWFS. The primary mirror segment pistons were controlled in closed-loop using the ZWFS, improving the Strehl ratio on the NIRC2 science camera by up to 10 percentage points. We analyze the performance of the closed-loop tests, the impact on NIRC2 science data, and discuss the ZWFS measurements.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
LHAASO-KM2A detector simulation using Geant4
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (254 additional authors not shown)
Abstract:
KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with…
▽ More
KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with large altitude difference (30 m) and huge coverage (1.3 km^2). In this paper, the design of the KM2A simulation code G4KM2A based on Geant4 is introduced. The process of G4KM2A is optimized mainly in memory consumption to avoid memory overffow. Some simpliffcations are used to signiffcantly speed up the execution of G4KM2A. The running time is reduced by at least 30 times compared to full detector simulation. The particle distributions and the core/angle resolution comparison between simulation and experimental data of the full KM2A array are also presented, which show good agreement.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
Laboratory demonstration of a Photonic Lantern Nuller in monochromatic and broadband light
Authors:
Yinzi Xin,
Daniel Echeverri,
Nemanja Jovanovic,
Dimitri Mawet,
Sergio Leon-Saval,
Rodrigo Amezcua-Correa,
Stephanos Yerolatsitis,
Michael P. Fitzgerald,
Pradip Gatkine,
Yoo Jung Kim,
Jonathan Lin,
Barnaby Norris,
Garreth Ruane,
Steph Sallum
Abstract:
Photonic lantern nulling (PLN) is a method for enabling the detection and characterization of close-in exoplanets by exploiting the symmetries of the ports of a mode-selective photonic lantern (MSPL) to cancel out starlight. A six-port MSPL provides four ports where on-axis starlight is suppressed, while off-axis planet light is coupled with efficiencies that vary as a function of the planet's spa…
▽ More
Photonic lantern nulling (PLN) is a method for enabling the detection and characterization of close-in exoplanets by exploiting the symmetries of the ports of a mode-selective photonic lantern (MSPL) to cancel out starlight. A six-port MSPL provides four ports where on-axis starlight is suppressed, while off-axis planet light is coupled with efficiencies that vary as a function of the planet's spatial position. We characterize the properties of a six-port MSPL in the laboratory and perform the first testbed demonstration of the PLN in monochromatic light (1569 nm) and in broadband light (1450 nm to 1625 nm), each using two orthogonal polarizations. We compare the measured spatial throughput maps with those predicted by simulations using the lantern's modes. We find that the morphologies of the measured throughput maps are reproduced by the simulations, though the real lantern is lossy and has lower throughputs overall. The measured ratios of on-axis stellar leakage to peak off-axis throughput are around 10^(-2), likely limited by testbed wavefront errors. These null-depths are already sufficient for observing young gas giants at the diffraction limit using ground-based observatories. Future work includes using wavefront control to further improve the nulls, as well as testing and validating the PLN on-sky.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Vortex Fiber Nulling for Exoplanet Observations: First Direct Detection of M Dwarf Companions around HIP 21543, HIP 94666, and HIP 50319
Authors:
Daniel Echeverri,
Jerry W. Xuan,
John D. Monnier,
Jacques-Robert Delorme,
Jason J. Wang,
Nemanja Jovanovic,
Katelyn Horstman,
Garreth Ruane,
Bertrand Mennesson,
Eugene Serabyn,
Dimitri Mawet,
J. Kent Wallace,
Sofia Hillman,
Ashley Baker,
Randall Bartos,
Benjamin Calvin,
Sylvain Cetre,
Greg Doppmann,
Luke Finnerty,
Michael P. Fitzgerald,
Chih-Chun Hsu,
Joshua Liberman,
Ronald Lopez,
Maxwell Millar-Blanchaer,
Evan Morris
, et al. (13 additional authors not shown)
Abstract:
Vortex fiber nulling (VFN) is a technique for detecting and characterizing faint companions at small separations from their host star. A near-infrared ($\sim2.3 μ$m) VFN demonstrator mode was deployed on the Keck Planet Imager and Characterizer (KPIC) instrument at the Keck Observatory and presented earlier. In this paper, we present the first VFN companion detections. Three targets, HIP 21543 Ab,…
▽ More
Vortex fiber nulling (VFN) is a technique for detecting and characterizing faint companions at small separations from their host star. A near-infrared ($\sim2.3 μ$m) VFN demonstrator mode was deployed on the Keck Planet Imager and Characterizer (KPIC) instrument at the Keck Observatory and presented earlier. In this paper, we present the first VFN companion detections. Three targets, HIP 21543 Ab, HIP 94666 Ab, and HIP 50319 B, were detected with host-companion flux ratios between 70 and 430 at and within one diffraction beamwidth ($λ/D$). We complement the spectra from KPIC VFN with flux ratio and position measurements from the CHARA Array to validate the VFN results and provide a more complete characterization of the targets. This paper reports the first direct detection of these three M dwarf companions, yielding their first spectra and flux ratios. Our observations provide measurements of bulk properties such as effective temperatures, radial velocities, and v$\sin{i}$, and verify the accuracy of the published orbits. These detections corroborate earlier predictions of the KPIC VFN performance, demonstrating that the instrument mode is ready for science observations.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A
Authors:
The LHAASO Collaboration,
Zhen Cao,
F. Aharonian,
Q. An,
A. Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen
, et al. (256 additional authors not shown)
Abstract:
We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at…
▽ More
We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at $3.67 \pm 0.05 \pm 0.15$ PeV. Below the knee, the spectral index is found to be -$2.7413 \pm 0.0004 \pm 0.0050$, while above the knee, it is -$3.128 \pm 0.005 \pm 0.027$, with the sharpness of the transition measured with a statistical error of 2%. The mean logarithmic mass of cosmic rays is almost heavier than helium in the whole measured energy range. It decreases from 1.7 at 0.3 PeV to 1.3 at 3 PeV, representing a 24% decline following a power law with an index of -$0.1200 \pm 0.0003 \pm 0.0341$. This is equivalent to an increase in abundance of light components. Above the knee, the mean logarithmic mass exhibits a power law trend towards heavier components, which is reversal to the behavior observed in the all-particle energy spectrum. Additionally, the knee position and the change in power-law index are approximately the same. These findings suggest that the knee observed in the all-particle spectrum corresponds to the knee of the light component, rather than the medium-heavy components.
△ Less
Submitted 26 March, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Physics-Inspired Deep Learning Anti-Aliasing Framework in Efficient Channel State Feedback
Authors:
Yu-Chien Lin,
Yan Xin,
Ta-Sung Lee,
Charlie,
Zhang,
Zhi Ding
Abstract:
Acquiring downlink channel state information (CSI) at the base station is vital for optimizing performance in massive Multiple input multiple output (MIMO) Frequency-Division Duplexing (FDD) systems. While deep learning architectures have been successful in facilitating UE-side CSI feedback and gNB-side recovery, the undersampling issue prior to CSI feedback is often overlooked. This issue, which…
▽ More
Acquiring downlink channel state information (CSI) at the base station is vital for optimizing performance in massive Multiple input multiple output (MIMO) Frequency-Division Duplexing (FDD) systems. While deep learning architectures have been successful in facilitating UE-side CSI feedback and gNB-side recovery, the undersampling issue prior to CSI feedback is often overlooked. This issue, which arises from low density pilot placement in current standards, results in significant aliasing effects in outdoor channels and consequently limits CSI recovery performance. To this end, this work introduces a new CSI upsampling framework at the gNB as a post-processing solution to address the gaps caused by undersampling. Leveraging the physical principles of discrete Fourier transform shifting theorem and multipath reciprocity, our framework effectively uses uplink CSI to mitigate aliasing effects. We further develop a learning-based method that integrates the proposed algorithm with the Iterative Shrinkage-Thresholding Algorithm Net (ISTA-Net) architecture, enhancing our approach for non-uniform sampling recovery. Our numerical results show that both our rule-based and deep learning methods significantly outperform traditional interpolation techniques and current state-of-the-art approaches in terms of performance.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Revision of the GeV $γ$-ray Emission in the Region of HESS J1813-178 with Fermi-LAT
Authors:
Xiaolei Guo,
Yuliang Xin
Abstract:
HESS J1813-178 is one of the brightest and most compact TeV $γ$-ray sources, and whether its $γ$-ray emission is associated with supernova remnant (SNR), pulsar wind nebula (PWN) or young stellar cluster (YSC) is still under debate. By analysing the GeV $γ$-ray data in the field of HESS J1813-178 using 14 years of PASS 8 data recorded by the Fermi Large Area Telescope (Fermi-LAT), we report the di…
▽ More
HESS J1813-178 is one of the brightest and most compact TeV $γ$-ray sources, and whether its $γ$-ray emission is associated with supernova remnant (SNR), pulsar wind nebula (PWN) or young stellar cluster (YSC) is still under debate. By analysing the GeV $γ$-ray data in the field of HESS J1813-178 using 14 years of PASS 8 data recorded by the Fermi Large Area Telescope (Fermi-LAT), we report the discovery of three different sources with different spectra in this region. The hard source with a power law spectral index of 2.11 $\pm$ 0.08 has a small size extension, which is spatially and spectrally coincident with the TeV $γ$-ray emission from HESS J1813-178. CO observations display the dense molecular clouds surrounding HESS J1813-178 in the velocity range of 45-60 km s$^{\rm -1}$. The possible origins of the $γ$-ray emission from HESS J1813-178 are discussed, including SNR G12.82-0.02, the PWN driven by the energetic X-ray pulsar PSR J1813-1749, and YSC Cl 1813-178. However, none of them can be ruled out clearly. Note that the maximum energy of protons in the hadronic model should exceed a few hundred TeV, which makes HESS J1813-178 to be a promising PeVatron. The detailed LHAASO data analysis about the morphology and spectrum would be helpful to investigate the origin of the $γ$-ray emission in this region and test its PeVatron nature.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Coherent Imaging with Photonic Lanterns
Authors:
Yoo Jung Kim,
Michael P. Fitzgerald,
Jonathan Lin,
Steph Sallum,
Yinzi Xin,
Nemanja Jovanovic,
Sergio Leon-Saval
Abstract:
Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pup…
▽ More
Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pupil, analogous to subapertures in aperture masking interferometry (AMI). Coherent imaging with PLs can be enabled by interfering SMF outputs and applying phase modulation, which can be achieved using a photonic chip beam combiner at the backend (e.g., the ABCD beam combiner). In this study, we investigate the potential of coherent imaging by interfering SMF outputs of a PL with a single telescope. We demonstrate that the visibilities that can be measured from a PL are mutual intensities incident on the pupil weighted by the cross-correlation of a pair of lantern modes. From numerically simulated lantern principal modes of a 6-port PL, we find that interferometric observables using a PL behave similarly to separated-aperture visibilities for simple models on small angular scales ($<λ/D$) but with greater sensitivity to symmetries and capability to break phase angle degeneracies. Furthermore, we present simulated observations with wavefront errors and compare them to AMI. Despite the redundancy caused by extended lantern principal modes, spatial filtering offers stability to wavefront errors. Our simulated observations suggest that PLs may offer significant benefits in the photon noise-limited regime and in resolving small angular scales at low contrast regime.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning
Authors:
Hang Zhao,
Yifei Xin,
Zhesong Yu,
Bilei Zhu,
Lu Lu,
Zejun Ma
Abstract:
In the realm of audio-language pre-training (ALP), the challenge of achieving cross-modal alignment is significant. Moreover, the integration of audio inputs with diverse distributions and task variations poses challenges in developing generic audio-language models. In this study, we present MINT, a novel ALP framework boosting audio-language models through multi-target pre-training and instructio…
▽ More
In the realm of audio-language pre-training (ALP), the challenge of achieving cross-modal alignment is significant. Moreover, the integration of audio inputs with diverse distributions and task variations poses challenges in developing generic audio-language models. In this study, we present MINT, a novel ALP framework boosting audio-language models through multi-target pre-training and instruction tuning. MINT leverages the strength of frozen pre-trained audio encoders and large language models (LLM) to improve audio-language pre-training, enabling effective transferablility to both audio-text understanding and generation tasks. To address the modality gap, we introduce Bridge-Net, a trainable module that enhances cross-modality alignment and the model's ability to follow instructions for a variety of audio-text tasks. Bridge-Net is pivotal within MINT, initially enhancing audio-language representation learning through a multi-target pre-training approach. Subsequently, Bridge-Net further boosts audio-to-language generative learning by integrating a frozen language model with instruction tuning. This integration empowers MINT to extract features in a flexible and effective manner, specifically tailored to the provided instructions for diverse tasks. Experimental results demonstrate that MINT attains superior performance across various audio-language understanding and generation tasks, highlighting its robust generalization capabilities even in zero-shot scenarios.
△ Less
Submitted 11 June, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey
Authors:
Yi Xin,
Siqi Luo,
Haodi Zhou,
Junlong Du,
Xiaohong Liu,
Yue Fan,
Qing Li,
Yuntao Du
Abstract:
Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks. However, with state-of-the-art PVMs growing to billions or even trillions of parameters, the standard full fine-tuning paradigm is becoming unsustainable due to high computational and storage demands. In response, researchers are exploring parameter-efficient fine-tuning…
▽ More
Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks. However, with state-of-the-art PVMs growing to billions or even trillions of parameters, the standard full fine-tuning paradigm is becoming unsustainable due to high computational and storage demands. In response, researchers are exploring parameter-efficient fine-tuning (PEFT), which seeks to exceed the performance of full fine-tuning with minimal parameter modifications. This survey provides a comprehensive overview and future directions for visual PEFT, offering a systematic review of the latest advancements. First, we provide a formal definition of PEFT and discuss model pre-training methods. We then categorize existing methods into three categories: addition-based, partial-based, and unified-based. Finally, we introduce the commonly used datasets and applications and suggest potential future research challenges. A comprehensive collection of resources is available at https://github.com/synbol/Awesome-Parameter-Efficient-Transfer-Learning.
△ Less
Submitted 8 February, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
Masked Audio Modeling with CLAP and Multi-Objective Learning
Authors:
Yifei Xin,
Xiulian Peng,
Yan Lu
Abstract:
Most existing masked audio modeling (MAM) methods learn audio representations by masking and reconstructing local spectrogram patches. However, the reconstruction loss mainly accounts for the signal-level quality of the reconstructed spectrogram and is still limited in extracting high-level audio semantics. In this paper, we propose to enhance the semantic modeling of MAM by distilling cross-modal…
▽ More
Most existing masked audio modeling (MAM) methods learn audio representations by masking and reconstructing local spectrogram patches. However, the reconstruction loss mainly accounts for the signal-level quality of the reconstructed spectrogram and is still limited in extracting high-level audio semantics. In this paper, we propose to enhance the semantic modeling of MAM by distilling cross-modality knowledge from contrastive language-audio pretraining (CLAP) representations for both masked and unmasked regions (MAM-CLAP) and leveraging a multi-objective learning strategy with a supervised classification branch (SupMAM), thereby providing more semantic knowledge for MAM and enabling it to effectively learn global features from labels. Experiments show that our methods significantly improve the performance on multiple downstream tasks. Furthermore, by combining our MAM-CLAP with SupMAM, we can achieve new state-of-the-art results on various audio and speech classification tasks, exceeding previous self-supervised learning and supervised pretraining methods.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Anomalous Proximitized Transport in Metal/Quantum Magnet Heterostructure $\rm{Bi_{2}Ir_{2}O_{7}/Yb_{2}Ti_{2}O_{7}}$
Authors:
Chengkun Xing,
Shu Zhang,
Weiliang Yao,
Dapeng Cui,
Qing Huang,
Junyi Yang,
Shashi Pandey,
Dongliang Gong,
Lukas Horák,
Yan Xin,
Eun Sang Choi,
Yang Zhang,
Haidong Zhou,
Jian Liu
Abstract:
Fluctuations of quantum spins play a crucial role in the emergence of exotic magnetic phases and excitations. The lack of the charge degree of freedom in insulating quantum magnets, however, precludes such fluctuations from mediating electronic transport. Here we show that the quantum fluctuations of a localized frustrated magnet induce strong proximitized charge transport of the conduction electr…
▽ More
Fluctuations of quantum spins play a crucial role in the emergence of exotic magnetic phases and excitations. The lack of the charge degree of freedom in insulating quantum magnets, however, precludes such fluctuations from mediating electronic transport. Here we show that the quantum fluctuations of a localized frustrated magnet induce strong proximitized charge transport of the conduction electrons in a synthetic heterostructure comprising an epitaxial $\rm{Bi_{2}Ir_{2}O_{7}}$ ultrathin film on the single crystal of $\rm{Yb_{2}Ti_{2}O_{7}}$. The proximity effects are evidenced by the scaling behavior of the $\rm{Bi_{2}Ir_{2}O_{7}}$ resistance in correspondance with the dynamic scaling of the dynamic spin correlation function of $\rm{Yb_{2}Ti_{2}O_{7}}$, which is a result of quantum fluctuations near a multi-phase quantum critical point. The proximitized transport in $\rm{Bi_{2}Ir_{2}O_{7}}$ can be effectively tuned by magnetic field through suppressing the quantum spin fluctuations as well as inducing transitions via magnetic anisotropy in $\rm{Yb_{2}Ti_{2}O_{7}}$. Our work establishes a new pathway for harnessing quantum spin fluctuations in magnetic insulators with electric transport, offering exciting prospects for potential applications in the realm of quantum spintronics.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Flexible filtrations for multiparameter persistent homology detect digital images
Authors:
Jiaxing He,
Bingzhe Hou,
Tieru Wu,
Yue Xin
Abstract:
Two important problems in the field of Topological Data Analysis are defining practical multifiltrations on objects and showing ability of TDA to detect the geometry. Motivated by the problems, we constuct three multifiltrations named multi-GENEO, multi-DGENEO and mix-GENEO, and prove the stability of both the interleaving distance and multiparameter persistence landscape of multi-GENEO with respe…
▽ More
Two important problems in the field of Topological Data Analysis are defining practical multifiltrations on objects and showing ability of TDA to detect the geometry. Motivated by the problems, we constuct three multifiltrations named multi-GENEO, multi-DGENEO and mix-GENEO, and prove the stability of both the interleaving distance and multiparameter persistence landscape of multi-GENEO with respect to the pseudometric of the subspace of bounded functions. We also give the estimations of upper bound for multi-DGENEO and mix-GENEO. Finally, we provide experiment results on MNIST dataset to demonstrate our bifiltrations have ability to detect geometric and topological differences of digital images.
△ Less
Submitted 1 April, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Coronagraphic Data Post-processing Using Projections on Instrumental Modes
Authors:
Yinzi Xin,
Laurent Pueyo,
Romain Laugier,
Leonid Pogorelyuk,
Ewan S. Douglas,
Benjamin J. S. Pope,
Kerri L. Cahoy
Abstract:
Directly observing exoplanets with coronagraphs is impeded by the presence of speckles from aberrations in the optical path, which can be mitigated in hardware with wavefront control as well as in post-processing. This work explores using an instrument model in post-processing to separate astrophysical signals from residual aberrations in coronagraphic data. The effect of wavefront error (WFE) on…
▽ More
Directly observing exoplanets with coronagraphs is impeded by the presence of speckles from aberrations in the optical path, which can be mitigated in hardware with wavefront control as well as in post-processing. This work explores using an instrument model in post-processing to separate astrophysical signals from residual aberrations in coronagraphic data. The effect of wavefront error (WFE) on the coronagraphic intensity consists of a linear contribution and a quadratic contribution. When either of the terms is much larger than the other, the instrument response can be approximated by a transfer matrix mapping WFE to detector plane intensity. From this transfer matrix, a useful projection onto instrumental modes that removes the dominant error modes can be derived. We apply this projection to synthetically generated Roman Space Telescope hybrid Lyot coronagraph (HLC) data to extract "robust observables," which can be used instead of raw data for applications such as detection testing. The projection improves planet flux ratio detection limits by about 28% in the linear regime and by over a factor of 2 in the quadratic regime, illustrating that robust observables can increase sensitivity to astrophysical signals and improve the scientific yield from coronagraphic data. While this approach does not require additional information such as observations of reference stars or modulations of a deformable mirror, it can and should be combined with these other techniques, acting as a model-informed prior in an overall post-processing strategy.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Advancing DDoS Attack Detection: A Synergistic Approach Using Deep Residual Neural Networks and Synthetic Oversampling
Authors:
Ali Alfatemi,
Mohamed Rahouti,
Ruhul Amin,
Sarah ALJamal,
Kaiqi Xiong,
Yufeng Xin
Abstract:
Distributed Denial of Service (DDoS) attacks pose a significant threat to the stability and reliability of online systems. Effective and early detection of such attacks is pivotal for safeguarding the integrity of networks. In this work, we introduce an enhanced approach for DDoS attack detection by leveraging the capabilities of Deep Residual Neural Networks (ResNets) coupled with synthetic overs…
▽ More
Distributed Denial of Service (DDoS) attacks pose a significant threat to the stability and reliability of online systems. Effective and early detection of such attacks is pivotal for safeguarding the integrity of networks. In this work, we introduce an enhanced approach for DDoS attack detection by leveraging the capabilities of Deep Residual Neural Networks (ResNets) coupled with synthetic oversampling techniques. Because of the inherent class imbalance in many cyber-security datasets, conventional methods often struggle with false negatives, misclassifying subtle DDoS patterns as benign. By applying the Synthetic Minority Over-sampling Technique (SMOTE) to the CICIDS dataset, we balance the representation of benign and malicious data points, enabling the model to better discern intricate patterns indicative of an attack. Our deep residual network, tailored for this specific task, further refines the detection process. Experimental results on a real-world dataset demonstrate that our approach achieves an accuracy of 99.98%, significantly outperforming traditional methods. This work underscores the potential of combining advanced data augmentation techniques with deep learning models to bolster cyber-security defenses.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Real-time experimental demonstrations of a photonic lantern wavefront sensor
Authors:
Jonathan W. Lin,
Michael P. Fitzgerald,
Yinzi Xin,
Yoo Jung Kim,
Olivier Guyon,
Barnaby Norris,
Christopher Betters,
Sergio Leon-Saval,
Kyohoon Ahn,
Vincent Deo,
Julien Lozi,
Sébastien Vievard,
Daniel Levinstein,
Steph Sallum,
Nemanja Jovanovic
Abstract:
The direct imaging of an Earth-like exoplanet will require sub-nanometric wavefront control across large light-collecting apertures, to reject host starlight and detect the faint planetary signal. Current adaptive optics (AO) systems, which use wavefront sensors that reimage the telescope pupil, face two challenges that prevent this level of control: non-common-path aberrations (NCPAs), caused by…
▽ More
The direct imaging of an Earth-like exoplanet will require sub-nanometric wavefront control across large light-collecting apertures, to reject host starlight and detect the faint planetary signal. Current adaptive optics (AO) systems, which use wavefront sensors that reimage the telescope pupil, face two challenges that prevent this level of control: non-common-path aberrations (NCPAs), caused by differences between the sensing and science arms of the instrument; and petaling modes: discontinuous phase aberrations caused by pupil fragmentation, especially relevant for the upcoming 30-m class telescopes. Such aberrations drastically impact the capabilities of high-contrast instruments. To address these issues, we can add a second-stage wavefront sensor to the science focal plane. One promising architecture uses the photonic lantern (PL): a waveguide that efficiently couples aberrated light into single-mode fibers (SMFs). In turn, SMF-confined light can be stably injected into high-resolution spectrographs, enabling direct exoplanet characterization and precision radial velocity measurements; simultaneously, the PL can be used for focal-plane wavefront sensing. We present a real-time experimental demonstration of the PL wavefront sensor on the Subaru/SCExAO testbed. Our system is stable out to around ~400 nm of low-order Zernike wavefront error, and can correct petaling modes. When injecting ~30 nm RMS of low order time-varying error, we achieve ~10x rejection at 1 s timescales; further refinements to the control law and lantern fabrication process should make sub-nanometric wavefront control possible. In the future, novel sensors like the PLWFS may prove to be critical in resolving the wavefront control challenges posed by exoplanet direct imaging.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Energy-Dependent Analyses of the Gamma-Ray Emission from HESS J1857+026 with Fermi-LAT
Authors:
Xiaolei Guo,
Xi Liu,
Yuliang Xin
Abstract:
We report the discovery of energy-dependent morphology for the GeV gamma-ray emission from HESS J1857+026 with more than 13 years of {\it Fermi} Large Area Telescope (LAT) data. The GeV gamma-ray emission from this region is composed of two extended components. The hard component with an index of $1.74 \pm 0.07$ in the energy range of 0.5-500 GeV is spatially coincident with HESS J1857+026, and it…
▽ More
We report the discovery of energy-dependent morphology for the GeV gamma-ray emission from HESS J1857+026 with more than 13 years of {\it Fermi} Large Area Telescope (LAT) data. The GeV gamma-ray emission from this region is composed of two extended components. The hard component with an index of $1.74 \pm 0.07$ in the energy range of 0.5-500 GeV is spatially coincident with HESS J1857+026, and its 68\% containment radius varies from $\sim 0.44^\circ$ below 40 GeV to $\sim 0.30^\circ$ above 140 GeV. The hard GeV gamma-ray spectrum and the energy-dependent morphology of HESS J1857+026 make it favor a PWN origin, which is associated with the energetic pulsar, PSR J1856+0245. The soft component with an index of $2.70 \pm 0.16$ and another extended gamma-ray source detected in this region, 4FGL J1857.9+0313e with an index of $2.55 \pm 0.07$, are spatially coincidence with two molecular clumps in the northeast and southwest of HESS J1857+026, which favors the hadronic process, and the protons could be accelerated by the hypothetical SNR associated with PSR J1856+0245.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding
Authors:
Yi Xin,
Junlong Du,
Qiang Wang,
Zhiwen Lin,
Ke Yan
Abstract:
Large-scale pre-trained models have achieved remarkable success in various computer vision tasks. A standard approach to leverage these models is to fine-tune all model parameters for downstream tasks, which poses challenges in terms of computational and storage costs. Recently, inspired by Natural Language Processing (NLP), parameter-efficient transfer learning has been successfully applied to vi…
▽ More
Large-scale pre-trained models have achieved remarkable success in various computer vision tasks. A standard approach to leverage these models is to fine-tune all model parameters for downstream tasks, which poses challenges in terms of computational and storage costs. Recently, inspired by Natural Language Processing (NLP), parameter-efficient transfer learning has been successfully applied to vision tasks. However, most existing techniques primarily focus on single-task adaptation, and despite limited research on multi-task adaptation, these methods often exhibit suboptimal training and inference efficiency. In this paper, we first propose an once-for-all Vision Multi-Task Adapter (VMT-Adapter), which strikes approximately O(1) training and inference efficiency w.r.t task number. Concretely, VMT-Adapter shares the knowledge from multiple tasks to enhance cross-task interaction while preserves task-specific knowledge via independent knowledge extraction modules. Notably, since task-specific modules require few parameters, VMT-Adapter can handle an arbitrary number of tasks with a negligible increase of trainable parameters. We also propose VMT-Adapter-Lite, which further reduces the trainable parameters by learning shared parameters between down- and up-projections. Extensive experiments on four dense scene understanding tasks demonstrate the superiority of VMT-Adapter(-Lite), achieving a 3.96%(1.34%) relative improvement compared to single-task full fine-tuning, while utilizing merely ~1% (0.36%) trainable parameters of the pre-trained model.
△ Less
Submitted 15 December, 2023; v1 submitted 14 December, 2023;
originally announced December 2023.
-
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning
Authors:
Yi Xin,
Junlong Du,
Qiang Wang,
Ke Yan,
Shouhong Ding
Abstract:
Multi-Task Learning (MTL) is designed to train multiple correlated tasks simultaneously, thereby enhancing the performance of individual tasks. Typically, a multi-task network structure consists of a shared backbone and task-specific decoders. However, the complexity of the decoders increases with the number of tasks. To tackle this challenge, we integrate the decoder-free vision-language model CL…
▽ More
Multi-Task Learning (MTL) is designed to train multiple correlated tasks simultaneously, thereby enhancing the performance of individual tasks. Typically, a multi-task network structure consists of a shared backbone and task-specific decoders. However, the complexity of the decoders increases with the number of tasks. To tackle this challenge, we integrate the decoder-free vision-language model CLIP, which exhibits robust zero-shot generalization capability. Recently, parameter-efficient transfer learning methods have been extensively explored with CLIP for adapting to downstream tasks, where prompt tuning showcases strong potential. Nevertheless, these methods solely fine-tune a single modality (text or visual), disrupting the modality structure of CLIP. In this paper, we first propose Multi-modal Alignment Prompt (MmAP) for CLIP, which aligns text and visual modalities during fine-tuning process. Building upon MmAP, we develop an innovative multi-task prompt learning framework. On the one hand, to maximize the complementarity of tasks with high similarity, we utilize a gradient-driven task grouping method that partitions tasks into several disjoint groups and assign a group-shared MmAP to each group. On the other hand, to preserve the unique characteristics of each task, we assign an task-specific MmAP to each task. Comprehensive experiments on two large multi-task learning datasets demonstrate that our method achieves significant performance improvements compared to full fine-tuning while only utilizing approximately 0.09% of trainable parameters.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Validation of elemental and isotopic abundances in late-M spectral types with the benchmark HIP 55507 AB system
Authors:
Jerry W. Xuan,
Jason J. Wang,
Luke Finnerty,
Katelyn Horstman,
Simon Grimm,
Anne Peck,
Eric L. Nielsen,
Heather A. Knutson,
Dimitri Mawet,
Howard Isaacson,
Andrew W. Howard,
Michael C. Liu,
Sam Walker,
Mark Phillips,
Geoffrey Blake,
Jean-Baptiste Ruffio,
Yapeng Zhang,
Julie Inglis,
Nicole L. Wallack,
Aniket Sanghi,
Erica Gonzales,
Fei Dai,
Ashley Baker,
Randall Bartos,
Charlotte Bond
, et al. (26 additional authors not shown)
Abstract:
M dwarfs are common host stars to exoplanets but often lack atmospheric abundance measurements. Late-M dwarfs are also good analogs to the youngest substellar companions, which share similar $T_{\rm eff}\sim2300-2800~K$. We present atmospheric analyses for the M7.5 companion HIP 55507 B and its K6V primary star with Keck/KPIC high-resolution ($R\sim35,000$) $K$ band spectroscopy. First, by includi…
▽ More
M dwarfs are common host stars to exoplanets but often lack atmospheric abundance measurements. Late-M dwarfs are also good analogs to the youngest substellar companions, which share similar $T_{\rm eff}\sim2300-2800~K$. We present atmospheric analyses for the M7.5 companion HIP 55507 B and its K6V primary star with Keck/KPIC high-resolution ($R\sim35,000$) $K$ band spectroscopy. First, by including KPIC relative radial velocities between the primary and secondary in the orbit fit, we improve the dynamical mass precision by 60% and find $M_B=88.0_{-3.2}^{+3.4}$ $M_{\rm Jup}$, putting HIP 55507 B above the stellar-substellar boundary. We also find that HIP 55507 B orbits its K6V primary star with $a=38^{+4}_{-3}$ AU and $e=0.40\pm0.04$. From atmospheric retrievals of HIP 55507 B, we measure $\rm [C/H]=0.24\pm0.13$, $\rm [O/H]=0.15\pm0.13$, and $\rm C/O=0.67\pm0.04$. Moreover, we strongly detect $\rm ^{13}CO$ ($7.8σ$ significance) and tentatively detect $\rm H_2^{18}O$ ($3.7σ$ significance) in companion's atmosphere, and measure $\rm ^{12}CO/^{13}CO=98_{-22}^{+28}$ and $\rm H_2^{16}O/H_2^{18}O=240_{-80}^{+145}$ after accounting for systematic errors. From a simplified retrieval analysis of HIP 55507 A, we measure $\rm ^{12}CO/^{13}CO=79_{-16}^{+21}$ and $\rm C^{16}O/C^{18}O=288_{-70}^{+125}$ for the primary star. These results demonstrate that HIP 55507 A and B have consistent $\rm ^{12} C/^{13}C$ and $\rm ^{16}O/^{18}O$ to the $<1σ$ level, as expected for a chemically homogeneous binary system. Given the similar flux ratios and separations between HIP 55507 AB and systems with young, substellar companions, our results open the door to systematically measuring $\rm ^{13}CO$ and $\rm H_2^{18}O$ abundances in the atmospheres of substellar or even planetary-mass companions with similar spectral types.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Spectroastrometry and Imaging Science with Photonic Lanterns on Extremely Large Telescopes
Authors:
Yoo Jung Kim,
Michael P. Fitzgerald,
Jonathan Lin,
Steph Sallum,
Yinzi Xin,
Nemanja Jovanovic,
Sergio Leon-Saval,
Christopher Betters,
Pradip Gatkine,
Olivier Guyon,
Julien Lozi,
Dimitri Mawet,
Barnaby Norris,
Sébastien Vievard
Abstract:
Photonic lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers. In astronomical applications, PLs can efficiently couple multi-mode telescope light into a multi-mode fiber entrance and convert it into multiple single-mode beams. The output beams are highly stable and suitable for feeding into high-resolution spectrographs…
▽ More
Photonic lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers. In astronomical applications, PLs can efficiently couple multi-mode telescope light into a multi-mode fiber entrance and convert it into multiple single-mode beams. The output beams are highly stable and suitable for feeding into high-resolution spectrographs or photonic chip beam combiners. For instance, by using relative intensities in the output cores as a function of wavelength, PLs can enable spectroastrometry. In addition, by interfering beams in the output cores with a beam combiner in the backend, PLs can be used for high-throughput interferometric imaging. When used on an Extremely Large Telescope (ELT), with its increased sensitivity and angular resolution, the imaging and spectroastrometric capabilities of PLs will be extended to higher contrast and smaller angular scales. We study the potential spectroastrometry and imaging science cases of PLs on ELTs, including study of exomoons, broad-line regions of quasars, and inner circumstellar disks.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Atmospheric metallicity and C/O of HD 189733 b from high-resolution spectroscopy
Authors:
Luke Finnerty,
Jerry W. Xuan,
Yinzi Xin,
Joshua Liberman,
Tobias Schofield,
Michael P. Fitzgerald,
Shubh Agrawal,
Ashley Baker,
Randall Bartos,
Geoffrey A. Blake,
Benjamin Calvin,
Sylvain Cetre,
Jacques-Robert Delorme,
Greg Doppman,
Daniel Echeverri,
Chih-Chun Hsu,
Nemanja Jovanovic,
Ronald A. López,
Emily C. Martin,
Dimitri Mawet,
Evan Morris,
Jacklyn Pezzato,
Jean-Baptiste Ruffio,
Ben Sappey,
Andrew Skemer
, et al. (5 additional authors not shown)
Abstract:
We present high-resolution $K$-band emission spectra of the quintessential hot Jupiter HD 189733 b from the Keck Planet Imager and Characterizer (KPIC). Using a Bayesian retrieval framework, we fit the dayside pressure-temperature profile, orbital kinematics, mass-mixing ratios of H$_2$O, CO, CH$_4$, NH$_3$, HCN, and H$_2$S, and the $\rm ^{13}CO/^{12}CO$ ratio. We measure mass fractions of…
▽ More
We present high-resolution $K$-band emission spectra of the quintessential hot Jupiter HD 189733 b from the Keck Planet Imager and Characterizer (KPIC). Using a Bayesian retrieval framework, we fit the dayside pressure-temperature profile, orbital kinematics, mass-mixing ratios of H$_2$O, CO, CH$_4$, NH$_3$, HCN, and H$_2$S, and the $\rm ^{13}CO/^{12}CO$ ratio. We measure mass fractions of $\rm \log H_2O = -2.0^{+0.4}_{-0.4}$ and $\rm \log CO = -2.2^{+0.5}_{-0.5}$, and place upper limits on the remaining species. Notably, we find $\rm \log CH_4 < -4.5$ at 99\% confidence, despite its anticipated presence at the equilibrium temperature of HD 189733 b assuming local thermal equilibrium. We make a tentative ($\sim3σ$) detection of $\rm ^{13}CO$, and the retrieved posteriors suggest a $\rm ^{12}C/^{13}C$ ratio similar to or substantially less than the local interstellar value. The possible $\rm ^{13}C$ enrichment would be consistent with accretion of fractionated material in ices or in the protoplanetary disk midplane. The retrieved abundances correspond to a substantially sub-stellar atmospheric $\rm C/O = 0.3\pm0.1$, while the carbon and oxygen abundances are stellar to slightly super-stellar, consistent with core-accretion models which predict an inverse correlation between C/O and metallicity. The specific combination of low C/O and high metallicity suggests significant accretion of solid material may have occurred late in the formation process of HD 189733 b.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Enhancing ML-Based DoS Attack Detection Through Combinatorial Fusion Analysis
Authors:
Evans Owusu,
Mohamed Rahouti,
D. Frank Hsu,
Kaiqi Xiong,
Yufeng Xin
Abstract:
Mitigating Denial-of-Service (DoS) attacks is vital for online service security and availability. While machine learning (ML) models are used for DoS attack detection, new strategies are needed to enhance their performance. We suggest an innovative method, combinatorial fusion, which combines multiple ML models using advanced algorithms. This includes score and rank combinations, weighted techniqu…
▽ More
Mitigating Denial-of-Service (DoS) attacks is vital for online service security and availability. While machine learning (ML) models are used for DoS attack detection, new strategies are needed to enhance their performance. We suggest an innovative method, combinatorial fusion, which combines multiple ML models using advanced algorithms. This includes score and rank combinations, weighted techniques, and diversity strength of scoring systems. Through rigorous evaluations, we demonstrate the effectiveness of this fusion approach, considering metrics like precision, recall, and F1-score. We address the challenge of low-profiled attack classification by fusing models to create a comprehensive solution. Our findings emphasize the potential of this approach to improve DoS attack detection and contribute to stronger defense mechanisms.
△ Less
Submitted 1 October, 2023;
originally announced December 2023.
-
Lightcone Hamiltonian for Ising Field Theory I: T < T_c
Authors:
A. Liam Fitzpatrick,
Emanuel Katz,
Yuan Xin
Abstract:
We study 2d Ising Field Theory (IFT) in the low-temperature phase in lightcone quantization, and show that integrating out zero modes generates a very compact form for the effective lightcone interaction that depends on the finite volume vacuum expectation value of the $σ$ operator. This form is most naturally understood in a conformal basis for the lightcone Hilbert space. We further verify that…
▽ More
We study 2d Ising Field Theory (IFT) in the low-temperature phase in lightcone quantization, and show that integrating out zero modes generates a very compact form for the effective lightcone interaction that depends on the finite volume vacuum expectation value of the $σ$ operator. This form is most naturally understood in a conformal basis for the lightcone Hilbert space. We further verify that this simple form reproduces to high accuracy results for the spectra, the $c$-function, and the form-factors from integrability methods for the magnetic deformation of IFT. For generic non-integrable values of parameters we also compute the above observables and compare our numeric results to those of equal-time truncation. In particular, we report on new measurements of various bound-state form-factors as well as the stress-tensor spectral density. We find that the stress tensor spectral density provides additional evidence that certain resonances of IFT are surprisingly narrow, even at generic strong coupling. Explicit example code for constructing the effective Hamiltonian is included in an appendix.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Curvature estimates of ancient solutions to the mean curvature flow of higher codimension with convex Gauss image
Authors:
Hongbing Qiu,
Y. L. Xin
Abstract:
By carrying out refined curvature estimates, we prove better rigidity theorems of complete noncompact ancient solutions to the mean curvature flow in higher codimension under various Gauss image restriction.
By carrying out refined curvature estimates, we prove better rigidity theorems of complete noncompact ancient solutions to the mean curvature flow in higher codimension under various Gauss image restriction.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
A causal intervention framework for synthesizing mobility data and evaluating predictive neural networks
Authors:
Ye Hong,
Yanan Xin,
Simon Dirmeier,
Fernando Perez-Cruz,
Martin Raubal
Abstract:
Deep neural networks are increasingly utilized in mobility prediction tasks, yet their intricate internal workings pose challenges for interpretability, especially in comprehending how various aspects of mobility behavior affect predictions. This study introduces a causal intervention framework to assess the impact of mobility-related factors on neural networks designed for next location predictio…
▽ More
Deep neural networks are increasingly utilized in mobility prediction tasks, yet their intricate internal workings pose challenges for interpretability, especially in comprehending how various aspects of mobility behavior affect predictions. This study introduces a causal intervention framework to assess the impact of mobility-related factors on neural networks designed for next location prediction -- a task focusing on predicting the immediate next location of an individual. To achieve this, we employ individual mobility models to synthesize location visit sequences and control behavior dynamics by intervening in their data generation process. We evaluate the interventional location sequences using mobility metrics and input them into well-trained networks to analyze performance variations. The results demonstrate the effectiveness in producing location sequences with distinct mobility behaviors, thereby facilitating the simulation of diverse yet realistic spatial and temporal changes. These changes result in performance fluctuations in next location prediction networks, revealing impacts of critical mobility behavior factors, including sequential patterns in location transitions, proclivity for exploring new locations, and preferences in location choices at population and individual levels. The gained insights hold value for the real-world application of mobility prediction networks, and the framework is expected to promote the use of causal inference to enhance the interpretability and robustness of neural networks in mobility applications.
△ Less
Submitted 1 August, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources
Authors:
Yipei Xu,
Dakuan Lu,
Jiaqing Liang,
Xintao Wang,
Yipeng Geng,
Yingsi Xin,
Hengkui Wu,
Ken Chen,
ruiji zhang,
Yanghua Xiao
Abstract:
Pre-trained language models (PLMs) have established the new paradigm in the field of NLP. For more powerful PLMs, one of the most popular and successful way is to continuously scale up sizes of the models and the pre-training corpora. These large corpora are generally obtained by converging smaller ones from multiple sources, they are thus growing increasingly diverse. However, the side-effects of…
▽ More
Pre-trained language models (PLMs) have established the new paradigm in the field of NLP. For more powerful PLMs, one of the most popular and successful way is to continuously scale up sizes of the models and the pre-training corpora. These large corpora are generally obtained by converging smaller ones from multiple sources, they are thus growing increasingly diverse. However, the side-effects of these colossal converged corpora remain understudied. In this paper, we identify the disadvantage of heterogeneous corpora from multiple sources for pre-training PLMs. Towards coordinated pre-training on diverse corpora, we further propose source prompts (SP), which explicitly prompt the model of the data source at the pre-training and fine-tuning stages. Results of extensive experiments demonstrate that PLMs pre-trained with SP on diverse corpora gain significant improvement in various downstream tasks.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.