-
Dust and Power: Unravelling the merger -- AGN connection in the second half of the cosmic history
Authors:
A. La Marca,
B. Margalef-Bentabol,
L. Wang,
F. Gao,
A. D. Goulding,
G. Martin,
V. Rodriguez-Gomez,
S. C. Trager,
G. Yang,
R. Davé,
Y. Dubois
Abstract:
Galaxy mergers represent a fundamental physical process under hierarchical structure formation, but their role in triggering AGNs is still unclear. We aim to investigate the merger-AGN connection using state-of-the-art observations and novel methods in detecting mergers and AGNs. We selected stellar mass-limited samples at redshift z<1 from KiDS, focusing on the KiDS-N-W2 field with a wide range o…
▽ More
Galaxy mergers represent a fundamental physical process under hierarchical structure formation, but their role in triggering AGNs is still unclear. We aim to investigate the merger-AGN connection using state-of-the-art observations and novel methods in detecting mergers and AGNs. We selected stellar mass-limited samples at redshift z<1 from KiDS, focusing on the KiDS-N-W2 field with a wide range of multi-wavelength data. Three AGN types, selected in the MIR, X-ray, and via SED modelling, were analysed. To identify mergers, we used convolutional neural networks trained on two cosmological simulations. We created mass and redshift-matched control samples of non-mergers and non-AGNs. We observe a clear AGN excess (a factor of 2-3) in mergers with respect to non-mergers for the MIR AGNs, and a mild excess for the X-ray and SED AGNs, indicating that mergers could trigger all 3 types but are more connected with the MIR AGNs. About half of the MIR AGNs are in mergers but it is unclear whether mergers are the main trigger. For the X-ray and SED AGNs, mergers are unlikely to be the dominant trigger. We also explore the relation using the continuous AGN fraction $f_{AGN}$ parameter. Mergers exhibit a clear excess of high $f_{AGN}$ values relative to non-mergers, for all AGNs. We unveil the first merger fraction $f_{merg}-f_{AGN}$ relation with two distinct regimes. When the AGN is not dominant, the relation is only mildly increasing or even flat, with the MIR AGNs showing the highest $f_{merg}$. In the regime of very dominant AGNs ($f_{AGN}\geq0.8$), $f_{merg}$ shows a steeply rising trend with increasing $f_{AGN}$ for all AGN types. These trends are also seen when plotted against AGN bolometric luminosity. We conclude that mergers are most connected with dust-obscured AGNs (linked to a fast-growing phase of the SMBH) and are the main or even the sole fuelling mechanism of the most powerful AGNs.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
XS-VID: An Extremely Small Video Object Detection Dataset
Authors:
Jiahao Guo,
Ziyang Xu,
Lianjun Wu,
Fei Gao,
Wenyu Liu,
Xinggang Wang
Abstract:
Small Video Object Detection (SVOD) is a crucial subfield in modern computer vision, essential for early object discovery and detection. However, existing SVOD datasets are scarce and suffer from issues such as insufficiently small objects, limited object categories, and lack of scene diversity, leading to unitary application scenarios for corresponding methods. To address this gap, we develop the…
▽ More
Small Video Object Detection (SVOD) is a crucial subfield in modern computer vision, essential for early object discovery and detection. However, existing SVOD datasets are scarce and suffer from issues such as insufficiently small objects, limited object categories, and lack of scene diversity, leading to unitary application scenarios for corresponding methods. To address this gap, we develop the XS-VID dataset, which comprises aerial data from various periods and scenes, and annotates eight major object categories. To further evaluate existing methods for detecting extremely small objects, XS-VID extensively collects three types of objects with smaller pixel areas: extremely small (\textit{es}, $0\sim12^2$), relatively small (\textit{rs}, $12^2\sim20^2$), and generally small (\textit{gs}, $20^2\sim32^2$). XS-VID offers unprecedented breadth and depth in covering and quantifying minuscule objects, significantly enriching the scene and object diversity in the dataset. Extensive validations on XS-VID and the publicly available VisDrone2019VID dataset show that existing methods struggle with small object detection and significantly underperform compared to general object detectors. Leveraging the strengths of previous methods and addressing their weaknesses, we propose YOLOFT, which enhances local feature associations and integrates temporal motion features, significantly improving the accuracy and stability of SVOD. Our datasets and benchmarks are available at \url{https://gjhhust.github.io/XS-VID/}.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
Baryogenesis and first-order QCD transition with gravitational waves from a large lepton asymmetry
Authors:
Fei Gao,
Julia Harz,
Chandan Hati,
Yi Lu,
Isabel M. Oldengott,
Graham White
Abstract:
A large primordial lepton asymmetry can lead to successful baryogenesis by preventing the restoration of electroweak symmetry at high temperatures, thereby suppressing the sphaleron rate. This asymmetry can also lead to a first-order cosmic QCD transition, accompanied by detectable gravitational wave (GW) signals. By employing next-to-leading order dimensional reduction we determine that the neces…
▽ More
A large primordial lepton asymmetry can lead to successful baryogenesis by preventing the restoration of electroweak symmetry at high temperatures, thereby suppressing the sphaleron rate. This asymmetry can also lead to a first-order cosmic QCD transition, accompanied by detectable gravitational wave (GW) signals. By employing next-to-leading order dimensional reduction we determine that the necessary lepton asymmetry is approximately one order of magnitude smaller than previously estimated. Incorporating an updated QCD equation of state that harmonizes lattice and functional QCD outcomes, we pinpoint the range of lepton flavor asymmetries capable of inducing a first-order cosmic QCD transition. To maintain consistency with observational constraints from the Cosmic Microwave Background and Big Bang Nucleosynthesis, achieving the correct baryon asymmetry requires entropy dilution by approximately a factor of ten. However, the first-order QCD transition itself can occur independently of entropy dilution. We propose that the sphaleron freeze-in mechanism can be investigated through forthcoming GW experiments such as $μ$Ares.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Distinct moiré trions in a twisted semiconductor homobilayer
Authors:
Zhida Liu,
Haonan Wang,
Xiaohui Liu,
Yue Ni,
Frank Gao,
Saba Arash,
Dong Seob Kim,
Xiangcheng Liu,
Yongxin Zeng,
Jiamin Quan,
Di Huang,
Kenji Watanabe,
Takashi Taniguchi,
Edoardo Baldini,
Allan H. MacDonald,
Chih-Kang Shih,
Li Yang,
Xiaoqin Li
Abstract:
Many fascinating properties discovered in graphene and transition metal dichalcogenide (TMD) moiré superlattices originate from flat bands and enhanced many-body effects. Here, we discover new many-electron excited states in TMD homobilayers. As optical resonances evolve with twist angle and doping in MoSe$_2$ bilayers, a unique type of ``charge-transfer" trions is observed when gradual changes in…
▽ More
Many fascinating properties discovered in graphene and transition metal dichalcogenide (TMD) moiré superlattices originate from flat bands and enhanced many-body effects. Here, we discover new many-electron excited states in TMD homobilayers. As optical resonances evolve with twist angle and doping in MoSe$_2$ bilayers, a unique type of ``charge-transfer" trions is observed when gradual changes in atomic alignment between the layers occur. In real space, the optically excited electron-hole pair mostly resides in a different site from the doped hole in a moiré supercell. In momentum space, the electron-hole pair forms in the single-particle-band $K$-valley, while the hole occupies the $Γ$-valley. The rich internal structure of this trion resonance arises from the ultra-flatness of the first valence band and the distinct influence of moiré potential modulation on holes and excitons. Our findings open new routes to realizing photon-spin transduction or implementing moiré quantum simulators with independently tunable fermion and boson densities.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
Authors:
Zirui Shao,
Feiyu Gao,
Hangdi Xing,
Zepeng Zhu,
Zhi Yu,
Jiajun Bu,
Qi Zheng,
Cong Yao
Abstract:
In the era of content creation revolution propelled by advancements in generative models, the field of web design remains unexplored despite its critical role in modern digital communication. The web design process is complex and often time-consuming, especially for those with limited expertise. In this paper, we introduce Web Rendering Parameters Generation (WebRPG), a new task that aims at autom…
▽ More
In the era of content creation revolution propelled by advancements in generative models, the field of web design remains unexplored despite its critical role in modern digital communication. The web design process is complex and often time-consuming, especially for those with limited expertise. In this paper, we introduce Web Rendering Parameters Generation (WebRPG), a new task that aims at automating the generation for visual presentation of web pages based on their HTML code. WebRPG would contribute to a faster web development workflow. Since there is no existing benchmark available, we develop a new dataset for WebRPG through an automated pipeline. Moreover, we present baseline models, utilizing VAE to manage numerous elements and rendering parameters, along with custom HTML embedding for capturing essential semantic and hierarchical information from HTML. Extensive experiments, including customized quantitative evaluations for this specific task, are conducted to evaluate the quality of the generated results.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Adapted Swin Transformer-based Real-Time Plasma Shape Detection and Control in HL-3
Authors:
Qianyun Dong,
Zhengwei Chen,
Rongpeng Li,
Zongyu Yang,
Feng Gao,
Yihang Chen,
Fan Xia,
Wulyu Zhong,
Zhifeng Zhao
Abstract:
In the field of magnetic confinement plasma control, the accurate feedback of plasma position and shape primarily relies on calculations derived from magnetic measurements through equilibrium reconstruction or matrix mapping method. However, under harsh conditions like high-energy neutron radiation and elevated temperatures, the installation of magnetic probes within the device becomes challenging…
▽ More
In the field of magnetic confinement plasma control, the accurate feedback of plasma position and shape primarily relies on calculations derived from magnetic measurements through equilibrium reconstruction or matrix mapping method. However, under harsh conditions like high-energy neutron radiation and elevated temperatures, the installation of magnetic probes within the device becomes challenging. Relying solely on external magnetic probes can compromise the precision of EFIT in determining the plasma shape. To tackle this issue, we introduce a real-time, non-magnetic measurement method on the HL-3 tokamak, which diagnoses the plasma position and shape via imaging. Particularly, we put forward an adapted Swin Transformer model, the Poolformer Swin Transformer (PST), to accurately and fastly interpret the plasma shape from the Charge-Coupled Device Camera (CCD) images. By adopting multi-task learning and knowledge distillation techniques, the model is capable of robustly detecting six shape parameters under disruptive conditions such as a divertor shape and gas injection, circumventing global brightness changes and cumbersome manual labeling. Specifically, the well-trained PST model capably infers R and Z within the mean average error below 1.1 cm and 1.8 cm, respectively, while requiring less than 2 ms for end-to-end feedback, an 80 improvement over the smallest Swin Transformer model, laying the foundation for real-time control. Finally, we deploy the PST model in the Plasma Control System (PCS) using TensorRT, and achieve 500 ms stable PID feedback control based on the PST-computed horizontal displacement information. In conclusion, this research opens up new avenues for the practical application of image-computing plasma shape diagnostic methods in the realm of real-time feedback control.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Visual Text Generation in the Wild
Authors:
Yuanzhi Zhu,
Jiawei Liu,
Feiyu Gao,
Wenyu Liu,
Xinggang Wang,
Peng Wang,
Fei Huang,
Cong Yao,
Zhibo Yang
Abstract:
Recently, with the rapid advancements of generative models, the field of visual text generation has witnessed significant progress. However, it is still challenging to render high-quality text images in real-world scenarios, as three critical criteria should be satisfied: (1) Fidelity: the generated text images should be photo-realistic and the contents are expected to be the same as specified in…
▽ More
Recently, with the rapid advancements of generative models, the field of visual text generation has witnessed significant progress. However, it is still challenging to render high-quality text images in real-world scenarios, as three critical criteria should be satisfied: (1) Fidelity: the generated text images should be photo-realistic and the contents are expected to be the same as specified in the given conditions; (2) Reasonability: the regions and contents of the generated text should cohere with the scene; (3) Utility: the generated text images can facilitate related tasks (e.g., text detection and recognition). Upon investigation, we find that existing methods, either rendering-based or diffusion-based, can hardly meet all these aspects simultaneously, limiting their application range. Therefore, we propose in this paper a visual text generator (termed SceneVTG), which can produce high-quality text images in the wild. Following a two-stage paradigm, SceneVTG leverages a Multimodal Large Language Model to recommend reasonable text regions and contents across multiple scales and levels, which are used by a conditional diffusion model as conditions to generate text images. Extensive experiments demonstrate that the proposed SceneVTG significantly outperforms traditional rendering-based methods and recent diffusion-based methods in terms of fidelity and reasonability. Besides, the generated images provide superior utility for tasks involving text detection and text recognition. Code and datasets are available at AdvancedLiterateMachinery.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection
Authors:
Jiangwei Xie,
Feng Gao,
Xiaowei Zhou,
Junyu Dong
Abstract:
Synthetic aperture radar (SAR) image change detection is critical in remote sensing image analysis. Recently, the attention mechanism has been widely used in change detection tasks. However, existing attention mechanisms often employ down-sampling operations such as average pooling on the Key and Value components to enhance computational efficiency. These irreversible operations result in the loss…
▽ More
Synthetic aperture radar (SAR) image change detection is critical in remote sensing image analysis. Recently, the attention mechanism has been widely used in change detection tasks. However, existing attention mechanisms often employ down-sampling operations such as average pooling on the Key and Value components to enhance computational efficiency. These irreversible operations result in the loss of high-frequency components and other important information. To address this limitation, we develop Wavelet-based Bi-dimensional Aggregation Network (WBANet) for SAR image change detection. We design a wavelet-based self-attention block that includes discrete wavelet transform and inverse discrete wavelet transform operations on Key and Value components. Hence, the feature undergoes downsampling without any loss of information, while simultaneously enhancing local contextual awareness through an expanded receptive field. Additionally, we have incorporated a bi-dimensional aggregation module that boosts the non-linear representation capability by merging spatial and channel information via broadcast mechanism. Experimental results on three SAR datasets demonstrate that our WBANet significantly outperforms contemporary state-of-the-art methods. Specifically, our WBANet achieves 98.33\%, 96.65\%, and 96.62\% of percentage of correct classification (PCC) on the respective datasets, highlighting its superior performance. Source codes are available at \url{https://github.com/summitgao/WBANet}.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Supernova Pointing Capabilities of DUNE
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1340 additional authors not shown)
Abstract:
The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electr…
▽ More
The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electron-neutrino charged-current absorption on $^{40}$Ar and elastic scattering of neutrinos on electrons. Procedures to reconstruct individual interactions, including a newly developed technique called ``brems flipping'', as well as the burst direction from an ensemble of interactions are described. Performance of the burst direction reconstruction is evaluated for supernovae happening at a distance of 10 kpc for a specific supernova burst flux model. The pointing resolution is found to be 3.4 degrees at 68% coverage for a perfect interaction-channel classification and a fiducial mass of 40 kton, and 6.6 degrees for a 10 kton fiducial mass respectively. Assuming a 4% rate of charged-current interactions being misidentified as elastic scattering, DUNE's burst pointing resolution is found to be 4.3 degrees (8.7 degrees) at 68% coverage.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
WING: Wheel-Inertial Neural Odometry with Ground Manifold Constraints
Authors:
Chenxing Jiang,
Kunyi Zhang,
Sheng Yang,
Shaojie Shen,
Chao Xu,
Fei Gao
Abstract:
In this paper, we propose an interoceptive-only odometry system for ground robots with neural network processing and soft constraints based on the assumption of a globally continuous ground manifold. Exteroceptive sensors such as cameras, GPS and LiDAR may encounter difficulties in scenarios with poor illumination, indoor environments, dusty areas and straight tunnels. Therefore, improving the pos…
▽ More
In this paper, we propose an interoceptive-only odometry system for ground robots with neural network processing and soft constraints based on the assumption of a globally continuous ground manifold. Exteroceptive sensors such as cameras, GPS and LiDAR may encounter difficulties in scenarios with poor illumination, indoor environments, dusty areas and straight tunnels. Therefore, improving the pose estimation accuracy only using interoceptive sensors is important to enhance the reliability of navigation system even in degrading scenarios mentioned above. However, interoceptive sensors like IMU and wheel encoders suffer from large drift due to noisy measurements. To overcome these challenges, the proposed system trains deep neural networks to correct the measurements from IMU and wheel encoders, while considering their uncertainty. Moreover, because ground robots can only travel on the ground, we model the ground surface as a globally continuous manifold using a dual cubic B-spline manifold to further improve the estimation accuracy by this soft constraint. A novel space-based sliding-window filtering framework is proposed to fully exploit the $C^2$ continuity of ground manifold soft constraints and fuse all the information from raw measurements and neural networks in a yaw-independent attitude convention. Extensive experiments demonstrate that our proposed approach can outperform state-of-the-art learning-based interoceptive-only odometry methods.
△ Less
Submitted 23 July, 2024; v1 submitted 14 July, 2024;
originally announced July 2024.
-
6D Motion Parameters Estimation in Monostatic Integrated Sensing and Communications System
Authors:
Hongliang Luo,
Feifei Gao,
Fan Liu,
Shi Jin
Abstract:
In this paper, we propose a novel scheme to estimate the six dimensional (6D) motion parameters of dynamic target for monostatic integrated sensing and communications (ISAC) system. We first provide a generic ISAC framework for dynamic target sensing based on massive multiple input and multiple output (MIMO) array. Next, we derive the relationship between the sensing channel of ISAC base station (…
▽ More
In this paper, we propose a novel scheme to estimate the six dimensional (6D) motion parameters of dynamic target for monostatic integrated sensing and communications (ISAC) system. We first provide a generic ISAC framework for dynamic target sensing based on massive multiple input and multiple output (MIMO) array. Next, we derive the relationship between the sensing channel of ISAC base station (BS) and the 6D motion parameters of dynamic target. Then, we employ the array signal processing methods to estimate the horizontal angle, pitch angle, distance, and virtual velocity of dynamic target. Since the virtual velocities observed by different antennas are different, we adopt plane fitting to estimate the dynamic target's radial velocity, horizontal angular velocity, and pitch angular velocity from these virtual velocities. Simulation results demonstrate the effectiveness of the proposed 6D motion parameters estimation scheme, which also confirms a new finding that one single BS with massive MIMO array is capable of estimating the horizontal angular velocity and pitch angular velocity of dynamic target.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
OFDM Achieves the Lowest Ranging Sidelobe Under Random ISAC Signaling
Authors:
Fan Liu,
Ying Zhang,
Yifeng Xiong,
Shuangyang Li,
Weijie Yuan,
Feifei Gao,
Shi Jin,
Giuseppe Caire
Abstract:
This paper aims to answer a fundamental question in the area of Integrated Sensing and Communications (ISAC): What is the optimal communication-centric ISAC waveform for ranging? Towards that end, we first established a generic framework to analyze the sensing performance of communication-centric ISAC waveforms built upon orthonormal signaling bases and random data symbols. Then, we evaluated thei…
▽ More
This paper aims to answer a fundamental question in the area of Integrated Sensing and Communications (ISAC): What is the optimal communication-centric ISAC waveform for ranging? Towards that end, we first established a generic framework to analyze the sensing performance of communication-centric ISAC waveforms built upon orthonormal signaling bases and random data symbols. Then, we evaluated their ranging performance by adopting both the periodic and aperiodic auto-correlation functions (P-ACF and A-ACF), and defined the expectation of the integrated sidelobe level (EISL) as a sensing performance metric. On top of that, we proved that among all communication waveforms with cyclic prefix (CP), the orthogonal frequency division multiplexing (OFDM) modulation is the only globally optimal waveform that achieves the lowest ranging sidelobe for quadrature amplitude modulation (QAM) and phase shift keying (PSK) constellations, in terms of both the EISL and the sidelobe level at each individual lag of the P-ACF. As a step forward, we proved that among all communication waveforms without CP, OFDM is a locally optimal waveform for QAM/PSK in the sense that it achieves a local minimum of the EISL of the A-ACF. Finally, we demonstrated by numerical results that under QAM/PSK constellations, there is no other orthogonal communication-centric waveform that achieves a lower ranging sidelobe level than that of the OFDM, in terms of both P-ACF and A-ACF cases.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Implementation of Composite Photon Blockade Based on Four-wave Mixing System
Authors:
Hongyu Lina,
Zhi-Hai Yao,
Xiao-Qian Wang,
Feng Gao
Abstract:
A high-quality single-photon blockade system can effectively enhance the quality of single-photon sources. Conventional photon blockade(CPB) suffers from low single-photon purity and high requirements for system nonlinearity, while unconventional photon blockade(UPB) has the disadvantage of low brightness. Recent research by [Laser Photon.Rev 14,1900279,2020] demonstrates that UPB can be used to e…
▽ More
A high-quality single-photon blockade system can effectively enhance the quality of single-photon sources. Conventional photon blockade(CPB) suffers from low single-photon purity and high requirements for system nonlinearity, while unconventional photon blockade(UPB) has the disadvantage of low brightness. Recent research by [Laser Photon.Rev 14,1900279,2020] demonstrates that UPB can be used to enhance the strength of CPB, thereby improving the purity of single-photon sources. Research by [Opt. Express 30(12),21787,2022] shows that there is an intersection point between CPB and UPB in certain nonlinear systems, where the performance of single photons is better. In this study, we investigated the phenomenon of photon blockade in a non-degenerate four-wave mixing system, where CPB and UPB can occur simultaneously within the same parameter range. We refer to this phenomenon as composite photon blockade. Particularly, when the system achieves composite photon blockade, the value of g(2)(0) is smaller, and there are more single photons. We conducted analytical analysis and numerical calculations to study the conditions for the realization of CPB, UPB, and 2PB in the system, and discussed in detail the influence of system parameters on various blockade effects.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Active-Controlled Trial Design for HIV Prevention Trials with a Counterfactual Placebo
Authors:
Fei Gao,
Holly Janes,
Susan Buchbinder,
Deborah Donnell
Abstract:
In the quest for enhanced HIV prevention methods, the advent of antiretroviral drugs as pre-exposure prophylaxis (PrEP) has marked a significant stride forward. However, the ethical challenges in conducting placebo-controlled trials for new PrEP agents against a backdrop of highly effective existing PrEP options necessitates innovative approaches. This manuscript delves into the design and impleme…
▽ More
In the quest for enhanced HIV prevention methods, the advent of antiretroviral drugs as pre-exposure prophylaxis (PrEP) has marked a significant stride forward. However, the ethical challenges in conducting placebo-controlled trials for new PrEP agents against a backdrop of highly effective existing PrEP options necessitates innovative approaches. This manuscript delves into the design and implementation of active-controlled trials that incorporate a counterfactual placebo estimate - a theoretical estimate of what HIV incidence would have been without effective prevention. We introduce a novel statistical framework for regulatory approval of new PrEP agents, predicated on the assumption of an available and consistent counterfactual placebo estimate. Our approach aims to assess the absolute efficacy (i.e., against placebo) of the new PrEP agent relative to the absolute efficacy of the active control. We propose a two-step procedure for hypothesis testing and further develop an approach that addresses potential biases inherent in non-randomized comparison to counterfactual placebos. By exploring different scenarios with moderately and highly effective active controls and counterfactual placebo estimates from various sources, we demonstrate how our design can significantly reduce sample sizes compared to traditional non-inferiority trials and offer a robust framework for evaluating new PrEP agents. This work contributes to the methodological repertoire for HIV prevention trials and underscores the importance of adaptability in the face of ethical and practical challenges.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
A quantitative analysis of Gravitational Wave spectrum sourced from First-Order Chiral Phase Transition of QCD
Authors:
Hui-wen Zheng,
Fei Gao,
Ligong Bian,
Si-xue Qin,
Yu-xin Liu
Abstract:
We investigate the cosmological first-order chiral phase transition of QCD, and for the first time calculate its parameters which can fully determine the gravitational wave spectrum. With the state-of-the-art calculation from the functional QCD method, we found that the large chemical potential of QCD phase transition results in very weak and fast first-order phase transitions at the temperature l…
▽ More
We investigate the cosmological first-order chiral phase transition of QCD, and for the first time calculate its parameters which can fully determine the gravitational wave spectrum. With the state-of-the-art calculation from the functional QCD method, we found that the large chemical potential of QCD phase transition results in very weak and fast first-order phase transitions at the temperature lower than $\mathcal{O}(10^2)$ MeV. These results further suggest that the GW signals of NANOGrav are very unlikely sourced from the chiral phase transition of QCD.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Limited-View Photoacoustic Imaging Reconstruction Via High-quality Self-supervised Neural Representation
Authors:
Youshen xiao,
Yuting Shen,
Bowei Yao,
Xiran Cai,
Yuyao Zhang,
Fei Gao
Abstract:
In practical applications within the human body, it is often challenging to fully encompass the target tissue or organ, necessitating the use of limited-view arrays, which can lead to the loss of crucial information. Addressing the reconstruction of photoacoustic sensor signals in limited-view detection spaces has become a focal point of current research. In this study, we introduce a self-supervi…
▽ More
In practical applications within the human body, it is often challenging to fully encompass the target tissue or organ, necessitating the use of limited-view arrays, which can lead to the loss of crucial information. Addressing the reconstruction of photoacoustic sensor signals in limited-view detection spaces has become a focal point of current research. In this study, we introduce a self-supervised network termed HIgh-quality Self-supervised neural representation (HIS), which tackles the inverse problem of photoacoustic imaging to reconstruct high-quality photoacoustic images from sensor data acquired under limited viewpoints. We regard the desired reconstructed photoacoustic image as an implicit continuous function in 2D image space, viewing the pixels of the image as sparse discrete samples. The HIS's objective is to learn the continuous function from limited observations by utilizing a fully connected neural network combined with Fourier feature position encoding. By simply minimizing the error between the network's predicted sensor data and the actual sensor data, HIS is trained to represent the observed continuous model. The results indicate that the proposed HIS model offers superior image reconstruction quality compared to three commonly used methods for photoacoustic image reconstruction.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Electromagnetic Property Sensing Based on Diffusion Model in ISAC System
Authors:
Yuhua Jiang,
Feifei Gao,
Shi Jin,
Tie Jun Cui
Abstract:
Integrated sensing and communications (ISAC) has opened up numerous game-changing opportunities for future wireless systems. In this paper, we develop a novel ISAC scheme that utilizes the diffusion model to sense the electromagnetic (EM) property of the target in a predetermined sensing area. Specifically, we first estimate the sensing channel by using both the communications and the sensing sign…
▽ More
Integrated sensing and communications (ISAC) has opened up numerous game-changing opportunities for future wireless systems. In this paper, we develop a novel ISAC scheme that utilizes the diffusion model to sense the electromagnetic (EM) property of the target in a predetermined sensing area. Specifically, we first estimate the sensing channel by using both the communications and the sensing signals echoed back from the target. Then we employ the diffusion model to generate the point cloud that represents the target and thus enables 3D visualization of the target's EM property distribution. In order to minimize the mean Chamfer distance (MCD) between the ground truth and the estimated point clouds, we further design the communications and sensing beamforming matrices under the constraint of a maximum transmit power and a minimum communications achievable rate for each user equipment (UE). Simulation results demonstrate the efficacy of the proposed method in achieving high-quality reconstruction of the target's shape, relative permittivity, and conductivity. Besides, the proposed method can sense the EM property of the target effectively in any position of the sensing area.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
NOEMA formIng Cluster survEy (NICE): Characterizing eight massive galaxy groups at $1.5 < z < 4$ in the COSMOS field
Authors:
Nikolaj B. Sillassen,
Shuowen Jin,
Georgios E. Magdis,
Emanuele Daddi,
Tao Wang,
Shiying Lu,
Hanwen Sun,
Vinod Arumugam,
Daizhong Liu,
Malte Brinch,
Chiara D'Eugenio,
Raphael Gobat,
Carlos Gómez-Guijarro,
Michael Rich,
Eva Schinnerer,
Veronica Strazzullo,
Qinghua Tan,
Francesco Valentino,
Yijun Wang,
Mengyuan Xiao,
Luwenjia Zhou,
David Blánquez-Sesé,
Zheng Cai,
Yanmei Chen,
Laure Ciesla
, et al. (19 additional authors not shown)
Abstract:
The NOEMA formIng Cluster survEy (NICE) is a large program targeting 69 massive galaxy group candidates at $z>2$ in six deep fields. We report spectroscopic confirmation of eight groups at $1.65\leq z\leq3.61$ in COSMOS. Homogeneously selected as significant overdensities of red IRAC sources with red Herschel colors, four groups are confirmed by CO and [CI] with NOEMA 3mm observations, three are c…
▽ More
The NOEMA formIng Cluster survEy (NICE) is a large program targeting 69 massive galaxy group candidates at $z>2$ in six deep fields. We report spectroscopic confirmation of eight groups at $1.65\leq z\leq3.61$ in COSMOS. Homogeneously selected as significant overdensities of red IRAC sources with red Herschel colors, four groups are confirmed by CO and [CI] with NOEMA 3mm observations, three are confirmed with ALMA, and one is confirmed by H$α$ from Subaru/FMOS. We constructed the integrated FIR SEDs for the eight groups, obtaining total IR SFR $=260-1300~{\rm M_\odot}$~yr$^{-1}$. We adopted six methods to estimate the dark matter masses, including stellar mass to halo mass relations, overdensity with galaxy bias, and NFW profile fitting to radial stellar mass density. We found the radial stellar mass density are consistent with a NFW profile, supporting that they are collapsed structures hosted by a single dark matter halo. The best halo mass estimates are $\log(M_{\rm h}/{\rm M_\odot})=12.8-13.7$ with uncertainty of 0.3 dex. From halo mass estimates, we derive baryonic accretion rate ${\rm BAR}=(1-8)\times10^{3}\,{\rm M_{\odot}/yr}$ for this sample. We find a quasi-linear correlation between the integrated SFR/BAR and the theoretical halo mass limit for cold streams, $M_{\rm stream}/M_{\rm h}$, with ${\rm SFR/BAR}=10^{-0.46\pm0.22}\left({M_{\rm stream}/M_{\rm h}}\right)^{0.71\pm0.16}$ with a scatter of $0.40\,{\rm dex}$. Further, we compare halo masses and stellar masses with simulations, and find all structures are consistent with being progenitors of $M_{\rm h}(z=0)>10^{14}\,{\rm M_{\odot}}$ galaxy clusters, and the most massive central galaxies have stellar masses consistent with brightest cluster galaxies (BCGs) progenitors in the TNG300 simulation. The results strongly suggest these structures are forming massive galaxy clusters via baryonic and dark matter accretion.
△ Less
Submitted 5 July, 2024; v1 submitted 3 July, 2024;
originally announced July 2024.
-
Aligning Human Motion Generation with Human Perceptions
Authors:
Haoru Wang,
Wentao Zhu,
Luyi Miao,
Yishu Xu,
Feng Gao,
Qi Tian,
Yizhou Wang
Abstract:
Human motion generation is a critical task with a wide range of applications. Achieving high realism in generated motions requires naturalness, smoothness, and plausibility. Despite rapid advancements in the field, current generation methods often fall short of these goals. Furthermore, existing evaluation metrics typically rely on ground-truth-based errors, simple heuristics, or distribution dist…
▽ More
Human motion generation is a critical task with a wide range of applications. Achieving high realism in generated motions requires naturalness, smoothness, and plausibility. Despite rapid advancements in the field, current generation methods often fall short of these goals. Furthermore, existing evaluation metrics typically rely on ground-truth-based errors, simple heuristics, or distribution distances, which do not align well with human perceptions of motion quality. In this work, we propose a data-driven approach to bridge this gap by introducing a large-scale human perceptual evaluation dataset, MotionPercept, and a human motion critic model, MotionCritic, that capture human perceptual preferences. Our critic model offers a more accurate metric for assessing motion quality and could be readily integrated into the motion generation pipeline to enhance generation quality. Extensive experiments demonstrate the effectiveness of our approach in both evaluating and improving the quality of generated human motions by aligning with human perceptions. Code and data are publicly available at https://motioncritic.github.io/.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation
Authors:
Lianjie Guo,
Zaitian Gongye,
Ziyi Xu,
Yingjian Wang,
Xin Zhou,
Jinni Zhou,
Fei Gao
Abstract:
Relative state estimation is crucial for vision-based swarms to estimate and compensate for the unavoidable drift of visual odometry. For autonomous drones equipped with the most compact sensor setting -- a stereo camera that provides a limited field of view (FoV), the demand for mutual observation for relative state estimation conflicts with the demand for environment observation. To balance the…
▽ More
Relative state estimation is crucial for vision-based swarms to estimate and compensate for the unavoidable drift of visual odometry. For autonomous drones equipped with the most compact sensor setting -- a stereo camera that provides a limited field of view (FoV), the demand for mutual observation for relative state estimation conflicts with the demand for environment observation. To balance the two demands for FoV limited swarms by acquiring mutual observations with a safety guarantee, this paper proposes an active localization correction system, which plans camera orientations via a yaw planner during the flight. The yaw planner manages the contradiction by calculating suitable timing and yaw angle commands based on the evaluation of localization uncertainty estimated by the Kalman Filter. Simulation validates the scalability of our algorithm. In real-world experiments, we reduce positioning drift by up to 65% and managed to maintain a given formation in both indoor and outdoor GPS-denied flight, from which the accuracy, efficiency, and robustness of the proposed system are verified.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
UniQuad: A Unified and Versatile Quadrotor Platform Series for UAV Research and Application
Authors:
Yichen Zhang,
Xinyi Chen,
Peize Liu,
Junzhe Wang,
Hetai Zou,
Neng Pan,
Fei Gao,
Shaojie Shen
Abstract:
As quadrotors take on an increasingly diverse range of roles, researchers often need to develop new hardware platforms tailored for specific tasks, introducing significant engineering overhead. In this article, we introduce the UniQuad series, a unified and versatile quadrotor platform series that offers high flexibility to adapt to a wide range of common tasks, excellent customizability for advan…
▽ More
As quadrotors take on an increasingly diverse range of roles, researchers often need to develop new hardware platforms tailored for specific tasks, introducing significant engineering overhead. In this article, we introduce the UniQuad series, a unified and versatile quadrotor platform series that offers high flexibility to adapt to a wide range of common tasks, excellent customizability for advanced demands, and easy maintenance in case of crashes. This project is fully open-source at https://hkust-aerial-robotics.github.io/UniQuad.
△ Less
Submitted 4 July, 2024; v1 submitted 29 June, 2024;
originally announced July 2024.
-
Hyper-sampling imaging
Authors:
Ze Zhang,
Hemeng Xue,
Mingtao Shang,
Hongfei Yu,
Jinchao Liang,
Meiling Guan,
Chengming Sun,
Huahua Wang,
Shufeng Wang,
Zhengyu Ye,
Feng Gao,
Lu Gao
Abstract:
In our research, we have developed a novel mechanism that allows for a significant reduction in the smallest sampling unit of digital image sensors (DIS) to as small as 1/16th of a pixel, through measuring the intra-pixel quantum efficiency for the first time and recomputing the image. Employing our method, the physical sampling resolution of DIS can be enhanced by 16 times. The method has undergo…
▽ More
In our research, we have developed a novel mechanism that allows for a significant reduction in the smallest sampling unit of digital image sensors (DIS) to as small as 1/16th of a pixel, through measuring the intra-pixel quantum efficiency for the first time and recomputing the image. Employing our method, the physical sampling resolution of DIS can be enhanced by 16 times. The method has undergone rigorous testing in real-world imaging scenarios.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry
Authors:
Linqing Chen,
Weilei Wang,
Zilong Bai,
Peng Xu,
Yan Fang,
Jie Fang,
Wentao Wu,
Lizhi Zhou,
Ruiji Zhang,
Yubin Xia,
Chaobo Xu,
Ran Hu,
Licong Xu,
Qijun Cai,
Haoran Hua,
Jing Sun,
Jin Liu,
Tian Qiu,
Haowen Liu,
Meng Hu,
Xiuwen Li,
Fei Gao,
Yufu Wang,
Lin Tie,
Chaochao Wang
, et al. (11 additional authors not shown)
Abstract:
Large language models (LLMs) have revolutionized Natural Language Processing (NLP) by minimizing the need for complex feature engineering. However, the application of LLMs in specialized domains like biopharmaceuticals and chemistry remains largely unexplored. These fields are characterized by intricate terminologies, specialized knowledge, and a high demand for precision areas where general purpo…
▽ More
Large language models (LLMs) have revolutionized Natural Language Processing (NLP) by minimizing the need for complex feature engineering. However, the application of LLMs in specialized domains like biopharmaceuticals and chemistry remains largely unexplored. These fields are characterized by intricate terminologies, specialized knowledge, and a high demand for precision areas where general purpose LLMs often fall short. In this study, we introduce PharmaGPT, a suite of domain specilized LLMs with 13 billion and 70 billion parameters, specifically trained on a comprehensive corpus tailored to the Bio-Pharmaceutical and Chemical domains. Our evaluation shows that PharmaGPT surpasses existing general models on specific-domain benchmarks such as NAPLEX, demonstrating its exceptional capability in domain-specific tasks. Remarkably, this performance is achieved with a model that has only a fraction, sometimes just one-tenth-of the parameters of general-purpose large models. This advancement establishes a new benchmark for LLMs in the bio-pharmaceutical and chemical fields, addressing the existing gap in specialized language modeling. It also suggests a promising path for enhanced research and development, paving the way for more precise and effective NLP applications in these areas.
△ Less
Submitted 9 July, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation
Authors:
Bowei Yao,
Yi Zeng,
Haizhao Dai,
Qing Wu,
Youshen Xiao,
Fei Gao,
Yuyao Zhang,
Jingyi Yu,
Xiran Cai
Abstract:
Photoacoustic tomography is a hybrid biomedical technology, which combines the advantages of acoustic and optical imaging. However, for the conventional image reconstruction method, the image quality is affected obviously by artifacts under the condition of sparse sampling. in this paper, a novel model-based sparse reconstruction method via implicit neural representation was proposed for improving…
▽ More
Photoacoustic tomography is a hybrid biomedical technology, which combines the advantages of acoustic and optical imaging. However, for the conventional image reconstruction method, the image quality is affected obviously by artifacts under the condition of sparse sampling. in this paper, a novel model-based sparse reconstruction method via implicit neural representation was proposed for improving the image quality reconstructed from sparse data. Specially, the initial acoustic pressure distribution was modeled as a continuous function of spatial coordinates, and parameterized by a multi-layer perceptron. The weights of multi-layer perceptron were determined by training the network in self-supervised manner. And the total variation regularization term was used to offer the prior knowledge. We compared our result with some ablation studies, and the results show that out method outperforms existing methods on simulation and experimental data. Under the sparse sampling condition, our method can suppress the artifacts and avoid the ill-posed problem effectively, which reconstruct images with higher signal-to-noise ratio and contrast-to-noise ratio than traditional methods. The high-quality results for sparse data make the proposed method hold the potential for further decreasing the hardware cost of photoacoustic tomography system.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Exploring Cross-Domain Few-Shot Classification via Frequency-Aware Prompting
Authors:
Tiange Zhang,
Qing Cai,
Feng Gao,
Lin Qi,
Junyu Dong
Abstract:
Cross-Domain Few-Shot Learning has witnessed great stride with the development of meta-learning. However, most existing methods pay more attention to learning domain-adaptive inductive bias (meta-knowledge) through feature-wise manipulation or task diversity improvement while neglecting the phenomenon that deep networks tend to rely more on high-frequency cues to make the classification decision,…
▽ More
Cross-Domain Few-Shot Learning has witnessed great stride with the development of meta-learning. However, most existing methods pay more attention to learning domain-adaptive inductive bias (meta-knowledge) through feature-wise manipulation or task diversity improvement while neglecting the phenomenon that deep networks tend to rely more on high-frequency cues to make the classification decision, which thus degenerates the robustness of learned inductive bias since high-frequency information is vulnerable and easy to be disturbed by noisy information. Hence in this paper, we make one of the first attempts to propose a Frequency-Aware Prompting method with mutual attention for Cross-Domain Few-Shot classification, which can let networks simulate the human visual perception of selecting different frequency cues when facing new recognition tasks. Specifically, a frequency-aware prompting mechanism is first proposed, in which high-frequency components of the decomposed source image are switched either with normal distribution sampling or zeroing to get frequency-aware augment samples. Then, a mutual attention module is designed to learn generalizable inductive bias under CD-FSL settings. More importantly, the proposed method is a plug-and-play module that can be directly applied to most off-the-shelf CD-FLS methods. Experimental results on CD-FSL benchmarks demonstrate the effectiveness of our proposed method as well as robustly improve the performance of existing CD-FLS methods. Resources at https://github.com/tinkez/FAP_CDFSC.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Research on Flight Accidents Prediction based Back Propagation Neural Network
Authors:
Haoxing Liu,
Fangzhou Shen,
Haoshen Qin and,
Fanru Gao
Abstract:
With the rapid development of civil aviation and the significant improvement of people's living standards, taking an air plane has become a common and efficient way of travel. However, due to the flight characteris-tics of the aircraft and the sophistication of the fuselage structure, flight de-lays and flight accidents occur from time to time. In addition, the life risk factor brought by aircraft…
▽ More
With the rapid development of civil aviation and the significant improvement of people's living standards, taking an air plane has become a common and efficient way of travel. However, due to the flight characteris-tics of the aircraft and the sophistication of the fuselage structure, flight de-lays and flight accidents occur from time to time. In addition, the life risk factor brought by aircraft after an accident is also the highest among all means of transportation. In this work, a model based on back-propagation neural network was used to predict flight accidents. By collecting historical flight data, including a variety of factors such as meteorological conditions, aircraft technical condition, and pilot experience, we trained a backpropaga-tion neural network model to identify potential accident risks. In the model design, a multi-layer perceptron structure is used to optimize the network performance by adjusting the number of hidden layer nodes and the learning rate. Experimental analysis shows that the model can effectively predict flight accidents with high accuracy and reliability.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference
Authors:
XENON Collaboration,
E. Aprile,
J. Aalbers,
K. Abe,
S. Ahmed Maouloud,
L. Althueser,
B. Andrieu,
E. Angelino,
D. Antón Martin,
F. Arneodo,
L. Baudis,
M. Bazyk,
L. Bellagamba,
R. Biondi,
A. Bismark,
K. Boese,
A. Brown,
G. Bruno,
R. Budnik,
J. M. R. Cardoso,
A. P. Cimental Chávez,
A. P. Colijn,
J. Conrad,
J. J. Cuenca-García,
V. D'Andrea
, et al. (139 additional authors not shown)
Abstract:
The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 t…
▽ More
The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 tonne-years yielded no signal excess over background expectations, from which competitive exclusion limits were derived on WIMP-nucleon elastic scatter cross sections, for WIMP masses ranging from 6 GeV/$c^2$ up to the TeV/$c^2$ scale. This work details the modeling and statistical methods employed in this search. By means of calibration data, we model the detector response, which is then used to derive background and signal models. The construction and validation of these models is discussed, alongside additional purely data-driven backgrounds. We also describe the statistical inference framework, including the definition of the likelihood function and the construction of confidence intervals.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Demonstration of neutron identification in neutrino interactions in the MicroBooNE liquid argon time projection chamber
Authors:
MicroBooNE collaboration,
P. Abratenko,
O. Alterkait,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhanderi,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
J. Y. Book
, et al. (165 additional authors not shown)
Abstract:
A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data const…
▽ More
A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data constraining their production rates and kinematics. We present the first demonstration of tagging neutrino-induced neutrons in liquid argon time projection chambers using secondary protons emitted from neutron-argon interactions in the MicroBooNE detector. We describe the method developed to identify neutrino-induced neutrons and demonstrate its performance using neutrons produced in muon-neutrino charged current interactions. The method is validated using a small subset of MicroBooNE's total dataset. The selection yields a sample with $60\%$ of selected tracks corresponding to neutron-induced secondary protons.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Improving neutrino energy estimation of charged-current interaction events with recurrent neural networks in MicroBooNE
Authors:
MicroBooNE collaboration,
P. Abratenko,
O. Alterkait,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhanderi,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
J. Y. Book
, et al. (164 additional authors not shown)
Abstract:
We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstr…
▽ More
We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstructing and summing visible energies, often experience sizable biases and resolution smearing because of the complex nature of neutrino interactions and the detector response. The estimation of neutrino energy can be improved after considering the kinematics information of reconstructed final-state particles. Utilizing kinematic information of reconstructed particles, the deep learning-based approach shows improved resolution and reduced bias for the muon neutrino Monte Carlo simulation sample compared to the traditional approach. In order to address the common concern about the effectiveness of this method on experimental data, the RNN-based energy estimator is further examined and validated with dedicated data-simulation consistency tests using MicroBooNE data. We also assess its potential impact on a neutrino oscillation study after accounting for all statistical and systematic uncertainties and show that it enhances physics sensitivity. This method has good potential to improve the performance of other physics analyses.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Low-Overhead Channel Estimation via 3D Extrapolation for TDD mmWave Massive MIMO Systems Under High-Mobility Scenarios
Authors:
Binggui Zhou,
Xi Yang,
Shaodan Ma,
Feifei Gao,
Guanghua Yang
Abstract:
In TDD mmWave massive MIMO systems, the downlink CSI can be attained through uplink channel estimation thanks to the uplink-downlink channel reciprocity. However, the channel aging issue is significant under high-mobility scenarios and thus necessitates frequent uplink channel estimation. In addition, large amounts of antennas and subcarriers lead to high-dimensional CSI matrices, aggravating the…
▽ More
In TDD mmWave massive MIMO systems, the downlink CSI can be attained through uplink channel estimation thanks to the uplink-downlink channel reciprocity. However, the channel aging issue is significant under high-mobility scenarios and thus necessitates frequent uplink channel estimation. In addition, large amounts of antennas and subcarriers lead to high-dimensional CSI matrices, aggravating the pilot training overhead. To systematically reduce the pilot overhead, a spatial, frequency, and temporal domain (3D) channel extrapolation framework is proposed in this paper. Considering the marginal effects of pilots in the spatial and frequency domains and the effectiveness of traditional knowledge-driven channel estimation methods, we first propose a knowledge-and-data driven spatial-frequency channel extrapolation network (KDD-SFCEN) for uplink channel estimation by exploiting the least square estimator for coarse channel estimation and joint spatial-frequency channel extrapolation to reduce the spatial-frequency domain pilot overhead. Then, resorting to the uplink-downlink channel reciprocity and temporal domain dependencies of downlink channels, a temporal uplink-downlink channel extrapolation network (TUDCEN) is proposed for slot-level channel extrapolation, aiming to enlarge the pilot signal period and thus reduce the temporal domain pilot overhead under high-mobility scenarios. Specifically, we propose the spatial-frequency sampling embedding module to reduce the representation dimension and consequent computational complexity, and we propose to exploit the autoregressive generative Transformer for generating downlink channels autoregressively. Numerical results demonstrate the superiority of the proposed framework in significantly reducing the pilot training overhead by more than 16 times and improving the system's spectral efficiency under high-mobility scenarios.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors
Authors:
Shu-Ang Yu,
Chao Yu,
Feng Gao,
Yi Wu,
Yu Wang
Abstract:
Spatial planning in cluttered environments is crucial for mobile systems, particularly agile quadrotors. Existing methods, both optimization-based and learning-based, often focus only on success rates in specific environments and lack a unified platform with tasks of varying difficulty. To address this, we introduce FlightBench, the first comprehensive open-source benchmark for 3D spatial planning…
▽ More
Spatial planning in cluttered environments is crucial for mobile systems, particularly agile quadrotors. Existing methods, both optimization-based and learning-based, often focus only on success rates in specific environments and lack a unified platform with tasks of varying difficulty. To address this, we introduce FlightBench, the first comprehensive open-source benchmark for 3D spatial planning on quadrotors, comparing classical optimization-based methods with emerging learning-based approaches. We also develop a suite of task difficulty metrics and evaluation metrics to quantify the characteristics of tasks and the performance of planning algorithms. Extensive experiments demonstrate the significant advantages of learning-based methods for high-speed flight and real-time planning, while highlighting the need for improvements in complex conditions, such as navigating large corners or dealing with view occlusion. We also conduct analytical experiments to justify the effectiveness of our proposed metrics. Additionally, we show that latency randomization effectively enhances performance in real-world deployments. The source code is available at \url{https://github.com/thu-uav/FlightBench}.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Dual-Stream Attention Network for Hyperspectral Image Unmixing
Authors:
Yufang Wang,
Wenmin Wu,
Lin Qi,
Feng Gao
Abstract:
Hyperspectral image (HSI) contains abundant spatial and spectral information, making it highly valuable for unmixing. In this paper, we propose a Dual-Stream Attention Network (DSANet) for HSI unmixing. The endmembers and abundance of a pixel in HSI have high correlations with its adjacent pixels. Therefore, we adopt a "many to one" strategy to estimate the abundance of the central pixel. In addit…
▽ More
Hyperspectral image (HSI) contains abundant spatial and spectral information, making it highly valuable for unmixing. In this paper, we propose a Dual-Stream Attention Network (DSANet) for HSI unmixing. The endmembers and abundance of a pixel in HSI have high correlations with its adjacent pixels. Therefore, we adopt a "many to one" strategy to estimate the abundance of the central pixel. In addition, we adopt multiview spectral method, dividing spectral bands into multiple partitions with low correlations to estimate abundances. To aggregate the estimated abundances for complementary from the two branches, we design a cross-fusion attention network to enhance valuable information. Extensive experiments have been conducted on two real datasets, which demonstrate the effectiveness of our DSANet.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Sparse Focus Network for Multi-Source Remote Sensing Data Classification
Authors:
Xuepeng Jin,
Junyan Lin,
Feng Gao,
Lin Qi,
Yang Zhou
Abstract:
Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors. Existing multi-source data classification methods are susceptible to irrelevant information interference during multi-source feature extraction and fusion. To solve this issue, we propose a sparse focus network for multi-source data classification. Sparse attention is…
▽ More
Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors. Existing multi-source data classification methods are susceptible to irrelevant information interference during multi-source feature extraction and fusion. To solve this issue, we propose a sparse focus network for multi-source data classification. Sparse attention is employed in Transformer block for HSI and SAR/LiDAR feature extraction, thereby the most useful self-attention values are maintained for better feature aggregation. Furthermore, cross-attention is used to enhance multi-source feature interactions, and further improves the efficiency of cross-modal feature fusion. Experimental results on the Berlin and Houston2018 datasets highlight the effectiveness of SF-Net, outperforming existing state-of-the-art methods.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism
Authors:
Zhaomin Fang,
Wankun Chen,
Feng Gao,
Yanhai Gan,
Junyu Dong,
Yang Zhou
Abstract:
Arctic Sea Ice Concentration (SIC) is the ratio of ice-covered area to the total sea area of the Arctic Ocean, which is a key indicator for maritime activities. Nowadays, we often use passive microwave images to display SIC, but it has low spatial resolution, and most of the existing super-resolution methods of Arctic SIC don't take the integration of spatial and channel features into account and…
▽ More
Arctic Sea Ice Concentration (SIC) is the ratio of ice-covered area to the total sea area of the Arctic Ocean, which is a key indicator for maritime activities. Nowadays, we often use passive microwave images to display SIC, but it has low spatial resolution, and most of the existing super-resolution methods of Arctic SIC don't take the integration of spatial and channel features into account and can't effectively integrate the multi-scale feature. To overcome the aforementioned issues, we propose MFM-Net for Arctic SIC super-resolution, which concurrently aggregates multi-scale information while integrating spatial and channel features. Extensive experiments on Arctic SIC dataset from the AMSR-E/AMSR-2 SIC DT-ASI products from Ocean University of China validate the effectiveness of porposed MFM-Net.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification
Authors:
Junyan Lin,
Xuepeng Jin,
Feng Gao,
Junyu Dong,
Hui Yu
Abstract:
Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage. This issue directly impairs the representation ability of the model. To tackle the problem, we propose a…
▽ More
Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage. This issue directly impairs the representation ability of the model. To tackle the problem, we propose a new strategy, named Mining Redundant Spectra (MRS). Unlike randomly masking spectral bands, MRS selectively masks them by similarity to increase the reconstruction difficulty. Specifically, a random spectral band is chosen during pretraining, and the selected and highly similar bands are masked. Experimental results demonstrate that employing the MRS strategy during the pretraining stage effectively improves the accuracy of existing MIM-based methods on the Berlin and Houston 2018 datasets.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism
Authors:
Miao Fu,
Feng Gao,
Ruzhuang Hua,
Yanhai Gan,
Xiaowei Zhou,
Yang Zhou
Abstract:
In this paper, we proposed large selective kernel and sparse attention network (LSKSANet) for remote sensing image semantic segmentation. The LSKSANet is a lightweight network that effectively combines convolution with sparse attention mechanisms. Specifically, we design large selective kernel module to decomposing the large kernel into a series of depth-wise convolutions with progressively increa…
▽ More
In this paper, we proposed large selective kernel and sparse attention network (LSKSANet) for remote sensing image semantic segmentation. The LSKSANet is a lightweight network that effectively combines convolution with sparse attention mechanisms. Specifically, we design large selective kernel module to decomposing the large kernel into a series of depth-wise convolutions with progressively increasing dilation rates, thereby expanding the receptive field without significantly increasing the computational burden. In addition, we introduce the sparse attention to keep the most useful self-attention values for better feature aggregation. Experimental results on the Vaihingen and Postdam datasets demonstrate the superior performance of the proposed LSKSANet over state-of-the-art methods.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Confidence-Based Task Prediction in Continual Disease Classification Using Probability Distribution
Authors:
Tanvi Verma,
Lukas Schwemer,
Mingrui Tan,
Fei Gao,
Yong Liu,
Huazhu Fu
Abstract:
Deep learning models are widely recognized for their effectiveness in identifying medical image findings in disease classification. However, their limitations become apparent in the dynamic and ever-changing clinical environment, characterized by the continuous influx of newly annotated medical data from diverse sources. In this context, the need for continual learning becomes particularly paramou…
▽ More
Deep learning models are widely recognized for their effectiveness in identifying medical image findings in disease classification. However, their limitations become apparent in the dynamic and ever-changing clinical environment, characterized by the continuous influx of newly annotated medical data from diverse sources. In this context, the need for continual learning becomes particularly paramount, not only to adapt to evolving medical scenarios but also to ensure the privacy of healthcare data. In our research, we emphasize the utilization of a network comprising expert classifiers, where a new expert classifier is added each time a new task is introduced. We present CTP, a task-id predictor that utilizes confidence scores, leveraging the probability distribution (logits) of the classifier to accurately determine the task-id at inference time. Logits are adjusted to ensure that classifiers yield a high-entropy distribution for data associated with tasks other than their own. By defining a noise region in the distribution and computing confidence scores, CTP achieves superior performance when compared to other relevant continual learning methods. Additionally, the performance of CTP can be further improved by providing it with a continuum of data at the time of inference.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation
Authors:
Fei Gao,
Siwen Wang,
Fandong Zhang,
Hong-Yu Zhou,
Yizhou Wang,
Churan Wang,
Gang Yu,
Yizhou Yu
Abstract:
Medical image analysis suffers from a shortage of data, whether annotated or not. This becomes even more pronounced when it comes to 3D medical images. Self-Supervised Learning (SSL) can partially ease this situation by using unlabeled data. However, most existing SSL methods can only make use of data in a single dimensionality (e.g. 2D or 3D), and are incapable of enlarging the training dataset b…
▽ More
Medical image analysis suffers from a shortage of data, whether annotated or not. This becomes even more pronounced when it comes to 3D medical images. Self-Supervised Learning (SSL) can partially ease this situation by using unlabeled data. However, most existing SSL methods can only make use of data in a single dimensionality (e.g. 2D or 3D), and are incapable of enlarging the training dataset by using data with differing dimensionalities jointly. In this paper, we propose a new cross-dimensional SSL framework based on a pseudo-3D transformation (CDSSL-P3D), that can leverage both 2D and 3D data for joint pre-training. Specifically, we introduce an image transformation based on the im2col algorithm, which converts 2D images into a format consistent with 3D data. This transformation enables seamless integration of 2D and 3D data, and facilitates cross-dimensional self-supervised learning for 3D medical image analysis. We run extensive experiments on 13 downstream tasks, including 2D and 3D classification and segmentation. The results indicate that our CDSSL-P3D achieves superior performance, outperforming other advanced SSL methods.
△ Less
Submitted 4 July, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
Scalable Distance-based Multi-Agent Relative State Estimation via Block Multiconvex Optimization
Authors:
Tianyue Wu,
Gongye Zaitian,
Qianhao Wang,
Fei Gao
Abstract:
This paper explores the distance-based relative state estimation problem in large-scale systems, which is hard to solve effectively due to its high-dimensionality and non-convexity. In this paper, we alleviate this inherent hardness to simultaneously achieve scalability and robustness of inference on this problem. Our idea is launched from a universal geometric formulation, called \emph{generalize…
▽ More
This paper explores the distance-based relative state estimation problem in large-scale systems, which is hard to solve effectively due to its high-dimensionality and non-convexity. In this paper, we alleviate this inherent hardness to simultaneously achieve scalability and robustness of inference on this problem. Our idea is launched from a universal geometric formulation, called \emph{generalized graph realization}, for the distance-based relative state estimation problem. Based on this formulation, we introduce two collaborative optimization models, one of which is convex and thus globally solvable, and the other enables fast searching on non-convex landscapes to refine the solution offered by the convex one. Importantly, both models enjoy \emph{multiconvex} and \emph{decomposable} structures, allowing efficient and safe solutions using \emph{block coordinate descent} that enjoys scalability and a distributed nature. The proposed algorithms collaborate to demonstrate superior or comparable solution precision to the current centralized convex relaxation-based methods, which are known for their high optimality. Distinctly, the proposed methods demonstrate scalability beyond the reach of previous convex relaxation-based methods. We also demonstrate that the combination of the two proposed algorithms achieves a more robust pipeline than deploying the local search method alone in a continuous-time scenario.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Integrated Sensing and Communications Framework for 6G Networks
Authors:
Hongliang Luo,
Tengyu Zhang,
Chuanbin Zhao,
Yucong Wang,
Bo Lin,
Yuhua Jiang,
Dongqi Luo,
Feifei Gao
Abstract:
In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) sc…
▽ More
In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) scheme to obtain the layout and point cloud information of static buildings. The dynamic targets floating in static environments create the spatiotemporal transition of the physical world, for which we design comprehensive dynamic target sensing (DTS) scheme to detect, estimate, track, image and recognize the dynamic targets in real-time. The object materials enrich the electromagnetic laws of the physical world, for which we develop object material recognition (OMR) scheme to estimate the electromagnetic coefficient of the objects. Besides, to integrate these sensing functions into existing communications systems, we discuss the interference issues and corresponding solutions for ISAC cellular networks. Furthermore, we develop an ISAC hardware prototype platform that can reconstruct the environmental maps and sense the dynamic targets while maintaining communications services. With all these designs, the proposed ISAC framework can support multifarious emerging applications, such as digital twins, low altitude economy, internet of vehicles, marine management, deformation monitoring, etc.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Authors:
Yasi Zhang,
Peiyu Yu,
Yaxuan Zhu,
Yingshan Chang,
Feng Gao,
Ying Nian Wu,
Oscar Leong
Abstract:
Generative models based on flow matching have attracted significant attention for their simplicity and superior performance in high-resolution image synthesis. By leveraging the instantaneous change-of-variables formula, one can directly compute image likelihoods from a learned flow, making them enticing candidates as priors for downstream tasks such as inverse problems. In particular, a natural a…
▽ More
Generative models based on flow matching have attracted significant attention for their simplicity and superior performance in high-resolution image synthesis. By leveraging the instantaneous change-of-variables formula, one can directly compute image likelihoods from a learned flow, making them enticing candidates as priors for downstream tasks such as inverse problems. In particular, a natural approach would be to incorporate such image probabilities in a maximum-a-posteriori (MAP) estimation problem. A major obstacle, however, lies in the slow computation of the log-likelihood, as it requires backpropagating through an ODE solver, which can be prohibitively slow for high-dimensional problems. In this work, we propose an iterative algorithm to approximate the MAP estimator efficiently to solve a variety of linear inverse problems. Our algorithm is mathematically justified by the observation that the MAP objective can be approximated by a sum of $N$ ``local MAP'' objectives, where $N$ is the number of function evaluations. By leveraging Tweedie's formula, we show that we can perform gradient steps to sequentially optimize these objectives. We validate our approach for various linear inverse problems, such as super-resolution, deblurring, inpainting, and compressed sensing, and demonstrate that we can outperform other methods based on flow matching.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication
Authors:
Yunuo Chen,
Tianyi Xie,
Zeshun Zong,
Xuan Li,
Feng Gao,
Yin Yang,
Ying Nian Wu,
Chenfanfu Jiang
Abstract:
Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to maintain balance when placed in physics-based simulations or 3D printed. This balance is crucial for satisfying user design intentions in interactive gaming, embod…
▽ More
Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to maintain balance when placed in physics-based simulations or 3D printed. This balance is crucial for satisfying user design intentions in interactive gaming, embodied AI, and robotics, where stable models are needed for reliable interaction. Additionally, stable models ensure that 3D-printed objects, such as figurines for home decoration, can stand on their own without requiring additional supports. To fill this gap, we introduce Atlas3D, an automatic and easy-to-implement method that enhances existing Score Distillation Sampling (SDS)-based text-to-3D tools. Atlas3D ensures the generation of self-supporting 3D models that adhere to physical laws of stability under gravity, contact, and friction. Our approach combines a novel differentiable simulation-based loss function with physically inspired regularization, serving as either a refinement or a post-processing module for existing frameworks. We verify Atlas3D's efficacy through extensive generation tasks and validate the resulting 3D models in both simulated and real-world environments.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Probabilistic and progressive deblended far-infrared and sub-millimetre point source catalogues I. Methodology and first application in the COSMOS field
Authors:
Lingyu Wang,
Antonio La Marca,
Fangyou Gao,
William J. Pearson,
Berta Margalef-Bentabol,
Matthieu Béthermin,
Longji Bing,
James Donnellan,
Peter D. Hurley,
Seb J. Oliver,
Catherine L. Hale,
Matt J. Jarvis,
Lucia Marchetti,
Mattia Vaccari,
Imogen H. Whittam
Abstract:
Single-dish far-infrared (far-IR) and sub-millimetre (sub-mm) point source catalogues and their connections with catalogues at other wavelengths are of paramount importance. However, due to the large mismatch in spatial resolution, cross-matching galaxies at different wavelengths is challenging. This work aims to develop the next-generation deblended far-IR and sub-mm catalogues and present the fi…
▽ More
Single-dish far-infrared (far-IR) and sub-millimetre (sub-mm) point source catalogues and their connections with catalogues at other wavelengths are of paramount importance. However, due to the large mismatch in spatial resolution, cross-matching galaxies at different wavelengths is challenging. This work aims to develop the next-generation deblended far-IR and sub-mm catalogues and present the first application in the COSMOS field. Our progressive deblending used the Bayesian probabilistic framework known as XID+. The deblending started from the Spitzer/MIPS 24 micron data, using an initial prior list composed of sources selected from the COSMOS2020 catalogue and radio catalogues from the VLA and the MeerKAT surveys, based on spectral energy distribution modelling which predicts fluxes of the known sources at the deblending wavelength. To speed up flux prediction, we made use of a neural network-based emulator. After deblending the 24 micron data, we proceeded to the Herschel PACS (100 & 160 micron) and SPIRE wavebands (250, 350 & 500 micron). Each time we constructed a tailor-made prior list based on the predicted fluxes of the known sources. Using simulated far-IR and sub-mm sky, we detailed the performance of our deblending pipeline. After validation with simulations, we then deblended the real observations from 24 to 500 micron and compared with blindly extracted catalogues and previous versions of deblended catalogues. As an additional test, we deblended the SCUBA-2 850 micron map and compared our deblended fluxes with ALMA measurements, which demonstrates a higher level of flux accuracy compared to previous results.We publicly release our XID+ deblended point source catalogues. These deblended long-wavelength data are crucial for studies such as deriving the fraction of dust-obscured star formation and better separation of quiescent galaxies from dusty star-forming galaxies.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
SSLChange: A Self-supervised Change Detection Framework Based on Domain Adaptation
Authors:
Yitao Zhao,
Turgay Celik,
Nanqing Liu,
Feng Gao,
Heng-Chao Li
Abstract:
In conventional remote sensing change detection (RS CD) procedures, extensive manual labeling for bi-temporal images is first required to maintain the performance of subsequent fully supervised training. However, pixel-level labeling for CD tasks is very complex and time-consuming. In this paper, we explore a novel self-supervised contrastive framework applicable to the RS CD task, which promotes…
▽ More
In conventional remote sensing change detection (RS CD) procedures, extensive manual labeling for bi-temporal images is first required to maintain the performance of subsequent fully supervised training. However, pixel-level labeling for CD tasks is very complex and time-consuming. In this paper, we explore a novel self-supervised contrastive framework applicable to the RS CD task, which promotes the model to accurately capture spatial, structural, and semantic information through domain adapter and hierarchical contrastive head. The proposed SSLChange framework accomplishes self-learning only by taking a single-temporal sample and can be flexibly transferred to main-stream CD baselines. With self-supervised contrastive learning, feature representation pre-training can be performed directly based on the original data even without labeling. After a certain amount of labels are subsequently obtained, the pre-trained features will be aligned with the labels for fully supervised fine-tuning. Without introducing any additional data or labels, the performance of downstream baselines will experience a significant enhancement. Experimental results on 2 entire datasets and 6 diluted datasets show that our proposed SSLChange improves the performance and stability of CD baseline in data-limited situations. The code of SSLChange will be released at \url{https://github.com/MarsZhaoYT/SSLChange}
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
JUNO Sensitivity to Invisible Decay Modes of Neutrons
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Kai Adamowicz,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta,
Antonio Bergnoli,
Daniel Bick
, et al. (635 additional authors not shown)
Abstract:
We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode…
▽ More
We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation modes of the excited residual nuclei can produce a time- and space-correlated triple coincidence signal in the JUNO detector. Based on a full Monte Carlo simulation informed with the latest available data, we estimate all backgrounds, including inverse beta decay events of the reactor antineutrino $\barν_e$, natural radioactivity, cosmogenic isotopes and neutral current interactions of atmospheric neutrinos. Pulse shape discrimination and multivariate analysis techniques are employed to further suppress backgrounds. With two years of exposure, JUNO is expected to give an order of magnitude improvement compared to the current best limits. After 10 years of data taking, the JUNO expected sensitivities at a 90% confidence level are $τ/B( n \rightarrow { inv} ) > 5.0 \times 10^{31} \, {\rm yr}$ and $τ/B( nn \rightarrow { inv} ) > 1.4 \times 10^{32} \, {\rm yr}$.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Microsaccade-inspired Event Camera for Robotics
Authors:
Botao He,
Ze Wang,
Yuan Zhou,
Jingxi Chen,
Chahat Deep Singh,
Haojia Li,
Yuman Gao,
Shaojie Shen,
Kaiwei Wang,
Yanjun Cao,
Chao Xu,
Yiannis Aloimonos,
Fei Gao,
Cornelia Fermuller
Abstract:
Neuromorphic vision sensors or event cameras have made the visual perception of extremely low reaction time possible, opening new avenues for high-dynamic robotics applications. These event cameras' output is dependent on both motion and texture. However, the event camera fails to capture object edges that are parallel to the camera motion. This is a problem intrinsic to the sensor and therefore c…
▽ More
Neuromorphic vision sensors or event cameras have made the visual perception of extremely low reaction time possible, opening new avenues for high-dynamic robotics applications. These event cameras' output is dependent on both motion and texture. However, the event camera fails to capture object edges that are parallel to the camera motion. This is a problem intrinsic to the sensor and therefore challenging to solve algorithmically. Human vision deals with perceptual fading using the active mechanism of small involuntary eye movements, the most prominent ones called microsaccades. By moving the eyes constantly and slightly during fixation, microsaccades can substantially maintain texture stability and persistence. Inspired by microsaccades, we designed an event-based perception system capable of simultaneously maintaining low reaction time and stable texture. In this design, a rotating wedge prism was mounted in front of the aperture of an event camera to redirect light and trigger events. The geometrical optics of the rotating wedge prism allows for algorithmic compensation of the additional rotational motion, resulting in a stable texture appearance and high informational output independent of external motion. The hardware device and software solution are integrated into a system, which we call Artificial MIcrosaccade-enhanced EVent camera (AMI-EV). Benchmark comparisons validate the superior data quality of AMI-EV recordings in scenarios where both standard cameras and event cameras fail to deliver. Various real-world experiments demonstrate the potential of the system to facilitate robotics perception both for low-level and high-level vision tasks.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Environment Sensing-aided Beam Prediction with Transfer Learning for Smart Factory
Authors:
Yuan Feng,
Chuanbing Zhao,
Feifei Gao,
Yong Zhang,
Shaodan Ma
Abstract:
In this paper, we propose an environment sensing-aided beam prediction model for smart factory that can be transferred from given environments to a new environment. In particular, we first design a pre-training model that predicts the optimal beam by sensing the present environmental information. When encountering a new environment, it generally requires collecting a large amount of new training d…
▽ More
In this paper, we propose an environment sensing-aided beam prediction model for smart factory that can be transferred from given environments to a new environment. In particular, we first design a pre-training model that predicts the optimal beam by sensing the present environmental information. When encountering a new environment, it generally requires collecting a large amount of new training data to retrain the model, whose cost severely impedes the application of the designed pre-training model. Therefore, we next design a transfer learning strategy that fine-tunes the pre-trained model by limited labeled data of the new environment. Simulation results show that when the pre-trained model is fine-tuned by 30\% of labeled data from the new environment, the Top-10 beam prediction accuracy reaches 94\%. Moreover, compared with the way to completely re-training the prediction model, the amount of training data and the time cost of the proposed transfer learning strategy reduce 70\% and 75\% respectively.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Flow and Heat Transfer in a Rotating Disc Cavity With Axial Throughflow at High Speed Conditions
Authors:
Ruonan Wang,
John W. Chew,
Feng Gao,
Olaf Marxen
Abstract:
Flow and heat transfer in a compressor rotating disc cavity with axial throughflow is investigated using wall-modelled large-eddy simulations (WMLES). These are compared to measurements from recently published experiments and used to investigate high Reynolds number effects. The simulations use an open-source CFD solver with high parallel efficiency and employ the Boussinesq approximation for cent…
▽ More
Flow and heat transfer in a compressor rotating disc cavity with axial throughflow is investigated using wall-modelled large-eddy simulations (WMLES). These are compared to measurements from recently published experiments and used to investigate high Reynolds number effects. The simulations use an open-source CFD solver with high parallel efficiency and employ the Boussinesq approximation for centrifugal buoyancy. Kinetic energy effects (characterised by Eckert number) are accounted for by scaling the thermal boundary conditions from static temperature to rotary stagnation temperature. The WMLES shows very encouraging agreement with experiments up to the highest Reynolds number tested, $Re_φ=3.0\times10^6$. A further simulation at $Re_φ=10^7$ extends the investigation to an operating condition more representative of aero engine high pressure compressors. The results support the scaling of shroud heat transfer found at lower $Re_φ$, but disc heat transfer is higher than expected from a simple extrapolation of lower $Re_φ$ results. This is associated with transition to turbulence in the disc Ekman layers and is consistent with the boundary layer Reynolds numbers at this condition. The introduction of swirl in the axial throughflow, as may occur at engine conditions, could reduce the boundary layer Reynolds numbers and delay the transition.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details
Authors:
Boqian Li,
Xuan Li,
Ying Jiang,
Tianyi Xie,
Feng Gao,
Huamin Wang,
Yin Yang,
Chenfanfu Jiang
Abstract:
Traditional 3D garment creation is labor-intensive, involving sketching, modeling, UV mapping, and texturing, which are time-consuming and costly. Recent advances in diffusion-based generative models have enabled new possibilities for 3D garment generation from text prompts, images, and videos. However, existing methods either suffer from inconsistencies among multi-view images or require addition…
▽ More
Traditional 3D garment creation is labor-intensive, involving sketching, modeling, UV mapping, and texturing, which are time-consuming and costly. Recent advances in diffusion-based generative models have enabled new possibilities for 3D garment generation from text prompts, images, and videos. However, existing methods either suffer from inconsistencies among multi-view images or require additional processes to separate cloth from the underlying human model. In this paper, we propose GarmentDreamer, a novel method that leverages 3D Gaussian Splatting (GS) as guidance to generate wearable, simulation-ready 3D garment meshes from text prompts. In contrast to using multi-view images directly predicted by generative models as guidance, our 3DGS guidance ensures consistent optimization in both garment deformation and texture synthesis. Our method introduces a novel garment augmentation module, guided by normal and RGBA information, and employs implicit Neural Texture Fields (NeTF) combined with Score Distillation Sampling (SDS) to generate diverse geometric and texture details. We validate the effectiveness of our approach through comprehensive qualitative and quantitative experiments, showcasing the superior performance of GarmentDreamer over state-of-the-art alternatives. Our project page is available at: https://xuan-li.github.io/GarmentDreamerDemo/.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction
Authors:
Rui Jin,
Yuman Gao,
Yingjian Wang,
Haojian Lu,
Fei Gao
Abstract:
Active reconstruction technique enables robots to autonomously collect scene data for full coverage, relieving users from tedious and time-consuming data capturing process. However, designed based on unsuitable scene representations, existing methods show unrealistic reconstruction results or the inability of online quality evaluation. Due to the recent advancements in explicit radiance field tech…
▽ More
Active reconstruction technique enables robots to autonomously collect scene data for full coverage, relieving users from tedious and time-consuming data capturing process. However, designed based on unsuitable scene representations, existing methods show unrealistic reconstruction results or the inability of online quality evaluation. Due to the recent advancements in explicit radiance field technology, online active high-fidelity reconstruction has become achievable. In this paper, we propose GS-Planner, a planning framework for active high-fidelity reconstruction using 3D Gaussian Splatting. With improvement on 3DGS to recognize unobserved regions, we evaluate the reconstruction quality and completeness of 3DGS map online to guide the robot. Then we design a sampling-based active reconstruction strategy to explore the unobserved areas and improve the reconstruction geometric and textural quality. To establish a complete robot active reconstruction system, we choose quadrotor as the robotic platform for its high agility. Then we devise a safety constraint with 3DGS to generate executable trajectories for quadrotor navigation in the 3DGS map. To validate the effectiveness of our method, we conduct extensive experiments and ablation studies in highly realistic simulation scenes.
△ Less
Submitted 24 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.