Search | arXiv e-print repository

LLM4Drive: A Survey of Large Language Models for Autonomous Driving

Authors: Zhenjie Yang, Xiaosong Jia, Hongyang Li, Junchi Yan

Abstract: Autonomous driving technology, a catalyst for revolutionizing transportation and urban mobility, has the tend to transition from rule-based systems to data-driven strategies. Traditional module-based systems are constrained by cumulative errors among cascaded modules and inflexible pre-set rules. In contrast, end-to-end autonomous driving systems have the potential to avoid error accumulation due… ▽ More Autonomous driving technology, a catalyst for revolutionizing transportation and urban mobility, has the tend to transition from rule-based systems to data-driven strategies. Traditional module-based systems are constrained by cumulative errors among cascaded modules and inflexible pre-set rules. In contrast, end-to-end autonomous driving systems have the potential to avoid error accumulation due to their fully data-driven training process, although they often lack transparency due to their "black box" nature, complicating the validation and traceability of decisions. Recently, large language models (LLMs) have demonstrated abilities including understanding context, logical reasoning, and generating answers. A natural thought is to utilize these abilities to empower autonomous driving. By combining LLM with foundation vision models, it could open the door to open-world understanding, reasoning, and few-shot learning, which current autonomous driving systems are lacking. In this paper, we systematically review a research line about \textit{Large Language Models for Autonomous Driving (LLM4AD)}. This study evaluates the current state of technological advancements, distinctly outlining the principal challenges and prospective directions for the field. For the convenience of researchers in academia and industry, we provide real-time updates on the latest advances in the field as well as relevant open-source resources via the designated link: https://github.com/Thinklab-SJTU/Awesome-LLM4AD. △ Less

Submitted 29 December, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: GitHub Repo: https://github.com/Thinklab-SJTU/Awesome-LLM4AD

arXiv:2311.01003 [pdf, other]

Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial Vehicle

Authors: Chen Qian, Rui Chen, Peiyao Shen, Yongchun Fang, Jifu Yan, Tiefeng Li

Abstract: Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial VehicleThis paper presents both the trajectory generation and tracking control strategies for an underactuated flapping wing aerial vehicle (FWAV). First, the FWAV dynamics is analyzed in a practical perspective. Then, based on these analyses, we demonstrate the differential flatness of the FWAV system, and d… ▽ More Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial VehicleThis paper presents both the trajectory generation and tracking control strategies for an underactuated flapping wing aerial vehicle (FWAV). First, the FWAV dynamics is analyzed in a practical perspective. Then, based on these analyses, we demonstrate the differential flatness of the FWAV system, and develop a general-purpose trajectory generation strategy. Subsequently, the trajectory tracking controller is developed with the help of robust control and switch control techniques. After that, the overall system asymptotic stability is guaranteed by Lyapunov stability analysis. To make the controller applicable in real flight, we also provide several instructions. Finally, a series of experiment results manifest the successful implementation of the proposed trajectory generation strategy and tracking control strategy. This work firstly achieves the closed-loop integration of trajectory generation and control for real 3-dimensional flight of an underactuated FWAV to a practical level. △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2311.00939 [pdf, other]

Accelerated Data-Driven Discovery and Screening of Two-Dimensional Magnets Using Graph Neural Networks

Authors: Ahmed Elrashidy, James Della-Giustina, Jia-An Yan

Abstract: In this study, we employ Graph Neural Networks (GNNs) to accelerate the discovery of novel 2D magnetic materials which have transformative potential in spintronics applications. Using data from the Materials Project database and the Computational 2D materials database (C2DB), we train three GNN architectures on a dataset of 1190 magnetic monolayers with energy above the convex hull (… ▽ More In this study, we employ Graph Neural Networks (GNNs) to accelerate the discovery of novel 2D magnetic materials which have transformative potential in spintronics applications. Using data from the Materials Project database and the Computational 2D materials database (C2DB), we train three GNN architectures on a dataset of 1190 magnetic monolayers with energy above the convex hull ($E_{\text{hull}}$) less than 0.3 eV/atom. Our Crystal Diffusion Variational Auto Encoder (CDVAE) generates 11,100 candidate crystals. Subsequent training on two Atomistic Line Graph Neural Networks (ALIGNN) achieves a 93$\%$ accuracy in predicting magnetic monolayers and a mean average error of 0.039 eV/atom for $E_{\text{hull}}$ predictions. After narrowing down candidates based on magnetic likelihood and predicted energy, constraining the atom count in the monolayers to five or fewer, and performing dimensionality checks, we identify 190 candidates. These are validated using Density-Functional Theory (DFT) to confirm their magnetic and energetic favorability resulting in 167 magnetic monolayers with $E_{\text{hull}} < 0.3$ eV/atom and a total magnetization of $\geq$ $0.5 μ_{B}$. Our methodology offers a way to accelerate exploring and predicting potential 2D magnetic materials, contributing to the ongoing computational and experimental efforts aimed at the discovery of new 2D magnets. △ Less

Submitted 5 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

Comments: 44 pages, 12 Figures

arXiv:2311.00447 [pdf, other]

On the Opportunities of Green Computing: A Survey

Authors: You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo , et al. (16 additional authors not shown)

Abstract: Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention… ▽ More Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention is paid on pursuing new state-of-the-art (SOTA) results, resulting in ever increasing of model size and computational complexity. The needs for high computing power brings higher carbon emission and undermines research fairness by preventing small or medium-sized research institutions and companies with limited funding in participating in research. To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic. In this survey, we give a systematic overview of the technologies used in Green Computing. We propose the framework of Green Computing and devide it into four key components: (1) Measures of Greenness, (2) Energy-Efficient AI, (3) Energy-Efficient Computing Systems and (4) AI Use Cases for Sustainability. For each components, we discuss the research progress made and the commonly used techniques to optimize the AI efficiency. We conclude that this new research direction has the potential to address the conflicts between resource constraints and AI development. We encourage more researchers to put attention on this direction and make AI more environmental friendly. △ Less

Submitted 8 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

Comments: 113 pages, 18 figures

arXiv:2311.00110 [pdf, ps, other]

Degree sequences of triangular multigraphs

Authors: John Talbot, Jun Yan

Abstract: A simple graph is triangular if every edge is contained in a triangle. A sequence of integers is graphical if it is the degree sequence of a simple graph. Egan and Nikolayevsky recently conjectured that every graphical sequence whose terms are all at least 4 is the degree sequence of a triangular simple graph, and proved this in some special cases. In this paper we state and prove the analogous ve… ▽ More A simple graph is triangular if every edge is contained in a triangle. A sequence of integers is graphical if it is the degree sequence of a simple graph. Egan and Nikolayevsky recently conjectured that every graphical sequence whose terms are all at least 4 is the degree sequence of a triangular simple graph, and proved this in some special cases. In this paper we state and prove the analogous version of this conjecture for multigraphs. △ Less

Submitted 31 October, 2023; originally announced November 2023.

Comments: 8 pages, 3 figures. Submitted to The Electronic Journal of Combinatorics

MSC Class: 05C07

arXiv:2311.00078 [pdf, other]

Experimental Evidence for Non-spherical Magnetic Form Factor in Ru$^{3+}$

Authors: Colin L. Sarkis, John W. Villanova, Casey Eichstaedt, Adolfo G. Eguiluz, Jaime A. Fernandez-Baca, Masaaki Matsuda, Jiaqiang Yan, Christian Balz, Arnab Banerjee, D. Alan Tennant, Tom Berlijn, Stephen E. Nagler

Abstract: The Mott insulator $α$-RuCl$_3$ has generated great interest in the community due to its possible field-induced Kitaev quantum spin liquid state. Despite enormous effort spent trying to obtain the form of the low energy Hamiltonian, there is currently no agreed upon set of parameters which is able to explain all of the data. A key piece of missing information lies in the determination of the magne… ▽ More The Mott insulator $α$-RuCl$_3$ has generated great interest in the community due to its possible field-induced Kitaev quantum spin liquid state. Despite enormous effort spent trying to obtain the form of the low energy Hamiltonian, there is currently no agreed upon set of parameters which is able to explain all of the data. A key piece of missing information lies in the determination of the magnetic form factor of Ru$^{3+}$, particularly for a true quantitative treatment of inelastic neutron scattering data. Here we present the experimentally derived magnetic form factor of Ru$^{3+}$ in the low spin 4$d^5$ state using polarized neutron diffraction within the paramagnetic regime on high quality single crystals of $α$-RuCl$_3$. We observe strong evidence of an anisotropic form factor, expected of the spin-orbit coupled $j_{\textrm{eff}} = \frac{1}{2}$ ground state. We model the static magnetization density in increasing complexity from simple isotropic cases, to a multipolar expansion, and finally \emph{ab initio} calculations of the generalized $j_{\textrm{eff}} = \frac{1}{2}$ ground state. Comparison of both single ion models and inclusion of Cl$^-$ anions support the presence of hybridization of Ru$^{3+}$ with the surrounding Cl$^{-}$ ligands. △ Less

Submitted 31 October, 2023; originally announced November 2023.

arXiv:2310.20461 [pdf, ps, other]

Ramsey numbers of bounded degree trees versus general graphs

Authors: Richard Montgomery, Matías Pavez-Signé, Jun Yan

Abstract: For every $k\ge 2$ and $Δ$, we prove that there exists a constant $C_{Δ,k}$ such that the following holds. For every graph $H$ with $χ(H)=k$ and every tree with at least $C_{Δ,k}|H|$ vertices and maximum degree at most $Δ$, the Ramsey number $R(T,H)$ is $(k-1)(|T|-1)+σ(H)$, where $σ(H)$ is the size of a smallest colour class across all proper $k$-colourings of $H$. This is tight up to the value of… ▽ More For every $k\ge 2$ and $Δ$, we prove that there exists a constant $C_{Δ,k}$ such that the following holds. For every graph $H$ with $χ(H)=k$ and every tree with at least $C_{Δ,k}|H|$ vertices and maximum degree at most $Δ$, the Ramsey number $R(T,H)$ is $(k-1)(|T|-1)+σ(H)$, where $σ(H)$ is the size of a smallest colour class across all proper $k$-colourings of $H$. This is tight up to the value of $C_{Δ,k}$, and confirms a conjecture of Balla, Pokrovskiy, and Sudakov. △ Less

Submitted 31 October, 2023; originally announced October 2023.

arXiv:2310.19735 [pdf, other]

Magnetic Stability, Fermi Surface Topology, and Spin-Correlated Dielectric Response in Monolayer 1T-CrTe2

Authors: Ahmed Elrashidy, Jia-An Yan

Abstract: We have carried out density-functional theory (DFT) calculations to study the magnetic stability of both ferromagnetic (FM) and anti-ferromagnetic (AFM) states in monolayer 1T-CrTe2. Our results show that the AFM order is lower in energy and thus is the ground state. By tuning the lattice parameters, the AFM order can transition to the FM order, in good agreement with experimental observation. We… ▽ More We have carried out density-functional theory (DFT) calculations to study the magnetic stability of both ferromagnetic (FM) and anti-ferromagnetic (AFM) states in monolayer 1T-CrTe2. Our results show that the AFM order is lower in energy and thus is the ground state. By tuning the lattice parameters, the AFM order can transition to the FM order, in good agreement with experimental observation. We observe a commensurate SDW alongside the previously predicted CDW, and attribute the AFM order to the SDW. This results in distinct hole and electron Fermi pockets and a pronounced optical anisotropy, suggesting quasi-one-dimensional behavior in this material. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.19651 [pdf, other]

Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace

Authors: Chiyu Song, Zhanchao Zhou, Jianhao Yan, Yuejiao Fei, Zhenzhong Lan, Yue Zhang

Abstract: Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). However, the creation of instruction data is still largely heuristic, leading to significant variation in quantity and quality across existing datasets. While some research advocates for expanding the number of instructions, others suggest that a small set of well-chosen examples is adequa… ▽ More Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). However, the creation of instruction data is still largely heuristic, leading to significant variation in quantity and quality across existing datasets. While some research advocates for expanding the number of instructions, others suggest that a small set of well-chosen examples is adequate. To better understand data construction guidelines, our research provides a granular analysis of how data volume, parameter size, and data construction methods influence the development of each underlying ability of LLM, such as creative writing, code generation, and logical reasoning. We present a meticulously curated dataset with over 40k instances across ten abilities and examine instruction-tuned models with 7b to 33b parameters. Our study reveals three primary findings: (i) Despite the models' overall performance being tied to data and parameter scale, individual abilities have different sensitivities to these factors. (ii) Human-curated data strongly outperforms synthetic data from GPT-4 in efficiency and can constantly enhance model performance with volume increases, but is unachievable with synthetic data. (iii) Instruction data brings powerful cross-ability generalization, as evidenced by out-of-domain evaluations. Furthermore, we demonstrate how these findings can guide more efficient data constructions, leading to practical performance improvements on two public benchmarks. △ Less

Submitted 22 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.19417 [pdf, other]

doi 10.1051/0004-6361/202347622

Multiwavelength observation of 1A 0535+262=HD 245770 from 2010 to 2021

Authors: Wei Liu, Jingzhi Yan, Guangcheng Xiao, Xiukun Li, Bo Gao, Qingzhong Liu

Abstract: Context. 1A 0535+262 is a high-mass X-ray binary that went into a giant X-ray outburst in 2020. During this event, the X-ray luminosity reached the highest value measured over the last 30 years. Aims. Our aim is to study the long-term variability of 1A 0535+262 before and after the 2020 major X-ray outburst and to uncover the mechanism that led to the X-ray outburst. Methods. We used the long-term… ▽ More Context. 1A 0535+262 is a high-mass X-ray binary that went into a giant X-ray outburst in 2020. During this event, the X-ray luminosity reached the highest value measured over the last 30 years. Aims. Our aim is to study the long-term variability of 1A 0535+262 before and after the 2020 major X-ray outburst and to uncover the mechanism that led to the X-ray outburst. Methods. We used the long-term photometric light curve and the equivalent widths of the H$α$ and He I $λ$6678 lines to monitor the state of the Be star's circumstellar disk. The H$α$ line profiles show evidence for V/R variability, which we revealed by fitting the H$α$ spectral line profiles with two Gaussian functions. In addition, we divided our data into four periods according to the intensity of the X-ray, optical, and infrared emission. Results. The H$α$ line profiles show single-peaked profiles in most cases. This is consistent with the previously reported orbital inclination of ${i}$ = $37^{\circ} \pm 2^{\circ}$. Unlike the H$α$ lines, the He I $\lambda6678$ lines show a maximal intensity in October 2020, which is one month before the giant X-ray outburst in 2020. Based on the behavior of the equivalent widths of the H$α$ and He I $\lambda6678$ lines, and the ${V}$-band magnitude, we find two mass ejection processes from the Be star to the Be disk on MJD 55820 and MJD 56600. The V/R quasi-period is about two\, years during 2011--2015, which is different from 1994 to 1995. Furthermore, the periods I$\to$II$\to$III$\to$IV in the $(B-V)$ color index versus $V$-band magnitude diagram constitute a cycle. From the behavior of the V/R ratio of H$α$ lines, and the variability of the $V$ band, we believe that the precession of the density perturbation inside the disk is retrograde. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 15 pages, 7 figures. arXiv admin note: text overlap with arXiv:2208.00151

Journal ref: A&A 681, A10 (2024)

arXiv:2310.18444 [pdf, other]

M3C: A Framework towards Convergent, Flexible, and Unsupervised Learning of Mixture Graph Matching and Clustering

Authors: Jiaxin Lu, Zetian Jiang, Tianzhe Wang, Junchi Yan

Abstract: Existing graph matching methods typically assume that there are similar structures between graphs and they are matchable. However, these assumptions do not align with real-world applications. This work addresses a more realistic scenario where graphs exhibit diverse modes, requiring graph grouping before or along with matching, a task termed mixture graph matching and clustering. We introduce Mino… ▽ More Existing graph matching methods typically assume that there are similar structures between graphs and they are matchable. However, these assumptions do not align with real-world applications. This work addresses a more realistic scenario where graphs exhibit diverse modes, requiring graph grouping before or along with matching, a task termed mixture graph matching and clustering. We introduce Minorize-Maximization Matching and Clustering (M3C), a learning-free algorithm that guarantees theoretical convergence through the Minorize-Maximization framework and offers enhanced flexibility via relaxed clustering. Building on M3C, we develop UM3C, an unsupervised model that incorporates novel edge-wise affinity learning and pseudo label selection. Extensive experimental results on public benchmarks demonstrate that our method outperforms state-of-the-art graph matching and mixture graph matching and clustering approaches in both accuracy and efficiency. Source code will be made publicly available. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 26 pages, 10 figures

arXiv:2310.17082 [pdf, ps, other]

Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE; $E_γ\geq 100$~TeV) $γ$-rays. In this context, the historical SNR Cassiopeia A (Cas A) is considered one of the most promising target for UHE observations. This paper presents the observation of Cas A and its vicinity by the LHAASO KM2A detector. The exceptional sensitivity of LHAASO KM2A in the UHE band, combined with the young age of Cas A, enabled us to derive stringent model-independent limits on the energy budget of UHE protons and nuclei accelerated by Cas A at any epoch after the explosion. The results challenge the prevailing paradigm that Cas A-type SNRs are major suppliers of PeV CRs in the Milky Way. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 11 pages, 3 figures, Accepted by the APJL

arXiv:2310.16567 [pdf, ps, other]

Inertia of partial transpose of positive semidefinite matrices

Authors: Yixuan Liang, Jiahao Yan, Dongran Si, Lin Chen

Abstract: We show that the partial transpose of $9\times 9$ positive semidefinite matrices do not have inertia (4,1,4) and (3,2,4). It solves an open problem in "LINEAR AND MULTILINEAR ALGEBRA, Changchun Feng et al, 2022". We apply our results to construct some inertia, as well as present the list of all possible inertia of partial transpose of $12\times 12$ positive semidefinite matrices. We show that the partial transpose of $9\times 9$ positive semidefinite matrices do not have inertia (4,1,4) and (3,2,4). It solves an open problem in "LINEAR AND MULTILINEAR ALGEBRA, Changchun Feng et al, 2022". We apply our results to construct some inertia, as well as present the list of all possible inertia of partial transpose of $12\times 12$ positive semidefinite matrices. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 20 pages, comments are welcome

arXiv:2310.14769 [pdf]

An introduction to radar Automatic Target Recognition (ATR) technology in ground-based radar systems

Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Deren Li

Abstract: This paper presents a brief examination of Automatic Target Recognition (ATR) technology within ground-based radar systems. It offers a lucid comprehension of the ATR concept, delves into its historical milestones, and categorizes ATR methods according to different scattering regions. By incorporating ATR solutions into radar systems, this study demonstrates the expansion of radar detection ranges… ▽ More This paper presents a brief examination of Automatic Target Recognition (ATR) technology within ground-based radar systems. It offers a lucid comprehension of the ATR concept, delves into its historical milestones, and categorizes ATR methods according to different scattering regions. By incorporating ATR solutions into radar systems, this study demonstrates the expansion of radar detection ranges and the enhancement of tracking capabilities, leading to superior situational awareness. Drawing insights from the Russo-Ukrainian War, the paper highlights three pressing radar applications that urgently necessitate ATR technology: detecting stealth aircraft, countering small drones, and implementing anti-jamming measures. Anticipating the next wave of radar ATR research, the study predicts a surge in cognitive radar and machine learning (ML)-driven algorithms. These emerging methodologies aspire to confront challenges associated with system adaptation, real-time recognition, and environmental adaptability. Ultimately, ATR stands poised to revolutionize conventional radar systems, ushering in an era of 4D sensing capabilities. △ Less

Submitted 23 October, 2023; originally announced October 2023.

arXiv:2310.13903 [pdf, ps, other]

Time periodic and almost periodic viscosity solutions of contact Hamilton-Jacobi equations on $\mathbb{T}^n$

Authors: Kaizhi Wang, Jun Yan, Kai Zhao

Abstract: This paper concerns with the time periodic viscosity solution problem for a class of evolutionary contact Hamilton-Jacobi equations with time independent Hamiltonians on the torus $\mathbb{T}^n$. Under certain suitable assumptions we show that the equation has a non-trivial $T$-periodic viscosity solution if and only if $T\in D$, where $D$ is a dense subset of $[0,+\infty)$. Moreover, we clarify t… ▽ More This paper concerns with the time periodic viscosity solution problem for a class of evolutionary contact Hamilton-Jacobi equations with time independent Hamiltonians on the torus $\mathbb{T}^n$. Under certain suitable assumptions we show that the equation has a non-trivial $T$-periodic viscosity solution if and only if $T\in D$, where $D$ is a dense subset of $[0,+\infty)$. Moreover, we clarify the structure of $D$. As a consequence, we also study the existence of Bohr almost periodic viscosity solutions. △ Less

Submitted 20 October, 2023; originally announced October 2023.

arXiv:2310.10979 [pdf, ps, other]

A New Gauge-Theoretic Construction of 4-Dimensional Hyperkähler ALE Spaces

Authors: Jiajun Yan

Abstract: Non-compact hyperkähler spaces arise frequently in gauge theory. The 4-dimensional hyperkähler ALE spaces are a special class of non-compact hyperkähler spaces. They are in one-to-one correspondence with the finite subgroups of SU(2) and have interesting connections with representation theory and singularity theory, captured by the McKay Correspondence. The 4-dimensional hyperkähler ALE spaces a… ▽ More Non-compact hyperkähler spaces arise frequently in gauge theory. The 4-dimensional hyperkähler ALE spaces are a special class of non-compact hyperkähler spaces. They are in one-to-one correspondence with the finite subgroups of SU(2) and have interesting connections with representation theory and singularity theory, captured by the McKay Correspondence. The 4-dimensional hyperkähler ALE spaces are first classified by Peter Kronheimer via a finite-dimensional hyperkähler reduction. In this paper, we give a new gauge-theoretic construction of these spaces. More specifically, we realize each 4-dimensional hyperkähler ALE space as a moduli space of solutions to a system of equations for a pair consisting of a connection and a section of a vector bundle over an orbifold Riemann surface, modulo a gauge group action. The construction given in this paper parallels Kronheimer's original construction and hence can also be thought of as a gauge-theoretic interpretation of Kronheimer's construction of these spaces. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 32 pages

arXiv:2310.10190 [pdf, other]

Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT -- A Text-to-SQL Parsing Comparison

Authors: Shuo Sun, Yuchen Zhang, Jiahuan Yan, Yuze Gao, Donovan Ong, Bin Chen, Jian Su

Abstract: The success of ChatGPT has ignited an AI race, with researchers striving to develop new large language models (LLMs) that can match or surpass the language understanding and generation abilities of commercial ones. In recent times, a number of models have emerged, claiming performance near that of GPT-3.5 or GPT-4 through various instruction-tuning methods. As practitioners of Text-to-SQL parsing,… ▽ More The success of ChatGPT has ignited an AI race, with researchers striving to develop new large language models (LLMs) that can match or surpass the language understanding and generation abilities of commercial ones. In recent times, a number of models have emerged, claiming performance near that of GPT-3.5 or GPT-4 through various instruction-tuning methods. As practitioners of Text-to-SQL parsing, we are grateful for their valuable contributions to open-source research. However, it is important to approach these claims with a sense of scrutiny and ascertain the actual effectiveness of these models. Therefore, we pit six popular large language models against each other, systematically evaluating their Text-to-SQL parsing capability on nine benchmark datasets with five different prompting strategies, covering both zero-shot and few-shot scenarios. Regrettably, the open-sourced models fell significantly short of the performance achieved by closed-source models like GPT-3.5, highlighting the need for further work to bridge the performance gap between these models. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: Findings of EMNLP 2023

arXiv:2310.10039 [pdf, other]

TpopT: Efficient Trainable Template Optimization on Low-Dimensional Manifolds

Authors: Jingkai Yan, Shiyu Wang, Xinyu Rain Wei, Jimmy Wang, Zsuzsanna Márka, Szabolcs Márka, John Wright

Abstract: In scientific and engineering scenarios, a recurring task is the detection of low-dimensional families of signals or patterns. A classic family of approaches, exemplified by template matching, aims to cover the search space with a dense template bank. While simple and highly interpretable, it suffers from poor computational efficiency due to unfavorable scaling in the signal space dimensionality.… ▽ More In scientific and engineering scenarios, a recurring task is the detection of low-dimensional families of signals or patterns. A classic family of approaches, exemplified by template matching, aims to cover the search space with a dense template bank. While simple and highly interpretable, it suffers from poor computational efficiency due to unfavorable scaling in the signal space dimensionality. In this work, we study TpopT (TemPlate OPTimization) as an alternative scalable framework for detecting low-dimensional families of signals which maintains high interpretability. We provide a theoretical analysis of the convergence of Riemannian gradient descent for TpopT, and prove that it has a superior dimension scaling to covering. We also propose a practical TpopT framework for nonparametric signal sets, which incorporates techniques of embedding and kernel interpolation, and is further configurable into a trainable network architecture by unrolled optimization. The proposed trainable TpopT exhibits significantly improved efficiency-accuracy tradeoffs for gravitational wave detection, where matched filtering is currently a method of choice. We further illustrate the general applicability of this approach with experiments on handwritten digit data. △ Less

Submitted 15 October, 2023; originally announced October 2023.

arXiv:2310.08845 [pdf, other]

doi 10.1126/sciadv.adj2778

Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the trigger. The intrinsic energy spectrum of gamma-rays can be described by a power-law after correcting for extragalactic background light (EBL) absorption. Such a hard spectrum challenges the synchrotron self-Compton (SSC) scenario of relativistic electrons for the afterglow emission above several TeV. Observations of gamma-rays up to 13 TeV from a source with a measured redshift of z=0.151 hints more transparency in intergalactic space than previously expected. Alternatively, one may invoke new physics such as Lorentz Invariance Violation (LIV) or an axion origin of very high energy (VHE) signals. △ Less

Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: 49pages, 11figures

Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

arXiv:2310.06756 [pdf, other]

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Authors: Yiting Chen, Zhanpeng Zhou, Junchi Yan

Abstract: The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant attention in measuring the similarity between features learned by distinct networks. However, feature similarity could be vague in describing the same feature sinc… ▽ More The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant attention in measuring the similarity between features learned by distinct networks. However, feature similarity could be vague in describing the same feature since equivalent features hardly exist. In this paper, we expand the concept of equivalent feature and provide the definition of what we call functionally equivalent features. These features produce equivalent output under certain transformations. Using this definition, we aim to derive a more intrinsic metric for the so-called feature complexity regarding the redundancy of features learned by a neural network at each layer. We offer a formal interpretation of our approach through the lens of category theory, a well-developed area in mathematics. To quantify the feature complexity, we further propose an efficient algorithm named Iterative Feature Merging. Our experimental results validate our ideas and theories from various perspectives. We empirically demonstrate that the functionally equivalence widely exists among different features learned by the same neural network and we could reduce the number of parameters of the network without affecting the performance.The IFM shows great potential as a data-agnostic model prune method. We have also drawn several interesting empirical findings regarding the defined feature complexity. △ Less

Submitted 26 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.06594 [pdf, other]

On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets

Authors: Ning Liao, Shaofeng Zhang, Renqiu Xia, Min Cao, Yu Qiao, Junchi Yan

Abstract: There is an emerging line of research on multimodal instruction tuning, and a line of benchmarks has been proposed for evaluating these models recently. Instead of evaluating the models directly, in this paper, we try to evaluate the Vision-Language Instruction-Tuning (VLIT) datasets. Also, we seek the way of building a dataset for developing an all-powerful VLIT model, which we believe could also… ▽ More There is an emerging line of research on multimodal instruction tuning, and a line of benchmarks has been proposed for evaluating these models recently. Instead of evaluating the models directly, in this paper, we try to evaluate the Vision-Language Instruction-Tuning (VLIT) datasets. Also, we seek the way of building a dataset for developing an all-powerful VLIT model, which we believe could also be of utility for establishing a grounded protocol for benchmarking VLIT models. For effective evaluation of VLIT datasets that remains an open question, we propose a tune-cross-evaluation paradigm: tuning on one dataset and evaluating on the others in turn. For each single tune-evaluation experiment set, we define the Meta Quality (MQ) as the mean score obtained by a set of caption metrics including BLEU, METEOR, and ROUGE-L to quantify the quality of a certain dataset or a sample. On this basis, to evaluate the comprehensiveness of a dataset, we develop the Dataset Quality (DQ) covering all tune-evaluation sets. To lay the foundation for building a comprehensive dataset and developing an all-powerful model for practical applications, we define the Sample Quality (SQ) to quantify the all-sided quality of each sample. Extensive experiments validate the rationality of the proposed evaluation paradigm. Based on the holistic evaluation, we build a new dataset, REVO-LION (REfining VisiOn-Language InstructiOn tuNing), by collecting samples with higher SQ from each dataset. Remarkably, even with only half of the complete data, the model trained on REVO-LION can achieve the performance comparable to simply adding all VLIT datasets up. Furthermore, REVO-LION not only facilitates the development of a powerful model but also incorporates an evaluation set, which is designed to serve as a convenient benchmark for future research in the field. △ Less

Submitted 29 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.06417 [pdf, other]

Advective Diffusion Transformers for Topological Generalization in Graph Learning

Authors: Qitian Wu, Chenxiao Yang, Kaipeng Zeng, Fan Nie, Michael Bronstein, Junchi Yan

Abstract: Graph diffusion equations are intimately related to graph neural networks (GNNs) and have recently attracted attention as a principled framework for analyzing GNN dynamics, formalizing their expressive power, and justifying architectural choices. One key open questions in graph learning is the generalization capabilities of GNNs. A major limitation of current approaches hinges on the assumption th… ▽ More Graph diffusion equations are intimately related to graph neural networks (GNNs) and have recently attracted attention as a principled framework for analyzing GNN dynamics, formalizing their expressive power, and justifying architectural choices. One key open questions in graph learning is the generalization capabilities of GNNs. A major limitation of current approaches hinges on the assumption that the graph topologies in the training and test sets come from the same distribution. In this paper, we make steps towards understanding the generalization of GNNs by exploring how graph diffusion equations extrapolate and generalize in the presence of varying graph topologies. We first show deficiencies in the generalization capability of existing models built upon local diffusion on graphs, stemming from the exponential sensitivity to topology variation. Our subsequent analysis reveals the promise of non-local diffusion, which advocates for feature propagation over fully-connected latent graphs, under the assumption of a specific data-generating condition. In addition to these findings, we propose a novel graph encoder backbone, Advective Diffusion Transformer (ADiT), inspired by advective graph diffusion equations that have a closed-form solution backed up with theoretical guarantees of desired generalization under topological distribution shifts. The new model, functioning as a versatile graph Transformer, demonstrates superior performance across a wide range of graph learning tasks. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 39 pages

arXiv:2310.05999 [pdf]

Two stage Robust Nash Bargaining based Energy Trading between Hydrogen-enriched Gas and Active Distribution Networks

Authors: Wenwen Zhang, Gao Qiu, Hongjun Gao, Tingjian Liu, Junyong Liu, Yaping Li, Shengchun Yang, Jiahao Yan, Wenbo Mao

Abstract: Integration of emerging hydrogen-enriched compressed natural gas (HCNG) distribution network with active distribution net-work (ADN) provides huge latent flexibility on consuming re-newable energies. However, paucity of energy trading mechanism risks the stable earnings of the flexibility for both entities, especially when rising highly-efficient solid oxide fuel cells (SOFCs) are pioneered to int… ▽ More Integration of emerging hydrogen-enriched compressed natural gas (HCNG) distribution network with active distribution net-work (ADN) provides huge latent flexibility on consuming re-newable energies. However, paucity of energy trading mechanism risks the stable earnings of the flexibility for both entities, especially when rising highly-efficient solid oxide fuel cells (SOFCs) are pioneered to interface gas and electricity. To fill the gap, a two-stage robust Nash bargaining strategy is pro-posed. In the first stage, a privacy-preserved Nash Bargaining based on the ADMM is applied to clear energy trading between the two autonomous entities, i.e., ADN and gas distribution network (GDN). Via robust dispatch of configured energy storage in ADN, the next stage de-risks ADN profit collapse from transaction biases, caused by forecasting errors of distributed energy resources. C&CG is finally utilized to loop the two stages. The convergence of the entire energy trading strategy is theoretically proved. As such, sustain-able returns from the integration of ADN and GDN bridged by SOFC and HCNG are facilitated. Numerical studies indicate that, the proposed cooperative strategy reaps a stable social welfare of nearly 1.6% to total cost, and benefit-steady situations for both ADN and GDN, even in the worst case. △ Less

Submitted 22 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.05105 [pdf, other]

How Graph Neural Networks Learn: Lessons from Training Dynamics

Authors: Chenxiao Yang, Qitian Wu, David Wipf, Ruoyu Sun, Junchi Yan

Abstract: A long-standing goal in deep learning has been to characterize the learning behavior of black-box models in a more interpretable manner. For graph neural networks (GNNs), considerable advances have been made in formalizing what functions they can represent, but whether GNNs will learn desired functions during the optimization process remains less clear. To fill this gap, we study their training dy… ▽ More A long-standing goal in deep learning has been to characterize the learning behavior of black-box models in a more interpretable manner. For graph neural networks (GNNs), considerable advances have been made in formalizing what functions they can represent, but whether GNNs will learn desired functions during the optimization process remains less clear. To fill this gap, we study their training dynamics in function space. In particular, we find that the gradient descent optimization of GNNs implicitly leverages the graph structure to update the learned function, as can be quantified by a phenomenon which we call \emph{kernel-graph alignment}. We provide theoretical explanations for the emergence of this phenomenon in the overparameterized regime and empirically validate it on real-world GNNs. This finding offers new interpretable insights into when and why the learned GNN functions generalize, highlighting their limitations in heterophilic graphs. Practically, we propose a parameter-free algorithm that directly uses a sparse matrix (i.e. graph adjacency) to update the learned function. We demonstrate that this embarrassingly simple approach can be as effective as GNNs while being orders-of-magnitude faster. △ Less

Submitted 18 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

Comments: Accepted to ICML 2024

arXiv:2310.03917 [pdf, other]

Anisotropy of thermal conductivity oscillations in relation to the Kitaev spin liquid phase

Authors: Heda Zhang, Hu Miao, Thomas Z Ward, David G Mandrus, Stephen E Nagler, Michael A McGuire, Jiaqiang Yan

Abstract: In the presence of external magnetic field, the Kitaev model could either hosts gapped topological anyon or gapless Majorana fermions. In $α$-RuCl$_3$, the gapped and gapless cases are only separated by a thirty-degree rotation of the in-plane magnetic field vector. The presence/absence of the spectral gap is key for understanding the thermal transport behavior in $α$-RuCl$_3$. Here, we study the… ▽ More In the presence of external magnetic field, the Kitaev model could either hosts gapped topological anyon or gapless Majorana fermions. In $α$-RuCl$_3$, the gapped and gapless cases are only separated by a thirty-degree rotation of the in-plane magnetic field vector. The presence/absence of the spectral gap is key for understanding the thermal transport behavior in $α$-RuCl$_3$. Here, we study the anisotropy of the oscillatory features of thermal conductivity in $α$-RuCl$_3$. We examine the oscillatory features of thermal conductivities (k//a, k//b) with fixed external fields and found distinct behavior for the gapped (B//a) and gapless (B//b) scenarios. Furthermore, we track the evolution of thermal resistivity ($λ_{a}$) and its oscillatory features with the rotation of in-plane magnetic fields from B//b to B//a. The thermal resistivity $λ(B,θ)$ display distinct rotational symmetries before and after the emergence of the field induced Kitaev spin liquid phase. These experiment data suggest close correlations between the oscillatory features of thermal conductivity, the underlying Kitaev spin liquid phase and the fermionic excitation it holds. △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2310.02965 [pdf, other]

Structure transition and zigzag magnetic order in Ir/Rh-substituted honeycomb lattice RuCl3

Authors: Zachary Morgan, Iris Ye, Colin L. Sarkis, Xiaoping Wang, Stephen Nagler, Jiaqiang Yan

Abstract: We report magnetization and neutron diffraction studies on crystal and magnetic structures of Ir- and Rh-substituted honeycomb lattice $α$-RuCl$_3$. The iridium or rhodium atoms are distributed at the Ru site with little structural modification. Both systems undergo a room-temperature monoclinic $C2/m$ to low-temperature trigonal $R\bar{3}$ phase transformation with a large recoverable hysteresis.… ▽ More We report magnetization and neutron diffraction studies on crystal and magnetic structures of Ir- and Rh-substituted honeycomb lattice $α$-RuCl$_3$. The iridium or rhodium atoms are distributed at the Ru site with little structural modification. Both systems undergo a room-temperature monoclinic $C2/m$ to low-temperature trigonal $R\bar{3}$ phase transformation with a large recoverable hysteresis. At low temperature, a zigzag spin order is observed with the same characteristic wavevector $(0,0.5,1)$ as in the parent $α$-RuCl$_3$. Detailed magnetic structure refinement reveals an ordered moment of $\rm 0.32(5) μ_B/Ru$ and a upper boundary of canting angle of $15(4)^\circ$ away from the basal plane at 5~K for the 10\% Ir-substituted $α$-RuCl$_3$, which is different from the 0.45-0.73~$\rm μ_B/Ru$ and $32^\circ$-$48^\circ$ canting angle reported in the parent compound $α$-RuCl$_3$. The observation of unchanged RuCl$_6$ local octahedral environment, reduced magnetic moment size and canting angle highlights the potential to study quantum spin liquid behavior through non-magnetic ion doping. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: 10 pages 9 figures

arXiv:2310.00297 [pdf, other]

Understanding In-Context Learning from Repetitions

Authors: Jianhao Yan, Jin Xu, Chiyu Song, Chenming Wu, Yafu Li, Yue Zhang

Abstract: This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs). Our work provides a novel perspective by examining in-context learning via the lens of surface repetitions. We quantitatively investigate the role of surface features in text generation, and empirically establish the existence of \emph{token co-occurrence reinforcement}, a principle that str… ▽ More This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs). Our work provides a novel perspective by examining in-context learning via the lens of surface repetitions. We quantitatively investigate the role of surface features in text generation, and empirically establish the existence of \emph{token co-occurrence reinforcement}, a principle that strengthens the relationship between two tokens based on their contextual co-occurrences. By investigating the dual impacts of these features, our research illuminates the internal workings of in-context learning and expounds on the reasons for its failures. This paper provides an essential contribution to the understanding of in-context learning and its potential limitations, providing a fresh perspective on this exciting capability. △ Less

Submitted 21 February, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

Comments: Accepted by ICLR 2024. Updated with new experiments and results

arXiv:2309.17378 [pdf]

doi 10.1103/PhysRevX.14.011040

Pressure-induced superconductivity in polycrystalline La3Ni2O7

Authors: Gang Wang, Ningning Wang, Jun Hou, Liang Ma, Lifen Shi, Zhian Ren, Yadong Gu, Xiaoling Shen, Hanming Ma, Pengtao Yang, Ziyi Liu, Haizhong Guo, Jianping Sun, Guangming Zhang, Jiaqiang Yan, Bosen Wang, Yoshiya Uwatoko, Jinguang Cheng

Abstract: We synthesized polycrystalline La3Ni2O7 samples by using the sol-gel method without post-annealing under high oxygen pressure, and then measured temperature-dependent resistivity under various hydrostatic pressures up to 14.5 GPa in a cubic anvil cell apparatus. We find that the density-wave-like anomaly in resistivity is progressively suppressed with increasing pressure and the resistivity drop c… ▽ More We synthesized polycrystalline La3Ni2O7 samples by using the sol-gel method without post-annealing under high oxygen pressure, and then measured temperature-dependent resistivity under various hydrostatic pressures up to 14.5 GPa in a cubic anvil cell apparatus. We find that the density-wave-like anomaly in resistivity is progressively suppressed with increasing pressure and the resistivity drop corresponding to the onset of superconductivity emerges at pressure as low as 7 GPa. Zero resistivity is achieved at 9 GPa below 6.6 K, which increases quickly with pressure to 35.6 K at 14.5 GPa. The observation of zero-resistance state in the polycrystalline La3Ni2O7 samples under high pressures not only corroborates the recent report of superconductivity in the pressurized La3Ni2O7 crystals but also facilitates further studies on this emerging family of nickelate high-Tc superconductors. △ Less

Submitted 3 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

Comments: 12 pages, 4 figures

Report number: Physical Review X (Featured in Physics) 14, 011040( 2024)

Journal ref: Physical Review X 14, 011040 (2024)

arXiv:2309.17283 [pdf, other]

The Blessings of Multiple Treatments and Outcomes in Treatment Effect Estimation

Authors: Yong Wu, Mingzhou Liu, Jing Yan, Yanwei Fu, Shouyan Wang, Yizhou Wang, Xinwei Sun

Abstract: Assessing causal effects in the presence of unobserved confounding is a challenging problem. Existing studies leveraged proxy variables or multiple treatments to adjust for the confounding bias. In particular, the latter approach attributes the impact on a single outcome to multiple treatments, allowing estimating latent variables for confounding control. Nevertheless, these methods primarily focu… ▽ More Assessing causal effects in the presence of unobserved confounding is a challenging problem. Existing studies leveraged proxy variables or multiple treatments to adjust for the confounding bias. In particular, the latter approach attributes the impact on a single outcome to multiple treatments, allowing estimating latent variables for confounding control. Nevertheless, these methods primarily focus on a single outcome, whereas in many real-world scenarios, there is greater interest in studying the effects on multiple outcomes. Besides, these outcomes are often coupled with multiple treatments. Examples include the intensive care unit (ICU), where health providers evaluate the effectiveness of therapies on multiple health indicators. To accommodate these scenarios, we consider a new setting dubbed as multiple treatments and multiple outcomes. We then show that parallel studies of multiple outcomes involved in this setting can assist each other in causal identification, in the sense that we can exploit other treatments and outcomes as proxies for each treatment effect under study. We proceed with a causal discovery method that can effectively identify such proxies for causal estimation. The utility of our method is demonstrated in synthetic data and sepsis disease. △ Less

Submitted 14 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

Comments: Preprint, under review

arXiv:2309.15415 [pdf]

Formation Wing-Beat Modulation (FWM): A Tool for Quantifying Bird Flocks Using Radar Micro-Doppler Signals

Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Ruizhi Chen, Deren Li

Abstract: Radar echoes from bird flocks contain modulation signals, which we find are produced by the flapping gaits of birds in the flock, resulting in a group of spectral peaks with similar amplitudes spaced at a specific interval. We call this the formation wing-beat modulation (FWM) effect. FWM signals are micro-Doppler modulated by flapping wings and are related to the bird number, wing-beat frequency,… ▽ More Radar echoes from bird flocks contain modulation signals, which we find are produced by the flapping gaits of birds in the flock, resulting in a group of spectral peaks with similar amplitudes spaced at a specific interval. We call this the formation wing-beat modulation (FWM) effect. FWM signals are micro-Doppler modulated by flapping wings and are related to the bird number, wing-beat frequency, and flight phasing strategy. Our X-band radar data show that FWM signals exist in radar signals of a seagull flock, providing tools for quantifying the bird number and estimating the mean wingbeat rate of birds. This new finding could aid in research on the quantification of bird migration numbers and estimation of bird flight behavior in radar ornithology and aero-ecology. △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.11268 [pdf, other]

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding

Authors: Renqiu Xia, Bo Zhang, Haoyang Peng, Hancheng Ye, Xiangchao Yan, Peng Ye, Botian Shi, Yu Qiao, Junchi Yan

Abstract: Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers. Current chart-related tasks focus on either chart perception which refers to extracting information from the visual charts, or performing reasoning given the extracted data, e.g. in a tabular form. In this paper, we aim to establish a unified and label-efficient learning par… ▽ More Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers. Current chart-related tasks focus on either chart perception which refers to extracting information from the visual charts, or performing reasoning given the extracted data, e.g. in a tabular form. In this paper, we aim to establish a unified and label-efficient learning paradigm for joint perception and reasoning tasks, which can be generally applicable to different downstream tasks, beyond the question-answering task as specifically studied in peer works. Specifically, StructChart first reformulates the chart information from the popular tubular form (specifically linearized CSV) to the proposed Structured Triplet Representations (STR), which is more friendly for reducing the task gap between chart perception and reasoning due to the employed structured information extraction for charts. We then propose a Structuring Chart-oriented Representation Metric (SCRM) to quantitatively evaluate the performance for the chart perception task. To enrich the dataset for training, we further explore the possibility of leveraging the Large Language Model (LLM), enhancing the chart diversity in terms of both chart visual style and its statistical information. Extensive experiments are conducted on various chart-related tasks, demonstrating the effectiveness and promising potential for a unified chart perception-reasoning paradigm to push the frontier of chart understanding. △ Less

Submitted 18 February, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: SimChart9K is available for downloading at: https://github.com/UniModal4Reasoning/SimChart9K 26 pages, 15 figures

arXiv:2309.11042 [pdf, other]

Making Small Language Models Better Multi-task Learners with Mixture-of-Task-Adapters

Authors: Yukang Xie, Chengyu Wang, Junbing Yan, Jiyong Zhou, Feiqi Deng, Jun Huang

Abstract: Recently, Large Language Models (LLMs) have achieved amazing zero-shot learning performance over a variety of Natural Language Processing (NLP) tasks, especially for text generative tasks. Yet, the large size of LLMs often leads to the high computational cost of model training and online deployment. In our work, we present ALTER, a system that effectively builds the multi-tAsk Learners with mixTur… ▽ More Recently, Large Language Models (LLMs) have achieved amazing zero-shot learning performance over a variety of Natural Language Processing (NLP) tasks, especially for text generative tasks. Yet, the large size of LLMs often leads to the high computational cost of model training and online deployment. In our work, we present ALTER, a system that effectively builds the multi-tAsk Learners with mixTure-of-task-adaptERs upon small language models (with <1B parameters) to address multiple NLP tasks simultaneously, capturing the commonalities and differences between tasks, in order to support domain-specific applications. Specifically, in ALTER, we propose the Mixture-of-Task-Adapters (MTA) module as an extension to the transformer architecture for the underlying model to capture the intra-task and inter-task knowledge. A two-stage training method is further proposed to optimize the collaboration between adapters at a small computational cost. Experimental results over a mixture of NLP tasks show that our proposed MTA architecture and the two-stage training method achieve good performance. Based on ALTER, we have also produced MTA-equipped language models for various domains. △ Less

Submitted 19 September, 2023; originally announced September 2023.

arXiv:2309.10527 [pdf, other]

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Learning Transferable 3D Representations

Authors: Xiangchao Yan, Runjian Chen, Bo Zhang, Hancheng Ye, Renqiu Xia, Jiakang Yuan, Hongbin Zhou, Xinyu Cai, Botian Shi, Wenqi Shao, Ping Luo, Yu Qiao, Tao Chen, Junchi Yan

Abstract: Annotating 3D LiDAR point clouds for perception tasks is fundamental for many applications e.g., autonomous driving, yet it still remains notoriously labor-intensive. Pretraining-finetuning approach can alleviate the labeling burden by fine-tuning a pre-trained backbone across various downstream datasets as well as tasks. In this paper, we propose SPOT, namely Scalable Pre-training via Occupancy p… ▽ More Annotating 3D LiDAR point clouds for perception tasks is fundamental for many applications e.g., autonomous driving, yet it still remains notoriously labor-intensive. Pretraining-finetuning approach can alleviate the labeling burden by fine-tuning a pre-trained backbone across various downstream datasets as well as tasks. In this paper, we propose SPOT, namely Scalable Pre-training via Occupancy prediction for learning Transferable 3D representations under such a label-efficient fine-tuning paradigm. SPOT achieves effectiveness on various public datasets with different downstream tasks, showcasing its general representation power, cross-domain robustness and data scalability which are three key factors for real-world application. Specifically, we both theoretically and empirically show, for the first time, that general representations learning can be achieved through the task of occupancy prediction. Then, to address the domain gap caused by different LiDAR sensors and annotation methods, we develop a beam re-sampling technique for point cloud augmentation combined with class-balancing strategy. Furthermore, scalable pre-training is observed, that is, the downstream performance across all the experiments gets better with more pre-training data. Additionally, such pre-training strategy also remains compatible with unlabeled data. The hope is that our findings will facilitate the understanding of LiDAR points and pave the way for future advancements in LiDAR pre-training. △ Less

Submitted 25 July, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: 15 pages, 8 figures, Code is available at https://github.com/PJLab-ADG/3DTrans

arXiv:2309.09512 [pdf]

Extrinsic nonlinear Kerr rotation in topological materials under a magnetic field

Authors: Shuang Wu, Zaiyao Fei, Zeyuan Sun, Yangfan Yi, Wei Xia, Dayu Yan, Yanfeng Guo, Youguo Shi, Jiaqiang Yan, David H. Cobden, Wei-Tao Liu, Xiaodong Xu, Shiwei Wu

Abstract: Topological properties in quantum materials are often governed by symmetry and tuned by crystal structure and external fields, and hence symmetry-sensitive nonlinear optical measurements in a magnetic field are a valuable probe. Here we report nonlinear magneto-optical second harmonic generation (SHG) studies of non-magnetic topological materials including bilayer WTe2, monolayer WSe2 and bulk TaA… ▽ More Topological properties in quantum materials are often governed by symmetry and tuned by crystal structure and external fields, and hence symmetry-sensitive nonlinear optical measurements in a magnetic field are a valuable probe. Here we report nonlinear magneto-optical second harmonic generation (SHG) studies of non-magnetic topological materials including bilayer WTe2, monolayer WSe2 and bulk TaAs. The polarization-resolved patterns of optical SHG under magnetic field show nonlinear Kerr rotation in these time-reversal symmetric materials. For materials with three-fold rotational symmetric lattice structure, the SHG polarization pattern rotates just slightly in a magnetic field, whereas in those with mirror or two-fold rotational symmetry the SHG polarization pattern rotates greatly and distorts. These different magneto-SHG characters can be understood by considering the superposition of the magnetic field-induced time-noninvariant nonlinear optical tensor and the crystal-structure-based time-invariant counterpart. The situation is further clarified by scrutinizing the Faraday rotation, whose subtle interplay with crystal symmetry accounts for the diverse behavior of the extrinsic nonlinear Kerr rotation in different materials. Our work illustrates the application of magneto-SHG techniques to directly probe nontrivial topological properties, and underlines the importance of minimizing extrinsic nonlinear Kerr rotation in polarization-resolved magneto-optical studies. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 25 pages, 6 figures

arXiv:2309.08888 [pdf, other]

GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels

Authors: Yixuan Wu, Jintai Chen, Jiahuan Yan, Yiheng Zhu, Danny Z. Chen, Jian Wu

Abstract: Since annotating medical images for segmentation tasks commonly incurs expensive costs, it is highly desirable to design an annotation-efficient method to alleviate the annotation burden. Recently, contrastive learning has exhibited a great potential in learning robust representations to boost downstream tasks with limited labels. In medical imaging scenarios, ready-made meta labels (i.e., specifi… ▽ More Since annotating medical images for segmentation tasks commonly incurs expensive costs, it is highly desirable to design an annotation-efficient method to alleviate the annotation burden. Recently, contrastive learning has exhibited a great potential in learning robust representations to boost downstream tasks with limited labels. In medical imaging scenarios, ready-made meta labels (i.e., specific attribute information of medical images) inherently reveal semantic relationships among images, which have been used to define positive pairs in previous work. However, the multi-perspective semantics revealed by various meta labels are usually incompatible and can incur intractable "semantic contradiction" when combining different meta labels. In this paper, we tackle the issue of "semantic contradiction" in a gradient-guided manner using our proposed Gradient Mitigator method, which systematically unifies multi-perspective meta labels to enable a pre-trained model to attain a better high-level semantic recognition ability. Moreover, we emphasize that the fine-grained discrimination ability is vital for segmentation-oriented pre-training, and develop a novel method called Gradient Filter to dynamically screen pixel pairs with the most discriminating power based on the magnitude of gradients. Comprehensive experiments on four medical image segmentation datasets verify that our new method GCL: (1) learns informative image representations and considerably boosts segmentation performance with limited labels, and (2) shows promising generalizability on out-of-distribution datasets. △ Less

Submitted 16 September, 2023; originally announced September 2023.

arXiv:2309.08276

A New Adaptive Phase-locked Loop for Synchronization of a Grid-Connected Voltage Source Converter: Simulation and Experimental Results

Authors: Wei He, Jiachen Yan, Romeo Ortega, Daniele Zonetti, Wangping Zhou

Abstract: In [1] a new adaptive phase-locked loop scheme for synchronization of a grid connected voltage source converter with guaranteed (almost) global stability properties was reported. To guarantee a suitable synchronization with the angle of the three-phase grid voltage we design an adaptive observer for such a signal requiring measurements only at the point of common coupling. An interesting feature o… ▽ More In [1] a new adaptive phase-locked loop scheme for synchronization of a grid connected voltage source converter with guaranteed (almost) global stability properties was reported. To guarantee a suitable synchronization with the angle of the three-phase grid voltage we design an adaptive observer for such a signal requiring measurements only at the point of common coupling. An interesting feature of this scheme is the ability to synchronize in the challenging condition of connection with a grid with reduced short-circuit ratio. In this paper we present some simulation and experimental illustration of the excellent performance of the proposed solution. △ Less

Submitted 30 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: Something needs to be modified so that this paper is more clear

arXiv:2309.07394 [pdf, other]

doi 10.1109/TMI.2023.3309971

Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images

Authors: Zhiyun Song, Penghui Du, Junpeng Yan, Kailu Li, Jianzhong Shou, Maode Lai, Yubo Fan, Yan Xu

Abstract: Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretr… ▽ More Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretraining framework for histopathology images. The framework aims to capture the nuclear morphology and distribution information through unpaired image-to-image translation between histopathology images and pseudo mask images. The generation process is modulated by both conditional and stochastic style representations, ensuring the reality and diversity of the generated histopathology images for pretraining. Further, an instance segmentation guided strategy is employed to capture instance-level information. The experiments on 7 datasets show that the proposed pretraining method outperforms supervised ones on Kather classification, multiple instance learning, and 5 dense-prediction tasks with the transfer learning protocol, and yields superior results than other self-supervised approaches on 8 semi-supervised tasks. Our project is publicly available at https://github.com/zhiyuns/UNITPathSSL. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2309.05656 [pdf]

Atomistic Control in Molecular Beam Epitaxy Growth of Intrinsic Magnetic Topological Insulator MnBi2Te4

Authors: Hyunsue Kim, Mengke Liu, Lisa Frammolino, Yanxing Li, Fan Zhang, Woojoo Lee, Chengye Dong, Yi-Fan Zhao, Guan-Yu Chen, Pin-Jui Hsu, Cui-Zu Chang, Joshua Robinson, Jiaqiang Yan, Xiaoqin Li, Allan H. MacDonald, Chih-Kang Shih

Abstract: Intrinsic magnetic topological insulators have emerged as a promising platform to study the interplay between topological surface states and ferromagnetism. This unique interplay can give rise to a variety of exotic quantum phenomena, including the quantum anomalous Hall effect and axion insulating states. Here, utilizing molecular beam epitaxy (MBE), we present a comprehensive study of the growth… ▽ More Intrinsic magnetic topological insulators have emerged as a promising platform to study the interplay between topological surface states and ferromagnetism. This unique interplay can give rise to a variety of exotic quantum phenomena, including the quantum anomalous Hall effect and axion insulating states. Here, utilizing molecular beam epitaxy (MBE), we present a comprehensive study of the growth of high-quality MnBi2Te4 thin films on Si (111), epitaxial graphene, and highly ordered pyrolytic graphite substrates. By combining a suite of in-situ characterization techniques, we obtain critical insights into the atomic-level control of MnBi2Te4 epitaxial growth. First, we extract the free energy landscape for the epitaxial relationship as a function of the in-plane angular distribution. Then, by employing an optimized layer-by-layer growth, we determine the chemical potential and Dirac point of the thin film at different thicknesses. Overall, these results establish a foundation for understanding the growth dynamics of MnBi2Te4 and pave the way for the future applications of MBE in emerging topological quantum materials. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 20 pages, 4 figures

arXiv:2309.05606 [pdf, ps, other]

Distribution of colours in rainbow H-free colourings

Authors: Zhuo Wu, Jun Yan

Abstract: An edge colouring of $K_n$ with $k$ colours is a Gallai $k$-colouring if it does not contain any rainbow triangle. Gyárfás, Pálvölgyi, Patkós and Wales proved that there exists a number $g(k)$ such that $n\geq g(k)$ if and only if for any colour distribution sequence $(e_1,\cdots,e_k)$ with $\sum_{i=1}^ke_i=\binom{n}{2}$, there exist a Gallai $k$-colouring of $K_n$ with $e_i$ edges having colour… ▽ More An edge colouring of $K_n$ with $k$ colours is a Gallai $k$-colouring if it does not contain any rainbow triangle. Gyárfás, Pálvölgyi, Patkós and Wales proved that there exists a number $g(k)$ such that $n\geq g(k)$ if and only if for any colour distribution sequence $(e_1,\cdots,e_k)$ with $\sum_{i=1}^ke_i=\binom{n}{2}$, there exist a Gallai $k$-colouring of $K_n$ with $e_i$ edges having colour $i$. They also showed that $Ω(k)=g(k)=O(k^2)$ and posed the problem of determining the exact order of magnitude of $g(k)$. Feffer, Fu and Yan improved both bounds significantly by proving $Ω(k^{1.5}/\log k)=g(k)=O(k^{1.5})$. We resolve this problem by showing $g(k)=Θ(k^{1.5}/(\log k)^{0.5})$. Moreover, we generalise these definitions by considering rainbow $H$-free colourings of $K_n$ for any general graph $H$, and the natural corresponding quantity $g(H,k)$. We prove that $g(H,k)$ is finite for every $k$ if and only if $H$ is not a forest, and determine the order of $g(H,k)$ when $H$ contains a subgraph with minimum degree at least 3. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 15 pages. Submitted to SIAM Journal on Discrete Mathematics

MSC Class: 05C15

arXiv:2309.05527 [pdf, other]

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation

Authors: Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao

Abstract: Domain shifts such as sensor type changes and geographical situation variations are prevalent in Autonomous Driving (AD), which poses a challenge since AD model relying on the previous domain knowledge can be hardly directly deployed to a new domain without additional costs. In this paper, we provide a new perspective and approach of alleviating the domain shifts, by proposing a Reconstruction-Sim… ▽ More Domain shifts such as sensor type changes and geographical situation variations are prevalent in Autonomous Driving (AD), which poses a challenge since AD model relying on the previous domain knowledge can be hardly directly deployed to a new domain without additional costs. In this paper, we provide a new perspective and approach of alleviating the domain shifts, by proposing a Reconstruction-Simulation-Perception (ReSimAD) scheme. Specifically, the implicit reconstruction process is based on the knowledge from the previous old domain, aiming to convert the domain-related knowledge into domain-invariant representations, e.g., 3D scene-level meshes. Besides, the point clouds simulation process of multiple new domains is conditioned on the above reconstructed 3D meshes, where the target-domain-like simulation samples can be obtained, thus reducing the cost of collecting and annotating new-domain data for the subsequent perception process. For experiments, we consider different cross-domain situations such as Waymo-to-KITTI, Waymo-to-nuScenes, Waymo-to-ONCE, etc, to verify the zero-shot target-domain perception using ReSimAD. Results demonstrate that our method is beneficial to boost the domain generalization ability, even promising for 3D pre-training. △ Less

Submitted 25 January, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: Accepted by ICLR 2024. Code and simulated points are available at https://github.com/PJLab-ADG/3DTrans#resimad

arXiv:2309.04342 [pdf, other]

Revealing the preference for correcting separated aberrations in joint optic-image design

Authors: Jingwen Zhou, Shiqi Chen, Zheng Ren, Wenguan Zhang, Jiapu Yan, Huajun Feng, Qi Li, Yueting Chen

Abstract: The joint design of the optical system and the downstream algorithm is a challenging and promising task. Due to the demand for balancing the global optimal of imaging systems and the computational cost of physical simulation, existing methods cannot achieve efficient joint design of complex systems such as smartphones and drones. In this work, starting from the perspective of the optical design, w… ▽ More The joint design of the optical system and the downstream algorithm is a challenging and promising task. Due to the demand for balancing the global optimal of imaging systems and the computational cost of physical simulation, existing methods cannot achieve efficient joint design of complex systems such as smartphones and drones. In this work, starting from the perspective of the optical design, we characterize the optics with separated aberrations. Additionally, to bridge the hardware and software without gradients, an image simulation system is presented to reproduce the genuine imaging procedure of lenses with large field-of-views. As for aberration correction, we propose a network to perceive and correct the spatially varying aberrations and validate its superiority over state-of-the-art methods. Comprehensive experiments reveal that the preference for correcting separated aberrations in joint design is as follows: longitudinal chromatic aberration, lateral chromatic aberration, spherical aberration, field curvature, and coma, with astigmatism coming last. Drawing from the preference, a 10% reduction in the total track length of the consumer-level mobile phone lens module is accomplished. Moreover, this procedure spares more space for manufacturing deviations, realizing extreme-quality enhancement of computational photography. The optimization paradigm provides innovative insight into the practical joint design of sophisticated optical systems and post-processing algorithms. △ Less

Submitted 20 November, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

Comments: 19 pages

arXiv:2309.04104 [pdf]

Design of multifunctional color routers with Kerker switching using generative adversarial networks

Authors: Jiahao Yan, Dayu Zhu, Yanjun Bao, Qin Chen, Baojun Li, Wenshan Cai

Abstract: To achieve optoelectronic devices with high resolution and efficiency, there is a pressing need for optical structural units that possess an ultrasmall footprint yet exhibit strong controllability in both the frequency and spatial domains. For dielectric nanoparticles, the overlap of electric and magnetic dipole moments can scatter light completely forward or backward, which is called Kerker theor… ▽ More To achieve optoelectronic devices with high resolution and efficiency, there is a pressing need for optical structural units that possess an ultrasmall footprint yet exhibit strong controllability in both the frequency and spatial domains. For dielectric nanoparticles, the overlap of electric and magnetic dipole moments can scatter light completely forward or backward, which is called Kerker theory. This effect can expand to any multipoles and any directions, re-named as generalized Kerker effect, and realize controllable light manipulation at full space and full spectrum using well-designed dielectric structures. However, the complex situations of multipole couplings make it difficult to achieve structural design. Here, generative artificial intelligence (AI) is utilized to facilitate multi-objective-oriented structural design, wherein we leverage the concept of "combined spectra" that consider both spectra and direction ratios as labels. The proposed generative adversarial network (GAN) is named as DDGAN (double-discriminator GAN) which discriminates both images and spectral labels. Using trained networks, we achieve the simultaneous design for scattering color and directivities, RGB color routers, as well as narrowband light routers. Notably, all generated structures possess a footprint less than 600x600 nm indicating their potential applications in optoelectronic devices with ultrahigh resolution. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2308.16138 [pdf, other]

doi 10.1021/acs.chemmater.3c02289

Evolution of highly anisotropic magnetism in the titanium-based kagome metals LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$)

Authors: Brenden R. Ortiz, Hu Miao, David S. Parker, Fazhi Yang, German D. Samolyuk, Eleanor M. Clements, Anil Rajapitamahuni, Turgut Yilmaz, Elio Vescovo, Jiaqiang Yan, Andrew F. May, Michael A. McGuire

Abstract: Here we present the family of titanium-based kagome metals of the form LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$). Single crystal growth methods are presented alongside detailed magnetic and thermodynamic measurements. The orthorhombic (Fmmm) LnTi$_3$Bi$_4$ family of compounds exhibit slightly distorted titanium-based kagome nets interwoven with zig-zag lanthanide-based (Ln) chains.… ▽ More Here we present the family of titanium-based kagome metals of the form LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$). Single crystal growth methods are presented alongside detailed magnetic and thermodynamic measurements. The orthorhombic (Fmmm) LnTi$_3$Bi$_4$ family of compounds exhibit slightly distorted titanium-based kagome nets interwoven with zig-zag lanthanide-based (Ln) chains. Crystals are easily exfoliated parallel to the kagome sheets and angular resolved photoemission (ARPES) measurements highlight the intricacy of the electronic structure in these compounds, with Dirac points existing at the Fermi level. The magnetic properties and the associated anisotropy emerge from the quasi-1D zig-zag chains of Ln, and impart a wide array of magnetic ground states ranging from anisotropic ferromagnetism to complex antiferromagnetism with a cascade of metamagnetic transitions. Kagome metals continue to provide a rich direction for the exploration of magnetic, topologic, and highly correlated behavior. Our work here introduces the LnTi$_3$Bi$_4$ compounds to augment the continuously expanding suite of complex and interesting kagome materials. △ Less

Submitted 6 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.13634 [pdf, other]

doi 10.1103/PhysRevResearch.5.043026

Helical magnetic state in the vicinity of the pressure-induced superconducting phase in MnP

Authors: S. E. Dissanayake, M. Matsuda, K. Yoshimi, S. Kasamatsu, F. Ye, S. Chi, W. Steinhardt, G. Fabbris, S. Haravifard, J. -G. Cheng, J. -Q. Yan, J. Gouchi, Y. Uwatoko

Abstract: MnP is a metal that shows successive magnetic transitions from paramagnetic to ferromagnetic and helical magnetic phases at ambient pressure with decreasing temperature. With applied pressure, the magnetic transition temperatures decrease and superconductivity appears around 8 GPa where the magnetic order is fully suppressed and the quantum critical behavior is observed. These results suggest that… ▽ More MnP is a metal that shows successive magnetic transitions from paramagnetic to ferromagnetic and helical magnetic phases at ambient pressure with decreasing temperature. With applied pressure, the magnetic transition temperatures decrease and superconductivity appears around 8 GPa where the magnetic order is fully suppressed and the quantum critical behavior is observed. These results suggest that MnP is an unconventional superconductor in which magnetic fluctuations may be relevant to the superconducting pairing mechanism. In order to elucidate the magnetic ground state adjacent to the superconducting phase first discovered in Mn-based materials, high-pressure neutron diffraction measurements have been performed in hydrostatic pressure up to 7.5 GPa. The helical magnetic structure with the propagation vector along the $b$ axis, reported previously at 3.8 GPa, was found to be robust up to 7.5 GPa. First principles and classical Monte Carlo calculations have also been performed to understand how the pressure-driven magnetic phase transitions are coupled with change of the exchange interactions. The calculations, which qualitatively reproduce the magnetic structures as a function of pressure, suggest that the exchange interactions change drastically with applied pressure and the further-neighbor interactions become more influential at high pressures. Combining the experimental and theoretical results, we describe the detail of exchange interactions in the vicinity of the superconducting phase which is critical to understand the pairing mechanism of the unconventional superconductivity in MnP. △ Less

Submitted 25 August, 2023; originally announced August 2023.

Comments: 15 pages, 10 figures

Journal ref: Physical Review Research 5, 043026 (2023)

arXiv:2308.13241 [pdf, other]

WSTac: Interactive Surface Perception based on Whisker-Inspired and Self-Illuminated Vision-Based Tactile Sensor

Authors: Kai Chong Lei, Kit Wa Sou, Wang Sing Chan, Jiayi Yan, Siqi Ping, Dengfeng Peng, Wenbo Ding, Xiao-Ping Zhang

Abstract: Modern Visual-Based Tactile Sensors (VBTSs) use cost-effective cameras to track elastomer deformation, but struggle with ambient light interference. Solutions typically involve using internal LEDs and blocking external light, thus adding complexity. Creating a VBTS resistant to ambient light with just a camera and an elastomer remains a challenge. In this work, we introduce WStac, a self-illuminat… ▽ More Modern Visual-Based Tactile Sensors (VBTSs) use cost-effective cameras to track elastomer deformation, but struggle with ambient light interference. Solutions typically involve using internal LEDs and blocking external light, thus adding complexity. Creating a VBTS resistant to ambient light with just a camera and an elastomer remains a challenge. In this work, we introduce WStac, a self-illuminating VBTS comprising a mechanoluminescence (ML) whisker elastomer, camera, and 3D printed parts. The ML whisker elastomer, inspired by the touch sensitivity of vibrissae, offers both light isolation and high ML intensity under stress, thereby removing the necessity for additional LED modules. With the incorporation of machine learning, the sensor effectively utilizes the dynamic contact variations of 25 whiskers to successfully perform tasks like speed regression, directional identification, and texture classification. Videos are available at: https://sites.google.com/view/wstac/. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.13033 [pdf, other]

A Strength and Sparsity Preserving Algorithm for Generating Weighted, Directed Networks with Predetermined Assortativity

Authors: Yelie Yuan, Jun Yan, Panpan Zhang

Abstract: Degree-preserving rewiring is a widely used technique for generating unweighted networks with given assortativity, but for weighted networks, it is unclear how an analog would preserve the strengths and other critical network features such as sparsity level. This study introduces a novel approach for rewiring weighted networks to achieve desired directed assortativity. The method utilizes a mixed… ▽ More Degree-preserving rewiring is a widely used technique for generating unweighted networks with given assortativity, but for weighted networks, it is unclear how an analog would preserve the strengths and other critical network features such as sparsity level. This study introduces a novel approach for rewiring weighted networks to achieve desired directed assortativity. The method utilizes a mixed integer programming framework to establish a target network with predetermined assortativity coefficients, followed by an efficient rewiring algorithm termed "strength and sparsity preserving rewiring" (SSPR). SSPR retains the node strength distributions and network sparsity after rewiring. It is also possible to accommodate additional properties like edge weight distribution with extra computational cost. The optimization scheme can be used to determine feasible assortativity ranges for an initial network. The effectiveness of the proposed SSPR algorithm is demonstrated through its application to two classes of popular network models. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.09569 [pdf, other]

Cost-Intelligent Data Analytics in the Cloud

Authors: Huanchen Zhang, Yihao Liu, Jiaqi Yan

Abstract: For decades, database research has focused on optimizing performance under fixed resources. As more and more database applications move to the public cloud, we argue that it is time to make cost a first-class citizen when solving database optimization problems. In this paper, we introduce the concept of cost intelligence and envision the architecture of a cloud data warehouse designed for that. We… ▽ More For decades, database research has focused on optimizing performance under fixed resources. As more and more database applications move to the public cloud, we argue that it is time to make cost a first-class citizen when solving database optimization problems. In this paper, we introduce the concept of cost intelligence and envision the architecture of a cloud data warehouse designed for that. We investigate two critical challenges to achieving cost intelligence in an analytical system: automatic resource deployment and cost-oriented auto-tuning. We describe our system architecture with an emphasis on the components that are missing in today's cloud data warehouses. Each of these new components represents unique research opportunities in this much-needed research area. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2308.09269 [pdf, other]

doi 10.1063/5.0176310

Transition to anomalous dynamics in a simple random map

Authors: Jin Yan, Moitrish Majumdar, Stefano Ruffo, Yuzuru Sato, Christian Beck, Rainer Klages

Abstract: The famous Bernoulli shift (or dyadic transformation) is perhaps the simplest deterministic dynamical system exhibiting chaotic dynamics. It is a piecewise linear time-discrete map on the unit interval with a uniform slope larger than one, hence expanding, with a positive Lyapunov exponent and a uniform invariant density. If the slope is less than one the map becomes contracting, the Lyapunov expo… ▽ More The famous Bernoulli shift (or dyadic transformation) is perhaps the simplest deterministic dynamical system exhibiting chaotic dynamics. It is a piecewise linear time-discrete map on the unit interval with a uniform slope larger than one, hence expanding, with a positive Lyapunov exponent and a uniform invariant density. If the slope is less than one the map becomes contracting, the Lyapunov exponent is negative, and the density trivially collapses onto a fixed point. Sampling from these two different types of maps at each time step by randomly selecting the expanding one with probability $p$, and the contracting one with probability $1-p$, gives a prototype of a random dynamical system. Here we calculate the invariant density of this simple random map, as well as its position autocorrelation function, analytically and numerically under variation of $p$. We find that the map exhibits a non-trivial transition from fully chaotic to completely regular dynamics by generating a long-time anomalous dynamics at a critical sampling probability $p_c$, defined by a zero Lyapunov exponent. This anomalous dynamics is characterised by an infinite invariant density, weak ergodicity breaking and power law correlation decay. △ Less

Submitted 26 April, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: 22 pages, 6 figures

Journal ref: Chaos 34, 023128 (2024)

arXiv:2308.09242 [pdf, other]

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation

Authors: Shenghao Fu, Junkai Yan, Yipeng Gao, Xiaohua Xie, Wei-Shi Zheng

Abstract: Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built one-decoder-layer detectors. Although they gain remarkable acceleration, their performance still lags behind their six-decoder-layer counterparts by a large margin. In this work, we ai… ▽ More Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built one-decoder-layer detectors. Although they gain remarkable acceleration, their performance still lags behind their six-decoder-layer counterparts by a large margin. In this work, we aim to bridge this performance gap while retaining fast speed. We find that the architecture discrepancy between dense and sparse detectors leads to feature conflict, hampering the performance of one-decoder-layer detectors. Thus we propose Adaptive Sparse Anchor Generator (ASAG) which predicts dynamic anchors on patches rather than grids in a sparse way so that it alleviates the feature conflict problem. For each image, ASAG dynamically selects which feature maps and which locations to predict, forming a fully adaptive way to generate image-specific anchors. Further, a simple and effective Query Weighting method eases the training instability from adaptiveness. Extensive experiments show that our method outperforms dense-initialized ones and achieves a better speed-accuracy trade-off. The code is available at \url{https://github.com/iSEE-Laboratory/ASAG}. △ Less

Submitted 17 August, 2023; originally announced August 2023.

Comments: Accepted to ICCV 2023

arXiv:2308.04744 [pdf, other]

doi 10.1038/s41467-024-50062-0

Wavelength-tunable high-fidelity entangled photon sources enabled by dual Stark effects

Authors: Chen Chen, Jun-Yong Yan, Hans-Georg Babin, Jiefei Wang, Xingqi Xu, Xing Lin, Qianqian Yu, Wei Fang, Run-Ze Liu, Yong-Heng Huo, Han Cai, Wei E. I. Sha, Jiaxiang Zhang, Christian Heyn, Andreas D. Wieck, Arne Ludwig, Da-Wei Wang, Chao-Yuan Jin, Feng Liu

Abstract: The construction of a large-scale quantum internet requires quantum repeaters containing multiple entangled photon sources with identical wavelengths. Semiconductor quantum dots can generate entangled photon pairs deterministically with high fidelity. However, realizing wavelength-matched quantum-dot entangled photon sources faces two difficulties: the non-uniformity of emission wavelength and exc… ▽ More The construction of a large-scale quantum internet requires quantum repeaters containing multiple entangled photon sources with identical wavelengths. Semiconductor quantum dots can generate entangled photon pairs deterministically with high fidelity. However, realizing wavelength-matched quantum-dot entangled photon sources faces two difficulties: the non-uniformity of emission wavelength and exciton fine-structure splitting induced fidelity reduction. Typically, these two factors are not independently tunable, making it challenging to achieve simultaneous improvement. In this work, we demonstrate wavelength-tunable entangled photon sources based on droplet-etched GaAs quantum dots through the combined use of AC and quantum-confined Stark effects. The emission wavelength can be tuned by ~1 meV while preserving an entanglement fidelity f exceeding 0.955(1) in the entire tuning range. Based on this hybrid tuning scheme, we finally demonstrate multiple wavelength-matched entangled photon sources with f>0.919(3), paving a way towards robust and scalable on-demand entangled photon sources for quantum internet and integrated quantum optical circuits. △ Less

Submitted 21 April, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

Comments: Main text: 11 pages, 6 figures, Supplementary information: 7 pages, 6 figures, 2 tables

Showing 201–250 of 1,497 results for author: Yan, J