-
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
Authors:
Zhenjie Yang,
Xiaosong Jia,
Hongyang Li,
Junchi Yan
Abstract:
Autonomous driving technology, a catalyst for revolutionizing transportation and urban mobility, has the tend to transition from rule-based systems to data-driven strategies. Traditional module-based systems are constrained by cumulative errors among cascaded modules and inflexible pre-set rules. In contrast, end-to-end autonomous driving systems have the potential to avoid error accumulation due…
▽ More
Autonomous driving technology, a catalyst for revolutionizing transportation and urban mobility, has the tend to transition from rule-based systems to data-driven strategies. Traditional module-based systems are constrained by cumulative errors among cascaded modules and inflexible pre-set rules. In contrast, end-to-end autonomous driving systems have the potential to avoid error accumulation due to their fully data-driven training process, although they often lack transparency due to their "black box" nature, complicating the validation and traceability of decisions. Recently, large language models (LLMs) have demonstrated abilities including understanding context, logical reasoning, and generating answers. A natural thought is to utilize these abilities to empower autonomous driving. By combining LLM with foundation vision models, it could open the door to open-world understanding, reasoning, and few-shot learning, which current autonomous driving systems are lacking. In this paper, we systematically review a research line about \textit{Large Language Models for Autonomous Driving (LLM4AD)}. This study evaluates the current state of technological advancements, distinctly outlining the principal challenges and prospective directions for the field. For the convenience of researchers in academia and industry, we provide real-time updates on the latest advances in the field as well as relevant open-source resources via the designated link: https://github.com/Thinklab-SJTU/Awesome-LLM4AD.
△ Less
Submitted 29 December, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial Vehicle
Authors:
Chen Qian,
Rui Chen,
Peiyao Shen,
Yongchun Fang,
Jifu Yan,
Tiefeng Li
Abstract:
Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial VehicleThis paper presents both the trajectory generation and tracking control strategies for an underactuated flapping wing aerial vehicle (FWAV). First, the FWAV dynamics is analyzed in a practical perspective. Then, based on these analyses, we demonstrate the differential flatness of the FWAV system, and d…
▽ More
Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial VehicleThis paper presents both the trajectory generation and tracking control strategies for an underactuated flapping wing aerial vehicle (FWAV). First, the FWAV dynamics is analyzed in a practical perspective. Then, based on these analyses, we demonstrate the differential flatness of the FWAV system, and develop a general-purpose trajectory generation strategy. Subsequently, the trajectory tracking controller is developed with the help of robust control and switch control techniques. After that, the overall system asymptotic stability is guaranteed by Lyapunov stability analysis. To make the controller applicable in real flight, we also provide several instructions. Finally, a series of experiment results manifest the successful implementation of the proposed trajectory generation strategy and tracking control strategy. This work firstly achieves the closed-loop integration of trajectory generation and control for real 3-dimensional flight of an underactuated FWAV to a practical level.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Accelerated Data-Driven Discovery and Screening of Two-Dimensional Magnets Using Graph Neural Networks
Authors:
Ahmed Elrashidy,
James Della-Giustina,
Jia-An Yan
Abstract:
In this study, we employ Graph Neural Networks (GNNs) to accelerate the discovery of novel 2D magnetic materials which have transformative potential in spintronics applications. Using data from the Materials Project database and the Computational 2D materials database (C2DB), we train three GNN architectures on a dataset of 1190 magnetic monolayers with energy above the convex hull (…
▽ More
In this study, we employ Graph Neural Networks (GNNs) to accelerate the discovery of novel 2D magnetic materials which have transformative potential in spintronics applications. Using data from the Materials Project database and the Computational 2D materials database (C2DB), we train three GNN architectures on a dataset of 1190 magnetic monolayers with energy above the convex hull ($E_{\text{hull}}$) less than 0.3 eV/atom. Our Crystal Diffusion Variational Auto Encoder (CDVAE) generates 11,100 candidate crystals. Subsequent training on two Atomistic Line Graph Neural Networks (ALIGNN) achieves a 93$\%$ accuracy in predicting magnetic monolayers and a mean average error of 0.039 eV/atom for $E_{\text{hull}}$ predictions. After narrowing down candidates based on magnetic likelihood and predicted energy, constraining the atom count in the monolayers to five or fewer, and performing dimensionality checks, we identify 190 candidates. These are validated using Density-Functional Theory (DFT) to confirm their magnetic and energetic favorability resulting in 167 magnetic monolayers with $E_{\text{hull}} < 0.3$ eV/atom and a total magnetization of $\geq$ $0.5 μ_{B}$. Our methodology offers a way to accelerate exploring and predicting potential 2D magnetic materials, contributing to the ongoing computational and experimental efforts aimed at the discovery of new 2D magnets.
△ Less
Submitted 5 February, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
On the Opportunities of Green Computing: A Survey
Authors:
You Zhou,
Xiujing Lin,
Xiang Zhang,
Maolin Wang,
Gangwei Jiang,
Huakang Lu,
Yupeng Wu,
Kai Zhang,
Zhe Yang,
Kehang Wang,
Yongduo Sui,
Fengwei Jia,
Zuoli Tang,
Yao Zhao,
Hongxuan Zhang,
Tiannuo Yang,
Weibo Chen,
Yunong Mao,
Yi Li,
De Bao,
Yu Li,
Hongrui Liao,
Ting Liu,
Jingwen Liu,
Jinchi Guo
, et al. (16 additional authors not shown)
Abstract:
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention…
▽ More
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention is paid on pursuing new state-of-the-art (SOTA) results, resulting in ever increasing of model size and computational complexity. The needs for high computing power brings higher carbon emission and undermines research fairness by preventing small or medium-sized research institutions and companies with limited funding in participating in research. To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic. In this survey, we give a systematic overview of the technologies used in Green Computing. We propose the framework of Green Computing and devide it into four key components: (1) Measures of Greenness, (2) Energy-Efficient AI, (3) Energy-Efficient Computing Systems and (4) AI Use Cases for Sustainability. For each components, we discuss the research progress made and the commonly used techniques to optimize the AI efficiency. We conclude that this new research direction has the potential to address the conflicts between resource constraints and AI development. We encourage more researchers to put attention on this direction and make AI more environmental friendly.
△ Less
Submitted 8 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Degree sequences of triangular multigraphs
Authors:
John Talbot,
Jun Yan
Abstract:
A simple graph is triangular if every edge is contained in a triangle. A sequence of integers is graphical if it is the degree sequence of a simple graph. Egan and Nikolayevsky recently conjectured that every graphical sequence whose terms are all at least 4 is the degree sequence of a triangular simple graph, and proved this in some special cases. In this paper we state and prove the analogous ve…
▽ More
A simple graph is triangular if every edge is contained in a triangle. A sequence of integers is graphical if it is the degree sequence of a simple graph. Egan and Nikolayevsky recently conjectured that every graphical sequence whose terms are all at least 4 is the degree sequence of a triangular simple graph, and proved this in some special cases. In this paper we state and prove the analogous version of this conjecture for multigraphs.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Experimental Evidence for Non-spherical Magnetic Form Factor in Ru$^{3+}$
Authors:
Colin L. Sarkis,
John W. Villanova,
Casey Eichstaedt,
Adolfo G. Eguiluz,
Jaime A. Fernandez-Baca,
Masaaki Matsuda,
Jiaqiang Yan,
Christian Balz,
Arnab Banerjee,
D. Alan Tennant,
Tom Berlijn,
Stephen E. Nagler
Abstract:
The Mott insulator $α$-RuCl$_3$ has generated great interest in the community due to its possible field-induced Kitaev quantum spin liquid state. Despite enormous effort spent trying to obtain the form of the low energy Hamiltonian, there is currently no agreed upon set of parameters which is able to explain all of the data. A key piece of missing information lies in the determination of the magne…
▽ More
The Mott insulator $α$-RuCl$_3$ has generated great interest in the community due to its possible field-induced Kitaev quantum spin liquid state. Despite enormous effort spent trying to obtain the form of the low energy Hamiltonian, there is currently no agreed upon set of parameters which is able to explain all of the data. A key piece of missing information lies in the determination of the magnetic form factor of Ru$^{3+}$, particularly for a true quantitative treatment of inelastic neutron scattering data. Here we present the experimentally derived magnetic form factor of Ru$^{3+}$ in the low spin 4$d^5$ state using polarized neutron diffraction within the paramagnetic regime on high quality single crystals of $α$-RuCl$_3$. We observe strong evidence of an anisotropic form factor, expected of the spin-orbit coupled $j_{\textrm{eff}} = \frac{1}{2}$ ground state. We model the static magnetization density in increasing complexity from simple isotropic cases, to a multipolar expansion, and finally \emph{ab initio} calculations of the generalized $j_{\textrm{eff}} = \frac{1}{2}$ ground state. Comparison of both single ion models and inclusion of Cl$^-$ anions support the presence of hybridization of Ru$^{3+}$ with the surrounding Cl$^{-}$ ligands.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Ramsey numbers of bounded degree trees versus general graphs
Authors:
Richard Montgomery,
Matías Pavez-Signé,
Jun Yan
Abstract:
For every $k\ge 2$ and $Δ$, we prove that there exists a constant $C_{Δ,k}$ such that the following holds. For every graph $H$ with $χ(H)=k$ and every tree with at least $C_{Δ,k}|H|$ vertices and maximum degree at most $Δ$, the Ramsey number $R(T,H)$ is $(k-1)(|T|-1)+σ(H)$, where $σ(H)$ is the size of a smallest colour class across all proper $k$-colourings of $H$. This is tight up to the value of…
▽ More
For every $k\ge 2$ and $Δ$, we prove that there exists a constant $C_{Δ,k}$ such that the following holds. For every graph $H$ with $χ(H)=k$ and every tree with at least $C_{Δ,k}|H|$ vertices and maximum degree at most $Δ$, the Ramsey number $R(T,H)$ is $(k-1)(|T|-1)+σ(H)$, where $σ(H)$ is the size of a smallest colour class across all proper $k$-colourings of $H$. This is tight up to the value of $C_{Δ,k}$, and confirms a conjecture of Balla, Pokrovskiy, and Sudakov.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Magnetic Stability, Fermi Surface Topology, and Spin-Correlated Dielectric Response in Monolayer 1T-CrTe2
Authors:
Ahmed Elrashidy,
Jia-An Yan
Abstract:
We have carried out density-functional theory (DFT) calculations to study the magnetic stability of both ferromagnetic (FM) and anti-ferromagnetic (AFM) states in monolayer 1T-CrTe2. Our results show that the AFM order is lower in energy and thus is the ground state. By tuning the lattice parameters, the AFM order can transition to the FM order, in good agreement with experimental observation. We…
▽ More
We have carried out density-functional theory (DFT) calculations to study the magnetic stability of both ferromagnetic (FM) and anti-ferromagnetic (AFM) states in monolayer 1T-CrTe2. Our results show that the AFM order is lower in energy and thus is the ground state. By tuning the lattice parameters, the AFM order can transition to the FM order, in good agreement with experimental observation. We observe a commensurate SDW alongside the previously predicted CDW, and attribute the AFM order to the SDW. This results in distinct hole and electron Fermi pockets and a pronounced optical anisotropy, suggesting quasi-one-dimensional behavior in this material.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace
Authors:
Chiyu Song,
Zhanchao Zhou,
Jianhao Yan,
Yuejiao Fei,
Zhenzhong Lan,
Yue Zhang
Abstract:
Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). However, the creation of instruction data is still largely heuristic, leading to significant variation in quantity and quality across existing datasets. While some research advocates for expanding the number of instructions, others suggest that a small set of well-chosen examples is adequa…
▽ More
Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). However, the creation of instruction data is still largely heuristic, leading to significant variation in quantity and quality across existing datasets. While some research advocates for expanding the number of instructions, others suggest that a small set of well-chosen examples is adequate. To better understand data construction guidelines, our research provides a granular analysis of how data volume, parameter size, and data construction methods influence the development of each underlying ability of LLM, such as creative writing, code generation, and logical reasoning. We present a meticulously curated dataset with over 40k instances across ten abilities and examine instruction-tuned models with 7b to 33b parameters. Our study reveals three primary findings: (i) Despite the models' overall performance being tied to data and parameter scale, individual abilities have different sensitivities to these factors. (ii) Human-curated data strongly outperforms synthetic data from GPT-4 in efficiency and can constantly enhance model performance with volume increases, but is unachievable with synthetic data. (iii) Instruction data brings powerful cross-ability generalization, as evidenced by out-of-domain evaluations. Furthermore, we demonstrate how these findings can guide more efficient data constructions, leading to practical performance improvements on two public benchmarks.
△ Less
Submitted 22 February, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Multiwavelength observation of 1A 0535+262=HD 245770 from 2010 to 2021
Authors:
Wei Liu,
Jingzhi Yan,
Guangcheng Xiao,
Xiukun Li,
Bo Gao,
Qingzhong Liu
Abstract:
Context. 1A 0535+262 is a high-mass X-ray binary that went into a giant X-ray outburst in 2020. During this event, the X-ray luminosity reached the highest value measured over the last 30 years. Aims. Our aim is to study the long-term variability of 1A 0535+262 before and after the 2020 major X-ray outburst and to uncover the mechanism that led to the X-ray outburst. Methods. We used the long-term…
▽ More
Context. 1A 0535+262 is a high-mass X-ray binary that went into a giant X-ray outburst in 2020. During this event, the X-ray luminosity reached the highest value measured over the last 30 years. Aims. Our aim is to study the long-term variability of 1A 0535+262 before and after the 2020 major X-ray outburst and to uncover the mechanism that led to the X-ray outburst. Methods. We used the long-term photometric light curve and the equivalent widths of the H$α$ and He I $λ$6678 lines to monitor the state of the Be star's circumstellar disk. The H$α$ line profiles show evidence for V/R variability, which we revealed by fitting the H$α$ spectral line profiles with two Gaussian functions. In addition, we divided our data into four periods according to the intensity of the X-ray, optical, and infrared emission. Results. The H$α$ line profiles show single-peaked profiles in most cases. This is consistent with the previously reported orbital inclination of ${i}$ = $37^{\circ} \pm 2^{\circ}$. Unlike the H$α$ lines, the He I $\lambda6678$ lines show a maximal intensity in October 2020, which is one month before the giant X-ray outburst in 2020. Based on the behavior of the equivalent widths of the H$α$ and He I $\lambda6678$ lines, and the ${V}$-band magnitude, we find two mass ejection processes from the Be star to the Be disk on MJD 55820 and MJD 56600. The V/R quasi-period is about two\, years during 2011--2015, which is different from 1994 to 1995. Furthermore, the periods I$\to$II$\to$III$\to$IV in the $(B-V)$ color index versus $V$-band magnitude diagram constitute a cycle. From the behavior of the V/R ratio of H$α$ lines, and the variability of the $V$ band, we believe that the precession of the density perturbation inside the disk is retrograde.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
M3C: A Framework towards Convergent, Flexible, and Unsupervised Learning of Mixture Graph Matching and Clustering
Authors:
Jiaxin Lu,
Zetian Jiang,
Tianzhe Wang,
Junchi Yan
Abstract:
Existing graph matching methods typically assume that there are similar structures between graphs and they are matchable. However, these assumptions do not align with real-world applications. This work addresses a more realistic scenario where graphs exhibit diverse modes, requiring graph grouping before or along with matching, a task termed mixture graph matching and clustering. We introduce Mino…
▽ More
Existing graph matching methods typically assume that there are similar structures between graphs and they are matchable. However, these assumptions do not align with real-world applications. This work addresses a more realistic scenario where graphs exhibit diverse modes, requiring graph grouping before or along with matching, a task termed mixture graph matching and clustering. We introduce Minorize-Maximization Matching and Clustering (M3C), a learning-free algorithm that guarantees theoretical convergence through the Minorize-Maximization framework and offers enhanced flexibility via relaxed clustering. Building on M3C, we develop UM3C, an unsupervised model that incorporates novel edge-wise affinity learning and pseudo label selection. Extensive experimental results on public benchmarks demonstrate that our method outperforms state-of-the-art graph matching and mixture graph matching and clustering approaches in both accuracy and efficiency. Source code will be made publicly available.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Does or did the supernova remnant Cassiopeia A operate as a PeVatron?
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;…
▽ More
For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE; $E_γ\geq 100$~TeV) $γ$-rays. In this context, the historical SNR Cassiopeia A (Cas A) is considered one of the most promising target for UHE observations. This paper presents the observation of Cas A and its vicinity by the LHAASO KM2A detector. The exceptional sensitivity of LHAASO KM2A in the UHE band, combined with the young age of Cas A, enabled us to derive stringent model-independent limits on the energy budget of UHE protons and nuclei accelerated by Cas A at any epoch after the explosion. The results challenge the prevailing paradigm that Cas A-type SNRs are major suppliers of PeV CRs in the Milky Way.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Inertia of partial transpose of positive semidefinite matrices
Authors:
Yixuan Liang,
Jiahao Yan,
Dongran Si,
Lin Chen
Abstract:
We show that the partial transpose of $9\times 9$ positive semidefinite matrices do not have inertia (4,1,4) and (3,2,4). It solves an open problem in "LINEAR AND MULTILINEAR ALGEBRA, Changchun Feng et al, 2022". We apply our results to construct some inertia, as well as present the list of all possible inertia of partial transpose of $12\times 12$ positive semidefinite matrices.
We show that the partial transpose of $9\times 9$ positive semidefinite matrices do not have inertia (4,1,4) and (3,2,4). It solves an open problem in "LINEAR AND MULTILINEAR ALGEBRA, Changchun Feng et al, 2022". We apply our results to construct some inertia, as well as present the list of all possible inertia of partial transpose of $12\times 12$ positive semidefinite matrices.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
An introduction to radar Automatic Target Recognition (ATR) technology in ground-based radar systems
Authors:
Jiangkun Gong,
Jun Yan,
Deyong Kong,
Deren Li
Abstract:
This paper presents a brief examination of Automatic Target Recognition (ATR) technology within ground-based radar systems. It offers a lucid comprehension of the ATR concept, delves into its historical milestones, and categorizes ATR methods according to different scattering regions. By incorporating ATR solutions into radar systems, this study demonstrates the expansion of radar detection ranges…
▽ More
This paper presents a brief examination of Automatic Target Recognition (ATR) technology within ground-based radar systems. It offers a lucid comprehension of the ATR concept, delves into its historical milestones, and categorizes ATR methods according to different scattering regions. By incorporating ATR solutions into radar systems, this study demonstrates the expansion of radar detection ranges and the enhancement of tracking capabilities, leading to superior situational awareness. Drawing insights from the Russo-Ukrainian War, the paper highlights three pressing radar applications that urgently necessitate ATR technology: detecting stealth aircraft, countering small drones, and implementing anti-jamming measures. Anticipating the next wave of radar ATR research, the study predicts a surge in cognitive radar and machine learning (ML)-driven algorithms. These emerging methodologies aspire to confront challenges associated with system adaptation, real-time recognition, and environmental adaptability. Ultimately, ATR stands poised to revolutionize conventional radar systems, ushering in an era of 4D sensing capabilities.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Time periodic and almost periodic viscosity solutions of contact Hamilton-Jacobi equations on $\mathbb{T}^n$
Authors:
Kaizhi Wang,
Jun Yan,
Kai Zhao
Abstract:
This paper concerns with the time periodic viscosity solution problem for a class of evolutionary contact Hamilton-Jacobi equations with time independent Hamiltonians on the torus $\mathbb{T}^n$. Under certain suitable assumptions we show that the equation has a non-trivial $T$-periodic viscosity solution if and only if $T\in D$, where $D$ is a dense subset of $[0,+\infty)$. Moreover, we clarify t…
▽ More
This paper concerns with the time periodic viscosity solution problem for a class of evolutionary contact Hamilton-Jacobi equations with time independent Hamiltonians on the torus $\mathbb{T}^n$. Under certain suitable assumptions we show that the equation has a non-trivial $T$-periodic viscosity solution if and only if $T\in D$, where $D$ is a dense subset of $[0,+\infty)$. Moreover, we clarify the structure of $D$. As a consequence, we also study the existence of Bohr almost periodic viscosity solutions.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
A New Gauge-Theoretic Construction of 4-Dimensional Hyperkähler ALE Spaces
Authors:
Jiajun Yan
Abstract:
Non-compact hyperkähler spaces arise frequently in gauge theory. The 4-dimensional hyperkähler ALE spaces are a special class of non-compact hyperkähler spaces. They are in one-to-one correspondence with the finite subgroups of SU(2) and have interesting connections with representation theory and singularity theory, captured by the McKay Correspondence.
The 4-dimensional hyperkähler ALE spaces a…
▽ More
Non-compact hyperkähler spaces arise frequently in gauge theory. The 4-dimensional hyperkähler ALE spaces are a special class of non-compact hyperkähler spaces. They are in one-to-one correspondence with the finite subgroups of SU(2) and have interesting connections with representation theory and singularity theory, captured by the McKay Correspondence.
The 4-dimensional hyperkähler ALE spaces are first classified by Peter Kronheimer via a finite-dimensional hyperkähler reduction. In this paper, we give a new gauge-theoretic construction of these spaces. More specifically, we realize each 4-dimensional hyperkähler ALE space as a moduli space of solutions to a system of equations for a pair consisting of a connection and a section of a vector bundle over an orbifold Riemann surface, modulo a gauge group action. The construction given in this paper parallels Kronheimer's original construction and hence can also be thought of as a gauge-theoretic interpretation of Kronheimer's construction of these spaces.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT -- A Text-to-SQL Parsing Comparison
Authors:
Shuo Sun,
Yuchen Zhang,
Jiahuan Yan,
Yuze Gao,
Donovan Ong,
Bin Chen,
Jian Su
Abstract:
The success of ChatGPT has ignited an AI race, with researchers striving to develop new large language models (LLMs) that can match or surpass the language understanding and generation abilities of commercial ones. In recent times, a number of models have emerged, claiming performance near that of GPT-3.5 or GPT-4 through various instruction-tuning methods. As practitioners of Text-to-SQL parsing,…
▽ More
The success of ChatGPT has ignited an AI race, with researchers striving to develop new large language models (LLMs) that can match or surpass the language understanding and generation abilities of commercial ones. In recent times, a number of models have emerged, claiming performance near that of GPT-3.5 or GPT-4 through various instruction-tuning methods. As practitioners of Text-to-SQL parsing, we are grateful for their valuable contributions to open-source research. However, it is important to approach these claims with a sense of scrutiny and ascertain the actual effectiveness of these models. Therefore, we pit six popular large language models against each other, systematically evaluating their Text-to-SQL parsing capability on nine benchmark datasets with five different prompting strategies, covering both zero-shot and few-shot scenarios. Regrettably, the open-sourced models fell significantly short of the performance achieved by closed-source models like GPT-3.5, highlighting the need for further work to bridge the performance gap between these models.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
TpopT: Efficient Trainable Template Optimization on Low-Dimensional Manifolds
Authors:
Jingkai Yan,
Shiyu Wang,
Xinyu Rain Wei,
Jimmy Wang,
Zsuzsanna Márka,
Szabolcs Márka,
John Wright
Abstract:
In scientific and engineering scenarios, a recurring task is the detection of low-dimensional families of signals or patterns. A classic family of approaches, exemplified by template matching, aims to cover the search space with a dense template bank. While simple and highly interpretable, it suffers from poor computational efficiency due to unfavorable scaling in the signal space dimensionality.…
▽ More
In scientific and engineering scenarios, a recurring task is the detection of low-dimensional families of signals or patterns. A classic family of approaches, exemplified by template matching, aims to cover the search space with a dense template bank. While simple and highly interpretable, it suffers from poor computational efficiency due to unfavorable scaling in the signal space dimensionality. In this work, we study TpopT (TemPlate OPTimization) as an alternative scalable framework for detecting low-dimensional families of signals which maintains high interpretability. We provide a theoretical analysis of the convergence of Riemannian gradient descent for TpopT, and prove that it has a superior dimension scaling to covering. We also propose a practical TpopT framework for nonparametric signal sets, which incorporates techniques of embedding and kernel interpolation, and is further configurable into a trainable network architecture by unrolled optimization. The proposed trainable TpopT exhibits significantly improved efficiency-accuracy tradeoffs for gravitational wave detection, where matched filtering is currently a method of choice. We further illustrate the general applicability of this approach with experiments on handwritten digit data.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
A. Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t…
▽ More
The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the trigger. The intrinsic energy spectrum of gamma-rays can be described by a power-law after correcting for extragalactic background light (EBL) absorption. Such a hard spectrum challenges the synchrotron self-Compton (SSC) scenario of relativistic electrons for the afterglow emission above several TeV. Observations of gamma-rays up to 13 TeV from a source with a measured redshift of z=0.151 hints more transparency in intergalactic space than previously expected. Alternatively, one may invoke new physics such as Lorentz Invariance Violation (LIV) or an axion origin of very high energy (VHE) signals.
△ Less
Submitted 22 November, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory
Authors:
Yiting Chen,
Zhanpeng Zhou,
Junchi Yan
Abstract:
The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant attention in measuring the similarity between features learned by distinct networks. However, feature similarity could be vague in describing the same feature sinc…
▽ More
The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant attention in measuring the similarity between features learned by distinct networks. However, feature similarity could be vague in describing the same feature since equivalent features hardly exist. In this paper, we expand the concept of equivalent feature and provide the definition of what we call functionally equivalent features. These features produce equivalent output under certain transformations. Using this definition, we aim to derive a more intrinsic metric for the so-called feature complexity regarding the redundancy of features learned by a neural network at each layer. We offer a formal interpretation of our approach through the lens of category theory, a well-developed area in mathematics. To quantify the feature complexity, we further propose an efficient algorithm named Iterative Feature Merging. Our experimental results validate our ideas and theories from various perspectives. We empirically demonstrate that the functionally equivalence widely exists among different features learned by the same neural network and we could reduce the number of parameters of the network without affecting the performance.The IFM shows great potential as a data-agnostic model prune method. We have also drawn several interesting empirical findings regarding the defined feature complexity.
△ Less
Submitted 26 November, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets
Authors:
Ning Liao,
Shaofeng Zhang,
Renqiu Xia,
Min Cao,
Yu Qiao,
Junchi Yan
Abstract:
There is an emerging line of research on multimodal instruction tuning, and a line of benchmarks has been proposed for evaluating these models recently. Instead of evaluating the models directly, in this paper, we try to evaluate the Vision-Language Instruction-Tuning (VLIT) datasets. Also, we seek the way of building a dataset for developing an all-powerful VLIT model, which we believe could also…
▽ More
There is an emerging line of research on multimodal instruction tuning, and a line of benchmarks has been proposed for evaluating these models recently. Instead of evaluating the models directly, in this paper, we try to evaluate the Vision-Language Instruction-Tuning (VLIT) datasets. Also, we seek the way of building a dataset for developing an all-powerful VLIT model, which we believe could also be of utility for establishing a grounded protocol for benchmarking VLIT models. For effective evaluation of VLIT datasets that remains an open question, we propose a tune-cross-evaluation paradigm: tuning on one dataset and evaluating on the others in turn. For each single tune-evaluation experiment set, we define the Meta Quality (MQ) as the mean score obtained by a set of caption metrics including BLEU, METEOR, and ROUGE-L to quantify the quality of a certain dataset or a sample. On this basis, to evaluate the comprehensiveness of a dataset, we develop the Dataset Quality (DQ) covering all tune-evaluation sets. To lay the foundation for building a comprehensive dataset and developing an all-powerful model for practical applications, we define the Sample Quality (SQ) to quantify the all-sided quality of each sample. Extensive experiments validate the rationality of the proposed evaluation paradigm. Based on the holistic evaluation, we build a new dataset, REVO-LION (REfining VisiOn-Language InstructiOn tuNing), by collecting samples with higher SQ from each dataset. Remarkably, even with only half of the complete data, the model trained on REVO-LION can achieve the performance comparable to simply adding all VLIT datasets up. Furthermore, REVO-LION not only facilitates the development of a powerful model but also incorporates an evaluation set, which is designed to serve as a convenient benchmark for future research in the field.
△ Less
Submitted 29 December, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Advective Diffusion Transformers for Topological Generalization in Graph Learning
Authors:
Qitian Wu,
Chenxiao Yang,
Kaipeng Zeng,
Fan Nie,
Michael Bronstein,
Junchi Yan
Abstract:
Graph diffusion equations are intimately related to graph neural networks (GNNs) and have recently attracted attention as a principled framework for analyzing GNN dynamics, formalizing their expressive power, and justifying architectural choices. One key open questions in graph learning is the generalization capabilities of GNNs. A major limitation of current approaches hinges on the assumption th…
▽ More
Graph diffusion equations are intimately related to graph neural networks (GNNs) and have recently attracted attention as a principled framework for analyzing GNN dynamics, formalizing their expressive power, and justifying architectural choices. One key open questions in graph learning is the generalization capabilities of GNNs. A major limitation of current approaches hinges on the assumption that the graph topologies in the training and test sets come from the same distribution. In this paper, we make steps towards understanding the generalization of GNNs by exploring how graph diffusion equations extrapolate and generalize in the presence of varying graph topologies. We first show deficiencies in the generalization capability of existing models built upon local diffusion on graphs, stemming from the exponential sensitivity to topology variation. Our subsequent analysis reveals the promise of non-local diffusion, which advocates for feature propagation over fully-connected latent graphs, under the assumption of a specific data-generating condition. In addition to these findings, we propose a novel graph encoder backbone, Advective Diffusion Transformer (ADiT), inspired by advective graph diffusion equations that have a closed-form solution backed up with theoretical guarantees of desired generalization under topological distribution shifts. The new model, functioning as a versatile graph Transformer, demonstrates superior performance across a wide range of graph learning tasks.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Two stage Robust Nash Bargaining based Energy Trading between Hydrogen-enriched Gas and Active Distribution Networks
Authors:
Wenwen Zhang,
Gao Qiu,
Hongjun Gao,
Tingjian Liu,
Junyong Liu,
Yaping Li,
Shengchun Yang,
Jiahao Yan,
Wenbo Mao
Abstract:
Integration of emerging hydrogen-enriched compressed natural gas (HCNG) distribution network with active distribution net-work (ADN) provides huge latent flexibility on consuming re-newable energies. However, paucity of energy trading mechanism risks the stable earnings of the flexibility for both entities, especially when rising highly-efficient solid oxide fuel cells (SOFCs) are pioneered to int…
▽ More
Integration of emerging hydrogen-enriched compressed natural gas (HCNG) distribution network with active distribution net-work (ADN) provides huge latent flexibility on consuming re-newable energies. However, paucity of energy trading mechanism risks the stable earnings of the flexibility for both entities, especially when rising highly-efficient solid oxide fuel cells (SOFCs) are pioneered to interface gas and electricity. To fill the gap, a two-stage robust Nash bargaining strategy is pro-posed. In the first stage, a privacy-preserved Nash Bargaining based on the ADMM is applied to clear energy trading between the two autonomous entities, i.e., ADN and gas distribution network (GDN). Via robust dispatch of configured energy storage in ADN, the next stage de-risks ADN profit collapse from transaction biases, caused by forecasting errors of distributed energy resources. C&CG is finally utilized to loop the two stages. The convergence of the entire energy trading strategy is theoretically proved. As such, sustain-able returns from the integration of ADN and GDN bridged by SOFC and HCNG are facilitated. Numerical studies indicate that, the proposed cooperative strategy reaps a stable social welfare of nearly 1.6% to total cost, and benefit-steady situations for both ADN and GDN, even in the worst case.
△ Less
Submitted 22 May, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
How Graph Neural Networks Learn: Lessons from Training Dynamics
Authors:
Chenxiao Yang,
Qitian Wu,
David Wipf,
Ruoyu Sun,
Junchi Yan
Abstract:
A long-standing goal in deep learning has been to characterize the learning behavior of black-box models in a more interpretable manner. For graph neural networks (GNNs), considerable advances have been made in formalizing what functions they can represent, but whether GNNs will learn desired functions during the optimization process remains less clear. To fill this gap, we study their training dy…
▽ More
A long-standing goal in deep learning has been to characterize the learning behavior of black-box models in a more interpretable manner. For graph neural networks (GNNs), considerable advances have been made in formalizing what functions they can represent, but whether GNNs will learn desired functions during the optimization process remains less clear. To fill this gap, we study their training dynamics in function space. In particular, we find that the gradient descent optimization of GNNs implicitly leverages the graph structure to update the learned function, as can be quantified by a phenomenon which we call \emph{kernel-graph alignment}. We provide theoretical explanations for the emergence of this phenomenon in the overparameterized regime and empirically validate it on real-world GNNs. This finding offers new interpretable insights into when and why the learned GNN functions generalize, highlighting their limitations in heterophilic graphs. Practically, we propose a parameter-free algorithm that directly uses a sparse matrix (i.e. graph adjacency) to update the learned function. We demonstrate that this embarrassingly simple approach can be as effective as GNNs while being orders-of-magnitude faster.
△ Less
Submitted 18 June, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Anisotropy of thermal conductivity oscillations in relation to the Kitaev spin liquid phase
Authors:
Heda Zhang,
Hu Miao,
Thomas Z Ward,
David G Mandrus,
Stephen E Nagler,
Michael A McGuire,
Jiaqiang Yan
Abstract:
In the presence of external magnetic field, the Kitaev model could either hosts gapped topological anyon or gapless Majorana fermions. In $α$-RuCl$_3$, the gapped and gapless cases are only separated by a thirty-degree rotation of the in-plane magnetic field vector. The presence/absence of the spectral gap is key for understanding the thermal transport behavior in $α$-RuCl$_3$. Here, we study the…
▽ More
In the presence of external magnetic field, the Kitaev model could either hosts gapped topological anyon or gapless Majorana fermions. In $α$-RuCl$_3$, the gapped and gapless cases are only separated by a thirty-degree rotation of the in-plane magnetic field vector. The presence/absence of the spectral gap is key for understanding the thermal transport behavior in $α$-RuCl$_3$. Here, we study the anisotropy of the oscillatory features of thermal conductivity in $α$-RuCl$_3$. We examine the oscillatory features of thermal conductivities (k//a, k//b) with fixed external fields and found distinct behavior for the gapped (B//a) and gapless (B//b) scenarios. Furthermore, we track the evolution of thermal resistivity ($λ_{a}$) and its oscillatory features with the rotation of in-plane magnetic fields from B//b to B//a. The thermal resistivity $λ(B,θ)$ display distinct rotational symmetries before and after the emergence of the field induced Kitaev spin liquid phase. These experiment data suggest close correlations between the oscillatory features of thermal conductivity, the underlying Kitaev spin liquid phase and the fermionic excitation it holds.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Structure transition and zigzag magnetic order in Ir/Rh-substituted honeycomb lattice RuCl3
Authors:
Zachary Morgan,
Iris Ye,
Colin L. Sarkis,
Xiaoping Wang,
Stephen Nagler,
Jiaqiang Yan
Abstract:
We report magnetization and neutron diffraction studies on crystal and magnetic structures of Ir- and Rh-substituted honeycomb lattice $α$-RuCl$_3$. The iridium or rhodium atoms are distributed at the Ru site with little structural modification. Both systems undergo a room-temperature monoclinic $C2/m$ to low-temperature trigonal $R\bar{3}$ phase transformation with a large recoverable hysteresis.…
▽ More
We report magnetization and neutron diffraction studies on crystal and magnetic structures of Ir- and Rh-substituted honeycomb lattice $α$-RuCl$_3$. The iridium or rhodium atoms are distributed at the Ru site with little structural modification. Both systems undergo a room-temperature monoclinic $C2/m$ to low-temperature trigonal $R\bar{3}$ phase transformation with a large recoverable hysteresis. At low temperature, a zigzag spin order is observed with the same characteristic wavevector $(0,0.5,1)$ as in the parent $α$-RuCl$_3$. Detailed magnetic structure refinement reveals an ordered moment of $\rm 0.32(5) μ_B/Ru$ and a upper boundary of canting angle of $15(4)^\circ$ away from the basal plane at 5~K for the 10\% Ir-substituted $α$-RuCl$_3$, which is different from the 0.45-0.73~$\rm μ_B/Ru$ and $32^\circ$-$48^\circ$ canting angle reported in the parent compound $α$-RuCl$_3$. The observation of unchanged RuCl$_6$ local octahedral environment, reduced magnetic moment size and canting angle highlights the potential to study quantum spin liquid behavior through non-magnetic ion doping.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Understanding In-Context Learning from Repetitions
Authors:
Jianhao Yan,
Jin Xu,
Chiyu Song,
Chenming Wu,
Yafu Li,
Yue Zhang
Abstract:
This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs). Our work provides a novel perspective by examining in-context learning via the lens of surface repetitions. We quantitatively investigate the role of surface features in text generation, and empirically establish the existence of \emph{token co-occurrence reinforcement}, a principle that str…
▽ More
This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs). Our work provides a novel perspective by examining in-context learning via the lens of surface repetitions. We quantitatively investigate the role of surface features in text generation, and empirically establish the existence of \emph{token co-occurrence reinforcement}, a principle that strengthens the relationship between two tokens based on their contextual co-occurrences. By investigating the dual impacts of these features, our research illuminates the internal workings of in-context learning and expounds on the reasons for its failures. This paper provides an essential contribution to the understanding of in-context learning and its potential limitations, providing a fresh perspective on this exciting capability.
△ Less
Submitted 21 February, 2024; v1 submitted 30 September, 2023;
originally announced October 2023.
-
Pressure-induced superconductivity in polycrystalline La3Ni2O7
Authors:
Gang Wang,
Ningning Wang,
Jun Hou,
Liang Ma,
Lifen Shi,
Zhian Ren,
Yadong Gu,
Xiaoling Shen,
Hanming Ma,
Pengtao Yang,
Ziyi Liu,
Haizhong Guo,
Jianping Sun,
Guangming Zhang,
Jiaqiang Yan,
Bosen Wang,
Yoshiya Uwatoko,
Jinguang Cheng
Abstract:
We synthesized polycrystalline La3Ni2O7 samples by using the sol-gel method without post-annealing under high oxygen pressure, and then measured temperature-dependent resistivity under various hydrostatic pressures up to 14.5 GPa in a cubic anvil cell apparatus. We find that the density-wave-like anomaly in resistivity is progressively suppressed with increasing pressure and the resistivity drop c…
▽ More
We synthesized polycrystalline La3Ni2O7 samples by using the sol-gel method without post-annealing under high oxygen pressure, and then measured temperature-dependent resistivity under various hydrostatic pressures up to 14.5 GPa in a cubic anvil cell apparatus. We find that the density-wave-like anomaly in resistivity is progressively suppressed with increasing pressure and the resistivity drop corresponding to the onset of superconductivity emerges at pressure as low as 7 GPa. Zero resistivity is achieved at 9 GPa below 6.6 K, which increases quickly with pressure to 35.6 K at 14.5 GPa. The observation of zero-resistance state in the polycrystalline La3Ni2O7 samples under high pressures not only corroborates the recent report of superconductivity in the pressurized La3Ni2O7 crystals but also facilitates further studies on this emerging family of nickelate high-Tc superconductors.
△ Less
Submitted 3 October, 2023; v1 submitted 29 September, 2023;
originally announced September 2023.
-
The Blessings of Multiple Treatments and Outcomes in Treatment Effect Estimation
Authors:
Yong Wu,
Mingzhou Liu,
Jing Yan,
Yanwei Fu,
Shouyan Wang,
Yizhou Wang,
Xinwei Sun
Abstract:
Assessing causal effects in the presence of unobserved confounding is a challenging problem. Existing studies leveraged proxy variables or multiple treatments to adjust for the confounding bias. In particular, the latter approach attributes the impact on a single outcome to multiple treatments, allowing estimating latent variables for confounding control. Nevertheless, these methods primarily focu…
▽ More
Assessing causal effects in the presence of unobserved confounding is a challenging problem. Existing studies leveraged proxy variables or multiple treatments to adjust for the confounding bias. In particular, the latter approach attributes the impact on a single outcome to multiple treatments, allowing estimating latent variables for confounding control. Nevertheless, these methods primarily focus on a single outcome, whereas in many real-world scenarios, there is greater interest in studying the effects on multiple outcomes. Besides, these outcomes are often coupled with multiple treatments. Examples include the intensive care unit (ICU), where health providers evaluate the effectiveness of therapies on multiple health indicators. To accommodate these scenarios, we consider a new setting dubbed as multiple treatments and multiple outcomes. We then show that parallel studies of multiple outcomes involved in this setting can assist each other in causal identification, in the sense that we can exploit other treatments and outcomes as proxies for each treatment effect under study. We proceed with a causal discovery method that can effectively identify such proxies for causal estimation. The utility of our method is demonstrated in synthetic data and sepsis disease.
△ Less
Submitted 14 October, 2023; v1 submitted 29 September, 2023;
originally announced September 2023.
-
Formation Wing-Beat Modulation (FWM): A Tool for Quantifying Bird Flocks Using Radar Micro-Doppler Signals
Authors:
Jiangkun Gong,
Jun Yan,
Deyong Kong,
Ruizhi Chen,
Deren Li
Abstract:
Radar echoes from bird flocks contain modulation signals, which we find are produced by the flapping gaits of birds in the flock, resulting in a group of spectral peaks with similar amplitudes spaced at a specific interval. We call this the formation wing-beat modulation (FWM) effect. FWM signals are micro-Doppler modulated by flapping wings and are related to the bird number, wing-beat frequency,…
▽ More
Radar echoes from bird flocks contain modulation signals, which we find are produced by the flapping gaits of birds in the flock, resulting in a group of spectral peaks with similar amplitudes spaced at a specific interval. We call this the formation wing-beat modulation (FWM) effect. FWM signals are micro-Doppler modulated by flapping wings and are related to the bird number, wing-beat frequency, and flight phasing strategy. Our X-band radar data show that FWM signals exist in radar signals of a seagull flock, providing tools for quantifying the bird number and estimating the mean wingbeat rate of birds. This new finding could aid in research on the quantification of bird migration numbers and estimation of bird flight behavior in radar ornithology and aero-ecology.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding
Authors:
Renqiu Xia,
Bo Zhang,
Haoyang Peng,
Hancheng Ye,
Xiangchao Yan,
Peng Ye,
Botian Shi,
Yu Qiao,
Junchi Yan
Abstract:
Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers. Current chart-related tasks focus on either chart perception which refers to extracting information from the visual charts, or performing reasoning given the extracted data, e.g. in a tabular form. In this paper, we aim to establish a unified and label-efficient learning par…
▽ More
Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers. Current chart-related tasks focus on either chart perception which refers to extracting information from the visual charts, or performing reasoning given the extracted data, e.g. in a tabular form. In this paper, we aim to establish a unified and label-efficient learning paradigm for joint perception and reasoning tasks, which can be generally applicable to different downstream tasks, beyond the question-answering task as specifically studied in peer works. Specifically, StructChart first reformulates the chart information from the popular tubular form (specifically linearized CSV) to the proposed Structured Triplet Representations (STR), which is more friendly for reducing the task gap between chart perception and reasoning due to the employed structured information extraction for charts. We then propose a Structuring Chart-oriented Representation Metric (SCRM) to quantitatively evaluate the performance for the chart perception task. To enrich the dataset for training, we further explore the possibility of leveraging the Large Language Model (LLM), enhancing the chart diversity in terms of both chart visual style and its statistical information. Extensive experiments are conducted on various chart-related tasks, demonstrating the effectiveness and promising potential for a unified chart perception-reasoning paradigm to push the frontier of chart understanding.
△ Less
Submitted 18 February, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Making Small Language Models Better Multi-task Learners with Mixture-of-Task-Adapters
Authors:
Yukang Xie,
Chengyu Wang,
Junbing Yan,
Jiyong Zhou,
Feiqi Deng,
Jun Huang
Abstract:
Recently, Large Language Models (LLMs) have achieved amazing zero-shot learning performance over a variety of Natural Language Processing (NLP) tasks, especially for text generative tasks. Yet, the large size of LLMs often leads to the high computational cost of model training and online deployment. In our work, we present ALTER, a system that effectively builds the multi-tAsk Learners with mixTur…
▽ More
Recently, Large Language Models (LLMs) have achieved amazing zero-shot learning performance over a variety of Natural Language Processing (NLP) tasks, especially for text generative tasks. Yet, the large size of LLMs often leads to the high computational cost of model training and online deployment. In our work, we present ALTER, a system that effectively builds the multi-tAsk Learners with mixTure-of-task-adaptERs upon small language models (with <1B parameters) to address multiple NLP tasks simultaneously, capturing the commonalities and differences between tasks, in order to support domain-specific applications. Specifically, in ALTER, we propose the Mixture-of-Task-Adapters (MTA) module as an extension to the transformer architecture for the underlying model to capture the intra-task and inter-task knowledge. A two-stage training method is further proposed to optimize the collaboration between adapters at a small computational cost. Experimental results over a mixture of NLP tasks show that our proposed MTA architecture and the two-stage training method achieve good performance. Based on ALTER, we have also produced MTA-equipped language models for various domains.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Learning Transferable 3D Representations
Authors:
Xiangchao Yan,
Runjian Chen,
Bo Zhang,
Hancheng Ye,
Renqiu Xia,
Jiakang Yuan,
Hongbin Zhou,
Xinyu Cai,
Botian Shi,
Wenqi Shao,
Ping Luo,
Yu Qiao,
Tao Chen,
Junchi Yan
Abstract:
Annotating 3D LiDAR point clouds for perception tasks is fundamental for many applications e.g., autonomous driving, yet it still remains notoriously labor-intensive. Pretraining-finetuning approach can alleviate the labeling burden by fine-tuning a pre-trained backbone across various downstream datasets as well as tasks. In this paper, we propose SPOT, namely Scalable Pre-training via Occupancy p…
▽ More
Annotating 3D LiDAR point clouds for perception tasks is fundamental for many applications e.g., autonomous driving, yet it still remains notoriously labor-intensive. Pretraining-finetuning approach can alleviate the labeling burden by fine-tuning a pre-trained backbone across various downstream datasets as well as tasks. In this paper, we propose SPOT, namely Scalable Pre-training via Occupancy prediction for learning Transferable 3D representations under such a label-efficient fine-tuning paradigm. SPOT achieves effectiveness on various public datasets with different downstream tasks, showcasing its general representation power, cross-domain robustness and data scalability which are three key factors for real-world application. Specifically, we both theoretically and empirically show, for the first time, that general representations learning can be achieved through the task of occupancy prediction. Then, to address the domain gap caused by different LiDAR sensors and annotation methods, we develop a beam re-sampling technique for point cloud augmentation combined with class-balancing strategy. Furthermore, scalable pre-training is observed, that is, the downstream performance across all the experiments gets better with more pre-training data. Additionally, such pre-training strategy also remains compatible with unlabeled data. The hope is that our findings will facilitate the understanding of LiDAR points and pave the way for future advancements in LiDAR pre-training.
△ Less
Submitted 25 July, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Extrinsic nonlinear Kerr rotation in topological materials under a magnetic field
Authors:
Shuang Wu,
Zaiyao Fei,
Zeyuan Sun,
Yangfan Yi,
Wei Xia,
Dayu Yan,
Yanfeng Guo,
Youguo Shi,
Jiaqiang Yan,
David H. Cobden,
Wei-Tao Liu,
Xiaodong Xu,
Shiwei Wu
Abstract:
Topological properties in quantum materials are often governed by symmetry and tuned by crystal structure and external fields, and hence symmetry-sensitive nonlinear optical measurements in a magnetic field are a valuable probe. Here we report nonlinear magneto-optical second harmonic generation (SHG) studies of non-magnetic topological materials including bilayer WTe2, monolayer WSe2 and bulk TaA…
▽ More
Topological properties in quantum materials are often governed by symmetry and tuned by crystal structure and external fields, and hence symmetry-sensitive nonlinear optical measurements in a magnetic field are a valuable probe. Here we report nonlinear magneto-optical second harmonic generation (SHG) studies of non-magnetic topological materials including bilayer WTe2, monolayer WSe2 and bulk TaAs. The polarization-resolved patterns of optical SHG under magnetic field show nonlinear Kerr rotation in these time-reversal symmetric materials. For materials with three-fold rotational symmetric lattice structure, the SHG polarization pattern rotates just slightly in a magnetic field, whereas in those with mirror or two-fold rotational symmetry the SHG polarization pattern rotates greatly and distorts. These different magneto-SHG characters can be understood by considering the superposition of the magnetic field-induced time-noninvariant nonlinear optical tensor and the crystal-structure-based time-invariant counterpart. The situation is further clarified by scrutinizing the Faraday rotation, whose subtle interplay with crystal symmetry accounts for the diverse behavior of the extrinsic nonlinear Kerr rotation in different materials. Our work illustrates the application of magneto-SHG techniques to directly probe nontrivial topological properties, and underlines the importance of minimizing extrinsic nonlinear Kerr rotation in polarization-resolved magneto-optical studies.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels
Authors:
Yixuan Wu,
Jintai Chen,
Jiahuan Yan,
Yiheng Zhu,
Danny Z. Chen,
Jian Wu
Abstract:
Since annotating medical images for segmentation tasks commonly incurs expensive costs, it is highly desirable to design an annotation-efficient method to alleviate the annotation burden. Recently, contrastive learning has exhibited a great potential in learning robust representations to boost downstream tasks with limited labels. In medical imaging scenarios, ready-made meta labels (i.e., specifi…
▽ More
Since annotating medical images for segmentation tasks commonly incurs expensive costs, it is highly desirable to design an annotation-efficient method to alleviate the annotation burden. Recently, contrastive learning has exhibited a great potential in learning robust representations to boost downstream tasks with limited labels. In medical imaging scenarios, ready-made meta labels (i.e., specific attribute information of medical images) inherently reveal semantic relationships among images, which have been used to define positive pairs in previous work. However, the multi-perspective semantics revealed by various meta labels are usually incompatible and can incur intractable "semantic contradiction" when combining different meta labels. In this paper, we tackle the issue of "semantic contradiction" in a gradient-guided manner using our proposed Gradient Mitigator method, which systematically unifies multi-perspective meta labels to enable a pre-trained model to attain a better high-level semantic recognition ability. Moreover, we emphasize that the fine-grained discrimination ability is vital for segmentation-oriented pre-training, and develop a novel method called Gradient Filter to dynamically screen pixel pairs with the most discriminating power based on the magnitude of gradients. Comprehensive experiments on four medical image segmentation datasets verify that our new method GCL: (1) learns informative image representations and considerably boosts segmentation performance with limited labels, and (2) shows promising generalizability on out-of-distribution datasets.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
A New Adaptive Phase-locked Loop for Synchronization of a Grid-Connected Voltage Source Converter: Simulation and Experimental Results
Authors:
Wei He,
Jiachen Yan,
Romeo Ortega,
Daniele Zonetti,
Wangping Zhou
Abstract:
In [1] a new adaptive phase-locked loop scheme for synchronization of a grid connected voltage source converter with guaranteed (almost) global stability properties was reported. To guarantee a suitable synchronization with the angle of the three-phase grid voltage we design an adaptive observer for such a signal requiring measurements only at the point of common coupling. An interesting feature o…
▽ More
In [1] a new adaptive phase-locked loop scheme for synchronization of a grid connected voltage source converter with guaranteed (almost) global stability properties was reported. To guarantee a suitable synchronization with the angle of the three-phase grid voltage we design an adaptive observer for such a signal requiring measurements only at the point of common coupling. An interesting feature of this scheme is the ability to synchronize in the challenging condition of connection with a grid with reduced short-circuit ratio. In this paper we present some simulation and experimental illustration of the excellent performance of the proposed solution.
△ Less
Submitted 30 October, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images
Authors:
Zhiyun Song,
Penghui Du,
Junpeng Yan,
Kailu Li,
Jianzhong Shou,
Maode Lai,
Yubo Fan,
Yan Xu
Abstract:
Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretr…
▽ More
Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretraining framework for histopathology images. The framework aims to capture the nuclear morphology and distribution information through unpaired image-to-image translation between histopathology images and pseudo mask images. The generation process is modulated by both conditional and stochastic style representations, ensuring the reality and diversity of the generated histopathology images for pretraining. Further, an instance segmentation guided strategy is employed to capture instance-level information. The experiments on 7 datasets show that the proposed pretraining method outperforms supervised ones on Kather classification, multiple instance learning, and 5 dense-prediction tasks with the transfer learning protocol, and yields superior results than other self-supervised approaches on 8 semi-supervised tasks. Our project is publicly available at https://github.com/zhiyuns/UNITPathSSL.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Atomistic Control in Molecular Beam Epitaxy Growth of Intrinsic Magnetic Topological Insulator MnBi2Te4
Authors:
Hyunsue Kim,
Mengke Liu,
Lisa Frammolino,
Yanxing Li,
Fan Zhang,
Woojoo Lee,
Chengye Dong,
Yi-Fan Zhao,
Guan-Yu Chen,
Pin-Jui Hsu,
Cui-Zu Chang,
Joshua Robinson,
Jiaqiang Yan,
Xiaoqin Li,
Allan H. MacDonald,
Chih-Kang Shih
Abstract:
Intrinsic magnetic topological insulators have emerged as a promising platform to study the interplay between topological surface states and ferromagnetism. This unique interplay can give rise to a variety of exotic quantum phenomena, including the quantum anomalous Hall effect and axion insulating states. Here, utilizing molecular beam epitaxy (MBE), we present a comprehensive study of the growth…
▽ More
Intrinsic magnetic topological insulators have emerged as a promising platform to study the interplay between topological surface states and ferromagnetism. This unique interplay can give rise to a variety of exotic quantum phenomena, including the quantum anomalous Hall effect and axion insulating states. Here, utilizing molecular beam epitaxy (MBE), we present a comprehensive study of the growth of high-quality MnBi2Te4 thin films on Si (111), epitaxial graphene, and highly ordered pyrolytic graphite substrates. By combining a suite of in-situ characterization techniques, we obtain critical insights into the atomic-level control of MnBi2Te4 epitaxial growth. First, we extract the free energy landscape for the epitaxial relationship as a function of the in-plane angular distribution. Then, by employing an optimized layer-by-layer growth, we determine the chemical potential and Dirac point of the thin film at different thicknesses. Overall, these results establish a foundation for understanding the growth dynamics of MnBi2Te4 and pave the way for the future applications of MBE in emerging topological quantum materials.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Distribution of colours in rainbow H-free colourings
Authors:
Zhuo Wu,
Jun Yan
Abstract:
An edge colouring of $K_n$ with $k$ colours is a Gallai $k$-colouring if it does not contain any rainbow triangle. Gyárfás, Pálvölgyi, Patkós and Wales proved that there exists a number $g(k)$ such that $n\geq g(k)$ if and only if for any colour distribution sequence $(e_1,\cdots,e_k)$ with $\sum_{i=1}^ke_i=\binom{n}{2}$, there exist a Gallai $k$-colouring of $K_n$ with $e_i$ edges having colour…
▽ More
An edge colouring of $K_n$ with $k$ colours is a Gallai $k$-colouring if it does not contain any rainbow triangle. Gyárfás, Pálvölgyi, Patkós and Wales proved that there exists a number $g(k)$ such that $n\geq g(k)$ if and only if for any colour distribution sequence $(e_1,\cdots,e_k)$ with $\sum_{i=1}^ke_i=\binom{n}{2}$, there exist a Gallai $k$-colouring of $K_n$ with $e_i$ edges having colour $i$. They also showed that $Ω(k)=g(k)=O(k^2)$ and posed the problem of determining the exact order of magnitude of $g(k)$. Feffer, Fu and Yan improved both bounds significantly by proving $Ω(k^{1.5}/\log k)=g(k)=O(k^{1.5})$. We resolve this problem by showing $g(k)=Θ(k^{1.5}/(\log k)^{0.5})$.
Moreover, we generalise these definitions by considering rainbow $H$-free colourings of $K_n$ for any general graph $H$, and the natural corresponding quantity $g(H,k)$. We prove that $g(H,k)$ is finite for every $k$ if and only if $H$ is not a forest, and determine the order of $g(H,k)$ when $H$ contains a subgraph with minimum degree at least 3.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Authors:
Bo Zhang,
Xinyu Cai,
Jiakang Yuan,
Donglin Yang,
Jianfei Guo,
Xiangchao Yan,
Renqiu Xia,
Botian Shi,
Min Dou,
Tao Chen,
Si Liu,
Junchi Yan,
Yu Qiao
Abstract:
Domain shifts such as sensor type changes and geographical situation variations are prevalent in Autonomous Driving (AD), which poses a challenge since AD model relying on the previous domain knowledge can be hardly directly deployed to a new domain without additional costs. In this paper, we provide a new perspective and approach of alleviating the domain shifts, by proposing a Reconstruction-Sim…
▽ More
Domain shifts such as sensor type changes and geographical situation variations are prevalent in Autonomous Driving (AD), which poses a challenge since AD model relying on the previous domain knowledge can be hardly directly deployed to a new domain without additional costs. In this paper, we provide a new perspective and approach of alleviating the domain shifts, by proposing a Reconstruction-Simulation-Perception (ReSimAD) scheme. Specifically, the implicit reconstruction process is based on the knowledge from the previous old domain, aiming to convert the domain-related knowledge into domain-invariant representations, e.g., 3D scene-level meshes. Besides, the point clouds simulation process of multiple new domains is conditioned on the above reconstructed 3D meshes, where the target-domain-like simulation samples can be obtained, thus reducing the cost of collecting and annotating new-domain data for the subsequent perception process. For experiments, we consider different cross-domain situations such as Waymo-to-KITTI, Waymo-to-nuScenes, Waymo-to-ONCE, etc, to verify the zero-shot target-domain perception using ReSimAD. Results demonstrate that our method is beneficial to boost the domain generalization ability, even promising for 3D pre-training.
△ Less
Submitted 25 January, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Revealing the preference for correcting separated aberrations in joint optic-image design
Authors:
Jingwen Zhou,
Shiqi Chen,
Zheng Ren,
Wenguan Zhang,
Jiapu Yan,
Huajun Feng,
Qi Li,
Yueting Chen
Abstract:
The joint design of the optical system and the downstream algorithm is a challenging and promising task. Due to the demand for balancing the global optimal of imaging systems and the computational cost of physical simulation, existing methods cannot achieve efficient joint design of complex systems such as smartphones and drones. In this work, starting from the perspective of the optical design, w…
▽ More
The joint design of the optical system and the downstream algorithm is a challenging and promising task. Due to the demand for balancing the global optimal of imaging systems and the computational cost of physical simulation, existing methods cannot achieve efficient joint design of complex systems such as smartphones and drones. In this work, starting from the perspective of the optical design, we characterize the optics with separated aberrations. Additionally, to bridge the hardware and software without gradients, an image simulation system is presented to reproduce the genuine imaging procedure of lenses with large field-of-views. As for aberration correction, we propose a network to perceive and correct the spatially varying aberrations and validate its superiority over state-of-the-art methods. Comprehensive experiments reveal that the preference for correcting separated aberrations in joint design is as follows: longitudinal chromatic aberration, lateral chromatic aberration, spherical aberration, field curvature, and coma, with astigmatism coming last. Drawing from the preference, a 10% reduction in the total track length of the consumer-level mobile phone lens module is accomplished. Moreover, this procedure spares more space for manufacturing deviations, realizing extreme-quality enhancement of computational photography. The optimization paradigm provides innovative insight into the practical joint design of sophisticated optical systems and post-processing algorithms.
△ Less
Submitted 20 November, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Design of multifunctional color routers with Kerker switching using generative adversarial networks
Authors:
Jiahao Yan,
Dayu Zhu,
Yanjun Bao,
Qin Chen,
Baojun Li,
Wenshan Cai
Abstract:
To achieve optoelectronic devices with high resolution and efficiency, there is a pressing need for optical structural units that possess an ultrasmall footprint yet exhibit strong controllability in both the frequency and spatial domains. For dielectric nanoparticles, the overlap of electric and magnetic dipole moments can scatter light completely forward or backward, which is called Kerker theor…
▽ More
To achieve optoelectronic devices with high resolution and efficiency, there is a pressing need for optical structural units that possess an ultrasmall footprint yet exhibit strong controllability in both the frequency and spatial domains. For dielectric nanoparticles, the overlap of electric and magnetic dipole moments can scatter light completely forward or backward, which is called Kerker theory. This effect can expand to any multipoles and any directions, re-named as generalized Kerker effect, and realize controllable light manipulation at full space and full spectrum using well-designed dielectric structures. However, the complex situations of multipole couplings make it difficult to achieve structural design. Here, generative artificial intelligence (AI) is utilized to facilitate multi-objective-oriented structural design, wherein we leverage the concept of "combined spectra" that consider both spectra and direction ratios as labels. The proposed generative adversarial network (GAN) is named as DDGAN (double-discriminator GAN) which discriminates both images and spectral labels. Using trained networks, we achieve the simultaneous design for scattering color and directivities, RGB color routers, as well as narrowband light routers. Notably, all generated structures possess a footprint less than 600x600 nm indicating their potential applications in optoelectronic devices with ultrahigh resolution.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Evolution of highly anisotropic magnetism in the titanium-based kagome metals LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$)
Authors:
Brenden R. Ortiz,
Hu Miao,
David S. Parker,
Fazhi Yang,
German D. Samolyuk,
Eleanor M. Clements,
Anil Rajapitamahuni,
Turgut Yilmaz,
Elio Vescovo,
Jiaqiang Yan,
Andrew F. May,
Michael A. McGuire
Abstract:
Here we present the family of titanium-based kagome metals of the form LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$). Single crystal growth methods are presented alongside detailed magnetic and thermodynamic measurements. The orthorhombic (Fmmm) LnTi$_3$Bi$_4$ family of compounds exhibit slightly distorted titanium-based kagome nets interwoven with zig-zag lanthanide-based (Ln) chains.…
▽ More
Here we present the family of titanium-based kagome metals of the form LnTi$_3$Bi$_4$ (Ln: La...Gd$^{3+}$, Eu$^{2+}$, Yb$^{2+}$). Single crystal growth methods are presented alongside detailed magnetic and thermodynamic measurements. The orthorhombic (Fmmm) LnTi$_3$Bi$_4$ family of compounds exhibit slightly distorted titanium-based kagome nets interwoven with zig-zag lanthanide-based (Ln) chains. Crystals are easily exfoliated parallel to the kagome sheets and angular resolved photoemission (ARPES) measurements highlight the intricacy of the electronic structure in these compounds, with Dirac points existing at the Fermi level. The magnetic properties and the associated anisotropy emerge from the quasi-1D zig-zag chains of Ln, and impart a wide array of magnetic ground states ranging from anisotropic ferromagnetism to complex antiferromagnetism with a cascade of metamagnetic transitions. Kagome metals continue to provide a rich direction for the exploration of magnetic, topologic, and highly correlated behavior. Our work here introduces the LnTi$_3$Bi$_4$ compounds to augment the continuously expanding suite of complex and interesting kagome materials.
△ Less
Submitted 6 September, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Helical magnetic state in the vicinity of the pressure-induced superconducting phase in MnP
Authors:
S. E. Dissanayake,
M. Matsuda,
K. Yoshimi,
S. Kasamatsu,
F. Ye,
S. Chi,
W. Steinhardt,
G. Fabbris,
S. Haravifard,
J. -G. Cheng,
J. -Q. Yan,
J. Gouchi,
Y. Uwatoko
Abstract:
MnP is a metal that shows successive magnetic transitions from paramagnetic to ferromagnetic and helical magnetic phases at ambient pressure with decreasing temperature. With applied pressure, the magnetic transition temperatures decrease and superconductivity appears around 8 GPa where the magnetic order is fully suppressed and the quantum critical behavior is observed. These results suggest that…
▽ More
MnP is a metal that shows successive magnetic transitions from paramagnetic to ferromagnetic and helical magnetic phases at ambient pressure with decreasing temperature. With applied pressure, the magnetic transition temperatures decrease and superconductivity appears around 8 GPa where the magnetic order is fully suppressed and the quantum critical behavior is observed. These results suggest that MnP is an unconventional superconductor in which magnetic fluctuations may be relevant to the superconducting pairing mechanism. In order to elucidate the magnetic ground state adjacent to the superconducting phase first discovered in Mn-based materials, high-pressure neutron diffraction measurements have been performed in hydrostatic pressure up to 7.5 GPa. The helical magnetic structure with the propagation vector along the $b$ axis, reported previously at 3.8 GPa, was found to be robust up to 7.5 GPa. First principles and classical Monte Carlo calculations have also been performed to understand how the pressure-driven magnetic phase transitions are coupled with change of the exchange interactions. The calculations, which qualitatively reproduce the magnetic structures as a function of pressure, suggest that the exchange interactions change drastically with applied pressure and the further-neighbor interactions become more influential at high pressures. Combining the experimental and theoretical results, we describe the detail of exchange interactions in the vicinity of the superconducting phase which is critical to understand the pairing mechanism of the unconventional superconductivity in MnP.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
WSTac: Interactive Surface Perception based on Whisker-Inspired and Self-Illuminated Vision-Based Tactile Sensor
Authors:
Kai Chong Lei,
Kit Wa Sou,
Wang Sing Chan,
Jiayi Yan,
Siqi Ping,
Dengfeng Peng,
Wenbo Ding,
Xiao-Ping Zhang
Abstract:
Modern Visual-Based Tactile Sensors (VBTSs) use cost-effective cameras to track elastomer deformation, but struggle with ambient light interference. Solutions typically involve using internal LEDs and blocking external light, thus adding complexity. Creating a VBTS resistant to ambient light with just a camera and an elastomer remains a challenge. In this work, we introduce WStac, a self-illuminat…
▽ More
Modern Visual-Based Tactile Sensors (VBTSs) use cost-effective cameras to track elastomer deformation, but struggle with ambient light interference. Solutions typically involve using internal LEDs and blocking external light, thus adding complexity. Creating a VBTS resistant to ambient light with just a camera and an elastomer remains a challenge. In this work, we introduce WStac, a self-illuminating VBTS comprising a mechanoluminescence (ML) whisker elastomer, camera, and 3D printed parts. The ML whisker elastomer, inspired by the touch sensitivity of vibrissae, offers both light isolation and high ML intensity under stress, thereby removing the necessity for additional LED modules. With the incorporation of machine learning, the sensor effectively utilizes the dynamic contact variations of 25 whiskers to successfully perform tasks like speed regression, directional identification, and texture classification. Videos are available at: https://sites.google.com/view/wstac/.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
A Strength and Sparsity Preserving Algorithm for Generating Weighted, Directed Networks with Predetermined Assortativity
Authors:
Yelie Yuan,
Jun Yan,
Panpan Zhang
Abstract:
Degree-preserving rewiring is a widely used technique for generating unweighted networks with given assortativity, but for weighted networks, it is unclear how an analog would preserve the strengths and other critical network features such as sparsity level. This study introduces a novel approach for rewiring weighted networks to achieve desired directed assortativity. The method utilizes a mixed…
▽ More
Degree-preserving rewiring is a widely used technique for generating unweighted networks with given assortativity, but for weighted networks, it is unclear how an analog would preserve the strengths and other critical network features such as sparsity level. This study introduces a novel approach for rewiring weighted networks to achieve desired directed assortativity. The method utilizes a mixed integer programming framework to establish a target network with predetermined assortativity coefficients, followed by an efficient rewiring algorithm termed "strength and sparsity preserving rewiring" (SSPR). SSPR retains the node strength distributions and network sparsity after rewiring. It is also possible to accommodate additional properties like edge weight distribution with extra computational cost. The optimization scheme can be used to determine feasible assortativity ranges for an initial network. The effectiveness of the proposed SSPR algorithm is demonstrated through its application to two classes of popular network models.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Cost-Intelligent Data Analytics in the Cloud
Authors:
Huanchen Zhang,
Yihao Liu,
Jiaqi Yan
Abstract:
For decades, database research has focused on optimizing performance under fixed resources. As more and more database applications move to the public cloud, we argue that it is time to make cost a first-class citizen when solving database optimization problems. In this paper, we introduce the concept of cost intelligence and envision the architecture of a cloud data warehouse designed for that. We…
▽ More
For decades, database research has focused on optimizing performance under fixed resources. As more and more database applications move to the public cloud, we argue that it is time to make cost a first-class citizen when solving database optimization problems. In this paper, we introduce the concept of cost intelligence and envision the architecture of a cloud data warehouse designed for that. We investigate two critical challenges to achieving cost intelligence in an analytical system: automatic resource deployment and cost-oriented auto-tuning. We describe our system architecture with an emphasis on the components that are missing in today's cloud data warehouses. Each of these new components represents unique research opportunities in this much-needed research area.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Transition to anomalous dynamics in a simple random map
Authors:
Jin Yan,
Moitrish Majumdar,
Stefano Ruffo,
Yuzuru Sato,
Christian Beck,
Rainer Klages
Abstract:
The famous Bernoulli shift (or dyadic transformation) is perhaps the simplest deterministic dynamical system exhibiting chaotic dynamics. It is a piecewise linear time-discrete map on the unit interval with a uniform slope larger than one, hence expanding, with a positive Lyapunov exponent and a uniform invariant density. If the slope is less than one the map becomes contracting, the Lyapunov expo…
▽ More
The famous Bernoulli shift (or dyadic transformation) is perhaps the simplest deterministic dynamical system exhibiting chaotic dynamics. It is a piecewise linear time-discrete map on the unit interval with a uniform slope larger than one, hence expanding, with a positive Lyapunov exponent and a uniform invariant density. If the slope is less than one the map becomes contracting, the Lyapunov exponent is negative, and the density trivially collapses onto a fixed point. Sampling from these two different types of maps at each time step by randomly selecting the expanding one with probability $p$, and the contracting one with probability $1-p$, gives a prototype of a random dynamical system. Here we calculate the invariant density of this simple random map, as well as its position autocorrelation function, analytically and numerically under variation of $p$. We find that the map exhibits a non-trivial transition from fully chaotic to completely regular dynamics by generating a long-time anomalous dynamics at a critical sampling probability $p_c$, defined by a zero Lyapunov exponent. This anomalous dynamics is characterised by an infinite invariant density, weak ergodicity breaking and power law correlation decay.
△ Less
Submitted 26 April, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
Authors:
Shenghao Fu,
Junkai Yan,
Yipeng Gao,
Xiaohua Xie,
Wei-Shi Zheng
Abstract:
Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built one-decoder-layer detectors. Although they gain remarkable acceleration, their performance still lags behind their six-decoder-layer counterparts by a large margin. In this work, we ai…
▽ More
Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built one-decoder-layer detectors. Although they gain remarkable acceleration, their performance still lags behind their six-decoder-layer counterparts by a large margin. In this work, we aim to bridge this performance gap while retaining fast speed. We find that the architecture discrepancy between dense and sparse detectors leads to feature conflict, hampering the performance of one-decoder-layer detectors. Thus we propose Adaptive Sparse Anchor Generator (ASAG) which predicts dynamic anchors on patches rather than grids in a sparse way so that it alleviates the feature conflict problem. For each image, ASAG dynamically selects which feature maps and which locations to predict, forming a fully adaptive way to generate image-specific anchors. Further, a simple and effective Query Weighting method eases the training instability from adaptiveness. Extensive experiments show that our method outperforms dense-initialized ones and achieves a better speed-accuracy trade-off. The code is available at \url{https://github.com/iSEE-Laboratory/ASAG}.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Wavelength-tunable high-fidelity entangled photon sources enabled by dual Stark effects
Authors:
Chen Chen,
Jun-Yong Yan,
Hans-Georg Babin,
Jiefei Wang,
Xingqi Xu,
Xing Lin,
Qianqian Yu,
Wei Fang,
Run-Ze Liu,
Yong-Heng Huo,
Han Cai,
Wei E. I. Sha,
Jiaxiang Zhang,
Christian Heyn,
Andreas D. Wieck,
Arne Ludwig,
Da-Wei Wang,
Chao-Yuan Jin,
Feng Liu
Abstract:
The construction of a large-scale quantum internet requires quantum repeaters containing multiple entangled photon sources with identical wavelengths. Semiconductor quantum dots can generate entangled photon pairs deterministically with high fidelity. However, realizing wavelength-matched quantum-dot entangled photon sources faces two difficulties: the non-uniformity of emission wavelength and exc…
▽ More
The construction of a large-scale quantum internet requires quantum repeaters containing multiple entangled photon sources with identical wavelengths. Semiconductor quantum dots can generate entangled photon pairs deterministically with high fidelity. However, realizing wavelength-matched quantum-dot entangled photon sources faces two difficulties: the non-uniformity of emission wavelength and exciton fine-structure splitting induced fidelity reduction. Typically, these two factors are not independently tunable, making it challenging to achieve simultaneous improvement. In this work, we demonstrate wavelength-tunable entangled photon sources based on droplet-etched GaAs quantum dots through the combined use of AC and quantum-confined Stark effects. The emission wavelength can be tuned by ~1 meV while preserving an entanglement fidelity f exceeding 0.955(1) in the entire tuning range. Based on this hybrid tuning scheme, we finally demonstrate multiple wavelength-matched entangled photon sources with f>0.919(3), paving a way towards robust and scalable on-demand entangled photon sources for quantum internet and integrated quantum optical circuits.
△ Less
Submitted 21 April, 2024; v1 submitted 9 August, 2023;
originally announced August 2023.