Search | arXiv e-print repository

Strongly nice property and Schur positivity of graphs

Authors: Ethan Y. H. Li, Grace M. X. Li, Arthur L. B. Yang, Zhong-Xue Zhang

Abstract: Motivated by the notion of nice graphs, we introduce the concept of strongly nice property, which can be used to study the Schur positivity of symmetric functions. We show that a graph and all its induced subgraphs are strongly nice if and only if it is claw-free, which strengthens a result of Stanley and provides further evidence for the well-known conjecture on the Schur positivity of claw-free… ▽ More Motivated by the notion of nice graphs, we introduce the concept of strongly nice property, which can be used to study the Schur positivity of symmetric functions. We show that a graph and all its induced subgraphs are strongly nice if and only if it is claw-free, which strengthens a result of Stanley and provides further evidence for the well-known conjecture on the Schur positivity of claw-free graphs. As another application, we solve Wang and Wang's conjecture on the non-Schur positivity of squid graphs $Sq(2n-1;1^n)$ for $n \ge 3$ by proving that these graphs are not strongly nice. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: 12 pages, 4 figures

MSC Class: 05E05; 06A07

arXiv:2408.13127 [pdf, ps, other]

Stanley's conjecture on the Schur positivity of distributive lattices

Authors: Grace M. X. Li, Dun Qiu, Arthur L. B. Yang, Zhong-Xue Zhang

Abstract: In this paper we solve an open problem on distributive lattices, which was proposed by Stanley in 1998. This problem was motivated by a conjecture due to Griggs, which equivalently states that the incomparability graph of the boolean algebra $B_n$ is nice. Stanley introduced the idea of studying the nice property of a graph by investigating the Schur positivity of its corresponding chromatic symme… ▽ More In this paper we solve an open problem on distributive lattices, which was proposed by Stanley in 1998. This problem was motivated by a conjecture due to Griggs, which equivalently states that the incomparability graph of the boolean algebra $B_n$ is nice. Stanley introduced the idea of studying the nice property of a graph by investigating the Schur positivity of its corresponding chromatic symmetric functions. Since the boolean algebras form a special class of distributive lattices, Stanley raised the question of whether the incomparability graph of any distributive lattice is Schur positive. Stanley further noted that this seems quite unlikely. In this paper, we construct a family of distributive lattices which are not nice and hence not Schur positive. We also provide a family of distributive lattices which are nice but not Schur positive. △ Less

Submitted 23 August, 2024; originally announced August 2024.

arXiv:2408.12796 [pdf, other]

Real-Time Posture Monitoring and Risk Assessment for Manual Lifting Tasks Using MediaPipe and LSTM

Authors: Ereena Bagga, Ang Yang

Abstract: This research focuses on developing a real-time posture monitoring and risk assessment system for manual lifting tasks using advanced AI and computer vision technologies. Musculoskeletal disorders (MSDs) are a significant concern for workers involved in manual lifting, and traditional methods for posture correction are often inadequate due to delayed feedback and lack of personalized assessment. O… ▽ More This research focuses on developing a real-time posture monitoring and risk assessment system for manual lifting tasks using advanced AI and computer vision technologies. Musculoskeletal disorders (MSDs) are a significant concern for workers involved in manual lifting, and traditional methods for posture correction are often inadequate due to delayed feedback and lack of personalized assessment. Our proposed solution integrates AI-driven posture detection, detailed keypoint analysis, risk level determination, and real-time feedback delivered through a user-friendly web interface. The system aims to improve posture, reduce the risk of MSDs, and enhance user engagement. The research involves comprehensive data collection, model training, and iterative development to ensure high accuracy and user satisfaction. The solution's effectiveness is evaluated against existing methodologies, demonstrating significant improvements in real-time feedback and risk assessment. This study contributes to the field by offering a novel approach to posture correction that addresses existing gaps and provides practical, immediate benefits to users. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Comments: Proceedings of the 1st International Workshop on Multimedia Computing for Health and Medicine at ACM MM'24

arXiv:2408.08239 [pdf, other]

Strong Data Processing Inequalities and their Applications to Reliable Computation

Authors: Andrew K. Yang

Abstract: In 1952, von Neumann gave a series of groundbreaking lectures that proved it was possible for circuits consisting of 3-input majority gates that have a sufficiently small independent probability $δ> 0$ of malfunctioning to reliably compute Boolean functions. In 1999, Evans and Schulman used a strong data-processing inequality (SDPI) to establish the tightest known necessary condition… ▽ More In 1952, von Neumann gave a series of groundbreaking lectures that proved it was possible for circuits consisting of 3-input majority gates that have a sufficiently small independent probability $δ> 0$ of malfunctioning to reliably compute Boolean functions. In 1999, Evans and Schulman used a strong data-processing inequality (SDPI) to establish the tightest known necessary condition $δ< \frac{1}{2} - \frac{1}{2\sqrt{k}}$ for reliable computation when the circuit consists of components that have at most $k$ inputs. In 2017, Polyanskiy and Wu distilled Evans and Schulman's SDPI argument to establish a general result on the contraction of mutual information in Bayesian networks. In this essay, we will first introduce the problem of reliable computation from unreliable components and establish the existence of noise thresholds. We will then provide an exposition of von Neumann's result with 3-input majority gates and extend it to minority gates. We will then provide an introduction to SDPIs, which have many applications, including in statistical mechanics, portfolio theory, and lower bounds on statistical estimation under privacy constraints. We will then use the introduced material to provide an exposition of Polyanskiy and Wu's 2017 result on Bayesian networks, from which the 1999 result of Evans-Schulman follows. △ Less

Submitted 15 August, 2024; originally announced August 2024.

arXiv:2408.07273 [pdf]

doi 10.1093/mnras/stae1910

SCOTCH Search for Clandestine Optically Thick Compact HII regions II

Authors: A. L. Patel, J. S. Urquhart, A. Y. Yang, T. Moore, M. A. Thompson, K. M. Menten, T. Csengeri

Abstract: In this study we present 18 to 24 GHz and high angular resolution radio wavelength Australia Telescope Compact Array follow up observations towards a sample of 39 HC HII region candidates. These objects, taken from a sample hosting 6.7 GHz methanol masers, were chosen due to the compact and optically thick nature of their continuum emission. We have detected 27 compact radio sources and constructe… ▽ More In this study we present 18 to 24 GHz and high angular resolution radio wavelength Australia Telescope Compact Array follow up observations towards a sample of 39 HC HII region candidates. These objects, taken from a sample hosting 6.7 GHz methanol masers, were chosen due to the compact and optically thick nature of their continuum emission. We have detected 27 compact radio sources and constructed their spectral energy distributions over the 5 to 24 GHz range to determine the young HII regions physical properties, i.e., diameter, electron density ne, emission measure, Lyman continuum flux NLy and turnover frequency. The flux measurements are fitted for 20 objects assuming an ionisation bounded HII region with uniform density model. For the remaining 7 objects that lack constraints spanning both their optically thick and thin regimes, we utilise relations from the literature to determine their physical properties. Comparing these determined parameters with those of known hypercompact and ultracompact HII regions, we have identified 13 HC HII regions, 6 intermediate objects that fall between HC HII and UC HII regions, 6 UC HII regions and one radio jet candidate which increases the known population of HC HII regions by 50 per cent. All the young and compact HII regions are embedded in dusty and dense clumps and 80 percent of the HC HII regions identified in this work are associated with various maser species. Four of our radio sources remain optically thick at 24 GHz, we consider these to be amongst the youngest HC HII regions. △ Less

Submitted 13 August, 2024; originally announced August 2024.

arXiv:2408.01112 [pdf, other]

Agentic LLM Workflows for Generating Patient-Friendly Medical Reports

Authors: Malavikha Sudarshan, Sophie Shih, Estella Yee, Alina Yang, John Zou, Cathy Chen, Quan Zhou, Leon Chen, Chinmay Singhal, George Shih

Abstract: The application of Large Language Models (LLMs) in healthcare is expanding rapidly, with one potential use case being the translation of formal medical reports into patient-legible equivalents. Currently, LLM outputs often need to be edited and evaluated by a human to ensure both factual accuracy and comprehensibility, and this is true for the above use case. We aim to minimize this step by propos… ▽ More The application of Large Language Models (LLMs) in healthcare is expanding rapidly, with one potential use case being the translation of formal medical reports into patient-legible equivalents. Currently, LLM outputs often need to be edited and evaluated by a human to ensure both factual accuracy and comprehensibility, and this is true for the above use case. We aim to minimize this step by proposing an agentic workflow with the Reflexion framework, which uses iterative self-reflection to correct outputs from an LLM. This pipeline was tested and compared to zero-shot prompting on 16 randomized radiology reports. In our multi-agent approach, reports had an accuracy rate of 94.94% when looking at verification of ICD-10 codes, compared to zero-shot prompted reports, which had an accuracy rate of 68.23%. Additionally, 81.25% of the final reflected reports required no corrections for accuracy or readability, while only 25% of zero-shot prompted reports met these criteria without needing modifications. These results indicate that our approach presents a feasible method for communicating clinical findings to patients in a quick, efficient and coherent manner whilst also retaining medical accuracy. The codebase is available for viewing at http://github.com/malavikhasudarshan/Multi-Agent-Patient-Letter-Generation. △ Less

Submitted 5 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

Comments: 12 pages, 7 figures

arXiv:2407.21783 [pdf, other]

The Llama 3 Herd of Models

Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development. △ Less

Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

arXiv:2407.21774 [pdf, other]

Spurious Solar-Wind Effects on Acceleration Noise in LISA Pathfinder

Authors: Arnold Yang, Indie Desiderio-Sloane, Grant David Meadors

Abstract: Spurious solar-wind effects are a potential noise source in the measurements of the future Laser Interferometer Space Antenna (LISA). Comparative models are used to predict the possible impact of this noise factor and estimate spurious solar-wind effects on acceleration noise in LISA Pathfinder (LPF). Data from NASA's Advanced Composition Explorer (ACE), situated at the L1 Lagrange point, served a… ▽ More Spurious solar-wind effects are a potential noise source in the measurements of the future Laser Interferometer Space Antenna (LISA). Comparative models are used to predict the possible impact of this noise factor and estimate spurious solar-wind effects on acceleration noise in LISA Pathfinder (LPF). Data from NASA's Advanced Composition Explorer (ACE), situated at the L1 Lagrange point, served as a reliable source of solar-wind data. The data sets were compared over the 114-day time period from March 1, 2016 to June 23, 2016. To evaluate these effects, the data from both satellites were formatted, gap-filled, and adapted for comparison, and a coherence plot comparing the results of the Fast Fourier Transformations. The coherence plot suggested that solar-wind had a minuscule effect on the LPF, and higher frequency coherence (LISA's main observing band) can be attributed to random chance correlation. This result indicates that measurable correlation due to solar-wind noise over 3-month timescales can be ruled out as a noise source. This is encouraging, although another source of noise from the sun, solar irradiance pressure, is estimated to have a more significant effect and has yet to be analyzed. △ Less

Submitted 31 July, 2024; originally announced July 2024.

Comments: 14 pages, 9 figures, will be submitted to Classical and Quantum Gravity

Report number: LA-UR-23-30801

arXiv:2407.19178 [pdf, other]

Power-LLaVA: Large Language and Vision Assistant for Power Transmission Line Inspection

Authors: Jiahao Wang, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang

Abstract: The inspection of power transmission line has achieved notable achievements in the past few years, primarily due to the integration of deep learning technology. However, current inspection approaches continue to encounter difficulties in generalization and intelligence, which restricts their further applicability. In this paper, we introduce Power-LLaVA, the first large language and vision assista… ▽ More The inspection of power transmission line has achieved notable achievements in the past few years, primarily due to the integration of deep learning technology. However, current inspection approaches continue to encounter difficulties in generalization and intelligence, which restricts their further applicability. In this paper, we introduce Power-LLaVA, the first large language and vision assistant designed to offer professional and reliable inspection services for power transmission line by engaging in dialogues with humans. Moreover, we also construct a large-scale and high-quality dataset specialized for the inspection task. By employing a two-stage training strategy on the constructed dataset, Power-LLaVA demonstrates exceptional performance at a comparatively low training cost. Extensive experiments further prove the great capabilities of Power-LLaVA within the realm of power transmission line inspection. Code shall be released. △ Less

Submitted 27 July, 2024; originally announced July 2024.

arXiv:2407.17867 [pdf, other]

Intrinsic Nonlinear Spin Hall Effect and Manipulation of Perpendicular Magnetization

Authors: Hui Wang, Huiying Liu, Xukun Feng, Jin Cao, Weikang Wu, Shen Lai, Weibo Gao, Cong Xiao, Shengyuan A. Yang

Abstract: We propose an intrinsic nonlinear spin Hall effect, which enables the generation of collinearly-polarized spin current in a large class of nonmagnetic materials with the corresponding linear response being symmetry-forbidden. This opens a new avenue for field-free switching of perpendicular magnetization, which is required for the next-generation information storage technology. We develop the micr… ▽ More We propose an intrinsic nonlinear spin Hall effect, which enables the generation of collinearly-polarized spin current in a large class of nonmagnetic materials with the corresponding linear response being symmetry-forbidden. This opens a new avenue for field-free switching of perpendicular magnetization, which is required for the next-generation information storage technology. We develop the microscopic theory of this effect, and clarify its quantum origin in band geometric quantities which can be enhanced by topological nodal features. Combined with first-principles calculations, we predict pronounced effects at room temperature in topological metals $\mathrm{PbTaSe_{2}}$ and PdGa. Our work establishes a fundamental nonlinear response in spin transport, and opens the door to exploring spintronic applications based on nonlinear spin Hall effect. △ Less

Submitted 25 July, 2024; originally announced July 2024.

arXiv:2407.13932 [pdf]

Excitation laser energy dependence of the gap-mode TERS spectra of WS$_2$ and MoS$_2$ on silver

Authors: Andrey Krayev, Eleonora Isotta, Lauren Hoang, Jerry A. Yang, Kathryn Neilson, Minyuan Wang, Noah Haughn, Eric Pop, Andrew Mannix, Oluwaseyi Balogun, Chih-Feng Wang

Abstract: We present a systematic study of the dependence of gap mode tip-enhanced Raman scattering (TERS) of mono- and bi-layer WS$_2$ and MoS$_2$ as a function of excitation laser energy. We collected consecutive TERS maps of mono-and bi-layer regions with 6 different excitation lasers. To decrease the acquisition time, we used for the first time concurrent excitation and collection with two lasers simult… ▽ More We present a systematic study of the dependence of gap mode tip-enhanced Raman scattering (TERS) of mono- and bi-layer WS$_2$ and MoS$_2$ as a function of excitation laser energy. We collected consecutive TERS maps of mono-and bi-layer regions with 6 different excitation lasers. To decrease the acquisition time, we used for the first time concurrent excitation and collection with two lasers simultaneously. We found that the E$_{2g}$/A$_{1g}$ peak intensity ratio for bilayer WS$_2$@Ag and the A'/A$_{1g}$ peak intensity ratio of the out-of-plane modes for mono- and bilayer change in a significantly non-monotonous way with excitation laser energies from 1.58 to 2.62 eV. The former ratio increases at energies corresponding to A and B excitons in bilayer WS$_2$. The intensity of the A peak in the monolayer, and hence the A/A$_{1g}$ ratio, is surprisingly high at low excitation energies, dips dramatically at energy corresponding to the A exciton, and is restored partially in between A and B excitons, though still showing a descending trend with increasing energy. A similar picture was observed in mono- and bi-layer MoS$_2$, though the existing set of lasers did not match its excitonic profile as nicely as for WS$_2$. We attribute the observed behavior to intermediate (Fano resonance) or strong (Rabi splitting) coupling between the excitons in transition metal dichalcogenides (TMDs) and the plasmons in the tip-substrate nanocavity. This is akin to the so-called Fano (Rabi) transparency experimentally observed in far field scattering from TMDs between two plasmonic metals. The possibility of intermediate/strong coupling between excitonic resonances in TMDs and the nanocavity re-evaluates the role of resonances in gap-mode TERS and should become an important factor to be considered by TERS practitioners when planning experiments. Finally, we propose the ideal substrate for efficient TERS and tip enhanced photoluminescence measurements. △ Less

Submitted 18 July, 2024; originally announced July 2024.

Comments: 21 pages, 10 figures

arXiv:2407.12585 [pdf, other]

A global view on star formation: The GLOSTAR Galactic plane survey. XI. Radio source catalog IV: $2^\circ < \ell < 28^\circ$, $36^\circ < \ell < 60^\circ$ and $|b| < 1^\circ$

Authors: S. -N. X. Medina, S. A. Dzib, J. S. Urquhart, A. Y. Yang, A. Brunthaler, K. M. Menten, F. Wyrowski, W. D. Cotton, A. Cheema, R. Dokara, Y. Gong, S. Khan, H. Nguyen, G. N. Ortiz-Leon, M. R. Rugel, V. S. Veena, H. Beuther, T. Csengeri, J. D. Pandian, N. Roy

Abstract: The GLOSTAR survey studies star formation with the VLA and the Effelsberg 100m telescope in the Galactic plane (-2d<l<60d; |b|<1d) and the Cygnus X region with unprecedented sensitivity in both flux density (~50uJy/beam) and the capability of detecting emission with angular scales in the range from 1" to the largest radio structures in the Galaxy. We provide a complete GLOSTAR-VLA D-configuratio… ▽ More The GLOSTAR survey studies star formation with the VLA and the Effelsberg 100m telescope in the Galactic plane (-2d<l<60d; |b|<1d) and the Cygnus X region with unprecedented sensitivity in both flux density (~50uJy/beam) and the capability of detecting emission with angular scales in the range from 1" to the largest radio structures in the Galaxy. We provide a complete GLOSTAR-VLA D-configuration radio source catalog for the covered part of the Galactic disk. A catalog for the pilot region (28d<l<36d) has been published in a previous paper and here we present the complementary catalog for the area within 2d<l<28d, 36d<l<60d and |b|<1d. Observations were taken with the VLA in a 4-8GHz band to image 100 degrees$^2$ of the inner Galactic disk at a reference frequency of 5.8GHz, using 260h of telescope time. We determined spectral indices inside the observed band and in the frequency range 1.4-5.8GHz by complementing our results with those from the THOR survey (1-2GHz). The final images have an angular resolution of 18" and an average sensitivity of 123uJy/beam. The sensitivity is better (~60uJy/beam) in areas free of extended emission. The Galactic disk catalog presented in this work, consists of 11211 radio sources. Of these, 1965 are known large-scale structure sources such as star-forming region complexes, well-known SNRs, SNR candidates or parts thereof. The remaining 9227 are discrete individual sources. Source parameters, namely flux densities, sizes, spectral indices, and classifications are reported. We identify 769 HII region candidates, 359 are newly classified as such. The mean value of spectral indices of 225 HII regions is 0.14$\pm$0.02, consistent with most of them emitting optically thin thermal radio emission. Combining our results with the previously published catalog of the pilot region, the final GLOSTAR-VLA D-configuration catalog contains 12981 radio sources. △ Less

Submitted 8 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

Comments: 21 pages, 18 figures, 7 tables, accepted to be published in the Astronomy & Astrophysics journal. V2 Includes language editor corrections

arXiv:2407.10671 [pdf, other]

Qwen2 Technical Report

Authors: An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Chengpeng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Ma, Jianxin Yang, Jin Xu, Jingren Zhou, Jinze Bai, Jinzheng He, Junyang Lin , et al. (37 additional authors not shown)

Abstract: This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, a… ▽ More This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, and exhibits competitive performance relative to proprietary models across diverse benchmarks on language understanding, generation, multilingual proficiency, coding, mathematics, and reasoning. The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on MMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base language model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1 on MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover, Qwen2 demonstrates robust multilingual capabilities, proficient in approximately 30 languages, spanning English, Chinese, Spanish, French, German, Arabic, Russian, Korean, Japanese, Thai, Vietnamese, and more, underscoring its versatility and global reach. To foster community innovation and accessibility, we have made the Qwen2 model weights openly available on Hugging Face and ModelScope, and the supplementary materials including example code on GitHub. These platforms also include resources for quantization, fine-tuning, and deployment, facilitating a wide range of applications and research endeavors. △ Less

Submitted 17 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

Comments: 25 pages, 1 figure

arXiv:2407.09037 [pdf]

Photonic quasicrystal of spin angular momentum

Authors: Min Lin, Xinxin Gou, Zhenwei Xie, Aiping Yang, Luping Du, Xiaocong Yuan

Abstract: Quasicrystals,characterized by long-range order without translational symmetry,have catalyzed transformative advances in various fields,including optics in terms of field quasicrystals.Here,we present the first demonstration of photonic quasicrystals formed by spin angular momentum, unveiling novel spin-orbit coupling effects absent in traditional field quasicrystals.A de Bruijn tiling like theore… ▽ More Quasicrystals,characterized by long-range order without translational symmetry,have catalyzed transformative advances in various fields,including optics in terms of field quasicrystals.Here,we present the first demonstration of photonic quasicrystals formed by spin angular momentum, unveiling novel spin-orbit coupling effects absent in traditional field quasicrystals.A de Bruijn tiling like theoretical framework was built elucidating the formation mechanism of spin quasicrystals for diverse symmetries.Moreover,the configurations of these spin textures can be manipulated through the adjustments of the wavefronts,among which phason-like discontinuous dynamics is observed and quantitatively measured. Unlike optical quasicrystals shaped by electromagnetic fields,these spin-governed quasicrystals exhibit quasi-periodic properties of kinematic parameters,extending their potential applications to other physical systems. These findings hold promise for novel advancements in optical trapping,quasicrystal fabrication,and optical encryption systems. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.08255 [pdf, other]

GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification

Authors: Aitao Yang, Min Li, Yao Ding, Leyuan Fang, Yaoming Cai, Yujie He

Abstract: Efficient extraction of spectral sequences and geospatial information has always been a hot topic in hyperspectral image classification. In terms of spectral sequence feature capture, RNN and Transformer have become mainstream classification frameworks due to their long-range feature capture capabilities. In terms of spatial information aggregation, CNN enhances the receptive field to retain integ… ▽ More Efficient extraction of spectral sequences and geospatial information has always been a hot topic in hyperspectral image classification. In terms of spectral sequence feature capture, RNN and Transformer have become mainstream classification frameworks due to their long-range feature capture capabilities. In terms of spatial information aggregation, CNN enhances the receptive field to retain integrated spatial information as much as possible. However, the spectral feature-capturing architectures exhibit low computational efficiency, and CNNs lack the flexibility to perceive spatial contextual information. To address these issues, this paper proposes GraphMamba--an efficient graph structure learning vision Mamba classification framework that fully considers HSI characteristics to achieve deep spatial-spectral information mining. Specifically, we propose a novel hyperspectral visual GraphMamba processing paradigm (HVGM) that preserves spatial-spectral features by constructing spatial-spectral cubes and utilizes linear spectral encoding to enhance the operability of subsequent tasks. The core components of GraphMamba include the HyperMamba module for improving computational efficiency and the SpectralGCN module for adaptive spatial context awareness. The HyperMamba mitigates clutter interference by employing the global mask (GM) and introduces a parallel training inference architecture to alleviate computational bottlenecks. The SpatialGCN incorporates weighted multi-hop aggregation (WMA) spatial encoding to focus on highly correlated spatial structural features, thus flexibly aggregating contextual information while mitigating spatial noise interference. Extensive experiments were conducted on three different scales of real HSI datasets, and compared with the state-of-the-art classification frameworks, GraphMamba achieved optimal performance. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 13 pages, 10 figures

arXiv:2407.05770 [pdf, other]

A global view on star formation: The GLOSTAR Galactic plane survey X. Galactic HII region catalog using radio recombination lines

Authors: S. Khan, M. R. Rugel, A. Brunthaler, K. M. Menten, F. Wyrowski, J. S. Urquhart, Y. Gong, A. Y. Yang, H. Nguyen, R. Dokara, S. A. Dzib, S. -N. X. Medina, G. N. Ortiz-León, J. D. Pandian, H. Beuther, V. S. Veena, S. Neupane, A. Cheema, W. Reich, N. Roy

Abstract: Studies of Galactic HII regions are of crucial importance for studying star formation and the evolution of the interstellar medium. Gaining an insight into their physical characteristics contributes to a more comprehensive understanding of these phenomena. The GLOSTAR project aims to provide a GLObal view on STAR formation in the Milky Way by performing an unbiased and sensitive survey. This is ac… ▽ More Studies of Galactic HII regions are of crucial importance for studying star formation and the evolution of the interstellar medium. Gaining an insight into their physical characteristics contributes to a more comprehensive understanding of these phenomena. The GLOSTAR project aims to provide a GLObal view on STAR formation in the Milky Way by performing an unbiased and sensitive survey. This is achieved by using the extremely wideband (4{-}8 GHz) C-band receiver of the Karl G. Jansky Very Large Array and the Effelsberg 100 m telescope. Using radio recombination lines observed in the GLOSTAR survey with the VLA in D-configuration with a typical line sensitivity of 1σ {\sim} 3.0 mJy beam{^-1} at {\sim} 5 km s{^-1} and an angular resolution of 25", we cataloged 244 individual Galactic HII regions and derived their physical properties. We examined the mid-infrared (MIR) morphology of these HII regions and find that a significant portion of them exhibit a bubble-like morphology in the GLIMPSE 8 μm emission. We also searched for associations with the dust continuum and sources of methanol maser emission, other tracers of young stellar objects, and find that 48\% and 14\% of our HII regions, respectively, are coextensive with those. We measured the electron temperature for a large sample of HII regions within Galactocentric distances spanning from 1.6 to 13.1 kpc and derived the Galactic electron temperature gradient as {\sim} 372 {\pm} 28 K kpc{^-1} with an intercept of 4248 {\pm} 161 K, which is consistent with previous studies. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: Accepted for publication in A&A

arXiv:2407.03064 [pdf, other]

Hilbert band complexes and their applications

Authors: Zeying Zhang, Y. X. Zhao, Yugui Yao, Shengyuan A. Yang

Abstract: The study of band connectivity is a fundamental problem in condensed matter physics. Here, we develop a new method for analyzing band connectivity, which completely solves the outstanding questions of the reducibility and decomposition of band complexes. By translating the symmetry conditions into a set of band balance equations, we show that all possible band structure solutions can be described… ▽ More The study of band connectivity is a fundamental problem in condensed matter physics. Here, we develop a new method for analyzing band connectivity, which completely solves the outstanding questions of the reducibility and decomposition of band complexes. By translating the symmetry conditions into a set of band balance equations, we show that all possible band structure solutions can be described by a positive affine monoid structure, which has a unique minimal set of generators, called Hilbert basis. We show that Hilbert basis completely determine whether a band complex is reducible and how it can be decomposed. The band complexes corresponding to Hilbert basis vectors, termed as Hilbert band complexes (HBCs), can be regarded as elementary building blocks of band structures. We develop algorithms to construct HBCs, analyze their graph features, and merge them into large complexes. We find some interesting examples, such as HBCs corresponding to complete bipartite graphs, and complexes which can grow without bound by successively merging a HBC. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 9 pages, 7 figures

arXiv:2406.17183 [pdf, other]

doi 10.21428/d82e957c.3da7f032

POPCat: Propagation of particles for complex annotation tasks

Authors: Adam Srebrnjak Yang, Dheeraj Khanna, John S. Zelek

Abstract: Novel dataset creation for all multi-object tracking, crowd-counting, and industrial-based videos is arduous and time-consuming when faced with a unique class that densely populates a video sequence. We propose a time efficient method called POPCat that exploits the multi-target and temporal features of video data to produce a semi-supervised pipeline for segmentation or box-based video annotation… ▽ More Novel dataset creation for all multi-object tracking, crowd-counting, and industrial-based videos is arduous and time-consuming when faced with a unique class that densely populates a video sequence. We propose a time efficient method called POPCat that exploits the multi-target and temporal features of video data to produce a semi-supervised pipeline for segmentation or box-based video annotation. The method retains the accuracy level associated with human level annotation while generating a large volume of semi-supervised annotations for greater generalization. The method capitalizes on temporal features through the use of a particle tracker to expand the domain of human-provided target points. This is done through the use of a particle tracker to reassociate the initial points to a set of images that follow the labeled frame. A YOLO model is then trained with this generated data, and then rapidly infers on the target video. Evaluations are conducted on GMOT-40, AnimalTrack, and Visdrone-2019 benchmarks. These multi-target video tracking/detection sets contain multiple similar-looking targets, camera movements, and other features that would commonly be seen in "wild" situations. We specifically choose these difficult datasets to demonstrate the efficacy of the pipeline and for comparison purposes. The method applied on GMOT-40, AnimalTrack, and Visdrone shows a margin of improvement on recall/mAP50/mAP over the best results by a value of 24.5%/9.6%/4.8%, -/43.1%/27.8%, and 7.5%/9.4%/7.5% where metrics were collected. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures, Accepted in "Conference on Robots and Vision 2024"

arXiv:2406.14209 [pdf]

doi 10.1088/2515-7639/ad2083

2024 roadmap on 2D topological insulators

Authors: Bent Weber, Michael S Fuhrer, Xian-Lei Sheng, Shengyuan A Yang, Ronny Thomale, Saquib Shamim, Laurens W Molenkamp, David Cobden, Dmytro Pesin, Harold J W Zandvliet, Pantelis Bampoulis, Ralph Claessen, Fabian R Menges, Johannes Gooth, Claudia Felser, Chandra Shekhar, Anton Tadich, Mengting Zhao, Mark T Edmonds, Junxiang Jia, Maciej Bieniek, Jukka I Väyrynen, Dimitrie Culcer, Bhaskaran Muralidharan, Muhammad Nadeem

Abstract: 2D topological insulators promise novel approaches towards electronic, spintronic, and quantum device applications. This is owing to unique features of their electronic band structure, in which bulk-boundary correspondences enforces the existence of 1D spin-momentum locked metallic edge states - both helical and chiral - surrounding an electrically insulating bulk. Forty years since the first disc… ▽ More 2D topological insulators promise novel approaches towards electronic, spintronic, and quantum device applications. This is owing to unique features of their electronic band structure, in which bulk-boundary correspondences enforces the existence of 1D spin-momentum locked metallic edge states - both helical and chiral - surrounding an electrically insulating bulk. Forty years since the first discoveries of topological phases in condensed matter, the abstract concept of band topology has sprung into realization with several materials now available in which sizable bulk energy gaps - up to a few hundred meV - promise to enable topology for applications even at room-temperature. Further, the possibility of combining 2D TIs in heterostructures with functional materials such as multiferroics, ferromagnets, and superconductors, vastly extends the range of applicability beyond their intrinsic properties. While 2D TIs remain a unique testbed for questions of fundamental condensed matter physics, proposals seek to control the topologically protected bulk or boundary states electrically, or even induce topological phase transitions to engender switching functionality. Induction of superconducting pairing in 2D TIs strives to realize non-Abelian quasiparticles, promising avenues towards fault-tolerant topological quantum computing. This roadmap aims to present a status update of the field, reviewing recent advances and remaining challenges in theoretical understanding, materials synthesis, physical characterization and, ultimately, device perspectives. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.11180 [pdf, other]

Definition and Frequency Dependence of Intrinsic Nonlinear Current

Authors: Cong Xiao, Jin Cao, Qian Niu, Shengyuan A. Yang

Abstract: We show that the three commonly employed approaches that define the same intrinsic linear anomalous Hall response actually lead to different results for intrinsic nonlinear transport. The difference arises from an intrinsic anomalous distribution. It originates from scattering, but its value is completely independent of scattering, because it represents the local equilibration of electron wave pac… ▽ More We show that the three commonly employed approaches that define the same intrinsic linear anomalous Hall response actually lead to different results for intrinsic nonlinear transport. The difference arises from an intrinsic anomalous distribution. It originates from scattering, but its value is completely independent of scattering, because it represents the local equilibration of electron wave packets with field corrected energy. As a manifestation, we find that under ac driving, the intrinsic contributions in rectified component and in double-frequency component exhibit distinct frequency dependence, which can be probed in experiment. Using first-principles calculations, we estimate the signals that can be probed in antiferromagnetic CuMnAs. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.08893 [pdf, other]

Modeling Nonlinear Dynamics from Videos

Authors: Antony Yang, Joar Axås, Fanni Kádár, Gábor Stépán, George Haller

Abstract: We introduce a method for constructing reduced-order models directly from videos of dynamical systems. The method uses a non-intrusive tracking to isolate the motion of a user-selected part in the video of an autonomous dynamical system. In the space of delayed observations of this motion, we reconstruct a low-dimensional attracting spectral submanifold (SSM) whose internal dynamics serves as a ma… ▽ More We introduce a method for constructing reduced-order models directly from videos of dynamical systems. The method uses a non-intrusive tracking to isolate the motion of a user-selected part in the video of an autonomous dynamical system. In the space of delayed observations of this motion, we reconstruct a low-dimensional attracting spectral submanifold (SSM) whose internal dynamics serves as a mathematically justified reduced-order model for nearby motions of the full system. We obtain this model in a simple polynomial form that allows explicit identification of important physical system parameters, such as natural frequencies, linear and nonlinear damping and nonlinear stiffness. Beyond faithfully reproducing attracting steady states and limit cycles, our SSM-reduced models can also uncover hidden motion not seen in the video, such as unstable fixed points and unstable limit cycles forming basin boundaries. We demonstrate all these features on experimental videos of five physical systems: a double pendulum, an inverted flag in counter-flow, water sloshing in tank, a wing exhibiting aeroelastic flutter and a shimmying wheel. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.07626 [pdf, other]

Tailoring Bound State Geometry in High-Dimensional Non-Hermitian Systems

Authors: Ao Yang, Zixi Fang, Kai Zhang, Chen Fang

Abstract: It is generally believed that the non-Hermitian effect (NHSE), due to its non-reciprocal nature, creates barriers for the appearance of impurity bound states. In this paper, we find that in two and higher dimensions, the presence of geometry-dependent skin effect eliminates this barrier such that even an infinitesimal impurity potential can confine bound states in this type of non-Hermitian system… ▽ More It is generally believed that the non-Hermitian effect (NHSE), due to its non-reciprocal nature, creates barriers for the appearance of impurity bound states. In this paper, we find that in two and higher dimensions, the presence of geometry-dependent skin effect eliminates this barrier such that even an infinitesimal impurity potential can confine bound states in this type of non-Hermitian systems. By examining bound states around Bloch saddle points, we find that non-Hermiticity can disrupt the isotropy of bound states, resulting in concave dumbbell-shaped bound states. Our work reveals a geometry transition of bound state between concavity and convexity in high-dimensional non-Hermitian systems. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 14 pages, 4 figures

arXiv:2406.07346 [pdf, other]

Few-Body Quantum Chaos, Localization, and Multi-Photon Entanglement in Optical Synthetic Frequency Dimension

Authors: Junlin Wang, Luojia Wang, Jinlou Ma, Ang Yang, Luqi Yuan, Lei Ying

Abstract: Generation and control of entanglement are fundamental tasks in quantum information processing. In this paper, we propose a novel approach to generate controllable frequency-entangled photons by using the concept of synthetic frequency dimension in an optical system. Such a system consists of a ring resonator made by a tailored third-order nonlinear media to induce photon-photon interactions and a… ▽ More Generation and control of entanglement are fundamental tasks in quantum information processing. In this paper, we propose a novel approach to generate controllable frequency-entangled photons by using the concept of synthetic frequency dimension in an optical system. Such a system consists of a ring resonator made by a tailored third-order nonlinear media to induce photon-photon interactions and a periodic modulator to manipulate coupling between different frequency modes. We show this system provides a unique platform for the exploration of distinct few- or many-body quantum phases including chaos, localization, and integrability in a highly integrable photonics platform. In particular, we develop the potential experimental method to calculate the spectral form factor, which characterizes the degree of chaos in the system and differentiates between these phases based on observable measurements. Interestingly, the transition signatures of each phase can lead to an efficient generation of frequency-entangled multi photons. This work is the first to explore rich and controllable quantum phases beyond single particle in a synthetic dimension. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 15 pages, 8 figures

arXiv:2406.05892 [pdf, other]

Security Vulnerability Detection with Multitask Self-Instructed Fine-Tuning of Large Language Models

Authors: Aidan Z. H. Yang, Haoye Tian, He Ye, Ruben Martins, Claire Le Goues

Abstract: Software security vulnerabilities allow attackers to perform malicious activities to disrupt software operations. Recent Transformer-based language models have significantly advanced vulnerability detection, surpassing the capabilities of static analysis based deep learning models. However, language models trained solely on code tokens do not capture either the explanation of vulnerability type or… ▽ More Software security vulnerabilities allow attackers to perform malicious activities to disrupt software operations. Recent Transformer-based language models have significantly advanced vulnerability detection, surpassing the capabilities of static analysis based deep learning models. However, language models trained solely on code tokens do not capture either the explanation of vulnerability type or the data flow structure information of code, both of which are crucial for vulnerability detection. We propose a novel technique that integrates a multitask sequence-to-sequence LLM with pro-gram control flow graphs encoded as a graph neural network to achieve sequence-to-classification vulnerability detection. We introduce MSIVD, multitask self-instructed fine-tuning for vulnerability detection, inspired by chain-of-thought prompting and LLM self-instruction. Our experiments demonstrate that MSIVD achieves superior performance, outperforming the highest LLM-based vulnerability detector baseline (LineVul), with a F1 score of 0.92 on the BigVul dataset, and 0.48 on the PreciseBugs dataset. By training LLMs and GNNs simultaneously using a combination of code and explanatory metrics of a vulnerable program, MSIVD represents a promising direction for advancing LLM-based vulnerability detection that generalizes to unseen data. Based on our findings, we further discuss the necessity for new labelled security vulnerability datasets, as recent LLMs have seen or memorized prior datasets' held-out evaluation data. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.04876 [pdf, other]

HateDebias: On the Diversity and Variability of Hate Speech Debiasing

Authors: Nankai Lin, Hongyan Wu, Zhengming Chen, Zijian Li, Lianxi Wang, Shengyi Jiang, Dong Zhou, Aimin Yang

Abstract: Hate speech on social media is ubiquitous but urgently controlled. Without detecting and mitigating the biases brought by hate speech, different types of ethical problems. While a number of datasets have been proposed to address the problem of hate speech detection, these datasets seldom consider the diversity and variability of bias, making it far from real-world scenarios. To fill this gap, we p… ▽ More Hate speech on social media is ubiquitous but urgently controlled. Without detecting and mitigating the biases brought by hate speech, different types of ethical problems. While a number of datasets have been proposed to address the problem of hate speech detection, these datasets seldom consider the diversity and variability of bias, making it far from real-world scenarios. To fill this gap, we propose a benchmark, named HateDebias, to analyze the model ability of hate speech detection under continuous, changing environments. Specifically, to meet the diversity of biases, we collect existing hate speech detection datasets with different types of biases. To further meet the variability (i.e., the changing of bias attributes in datasets), we reorganize datasets to follow the continuous learning setting. We evaluate the detection accuracy of models trained on the datasets with a single type of bias with the performance on the HateDebias, where a significant performance drop is observed. To provide a potential direction for debiasing, we further propose a debiasing framework based on continuous learning and bias information regularization, as well as the memory replay strategies to ensure the debiasing ability of the model. Experiment results on the proposed benchmark show that the aforementioned method can improve several baselines with a distinguished margin, highlighting its effectiveness in real-world applications. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.02128 [pdf, other]

Iteration Head: A Mechanistic Study of Chain-of-Thought

Authors: Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Alice Yang, Francois Charton, Julia Kempe

Abstract: Chain-of-Thought (CoT) reasoning is known to improve Large Language Models both empirically and in terms of theoretical approximation power. However, our understanding of the inner workings and conditions of apparition of CoT capabilities remains limited. This paper helps fill this gap by demonstrating how CoT reasoning emerges in transformers in a controlled and interpretable setting. In particul… ▽ More Chain-of-Thought (CoT) reasoning is known to improve Large Language Models both empirically and in terms of theoretical approximation power. However, our understanding of the inner workings and conditions of apparition of CoT capabilities remains limited. This paper helps fill this gap by demonstrating how CoT reasoning emerges in transformers in a controlled and interpretable setting. In particular, we observe the appearance of a specialized attention mechanism dedicated to iterative reasoning, which we coined "iteration heads". We track both the emergence and the precise working of these iteration heads down to the attention level, and measure the transferability of the CoT skills to which they give rise between tasks. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2405.20298 [pdf, other]

doi 10.1103/PhysRevB.110.054507

Representation Theory for Massless Quasiparticles in Bogoliubov-de Gennes Systems

Authors: Arist Zhenyuan Yang, Zheng-Xin Liu

Abstract: Gapless quasiparticles can exist in the Bogoliubov-de Gennes (BdG) Hamiltonians in the mean field description of superconductors (SCs), fermionic superfluids (SFs) and quantum spin liquids (QSLs). The mechanism of gapless quasiparticles in superconductors was studied in literature based on the homotopy theory or symmetry-indicators. However, important properties of the gapless quasiparticles inclu… ▽ More Gapless quasiparticles can exist in the Bogoliubov-de Gennes (BdG) Hamiltonians in the mean field description of superconductors (SCs), fermionic superfluids (SFs) and quantum spin liquids (QSLs). The mechanism of gapless quasiparticles in superconductors was studied in literature based on the homotopy theory or symmetry-indicators. However, important properties of the gapless quasiparticles including the degeneracy, the energy-momentum dispersion and the responses to external probe fields need to be determined. In the present work, we investigate gapless quasiparticles in general BdG systems by using projective representation theory for the full `symmetry' groups formed by combinations of lattice, spin and charge operations. We find that (I) charge conjugation (or effective charge conjugation) symmetry can yield gapless quasiparticles with linear, quadratic or higher order dispersions at high symmetry points of the Brillouin zone; (II) different quantum numbers protected level crossing can give rise to zero modes along high symmetry lines; (III) combined spatial inversion and time reversal symmetry can protect zero modes appearing at generic $k$ points. To obtain the low energy properties of gapless quasiparticles, the $k\cdot p$ theory is provided using a high efficient method--the Hamiltonian approach. Based on generalized band representation theory for BdG systems, several lattice models are constructed to illustrate the above results. Our theory provides a method to classify nodal SCs/SFs/QSLs with given symmetries, and enlightens the realization of Majorana type massless quasiparticles in condensed matter physics. △ Less

Submitted 9 August, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: 27 pages,11 figures

Journal ref: Physical Review B 110, 054507 (2024)

arXiv:2405.15682 [pdf, other]

The Road Less Scheduled

Authors: Aaron Defazio, Xingyu Alice Yang, Harsh Mehta, Konstantin Mishchenko, Ahmed Khaled, Ashok Cutkosky

Abstract: Existing learning rate schedules that do not require specification of the optimization stopping step T are greatly out-performed by learning rate schedules that depend on T. We propose an approach that avoids the need for this stopping time by eschewing the use of schedules entirely, while exhibiting state-of-the-art performance compared to schedules across a wide family of problems ranging from c… ▽ More Existing learning rate schedules that do not require specification of the optimization stopping step T are greatly out-performed by learning rate schedules that depend on T. We propose an approach that avoids the need for this stopping time by eschewing the use of schedules entirely, while exhibiting state-of-the-art performance compared to schedules across a wide family of problems ranging from convex problems to large-scale deep learning problems. Our Schedule-Free approach introduces no additional hyper-parameters over standard optimizers with momentum. Our method is a direct consequence of a new theory we develop that unifies scheduling and iterate averaging. An open source implementation of our method is available (https://github.com/facebookresearch/schedule_free). △ Less

Submitted 7 August, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.14394 [pdf, other]

Instruction Tuning With Loss Over Instructions

Authors: Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani

Abstract: Instruction tuning plays a crucial role in shaping the outputs of language models (LMs) to desired styles. In this work, we propose a simple yet effective method, Instruction Modelling (IM), which trains LMs by applying a loss function to the instruction and prompt part rather than solely to the output part. Through experiments across 21 diverse benchmarks, we show that, in many scenarios, IM can… ▽ More Instruction tuning plays a crucial role in shaping the outputs of language models (LMs) to desired styles. In this work, we propose a simple yet effective method, Instruction Modelling (IM), which trains LMs by applying a loss function to the instruction and prompt part rather than solely to the output part. Through experiments across 21 diverse benchmarks, we show that, in many scenarios, IM can effectively improve the LM performance on both NLP tasks (e.g., MMLU, TruthfulQA, and HumanEval) and open-ended generation benchmarks (e.g., MT-Bench and AlpacaEval). Remarkably, in the most advantageous case, IM boosts model performance on AlpacaEval 1.0 by over 100%. We identify two key factors influencing the effectiveness of IM: (1) The ratio between instruction length and output length in the training data; and (2) The number of training examples. We observe that IM is especially beneficial when trained on datasets with lengthy instructions paired with brief outputs, or under the Superficial Alignment Hypothesis (SAH) where a small amount of training examples are used for instruction tuning. Further analysis substantiates our hypothesis that the improvement can be attributed to reduced overfitting to instruction tuning datasets. Our work provides practical guidance for instruction tuning LMs, especially in low-resource scenarios. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: Code is available at https://github.com/ZhengxiangShi/InstructionModelling

arXiv:2405.13907 [pdf, other]

Just rephrase it! Uncertainty estimation in closed-source language models via multiple rephrased queries

Authors: Adam Yang, Chen Chen, Konstantinos Pitas

Abstract: State-of-the-art large language models are sometimes distributed as open-source software but are also increasingly provided as a closed-source service. These closed-source large-language models typically see the widest usage by the public, however, they often do not provide an estimate of their uncertainty when responding to queries. As even the best models are prone to ``hallucinating" false info… ▽ More State-of-the-art large language models are sometimes distributed as open-source software but are also increasingly provided as a closed-source service. These closed-source large-language models typically see the widest usage by the public, however, they often do not provide an estimate of their uncertainty when responding to queries. As even the best models are prone to ``hallucinating" false information with high confidence, a lack of a reliable estimate of uncertainty limits the applicability of these models in critical settings. We explore estimating the uncertainty of closed-source LLMs via multiple rephrasings of an original base query. Specifically, we ask the model, multiple rephrased questions, and use the similarity of the answers as an estimate of uncertainty. We diverge from previous work in i) providing rules for rephrasing that are simple to memorize and use in practice ii) proposing a theoretical framework for why multiple rephrased queries obtain calibrated uncertainty estimates. Our method demonstrates significant improvements in the calibration of uncertainty estimates compared to the baseline and provides intuition as to how query strategies should be designed for optimal test calibration. △ Less

Submitted 16 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.11703 [pdf, other]

QComp: A QSAR-Based Data Completion Framework for Drug Discovery

Authors: Bingjia Yang, Yunsie Chung, Archer Y. Yang, Bo Yuan, Xiang Yu

Abstract: In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the… ▽ More In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the evolving experimental data as studies progress. We develop QSAR-Complete (QComp), a data completion framework to address this issue. Based on pre-existing QSAR models, QComp utilizes the correlation inherent in experimental data to enhance prediction accuracy across various tasks. Moreover, QComp emerges as a promising tool for guiding the optimal sequence of experiments by quantifying the reduction in statistical uncertainty for specific endpoints, thereby aiding in rational decision-making throughout the drug discovery process. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.09792 [pdf]

CMOS-compatible Strain Engineering for High-Performance Monolayer Semiconductor Transistors

Authors: Marc Jaikissoon, Çağıl Köroğlu, Jerry A. Yang, Kathryn M. Neilson, Krishna C. Saraswat, Eric Pop

Abstract: Strain engineering has played a key role in modern silicon electronics, having been introduced as a mobility booster in the 1990s and commercialized in the early 2000s. Achieving similar advances with two-dimensional (2D) semiconductors in a CMOS (complementary metal oxide semiconductor) compatible manner would radically improve the industrial viability of 2D transistors. Here, we show silicon nit… ▽ More Strain engineering has played a key role in modern silicon electronics, having been introduced as a mobility booster in the 1990s and commercialized in the early 2000s. Achieving similar advances with two-dimensional (2D) semiconductors in a CMOS (complementary metal oxide semiconductor) compatible manner would radically improve the industrial viability of 2D transistors. Here, we show silicon nitride capping layers can impart strain to monolayer MoS2 transistors on conventional silicon substrates, enhancing their electrical performance with a low thermal budget (350 °C), CMOS-compatible approach. Strained back-gated and dual-gated MoS2 transistors demonstrate median increases up to 60% and 45% in on-state current, respectively. The greatest improvements are found when both transistor channels and contacts are reduced to ~200 nm, reaching saturation currents of 488 uA/um, higher than any previous reports at such short contact pitch. Simulations reveal that most benefits arise from tensile strain lowering the contact Schottky barriers, and that further reducing device dimensions (including contacts) will continue to offer increased strain and performance improvements. △ Less

Submitted 29 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.04029 [pdf, other]

Enabling Privacy-Preserving and Publicly Auditable Federated Learning

Authors: Huang Zeng, Anjia Yang, Jian Weng, Min-Rong Chen, Fengjun Xiao, Yi Liu, Ye Yao

Abstract: Federated learning (FL) has attracted widespread attention because it supports the joint training of models by multiple participants without moving private dataset. However, there are still many security issues in FL that deserve discussion. In this paper, we consider three major issues: 1) how to ensure that the training process can be publicly audited by any third party; 2) how to avoid the infl… ▽ More Federated learning (FL) has attracted widespread attention because it supports the joint training of models by multiple participants without moving private dataset. However, there are still many security issues in FL that deserve discussion. In this paper, we consider three major issues: 1) how to ensure that the training process can be publicly audited by any third party; 2) how to avoid the influence of malicious participants on training; 3) how to ensure that private gradients and models are not leaked to third parties. Many solutions have been proposed to address these issues, while solving the above three problems simultaneously is seldom considered. In this paper, we propose a publicly auditable and privacy-preserving federated learning scheme that is resistant to malicious participants uploading gradients with wrong directions and enables anyone to audit and verify the correctness of the training process. In particular, we design a robust aggregation algorithm capable of detecting gradients with wrong directions from malicious participants. Then, we design a random vector generation algorithm and combine it with zero sharing and blockchain technologies to make the joint training process publicly auditable, meaning anyone can verify the correctness of the training. Finally, we conduct a series of experiments, and the experimental results show that the model generated by the protocol is comparable in accuracy to the original FL approach while keeping security advantages. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: ICC 2024 - 2024 IEEE International Conference on Communications Conference Program

ACM Class: C.2.2; C.2.4; E.3

arXiv:2405.02630 [pdf, other]

cuTN-QSVM: cuTensorNet-accelerated Quantum Support Vector Machine with cuQuantum SDK

Authors: Kuan-Cheng Chen, Tai-Yue Li, Yun-Yuan Wang, Simon See, Chun-Chieh Wang, Robert Wille, Nan-Yow Chen, An-Cheng Yang, Chun-Yu Lin

Abstract: This paper investigates the application of Quantum Support Vector Machines (QSVMs) with an emphasis on the computational advancements enabled by NVIDIA's cuQuantum SDK, especially leveraging the cuTensorNet library. We present a simulation workflow that substantially diminishes computational overhead, as evidenced by our experiments, from exponential to quadratic cost. While state vector simulatio… ▽ More This paper investigates the application of Quantum Support Vector Machines (QSVMs) with an emphasis on the computational advancements enabled by NVIDIA's cuQuantum SDK, especially leveraging the cuTensorNet library. We present a simulation workflow that substantially diminishes computational overhead, as evidenced by our experiments, from exponential to quadratic cost. While state vector simulations become infeasible for qubit counts over 50, our evaluation demonstrates that cuTensorNet speeds up simulations to be completed within seconds on the NVIDIA A100 GPU, even for qubit counts approaching 784. By employing multi-GPU processing with Message Passing Interface (MPI), we document a marked decrease in computation times, effectively demonstrating the strong linear speedup of our approach for increasing data sizes. This enables QSVMs to operate efficiently on High-Performance Computing (HPC) systems, thereby opening a new window for researchers to explore complex quantum algorithms that have not yet been investigated. In accuracy assessments, our QSVM achieves up to 95\% on challenging classifications within the MNIST dataset for training sets larger than 100 instances, surpassing the capabilities of classical SVMs. These advancements position cuTensorNet within the cuQuantum SDK as a pivotal tool for scaling quantum machine learning simulations and potentially signpost the seamless integration of such computational strategies as pivotal within the Quantum-HPC ecosystem. △ Less

Submitted 8 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

Comments: 10 pages, 14 figures

arXiv:2404.18852 [pdf, other]

VERT: Verified Equivalent Rust Transpilation with Large Language Models as Few-Shot Learners

Authors: Aidan Z. H. Yang, Yoshiki Takashima, Brandon Paulsen, Josiah Dodds, Daniel Kroening

Abstract: Rust is a programming language that combines memory safety and low-level control, providing C-like performance while guaranteeing the absence of undefined behaviors by default. Rust's growing popularity has prompted research on safe and correct transpiling of existing code-bases to Rust. Existing work falls into two categories: rule-based and large language model (LLM)-based. While rule-based appr… ▽ More Rust is a programming language that combines memory safety and low-level control, providing C-like performance while guaranteeing the absence of undefined behaviors by default. Rust's growing popularity has prompted research on safe and correct transpiling of existing code-bases to Rust. Existing work falls into two categories: rule-based and large language model (LLM)-based. While rule-based approaches can theoretically produce correct transpilations that maintain input-output equivalence to the original, they often yield unreadable Rust code that uses unsafe subsets of the Rust language. On the other hand, while LLM-based approaches typically produce more readable, maintainable, and safe code, they do not provide any guarantees about correctness. In this work, we present VERT, a tool that can produce readable Rust transpilations with formal guarantees of correctness. VERT's only requirement is that there is Web Assembly compiler for the source language, which is true for most major languages. VERT first uses the Web Assembly compiler to obtain an oracle Rust program. In parallel, VERT uses an LLM to generate a readable candidate Rust program. This candidate is verified against the oracle, and if verification fails, we regenerate a new candidate transpilation until verification succeeds. We evaluate VERT by transpiling a suite of 1,394 programs taken from competitive programming style benchmarks. Combining Anthropic's Claude-2 and VERT increases Rust transpilations passing property-based testing from 31% to 54% and bounded model-checking from 1% to 42% compared to using Claude alone. In addition, we evaluate VERT's ability to generate non-trivial safe Rust on programs taken from real-world C projects that make significant use of pointers. Our results provide insights into the limitations of LLMs to write safe Rust. △ Less

Submitted 25 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.18703 [pdf, other]

Gravitational wave detection of DFSZ axion models

Authors: Aidi Yang, Fa Peng Huang

Abstract: As a promising dark matter candidate, the axion particle can naturally solve the strong CP problem of the Standard Model through the U(1) Peccei-Quinn symmetry breaking process. However, detecting this high-energy process poses a significant challenge for colliders. We precisely calculate phase transition gravitational wave signals of this symmetry breaking process in the popular DFSZ axion model.… ▽ More As a promising dark matter candidate, the axion particle can naturally solve the strong CP problem of the Standard Model through the U(1) Peccei-Quinn symmetry breaking process. However, detecting this high-energy process poses a significant challenge for colliders. We precisely calculate phase transition gravitational wave signals of this symmetry breaking process in the popular DFSZ axion model. By comparing these signals with the expected sensitivity curves of the Cosmic Explorer and the Einstein Telescope, we demonstrate that Cosmic Explorer can detect this process with a signal-to-noise ratio higher than 8 for the benchmark points. Furthermore, by performing a Fisher analysis, we find that if signals are observed, the bubble wall velocity will be the first phase transition parameter to be determined. Future gravitational wave detectors are expected to provide a new approach for exploring concrete axion models. △ Less

Submitted 20 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

Comments: References updated, 37 pages, 4 figures, 5 tables, comments are welcome

arXiv:2404.15236 [pdf, other]

Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models

Authors: Aidan Z. H. Yang, Sophia Kolak, Vincent J. Hellendoorn, Ruben Martins, Claire Le Goues

Abstract: Language models have improved by orders of magnitude with the recent emergence of Transformer-based Large Language Models (LLMs). LLMs have demonstrated their ability to generate natural code that is highly similar to code written by professional developers. One intermediate value an LLM can emit is entropy, which measures the naturalness of a token of code. We hypothesize that entropy can be used… ▽ More Language models have improved by orders of magnitude with the recent emergence of Transformer-based Large Language Models (LLMs). LLMs have demonstrated their ability to generate natural code that is highly similar to code written by professional developers. One intermediate value an LLM can emit is entropy, which measures the naturalness of a token of code. We hypothesize that entropy can be used to improve the performance of Automated Program Repair (APR) tasks. While much progress has been made in Automated Program Repair (APR), fault localization techniques suffer from a lack of diversity in ranking scores, patch generation tools tend to be inefficient as all tests need to run before determining if a patch is likely to be correct, and patch ranking often suffers from the test-suite over-fitting problem. However, using an LLM directly for APR introduces concerns for training data leakage. In this work, we introduce a novel way of using the entropy of LLMs in combination with prior APR tools to improve all stages of APR. We show that entropy is highly complementary with prior fault localization tools. Our proposed re-ranking method achieves a 50% Top-5 score improvement over SBFL. We propose a patch-naturalness measurement, entropy-delta, to improve the efficiency of template-based repair techniques by ranking plausible patches before undergoing testing. When using entropy-delta for patch ranking and classification, our proposed method can rank correct patches more effectively than state-of-the-art machine learning tools with an 49% improvement in Top-1. Our work suggests that LLMs can be an effective addition to compliment prior APR tasks while minimizing both the test-suite overfitting problem and the LLM data leakage problem. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.13229 [pdf]

Preserving History through Augmented Reality

Authors: Annie Yang

Abstract: Extended reality can weave together the fabric of the past, present, and future. A two-day design hackathon was held to bring the community together through a love for history and a common goal to use technology for good. Through interviewing an influential community elder, Emile Pitre, and referencing his book Revolution to Evolution, my team developed an augmented reality artifact to tell his st… ▽ More Extended reality can weave together the fabric of the past, present, and future. A two-day design hackathon was held to bring the community together through a love for history and a common goal to use technology for good. Through interviewing an influential community elder, Emile Pitre, and referencing his book Revolution to Evolution, my team developed an augmented reality artifact to tell his story and preserve on revolutionary's legacy that impacted the University of Washington's history forever. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: Presented at CHI 2024 arXiv:2404.05889

Report number: ARSJ/2024/11

arXiv:2404.12629 [pdf, other]

Spreading Code Optimization for Low-Earth Orbit Satellites via Mixed-Integer Convex Programming

Authors: Alan Yang, Tara Mina, Grace Gao

Abstract: Optimizing the correlation properties of spreading codes is critical for minimizing inter-channel interference in satellite navigation systems. By improving the codes' correlation sidelobes, we can enhance navigation performance while minimizing the required spreading code lengths. In the case of low earth orbit (LEO) satellite navigation, shorter code lengths (on the order of a hundred) are prefe… ▽ More Optimizing the correlation properties of spreading codes is critical for minimizing inter-channel interference in satellite navigation systems. By improving the codes' correlation sidelobes, we can enhance navigation performance while minimizing the required spreading code lengths. In the case of low earth orbit (LEO) satellite navigation, shorter code lengths (on the order of a hundred) are preferred due to their ability to achieve fast signal acquisition. Additionally, the relatively high signal-to-noise ratio (SNR) in LEO systems reduces the need for longer spreading codes to mitigate inter-channel interference. In this work, we propose a two-stage block coordinate descent (BCD) method which optimizes the codes' correlation properties while enforcing the autocorrelation sidelobe zero (ACZ) property. In each iteration of the BCD method, we solve a mixed-integer convex program (MICP) over a block of 25 binary variables. Our method is applicable to spreading code families of arbitrary sizes and lengths, and we demonstrate its effectiveness for a problem with 66 length-127 codes and a problem with 130 length-257 codes. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.12412 [pdf]

Alloyed Re$_x$Mo$_{1-x}$S$_2$ Nanoflakes with Enlarged Interlayer Distances for Hydrogen Evolution

Authors: Jing Li, René Hübner, Marielle Deconinck, Ankita Bora, Markus Göbel, Dana Schwarz, Guangbo Chen, Guangzhao Wang, Shengyuan A. Yang, Yana Vaynzof, Vladimir Lesnyak

Abstract: Molybdenum sulfide (MoS$_2$) has attracted significant attention due to its great potential as a low-cost and efficient catalyst for the hydrogen evolution reaction. Developing a facile, easily upscalable, and inexpensive approach to produce catalytically active nanostructured MoS$_2$ with a high yield would significantly advance its practical application. Colloidal synthesis offers several advant… ▽ More Molybdenum sulfide (MoS$_2$) has attracted significant attention due to its great potential as a low-cost and efficient catalyst for the hydrogen evolution reaction. Developing a facile, easily upscalable, and inexpensive approach to produce catalytically active nanostructured MoS$_2$ with a high yield would significantly advance its practical application. Colloidal synthesis offers several advantages over other preparation techniques to overcome the low reaction yield of exfoliation and drawbacks of expensive equipment and processes used in chemical vapor deposition. In this work, we report an efficient synthesis of alloyed Re$_x$Mo$_{1-x}$S$_2$ nanoflakes with an enlarged interlayer distance, among which the composition Re$_{0.55}$Mo$_{0.45}$S$_2$ exhibits excellent catalytic performance with overpotentials as low as 79 mV at 10 mA/cm2 and a small Tafel slope of 42 mV/dec. Density functional theory calculations prove that enlarging the distance between layers in the Re$_x$Mo$_{1-x}$S$_2$alloy can greatly improve its catalytic performance due to a significantly reduced free energy of hydrogen adsorption. The developed approach paves the way to design advanced transition metal dichalcogenide-based catalysts for hydrogen evolution and to promote their large-scale practical application. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.11001 [pdf]

Modulation of the Octahedral Structure and Potential Superconductivity of La$_3$Ni$_2$O$_7$ through Strain Engineering

Authors: Zihao Huo, Zhihui Luo, Peng Zhang, Aiqin Yang, Zhengtao Liu, Xiangru Tao, Zihan Zhang, Shumin Guo, Qiwen Jiang, Wenxuan Chen, Dao-Xin Yao, Defang Duan, Tian Cui

Abstract: The recent transport measurement of La$_3$Ni$_2$O$_7$ uncover a "right-triangle" shape of the superconducting dome in the pressure-temperature (P-T) phase diagram. Motivated by this, we perform theoretical first-principles studies of La$_3$Ni$_2$O$_7$ with the pressure ranging from 0 to 100 GPa. Notably, we reveal a pressure dependence of the Ni-$d_{z^2}$ electron density at the Fermi energy (… ▽ More The recent transport measurement of La$_3$Ni$_2$O$_7$ uncover a "right-triangle" shape of the superconducting dome in the pressure-temperature (P-T) phase diagram. Motivated by this, we perform theoretical first-principles studies of La$_3$Ni$_2$O$_7$ with the pressure ranging from 0 to 100 GPa. Notably, we reveal a pressure dependence of the Ni-$d_{z^2}$ electron density at the Fermi energy ($n_z^{EF}$) that highly coincides with such shape. On this basis, we further explore the electronic structure under uniaxial stress. By tracking the stress response of $n_z^{EF}$, we propose that superconductivity can be achieved by applying only about 2 GPa of compression along the c axis. The idea is further exemplified from the perspectives of lattice distortion, band structure, Fermi surface and superconducting phase coherence. We also discuss the possible charge modulation under the stress and provide an insight to the relation between n_z^EF and the superconducting Tc in La$_3$Ni$_2$O$_7$ system. Our study provides a helpful guide to the future experiment. △ Less

Submitted 8 July, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

arXiv:2404.08687 [pdf, other]

A Survey of Reasoning for Substitution Relationships: Definitions, Methods, and Directions

Authors: Anxin Yang, Zhijuan Du, Tao Sun

Abstract: Substitute relationships are fundamental to people's daily lives across various domains. This study aims to comprehend and predict substitute relationships among products in diverse fields, extensively analyzing the application of machine learning algorithms, natural language processing, and other technologies. By comparing model methodologies across different domains, such as defining substitutes… ▽ More Substitute relationships are fundamental to people's daily lives across various domains. This study aims to comprehend and predict substitute relationships among products in diverse fields, extensively analyzing the application of machine learning algorithms, natural language processing, and other technologies. By comparing model methodologies across different domains, such as defining substitutes, representing and learning substitute relationships, and substitute reasoning, this study offers a methodological foundation for delving deeper into substitute relationships. Through ongoing research and innovation, we can further refine the personalization and accuracy of substitute recommendation systems, thus advancing the development and application of this field. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.08215 [pdf, other]

Stability and noncentered PT symmetry of real topological phases

Authors: S. J. Yue, Qing Liu, Shengyuan A. Yang, Y. X. Zhao

Abstract: Real topological phases protected by the spacetime inversion (P T) symmetry are a current research focus. The basis is that the P T symmetry endows a real structure in momentum space, which leads to Z2 topological classifications in 1D and 2D. Here, we provide solutions to two outstanding problems in the diagnosis of real topology. First, based on the stable equivalence in K-theory, we clarify tha… ▽ More Real topological phases protected by the spacetime inversion (P T) symmetry are a current research focus. The basis is that the P T symmetry endows a real structure in momentum space, which leads to Z2 topological classifications in 1D and 2D. Here, we provide solutions to two outstanding problems in the diagnosis of real topology. First, based on the stable equivalence in K-theory, we clarify that the 2D topological invariant remains well defined in the presence of nontrivial 1D invariant, and we develop a general numerical approach for its evaluation, which was hitherto unavailable. Second, under the unit-cell convention, noncentered P T symmetries assume momentum dependence, which violates the presumption in previous methods for computing the topological invariants. We clarify the classifications for this case and formulate the invariants by introducing a twisted Wilson-loop operator for both 1D and 2D. A simple model on a rectangular lattice is constructed to demonstrate our theory, which can be readily realized using artificial crystals. △ Less

Submitted 16 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

arXiv:2404.07600 [pdf, other]

Implicit and Explicit Language Guidance for Diffusion-based Visual Perception

Authors: Hefeng Wang, Jiale Cao, Jin Xie, Aiping Yang, Yanwei Pang

Abstract: Text-to-image diffusion models have shown powerful ability on conditional image synthesis. With large-scale vision-language pre-training, diffusion models are able to generate high-quality images with rich texture and reasonable structure under different text prompts. However, it is an open problem to adapt the pre-trained diffusion model for visual perception. In this paper, we propose an implici… ▽ More Text-to-image diffusion models have shown powerful ability on conditional image synthesis. With large-scale vision-language pre-training, diffusion models are able to generate high-quality images with rich texture and reasonable structure under different text prompts. However, it is an open problem to adapt the pre-trained diffusion model for visual perception. In this paper, we propose an implicit and explicit language guidance framework for diffusion-based perception, named IEDP. Our IEDP comprises an implicit language guidance branch and an explicit language guidance branch. The implicit branch employs frozen CLIP image encoder to directly generate implicit text embeddings that are fed to diffusion model, without using explicit text prompts. The explicit branch utilizes the ground-truth labels of corresponding images as text prompts to condition feature extraction of diffusion model. During training, we jointly train diffusion model by sharing the model weights of these two branches. As a result, implicit and explicit branches can jointly guide feature learning. During inference, we only employ implicit branch for final prediction, which does not require any ground-truth labels. Experiments are performed on two typical perception tasks, including semantic segmentation and depth estimation. Our IEDP achieves promising performance on both tasks. For semantic segmentation, our IEDP has the mIoU$^\text{ss}$ score of 55.9% on AD20K validation set, which outperforms the baseline method VPD by 2.2%. For depth estimation, our IEDP outperforms the baseline method VPD with a relative gain of 11.0%. △ Less

Submitted 15 August, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

Comments: Accepted by IEEE TMM

arXiv:2404.04393 [pdf, other]

Counting Like Transformers: Compiling Temporal Counting Logic Into Softmax Transformers

Authors: Andy Yang, David Chiang

Abstract: Deriving formal bounds on the expressivity of transformers, as well as studying transformers that are constructed to implement known algorithms, are both effective methods for better understanding the computational power of transformers. Towards both ends, we introduce the temporal counting logic $\textbf{K}_\text{t}$[#] alongside the RASP variant $\textbf{C-RASP}$. We show they are equivalent to… ▽ More Deriving formal bounds on the expressivity of transformers, as well as studying transformers that are constructed to implement known algorithms, are both effective methods for better understanding the computational power of transformers. Towards both ends, we introduce the temporal counting logic $\textbf{K}_\text{t}$[#] alongside the RASP variant $\textbf{C-RASP}$. We show they are equivalent to each other, and that together they are the best-known lower bound on the formal expressivity of future-masked soft attention transformers with unbounded input size. We prove this by showing all $\textbf{K}_\text{t}$[#] formulas can be compiled into these transformers. As a case study, we demonstrate on paper how to use $\textbf{C-RASP}$ to construct simple transformer language models that, using greedy decoding, can only generate sentences that have given properties formally specified in $\textbf{K}_\text{t}$[#]. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.04259 [pdf]

The prominent and heterogeneous gender disparities in scientific novelty: evidence from biomedical doctoral theses

Authors: Meijun Liu, Zihan Xie, Alex Jie Yang, Chao Yu, Jian Xu, Ying Ding, Yi Bu

Abstract: Scientific novelty is the essential driving force for research breakthroughs and innovation. However, little is known about how early-career scientists pursue novel research paths, and the gender disparities in this process. To address this research gap, this study investigates a comprehensive dataset of 279,424 doctoral theses in biomedical sciences authored by US Ph.D. graduates. Spanning from 1… ▽ More Scientific novelty is the essential driving force for research breakthroughs and innovation. However, little is known about how early-career scientists pursue novel research paths, and the gender disparities in this process. To address this research gap, this study investigates a comprehensive dataset of 279,424 doctoral theses in biomedical sciences authored by US Ph.D. graduates. Spanning from 1980 to 2016, the data originates from the ProQuest Dissertations & Theses Database. This study aims to shed light on Ph.D. students' pursuit of scientific novelty in their doctoral theses and assess gender-related differences in this process. Using a combinatorial approach and a pre-trained Bio-BERT model, we quantify the scientific novelty of doctoral theses based on bio-entities. Applying fractional logistic and quantile regression models, this study reveals a decreasing trend in scientific novelty over time and heterogeneous gender disparities in doctoral theses. Specifically, female students consistently exhibited lower scientific novelty levels than their male peers. When supervised by female advisors, students' theses are found to be less novel than those under male advisors. The significant interaction effect of female students and female advisors suggests that female advisors may amplify the gender disparity in scientific novelty. Moreover, heterogeneous gender disparities in scientific novelty are identified, with non-top-tier universities displaying more pronounced disparities, while the differences at higher percentile ranges were comparatively more minor. These findings indicate a potential underrepresentation of female scientists pursuing novel research during the early stages of their careers. Notably, the outcomes of this study hold significant policy implications for advancing the careers of female scientists. △ Less

Submitted 19 January, 2024; originally announced April 2024.

arXiv:2404.00504 [pdf, other]

NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation

Authors: Diwei Sheng, Anbang Yang, John-Ross Rizzo, Chen Feng

Abstract: Visual Place Recognition (VPR) in indoor environments is beneficial to humans and robots for better localization and navigation. It is challenging due to appearance changes at various frequencies, and difficulties of obtaining ground truth metric trajectories for training and evaluation. This paper introduces the NYC-Indoor-VPR dataset, a unique and rich collection of over 36,000 images compiled f… ▽ More Visual Place Recognition (VPR) in indoor environments is beneficial to humans and robots for better localization and navigation. It is challenging due to appearance changes at various frequencies, and difficulties of obtaining ground truth metric trajectories for training and evaluation. This paper introduces the NYC-Indoor-VPR dataset, a unique and rich collection of over 36,000 images compiled from 13 distinct crowded scenes in New York City taken under varying lighting conditions with appearance changes. Each scene has multiple revisits across a year. To establish the ground truth for VPR, we propose a semiautomatic annotation approach that computes the positional information of each image. Our method specifically takes pairs of videos as input and yields matched pairs of images along with their estimated relative locations. The accuracy of this matching is refined by human annotators, who utilize our annotation software to correlate the selected keyframes. Finally, we present a benchmark evaluation of several state-of-the-art VPR algorithms using our annotated dataset, revealing its challenge and thus value for VPR research. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: 7 pages, 7 figures, published in 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

arXiv:2403.18189 [pdf]

doi 10.1038/s41467-024-45318-8

Interfacial magnetic spin Hall effect in van der Waals Fe3GeTe2/MoTe2 heterostructure

Authors: Yudi Dai, Junlin Xiong, Yanfeng Ge, Bin Cheng, Lizheng Wang, Pengfei Wang, Zenglin Liu, Shengnan Yan, Cuiwei Zhang, Xianghan Xu, Youguo Shi, Sang-Wook Cheong, Cong Xiao, Shengyuan A. Yang, Shi-Jun Liang, Feng Miao

Abstract: The spin Hall effect (SHE) allows efficient generation of spin polarization or spin current through charge current and plays a crucial role in the development of spintronics. While SHE typically occurs in non-magnetic materials and is time-reversal even, exploring time-reversal-odd (T-odd) SHE, which couples SHE to magnetization in ferromagnetic materials, offers a new charge-spin conversion mecha… ▽ More The spin Hall effect (SHE) allows efficient generation of spin polarization or spin current through charge current and plays a crucial role in the development of spintronics. While SHE typically occurs in non-magnetic materials and is time-reversal even, exploring time-reversal-odd (T-odd) SHE, which couples SHE to magnetization in ferromagnetic materials, offers a new charge-spin conversion mechanism with new functionalities. Here, we report the observation of giant T-odd SHE in Fe3GeTe2/MoTe2 van der Waals heterostructure, representing a previously unidentified interfacial magnetic spin Hall effect (interfacial-MSHE). Through rigorous symmetry analysis and theoretical calculations, we attribute the interfacial-MSHE to a symmetry-breaking induced spin current dipole at the vdW interface. Furthermore, we show that this linear effect can be used for implementing multiply-accumulate operations and binary convolutional neural networks with cascaded multi-terminal devices. Our findings uncover an interfacial T-odd charge-spin conversion mechanism with promising potential for energy-efficient in-memory computing. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Journal ref: Nature Communications 15, 1129 (2024)

arXiv:2403.17912 [pdf, other]

Emergent Anomalous Hydrodynamics at Infinite Temperature in a Long-Range XXZ Model

Authors: Ang Yang, Jinlou Ma, Lei Ying

Abstract: The conventional wisdom suggests that transports of conserved quantities in non-integrable quantum many-body systems at high temperatures are diffusive. However, we discover a counterexample of this paradigm by uncovering anomalous hydrodynamics in a spin-1/2 XXZ chain with power-law couplings. This model, classified as non-integrable due to its Wigner-Dyson level-spacing statistics in the random… ▽ More The conventional wisdom suggests that transports of conserved quantities in non-integrable quantum many-body systems at high temperatures are diffusive. However, we discover a counterexample of this paradigm by uncovering anomalous hydrodynamics in a spin-1/2 XXZ chain with power-law couplings. This model, classified as non-integrable due to its Wigner-Dyson level-spacing statistics in the random matrix theory, exhibits a surprising superdiffusive-ballistic-superdiffusive transport transition by varying the power-law exponent of couplings for a fixed anisotropy. Our findings are verified by multiple observables, including the spin-spin autocorrelator, mean-square displacement, and spin conductivity. Interestingly, we further quantify the degree of quantum chaos using the Kullback-Leibler divergence between the entanglement entropy distributions of the model's eigenstates and a random state. Remarkably, an observed local maximum in the divergence near the transition boundary suggests a link between anomalous hydrodynamics and a suppression of quantum chaos. This work offers another deep understanding of emergent anomalous transport phenomena in a wider range of non-integrable quantum many-body systems △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 12 pages, 10 figures

arXiv:2403.17560 [pdf, other]

Anomalous shift in Andreev reflection from side incidence

Authors: Runze Li, Chaoxi Cui, Ying Liu, Zhi-Ming Yu, Shengyuan A. Yang

Abstract: Andreev reflection at a normal-superconductor interface may be accompanied with an anomalous spatial shift. The studies so far are limited to the top incidence configuration. Here, we investigate this effect in the side incidence configuration, with the interface parallel to the principal axis of superconductor. We find that the shift exhibits rich behaviors reflecting the character of pair potent… ▽ More Andreev reflection at a normal-superconductor interface may be accompanied with an anomalous spatial shift. The studies so far are limited to the top incidence configuration. Here, we investigate this effect in the side incidence configuration, with the interface parallel to the principal axis of superconductor. We find that the shift exhibits rich behaviors reflecting the character of pair potential. It has two contributions: one from the $k$-dependent phase of pair potential, and the other from the evanescent mode. For chiral $p$-wave pairing, the pairing phase contribution is proportional to the chirality of pairing and is independent of excitation energy, whereas the evanescent mode contribution is independent of chirality and is nonzero only for excitation energy below the gap. The two contributions also have opposite parity with respect to the incident angle. For $d_{x^{2}-y^{2}}$-wave pairing, only the evanescent mode contribution exists, and the shift exhibits suppressed zones in incident angles, manifesting the superconducting nodes. The dependence of the shift on other factors, such as the angle of incident plane and Fermi surface anisotropy, are discussed. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Showing 1–50 of 612 results for author: Yang, A