Skip to main content

Showing 1–50 of 304 results for author: He, G

.
  1. Not All Videos Become Outdated: Short-Video Recommendation by Learning to Deconfound Release Interval Bias

    Authors: Lulu Dong, Guoxiu He, Aixin Sun

    Abstract: Short-video recommender systems often exhibit a biased preference to recently released videos. However, not all videos become outdated; certain classic videos can still attract user's attention. Such bias along temporal dimension can be further aggravated by the matching model between users and videos, because the model learns from preexisting interactions. From real data, we observe that differen… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Journal ref: RecSys 2024

  2. arXiv:2408.09113  [pdf, other

    math.OC eess.SY

    Planning of Off-Grid Renewable Power to Ammonia Systems with Heterogeneous Flexibility: A Multistakeholder Equilibrium Perspective

    Authors: Yangjun Zeng, Yiwei Qiu, Jie Zhu, Shi Chen, Tianlei Zang, Buxiang Zhou, Ge He, Xu Ji

    Abstract: Off-grid renewable power to ammonia (ReP2A) systems present a promising pathway toward carbon neutrality in both the energy and chemical industries. However, due to chemical safety requirements, the limited flexibility of ammonia synthesis poses a challenge when attempting to align with the variable hydrogen flow produced from renewable power. This necessitates the optimal sizing of equipment capa… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  3. arXiv:2408.08506  [pdf, other

    cs.CL cs.AI

    Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding

    Authors: Huang Lei, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen

    Abstract: Generating long-term texts such as novels using artificial intelligence has always been a challenge. A common approach is to use large language models (LLMs) to construct a hierarchical framework that first plans and then writes. Despite the fact that the generated novels reach a sufficient length, they exhibit poor logical coherence and appeal in their plots and deficiencies in character and even… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  4. arXiv:2408.01607  [pdf

    cs.CV cs.LG

    Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives

    Authors: Lei Ma, Ziyun Yan, Mengmeng Li, Tao Liu, Liqin Tan, Xuan Wang, Weiqiang He, Ruikun Wang, Guangjun He, Heng Lu, Thomas Blaschke

    Abstract: Deep learning has gained significant attention in remote sensing, especially in pixel- or patch-level applications. Despite initial attempts to integrate deep learning into object-based image analysis (OBIA), its full potential remains largely unexplored. In this article, as OBIA usage becomes more widespread, we conducted a comprehensive review and expansion of its task subdomains, with or withou… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  5. arXiv:2407.19453  [pdf, other

    cs.CV

    FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models

    Authors: Changgu Chen, Libing Yang, Xiaoyan Yang, Lianggangxu Chen, Gaoqi He, CHangbo Wang, Yang Li

    Abstract: In recent years, large-scale pre-trained diffusion models have demonstrated their outstanding capabilities in image and video generation tasks. However, existing models tend to produce visual objects commonly found in the training dataset, which diverges from user input prompts. The underlying reason behind the inaccurate generated results lies in the model's difficulty in sampling from specific i… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  6. arXiv:2407.13205  [pdf, ps, other

    cs.CL

    Transformer-based Single-Cell Language Model: A Survey

    Authors: Wei Lan, Guohang He, Mingyang Liu, Qingfeng Chen, Junyue Cao, Wei Peng

    Abstract: The transformers have achieved significant accomplishments in the natural language processing as its outstanding parallel processing capabilities and highly flexible attention mechanism. In addition, increasing studies based on transformers have been proposed to model single-cell data. In this review, we attempt to systematically summarize the single-cell language models and applications based on… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  7. arXiv:2407.12678  [pdf, other

    eess.IV

    Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs

    Authors: Yiqing Shen, Guannan He, Mathias Unberath

    Abstract: Brain tumor analysis in Magnetic Resonance Imaging (MRI) is crucial for accurate diagnosis and treatment planning. However, the task remains challenging due to the complexity and variability of tumor appearances, as well as the scarcity of labeled data. Traditional approaches often address tumor segmentation and image generation separately, limiting their effectiveness in capturing the intricate r… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  8. arXiv:2407.11292  [pdf

    cs.CV

    LoRA-PT: Low-Rank Adapting UNETR for Hippocampus Segmentation Using Principal Tensor Singular Values and Vectors

    Authors: Guanghua He, Wangang Cheng, Hancan Zhu, Gaohang Yu

    Abstract: The hippocampus is a crucial brain structure associated with various psychiatric disorders, and its automatic and precise segmentation is essential for studying these diseases. In recent years, deep learning-based methods have made significant progress in hippocampus segmentation. However, training deep neural network models requires substantial computational resources and time, as well as a large… ▽ More

    Submitted 18 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  9. arXiv:2407.05587  [pdf, other

    cs.RO

    Flying Calligrapher: Contact-Aware Motion and Force Planning and Control for Aerial Manipulation

    Authors: Xiaofeng Guo, Guanqi He, Jiahe Xu, Mohammadreza Mousaei, Junyi Geng, Sebastian Scherer, Guanya Shi

    Abstract: Aerial manipulation has gained interest in completing high-altitude tasks that are challenging for human workers, such as contact inspection and defect detection, etc. Previous research has focused on maintaining static contact points or forces. This letter addresses a more general and dynamic task: simultaneously tracking time-varying contact force in the surface normal direction and motion traje… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 8 pages, 9 figures, 1 table

  10. arXiv:2407.04202  [pdf, other

    q-bio.NC

    Reverse Engineering the Fly Brain Using FlyCircuit Database

    Authors: Yu-Tai Ching, Chin-Ping Cho, Fu-Kai Tang, Yi-Chiun Chang, Chang-Chieh Cheng, Guan-Wei He, Ann-Shyn Chang, Chaochun Chuang

    Abstract: A method to reverse engineering of a fly brain using the {\it FlyCircuit} database is presented. This method was designed based on the assumption that similar neurons could serve identical functions. We thus cluster the neurons based on the similarity between neurons. The procedures are to partition the neurons in the database into groups, and then assemble the groups into potential modules. Some… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  11. arXiv:2406.16622  [pdf, other

    quant-ph physics.optics

    Simultaneous Generation of Quantum Frequency Combs across Distinct Modal Families in a Single $Si_3 N_4$ Whispering Gallery Mode Resonator

    Authors: Bo Ji, Nianqin Li, Guangqiang He

    Abstract: Quantum frequency combs (QFCs) are versatile resources for multi-mode entanglement, such as cluster states, crucial for quantum communication and computation. On-chip whispering gallery mode resonators (WGMRs) can generate these states at ultra-low threshold power. In this paper, we demonstrate the simultaneous generation of three QFCs using a single on-chip $Si_3N_4$ WGMR across distinct modal fa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures

  12. arXiv:2406.11857  [pdf, other

    cs.CY cs.AI

    AI Royalties -- an IP Framework to Compensate Artists & IP Holders for AI-Generated Content

    Authors: Pablo Ducru, Jonathan Raiman, Ronaldo Lemos, Clay Garner, George He, Hanna Balcha, Gabriel Souto, Sergio Branco, Celina Bottino

    Abstract: This article investigates how AI-generated content can disrupt central revenue streams of the creative industries, in particular the collection of dividends from intellectual property (IP) rights. It reviews the IP and copyright questions related to the input and output of generative AI systems. A systematic method is proposed to assess whether AI-generated outputs, especially images, infringe pre… ▽ More

    Submitted 5 April, 2024; originally announced June 2024.

    Comments: 7 pages, 2 figures, submitted to AAAI

  13. arXiv:2406.06626  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Benchmarking Neural Decoding Backbones towards Enhanced On-edge iBCI Applications

    Authors: Zhou Zhou, Guohang He, Zheng Zhang, Luziwei Leng, Qinghai Guo, Jianxing Liao, Xuan Song, Ran Cheng

    Abstract: Traditional invasive Brain-Computer Interfaces (iBCIs) typically depend on neural decoding processes conducted on workstations within laboratory settings, which prevents their everyday usage. Implementing these decoding processes on edge devices, such as the wearables, introduces considerable challenges related to computational demands, processing speed, and maintaining accuracy. This study seeks… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  14. arXiv:2405.15885  [pdf, other

    cs.LG stat.ML

    Diffusion Bridge Implicit Models

    Authors: Kaiwen Zheng, Guande He, Jianfei Chen, Fan Bao, Jun Zhu

    Abstract: Denoising diffusion bridge models (DDBMs) are a powerful variant of diffusion models for interpolating between two arbitrary paired distributions given as endpoints. Despite their promising performance in tasks like image translation, DDBMs require a computationally intensive sampling process that involves the simulation of a (stochastic) differential equation through hundreds of network evaluatio… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  15. arXiv:2405.06083  [pdf, other

    cond-mat.mes-hall quant-ph

    Exploring Entanglement Spectrum and Phase Diagram in multi-electron Quantum Dot Chains

    Authors: Guanjie He, Xin Wang

    Abstract: We investigate the entanglement properties in semiconductor quantum dot systems modeled by extended Hubbard model, focusing on the impact of potential energy variations and electron interactions within a four-site quantum dot spin chain. Our study explores local and pairwise entanglement across configurations with electron counts N=4 and N=6, under different potential energy settings. By adjusting… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 14 pages, 9 figures

  16. arXiv:2405.04233  [pdf, other

    cs.CV cs.LG

    Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

    Authors: Fan Bao, Chendong Xiang, Gang Yue, Guande He, Hongzhou Zhu, Kaiwen Zheng, Min Zhao, Shilong Liu, Yaole Wang, Jun Zhu

    Abstract: We introduce Vidu, a high-performance text-to-video generator that is capable of producing 1080p videos up to 16 seconds in a single generation. Vidu is a diffusion model with U-ViT as its backbone, which unlocks the scalability and the capability for handling long videos. Vidu exhibits strong coherence and dynamism, and is capable of generating both realistic and imaginative videos, as well as un… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Project page at https://www.shengshu-ai.com/vidu

  17. arXiv:2404.15493  [pdf, other

    quant-ph cond-mat.mes-hall

    Temperature dependent spin-phonon coupling of boron-vacancy centers in hexagonal boron nitride

    Authors: Zhongyuan Liu, Ruotian Gong, Benchen Huang, Yu Jin, Xinyi Du, Guanghui He, Eli Janzen, Li Yang, Erik Henriksen, James Edgar, Giulia Galli, Chong Zu

    Abstract: The negatively charged boron-vacancy center ($\mathrm{V}_{\mathrm{B}}^-$) in hexagonal boron nitride (hBN) has recently emerged as a highly promising quantum sensor. Compared to the nitrogen-vacancy (NV) center in diamond, the change with temperature of the spin transition energy of $\mathrm{V}_{\mathrm{B}}^-$ is more than an order of magnitude larger, making it a potential nanoscale thermometer w… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures and 1 table in main. 9 pages and 5 figures in supplementary

  18. arXiv:2404.13640  [pdf, other

    cs.MM cs.CV eess.IV

    Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

    Authors: Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

    Abstract: Multiple complex degradations are coupled in low-quality video faces in the real world. Therefore, blind video face restoration is a highly challenging ill-posed problem, requiring not only hallucinating high-fidelity details but also enhancing temporal coherence across diverse pose variations. Restoring each frame independently in a naive manner inevitably introduces temporal incoherence and arti… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 9 pages

  19. arXiv:2404.07577  [pdf, other

    cs.LG eess.SP

    Generating Comprehensive Lithium Battery Charging Data with Generative AI

    Authors: Lidang Jiang, Changyan Hu, Sibei Ji, Hang Zhao, Junxiong Chen, Ge He

    Abstract: In optimizing performance and extending the lifespan of lithium batteries, accurate state prediction is pivotal. Traditional regression and classification methods have achieved some success in battery state prediction. However, the efficacy of these data-driven approaches heavily relies on the availability and quality of public datasets. Additionally, generating electrochemical data predominantly… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  20. arXiv:2404.03079  [pdf, other

    cs.DC

    vPALs: Towards Verified Performance-aware Learning System For Resource Management

    Authors: Guoliang He, Gingfung Yeung, Sheriffo Ceesay, Adam Barker

    Abstract: Accurately predicting task performance at runtime in a cluster is advantageous for a resource management system to determine whether a task should be migrated due to performance degradation caused by interference. This is beneficial for both cluster operators and service owners. However, deploying performance prediction systems with learning methods requires sophisticated safeguard mechanisms due… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: presented at Deployable AI Workshop at AAAI-2024

  21. arXiv:2404.00672  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    A General and Efficient Training for Transformer via Token Expansion

    Authors: Wenxuan Huang, Yunhang Shen, Jiao Xie, Baochang Zhang, Gaoqi He, Ke Li, Xing Sun, Shaohui Lin

    Abstract: The remarkable performance of Vision Transformers (ViTs) typically requires an extremely large training cost. Existing methods have attempted to accelerate the training of ViTs, yet typically disregard method universality with accuracy dropping. Meanwhile, they break the training consistency of the original transformers, including the consistency of hyper-parameters, architecture, and strategy, wh… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Code is available at https://github.com/Osilly/TokenExpansion

  22. arXiv:2403.17842  [pdf, other

    quant-ph cond-mat.str-el

    Experimental Realization of Discrete Time Quasi-Crystals

    Authors: Guanghui He, Bingtian Ye, Ruotian Gong, Changyu Yao, Zhongyuan Liu, Kater W. Murch, Norman Y. Yao, Chong Zu

    Abstract: Floquet (periodically driven) systems can give rise to unique non-equilibrium phases of matter without equilibrium analogs. The most prominent example is the realization of discrete time crystals. An intriguing question emerges: what other novel phases can manifest when the constraint of time periodicity is relaxed? In this study, we explore quantum systems subjected to a quasi-periodic drive. Lev… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7+5 pages, 4+5 figures

  23. arXiv:2403.16863  [pdf, other

    cs.AR cs.AI

    SIP: Autotuning GPU Native Schedules via Stochastic Instruction Perturbation

    Authors: Guoliang He, Eiko Yoneki

    Abstract: Large language models (LLMs) have become a significant workload since their appearance. However, they are also computationally expensive as they have billions of parameters and are trained with massive amounts of data. Thus, recent works have developed dedicated CUDA kernels for LLM training and inference instead of relying on compilergenerated ones, so that hardware resources are as fully utilize… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: EuroMLSys 24, April 22, 2024, Athens, Greece

  24. arXiv:2403.12707  [pdf, other

    cs.CV

    Selective Domain-Invariant Feature for Generalizable Deepfake Detection

    Authors: Yingxin Lai, Guoqing Yang Yifan He, Zhiming Luo, Shaozi Li

    Abstract: With diverse presentation forgery methods emerging continually, detecting the authenticity of images has drawn growing attention. Although existing methods have achieved impressive accuracy in training dataset detection, they still perform poorly in the unseen domain and suffer from forgery of irrelevant information such as background and identity, affecting generalizability. To solve this problem… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP 2024

  25. arXiv:2403.10913  [pdf, other

    cs.AR

    DEFA: Efficient Deformable Attention Acceleration via Pruning-Assisted Grid-Sampling and Multi-Scale Parallel Processing

    Authors: Yansong Xu, Dongxu Lyu, Zhenyu Li, Zilong Wang, Yuzhou Chen, Gang Wang, Zhican Wang, Haomin Li, Guanghui He

    Abstract: Multi-scale deformable attention (MSDeformAttn) has emerged as a key mechanism in various vision tasks, demonstrating explicit superiority attributed to multi-scale grid-sampling. However, this newly introduced operator incurs irregular data access and enormous memory requirement, leading to severe PE underutilization. Meanwhile, existing approaches for attention acceleration cannot be directly ap… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted to DAC 2024

  26. arXiv:2402.16051  [pdf, other

    hep-ph

    Two-body hadronic weak decays of bottomed hadrons

    Authors: Ying Zhang, Guangzhao He, Quanxing Ye, Da-Cheng Yan, Jun Hua, Qian Wang

    Abstract: The structure of light diquarks plays a crucial role in the formation of exotic hadrons beyond the conventional quark model, especially in their line shapes of bottomed hadron decays. We study the two-body hadronic weak decays of bottomed baryons and bottomed mesons to probe the light diquark structure and pin down the quark-quark correlations in the diquark picture. We find that the light diquark… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: accepted by Chinese Physics Letter

  27. arXiv:2402.15368  [pdf, other

    cs.RO cs.AI

    Safe Task Planning for Language-Instructed Multi-Robot Systems using Conformal Prediction

    Authors: Jun Wang, Guocheng He, Yiannis Kantaros

    Abstract: This paper addresses task planning problems for language-instructed robot teams. Tasks are expressed in natural language (NL), requiring the robots to apply their capabilities at various locations and semantic objects. Several recent works have addressed similar planning problems by leveraging pre-trained Large Language Models (LLMs) to design effective multi-robot plans. However, these approaches… ▽ More

    Submitted 24 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  28. arXiv:2402.14282  [pdf, other

    stat.ME

    Extention of Bagging MARS with Group LASSO for Heterogeneous Treatment Effect Estimation

    Authors: Guanwenqing He, Ke Wan, Kazushi Maruo, Toshio Shimokawa

    Abstract: Recent years, large scale clinical data like patient surveys and medical record data are playing an increasing role in medical data science. These large-scale clinical data, collectively referred to as "real-world data (RWD)". It is expected to be widely used in large-scale observational studies of specific diseases, personal medicine or precise medicine, finding the responder of drugs or treatmen… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 19 pages, 9 figures

  29. arXiv:2402.08874  [pdf, other

    cs.CL

    Tree-Based Hard Attention with Self-Motivation for Large Language Models

    Authors: Chenxi Lin, Jiayu Ren, Guoxiu He, Zhuoren Jiang, Haiyan Yu, Xiaomin Zhu

    Abstract: While large language models (LLMs) excel at understanding and generating plain text, they are not specifically tailored to handle hierarchical text structures. Extracting the task-desired property from their natural language responses typically necessitates additional processing steps. In fact, selectively comprehending the hierarchical structure of large-scale text is pivotal to understanding its… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  30. arXiv:2402.05369  [pdf, other

    cs.LG cs.CL

    Noise Contrastive Alignment of Language Models with Explicit Rewards

    Authors: Huayu Chen, Guande He, Lifan Yuan, Ganqu Cui, Hang Su, Jun Zhu

    Abstract: User intentions are typically formalized as evaluation rewards to be maximized when fine-tuning language models (LMs). Existing alignment methods, such as Direct Preference Optimization (DPO), are mainly tailored for pairwise preference data where rewards are implicitly defined rather than explicitly given. In this paper, we introduce a general framework for LM alignment, leveraging Noise Contrast… ▽ More

    Submitted 3 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  31. arXiv:2402.03708  [pdf, other

    cs.CV

    SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images

    Authors: Pengming Feng, Mingjie Xie, Hongning Liu, Xuanjia Zhao, Guangjun He, Xueliang Zhang, Jian Guan

    Abstract: Fine-grained ship instance segmentation in satellite images holds considerable significance for monitoring maritime activities at sea. However, existing datasets often suffer from the scarcity of fine-grained information or pixel-wise localization annotations, as well as the insufficient image diversity and variations, thus limiting the research of this task. To this end, we propose a benchmark da… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 14 pages, 9 figures

  32. arXiv:2402.02170  [pdf, ps, other

    gr-qc

    Gravitational losses for the binary systems induced by the next-to-leading spin-orbit coupling effects

    Authors: Hao Zhang, Wei Gao, Guansheng He, Siming Liu, Huanyu Jia, Wenbin Lin

    Abstract: The orbital energy and momentum of the compact binary systems will loss due to gravitational radiation. Based on the mass and mass-current multipole moments of the binary system with the spin vector defined by Bohé et al. [Class. Quantum Grav. 30, 075017 (2013)], we calculate the loss rates of energy, angular and linear momentum induced by the next-to-leading spin-orbit effects. For the case of ci… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 18 pages

  33. arXiv:2402.01548  [pdf, other

    gr-qc

    Gravitational lensing of massive particles by a black-bounce-Schwarzschild black hole

    Authors: Guansheng He, Yi Xie, Chunhua Jiang, Wenbin Lin

    Abstract: We investigate in detail the weak-field gravitational lensing of a relativistic neutral massive particle induced by a regular black-bounce-Schwarzschild black hole proposed by Simpson and Visser. Starting with the calculation of the gravitational deflection of the massive particle up to the third post-Minkowskian order, the Virbhadra-Ellis lens equation is solved perturbatively beyond the weak-def… ▽ More

    Submitted 21 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in PRD

  34. arXiv:2401.17583  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion

    Authors: Tairan He, Chong Zhang, Wenli Xiao, Guanqi He, Changliu Liu, Guanya Shi

    Abstract: Legged robots navigating cluttered environments must be jointly agile for efficient task execution and safe to avoid collisions with obstacles or humans. Existing studies either develop conservative controllers (< 1.0 m/s) to ensure safety, or focus on agility without considering potentially fatal collisions. This paper introduces Agile But Safe (ABS), a learning-based control framework that enabl… ▽ More

    Submitted 21 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Published at RSS 2024, Project website: https://agile-but-safe.github.io/

  35. Flexible Parallel Neural Network Architecture Model for Early Prediction of Lithium Battery Life

    Authors: Lidang Jiang, Zhuoxiang Li, Changyan Hu, Qingsong Huang, Ge He

    Abstract: The early prediction of battery life (EPBL) is vital for enhancing the efficiency and extending the lifespan of lithium batteries. Traditional models with fixed architectures often encounter underfitting or overfitting issues due to the diverse data distributions in different EPBL tasks. An interpretable deep learning model of flexible parallel neural network (FPNN) is proposed, which includes an… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  36. arXiv:2401.14781  [pdf, other

    cond-mat.mtrl-sci

    Simulated TEM imaging of a heavily irradiated metal

    Authors: D. R. Mason, M. Boleininger, J. Haley, E. Prestat, G. He, F. Hofmann, S. L. Dudarev

    Abstract: We recast the Howie-Whelan equations for generating simulated transmission electron microscope (TEM) images, replacing the dependence on local atomic displacements with atomic positions only. This allows very rapid computation of simulated TEM images for arbitrarily complex atomistic configurations of lattice defects and dislocations in the dynamical two beam approximation. Large scale massively-o… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  37. arXiv:2401.14734  [pdf, other

    cond-mat.str-el

    Anomalous electron-phonon coupling in kagome ferromagnetic Weyl semimetal Co$_3$Sn$_2$S$_2$

    Authors: G. He, M. Kute, Z. C. Xu, L. Peis, R. Stumberger, A. Baum, D. Jost, E. M. Been, B. Moritz, Y. G. Shi, T. P. Devereaux, R. Hackl

    Abstract: We present results of a Raman scattering study of the Kagome ferromagnet Co$_3$Sn$_2$S$_2$, with a focus on electronic and phononic excitations and their interplay. In addition, the electronic band structure is analyzed theoretically, enabling a semi-quantitative explanation of the spectra. A prominent feature in the electronic spectra is a redistribution of spectral weight from low to high energi… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 9 pages, 4 figures

  38. arXiv:2401.10150  [pdf, other

    cs.CV

    Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation

    Authors: Changgu Chen, Junwei Shu, Lianggangxu Chen, Gaoqi He, Changbo Wang, Yang Li

    Abstract: Recent large-scale pre-trained diffusion models have demonstrated a powerful generative ability to produce high-quality videos from detailed text descriptions. However, exerting control over the motion of objects in videos generated by any video diffusion model is a challenging problem. In this paper, we propose a novel zero-shot moving object trajectory control framework, Motion-Zero, to enable a… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Preprint

  39. arXiv:2401.07369  [pdf, other

    cs.LG cs.RO

    CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design

    Authors: Zeji Yi, Chaoyi Pan, Guanqi He, Guannan Qu, Guanya Shi

    Abstract: Sampling-based Model Predictive Control (MPC) has been a practical and effective approach in many domains, notably model-based reinforcement learning, thanks to its flexibility and parallelizability. Despite its appealing empirical performance, the theoretical understanding, particularly in terms of convergence analysis and hyperparameter tuning, remains absent. In this paper, we characterize the… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 32 pages, 4 figures

  40. arXiv:2401.06418  [pdf, other

    physics.optics quant-ph

    Manipulating multiple optical parametric processes in photonic topological insulators

    Authors: Zhen Jiang, Bo Ji, Yanghe Chen, Chun Jiang, Guangqiang He

    Abstract: Topological quantum optics, an emerging area of study, holds the potential to bring about substantial enhancements for integrated quantum devices. Here we propose integrated topological quantum devices performing various functions including optical parametric amplification, frequency division, and frequency entangled biphoton generation. We show two distinct edge modes corresponding to different f… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 18pages, 12 figures

  41. arXiv:2401.02797  [pdf, other

    cs.CL cs.AI

    PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging

    Authors: Gang Liu, Jinlong He, Pengfei Li, Genrong He, Zhaolin Chen, Shenjun Zhong

    Abstract: Multimodal large language models (MLLMs) represent an evolutionary expansion in the capabilities of traditional large language models, enabling them to tackle challenges that surpass the scope of purely text-based applications. It leverages the knowledge previously encoded within these language models, thereby enhancing their applicability and functionality in the reign of multimodal contexts. Rec… ▽ More

    Submitted 16 April, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 12 pages, 8 figures, 12 tables

  42. arXiv:2312.09556  [pdf, other

    physics.optics physics.app-ph

    Optical Ranging Using Coherent Kerr Soliton Dual-microcombs with Extended Ambiguity Distance

    Authors: Yuechen Yang, Yang Shen, Kailu Zhou, Chenhua Hu, Yuanzhuo Ding, Tinghao Jiang, Wei Li, Yudong Li, Liangsen Feng, Tengfei Wu, Guangqiang He

    Abstract: Optical ranging is a key technology in metrology. Optical frequency combs are shown to provide several advantages in light ranging, offering high precision with high acquisition rate. However, performance of traditional ranging systems based on microcombs is limited by the short ambiguity distance and non-real-time processing. Here, we show that dual-comb ranging system using coherent Kerr soliton… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures

  43. arXiv:2312.08200  [pdf, other

    cs.LG stat.ML

    SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

    Authors: Yunchen Li, Zhou Yu, Gaoqi He, Yunhang Shen, Ke Li, Xing Sun, Shaohui Lin

    Abstract: Symmetric positive definite~(SPD) matrices have shown important value and applications in statistics and machine learning, such as FMRI analysis and traffic prediction. Previous works on SPD matrices mostly focus on discriminative models, where predictions are made directly on $E(X|y)$, where $y$ is a vector and $X$ is an SPD matrix. However, these methods are challenging to handle for large-scale… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: AAAI2024

  44. EasyVolcap: Accelerating Neural Volumetric Video Research

    Authors: Zhen Xu, Tao Xie, Sida Peng, Haotong Lin, Qing Shuai, Zhiyuan Yu, Guangzhao He, Jiaming Sun, Hujun Bao, Xiaowei Zhou

    Abstract: Volumetric video is a technology that digitally records dynamic events such as artistic performances, sporting events, and remote conversations. When acquired, such volumography can be viewed from any viewpoint and timestamp on flat screens, 3D displays, or VR headsets, enabling immersive viewing experiences and more flexible content creation in a variety of applications such as sports broadcastin… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: SIGGRAPH Asia 2023 Technical Communications. Source code: https://github.com/zju3dv/EasyVolcap

  45. arXiv:2312.06550  [pdf, other

    cs.CL cs.AI cs.LG

    LLM360: Towards Fully Transparent Open-Source LLMs

    Authors: Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze , et al. (3 additional authors not shown)

    Abstract: The recent surge in open-source Large Language Models (LLMs), such as LLaMA, Falcon, and Mistral, provides diverse options for AI practitioners and researchers. However, most LLMs have only released partial artifacts, such as the final model weights or inference code, and technical reports increasingly limit their scope to high-level design choices and surface statistics. These choices hinder prog… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  46. arXiv:2312.03491  [pdf, other

    cs.LG cs.SD eess.AS

    Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis

    Authors: Zehua Chen, Guande He, Kaiwen Zheng, Xu Tan, Jun Zhu

    Abstract: In text-to-speech (TTS) synthesis, diffusion models have achieved promising generation quality. However, because of the pre-defined data-to-noise diffusion process, their prior distribution is restricted to a noisy representation, which provides little information of the generation target. In this work, we present a novel TTS system, Bridge-TTS, making the first attempt to substitute the noisy Gau… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  47. Exchanging Dual Encoder-Decoder: A New Strategy for Change Detection with Semantic Guidance and Spatial Localization

    Authors: Sijie Zhao, Xueliang Zhang, Pengfeng Xiao, Guangjun He

    Abstract: Change detection is a critical task in earth observation applications. Recently, deep learning-based methods have shown promising performance and are quickly adopted in change detection. However, the widely used multiple encoder and single decoder (MESD) as well as dual encoder-decoder (DED) architectures still struggle to effectively handle change detection well. The former has problems of bitemp… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-16, 2023, Art no. 4508016

  48. arXiv:2311.09262  [pdf, other

    cs.SI cs.AI

    Disentangling the Potential Impacts of Papers into Diffusion, Conformity, and Contribution Values

    Authors: Zhikai Xue, Guoxiu He, Zhuoren Jiang, Sichen Gu, Yangyang Kang, Star Zhao, Wei Lu

    Abstract: The potential impact of an academic paper is determined by various factors, including its popularity and contribution. Existing models usually estimate original citation counts based on static graphs and fail to differentiate values from nuanced perspectives. In this study, we propose a novel graph neural network to Disentangle the Potential impacts of Papers into Diffusion, Conformity, and Contri… ▽ More

    Submitted 21 May, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Update and correct some references. This paper is still in progress

  49. arXiv:2310.15629  [pdf, other

    physics.optics quant-ph

    On-chip topological transport of optical frequency combs in silicon-based valley photonic crystals

    Authors: Zhen Jiang, Hongwei Wang, Yuechen Yang, Yang Shen, Bo Ji, Yanghe Chen, Yong Zhang, Lu Sun, Zheng Wang, Chun Jiang, Yikai Su, Guangqiang He

    Abstract: The generation and control of optical frequency combs in integrated photonic systems enables complex, high-controllable, and large-scale devices. In parallel, harnessing topological physics in multipartite systems has allowed them with compelling features such as robustness against fabrication imperfections. Here we experimentally demonstrate on-chip topological transport for optical frequency com… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 20 pages,12 figures

  50. arXiv:2310.14718  [pdf, other

    cs.CV

    Rethinking Scale Imbalance in Semi-supervised Object Detection for Aerial Images

    Authors: Ruixiang Zhang, Chang Xu, Fang Xu, Wen Yang, Guangjun He, Huai Yu, Gui-Song Xia

    Abstract: This paper focuses on the scale imbalance problem of semi-supervised object detection(SSOD) in aerial images. Compared to natural images, objects in aerial images show smaller sizes and larger quantities per image, increasing the difficulty of manual annotation. Meanwhile, the advanced SSOD technique can train superior detectors by leveraging limited labeled data and massive unlabeled data, saving… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.