Skip to main content

Showing 1–50 of 413 results for author: Qian, C

.
  1. arXiv:2408.13529  [pdf, other

    cs.RO

    Effects of fiber number and density on fiber jamming: Towards follow-the-leader deployment of a continuum robot

    Authors: Chen Qian, Tangyou Liu, Liao Wu

    Abstract: Fiber jamming modules (FJMs) offer flexibility and quick stiffness variation, making them suitable for follow-the-leader (FTL) motions in continuum robots, which is ideal for minimally invasive surgery (MIS). However, their potential has not been fully exploited, particularly in designing and manufacturing small-sized FJMs with high stiffness variation. Although existing research has focused on fa… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 6 pages, 6 figures, accepted by IROS2024

  2. arXiv:2408.11799  [pdf, other

    cs.CL

    Practical token pruning for foundation models in few-shot conversational virtual assistant systems

    Authors: Haode Qi, Cheng Qian, Jian Ni, Pratyush Singh, Reza Fazeli, Gengyu Wang, Zhongzheng Shu, Eric Wayne, Juergen Bross

    Abstract: In an enterprise Virtual Assistant (VA) system, intent classification is the crucial component that determines how a user input is handled based on what the user wants. The VA system is expected to be a cost-efficient SaaS service with low training and inference time while achieving high accuracy even with a small number of training samples. We pretrain a transformer-based sentence embedding model… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 6 pages, 3 figures

  3. arXiv:2408.03703  [pdf, other

    cs.CV

    CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

    Authors: Tianfang Zhang, Lei Li, Yang Zhou, Wentao Liu, Chen Qian, Xiangyang Ji

    Abstract: Vision Transformers (ViTs) mark a revolutionary advance in neural networks with their token mixer's powerful global context capability. However, the pairwise token affinity and complex matrix operations limit its deployment on resource-constrained scenarios and real-time applications, such as mobile devices, although considerable efforts have been made in previous works. In this paper, we introduc… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  4. arXiv:2408.01916  [pdf, other

    cs.AI cs.SE

    MAO: A Framework for Process Model Generation with Multi-Agent Orchestration

    Authors: Leilei Lin, Yumeng Jin, Yingming Zhou, Wenlong Chen, Chen Qian

    Abstract: Process models are frequently used in software engineering to describe business requirements, guide software testing and control system improvement. However, traditional process modeling methods often require the participation of numerous experts, which is expensive and time-consuming. Therefore, the exploration of a more efficient and cost-effective automated modeling method has emerged as a foca… ▽ More

    Submitted 7 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

  5. arXiv:2407.18178  [pdf, other

    cs.CV cs.AI cs.RO

    PianoMime: Learning a Generalist, Dexterous Piano Player from Internet Demonstrations

    Authors: Cheng Qian, Julen Urain, Kevin Zakka, Jan Peters

    Abstract: In this work, we introduce PianoMime, a framework for training a piano-playing agent using internet demonstrations. The internet is a promising source of large-scale demonstrations for training our robot agents. In particular, for the case of piano-playing, Youtube is full of videos of professional pianists playing a wide myriad of songs. In our work, we leverage these demonstrations to learn a ge… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  6. arXiv:2407.12344  [pdf, other

    cs.CL cs.CY

    The Better Angels of Machine Personality: How Personality Relates to LLM Safety

    Authors: Jie Zhang, Dongrui Liu, Chen Qian, Ziyue Gan, Yong Liu, Yu Qiao, Jing Shao

    Abstract: Personality psychologists have analyzed the relationship between personality and safety behaviors in human society. Although Large Language Models (LLMs) demonstrate personality traits, the relationship between personality traits and safety abilities in LLMs still remains a mystery. In this paper, we discover that LLMs' personality traits are closely related to their safety abilities, i.e., toxici… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  7. arXiv:2407.12027  [pdf, ps, other

    cs.AR cs.AI

    Idle is the New Sleep: Configuration-Aware Alternative to Powering Off FPGA-Based DL Accelerators During Inactivity

    Authors: Chao Qian, Christopher Cichiwskyj, Tianheng Ling, Gregor Schiele

    Abstract: In the rapidly evolving Internet of Things (IoT) domain, we concentrate on enhancing energy efficiency in Deep Learning accelerators on FPGA-based heterogeneous platforms, aligning with the principles of sustainable computing. Instead of focusing on the inference phase, we introduce innovative optimizations to minimize the overhead of the FPGA configuration phase. By fine-tuning configuration para… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted by 37th GI/ITG International Conference on Architecture of Computing Systems (ARCS 2024)

  8. arXiv:2407.11321  [pdf, other

    cs.CV

    TCFormer: Visual Recognition via Token Clustering Transformer

    Authors: Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang

    Abstract: Transformers are widely used in computer vision areas and have achieved remarkable success. Most state-of-the-art approaches split images into regular grids and represent each grid region with a vision token. However, fixed token distribution disregards the semantic meaning of different image regions, resulting in sub-optimal performance. To address this issue, we propose the Token Clustering Tran… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  9. arXiv:2407.11042  [pdf, other

    cs.LG cs.AI

    An Automated Approach to Collecting and Labeling Time Series Data for Event Detection Using Elastic Node Hardware

    Authors: Tianheng Ling, Islam Mansour, Chao Qian, Gregor Schiele

    Abstract: Recent advancements in IoT technologies have underscored the importance of using sensor data to understand environmental contexts effectively. This paper introduces a novel embedded system designed to autonomously label sensor data directly on IoT devices, thereby enhancing the efficiency of data collection methods. We present an integrated hardware and software solution equipped with specialized… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by the 4th Workshop on Collaborative Technologies and Data Science in Smart City Applications (CODASSCA 2024)

  10. arXiv:2407.11041  [pdf, other

    cs.LG cs.AI

    Integer-only Quantized Transformers for Embedded FPGA-based Time-series Forecasting in AIoT

    Authors: Tianheng Ling, Chao Qian, Gregor Schiele

    Abstract: This paper presents the design of a hardware accelerator for Transformers, optimized for on-device time-series forecasting in AIoT systems. It integrates integer-only quantization and Quantization-Aware Training with optimized hardware designs to realize 6-bit and 4-bit quantized Transformer models, which achieved precision comparable to 8-bit quantized models from related research. Utilizing a co… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: The paper is accepted by 2024 IEEE Annual Congress on Artificial Intelligence of Things (IEEE AIoT)

  11. arXiv:2407.10125  [pdf, other

    cs.CV

    When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

    Authors: Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu

    Abstract: Recent years have witnessed increasing research attention towards pedestrian detection by taking the advantages of different sensor modalities (e.g. RGB, IR, Depth, LiDAR and Event). However, designing a unified generalist model that can effectively process diverse sensor modalities remains a challenge. This paper introduces MMPedestron, a novel generalist model for multimodal perception. Unlike p… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV'2024

  12. arXiv:2407.09056  [pdf, other

    quant-ph hep-ex

    A Novel Quantum Realization of Jet Clustering in High-Energy Physics Experiments

    Authors: Yongfeng Zhu, Weifeng Zhuang, Chen Qian, Yunheng Ma, Dong E. Liu, Manqi Ruan, Chen Zhou

    Abstract: Exploring the application of quantum technologies to fundamental sciences holds the key to fostering innovation for both sides. In high-energy particle collisions, quarks and gluons are produced and immediately form collimated particle sprays known as jets. Accurate jet clustering is crucial as it retains the information of the originating quark or gluon and forms the basis for studying properties… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  13. arXiv:2407.07061  [pdf, other

    cs.CL

    Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

    Authors: Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

    Abstract: The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to reliance on agents defined within their own ecosystems. They also face challenges in simulating distributed environments, as most frameworks are limited to… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: work in progress

  14. arXiv:2407.05102  [pdf, other

    eess.SP cs.AI

    Towards Auto-Building of Embedded FPGA-based Soft Sensors for Wastewater Flow Estimation

    Authors: Tianheng Ling, Chao Qian, Gregor Schiele

    Abstract: Executing flow estimation using Deep Learning (DL)-based soft sensors on resource-limited IoT devices has demonstrated promise in terms of reliability and energy efficiency. However, its application in the field of wastewater flow estimation remains underexplored due to: (1) a lack of available datasets, (2) inconvenient toolchains for on-device AI model development and deployment, and (3) hardwar… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by 2024 IEEE Annual Congress on Artificial Intelligence of Things (IEEE AIoT)

  15. arXiv:2407.02818  [pdf, other

    cs.SE cs.ET cs.PL

    WizardMerge -- Save Us From Merging Without Any Clues

    Authors: Qingyu Zhang, Junzhe Li, Jiayi Lin, Jie Ding, Lanteng Lin, Chenxiong Qian

    Abstract: Modern software development necessitates efficient version-oriented collaboration among developers. While Git is the most popular version control system, it generates unsatisfactory version merging results due to textual-based workflow, leading to potentially unexpected results in the merged version of the project. Although numerous merging tools have been proposed for improving merge results, dev… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages

    ACM Class: D.2; D.3

  16. arXiv:2406.16950  [pdf

    physics.flu-dyn

    The influence of flame-pressure waves collisions on the development and evolution of tulip flames

    Authors: Chengeng Qian, Mikhail A. Liberman

    Abstract: The effects of pressure waves-flame collisions and tube aspect ratio on flame evolution and the formation of tulip and distorted tulip flames were investigated using numerical simulations of the fully compressible Navier-Stokes equations coupled with a detailed chemical model for a stoichiometric hydrogen-air mixture. It is shown that: (1) the rarefaction wave generated by the decelerating flame i… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 33 pages, 12 figures, Nordita preprint

    Report number: Preprint NORDITA 2024-020

  17. arXiv:2406.16360  [pdf, other

    cs.CV cs.GR

    MIRReS: Multi-bounce Inverse Rendering using Reservoir Sampling

    Authors: Yuxin Dai, Qi Wang, Jingsen Zhu, Dianbing Xi, Yuchi Huo, Chen Qian, Ying He

    Abstract: We present MIRReS, a novel two-stage inverse rendering framework that jointly reconstructs and optimizes the explicit geometry, material, and lighting from multi-view images. Unlike previous methods that rely on implicit irradiance fields or simplified path tracing algorithms, our method extracts an explicit geometry (triangular mesh) in stage one, and introduces a more realistic physically-based… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 16 pages, 14 figures

  18. arXiv:2406.16298  [pdf, other

    hep-ph quant-ph

    Bell nonlocality and entanglement in $e^{+}e^{-} \rightarrow Y\bar{Y}$ at BESIII

    Authors: Sihao Wu, Chen Qian, Qun Wang, Xiao-Rong Zhou

    Abstract: The Bell nonlocality and entanglement are two kinds of quantum correlations in quantum systems. Due to the recent upgrade in Beijing Spectrometer III (BESIII) experiment, it is possible to explore the nonlocality and entanglement in hyperon-antihyperon systems produced in electron-positron annihilation with high precision data. We provide a systematic method for studying quantum correlations in sp… ▽ More

    Submitted 28 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures, 4 tables. We corrected a few errors in plotting figures from analytical formula. Some results in tables read from figures have also been corrected. A new table (Table III) was added for the maximum concurrence and their corresponding angles. A few references were added

  19. arXiv:2406.16116  [pdf, ps, other

    cs.NE

    A First Running Time Analysis of the Strength Pareto Evolutionary Algorithm 2 (SPEA2)

    Authors: Shengjie Ren, Chao Bian, Miqing Li, Chao Qian

    Abstract: Evolutionary algorithms (EAs) have emerged as a predominant approach for addressing multi-objective optimization problems. However, the theoretical foundation of multi-objective EAs (MOEAs), particularly the fundamental aspects like running time analysis, remains largely underexplored. Existing theoretical studies mainly focus on basic MOEAs, with little attention given to practical MOEAs. In this… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  20. arXiv:2406.14928  [pdf, other

    cs.AI cs.CL cs.HC cs.MA cs.SI

    Autonomous Agents for Collaborative Task under Information Asymmetry

    Authors: Wei Liu, Chenxi Wang, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Chen Qian

    Abstract: Large Language Model Multi-Agent Systems (LLM-MAS) have achieved great progress in solving complex tasks. It performs communication among agents within the system to collaboratively solve tasks, under the premise of shared information. However, when agents' communication is leveraged to enhance human cooperation, a new challenge arises due to information asymmetry, since each agent can only access… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures, 5 tables, Work in progress

  21. arXiv:2406.12383  [pdf, other

    cs.DS cs.NE

    Biased Pareto Optimization for Subset Selection with Dynamic Cost Constraints

    Authors: Dan-Xuan Liu, Chao Qian

    Abstract: Subset selection with cost constraints aims to select a subset from a ground set to maximize a monotone objective function without exceeding a given budget, which has various applications such as influence maximization and maximum coverage. In real-world scenarios, the budget, representing available resources, may change over time, which requires that algorithms must adapt quickly to new budgets.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This paper has appeared at PPSN'24

  22. arXiv:2406.11721  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

    Authors: Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Huan-ang Gao, Huimin Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Understanding alignment techniques begins with comprehending zero-shot generalization brought by instruction tuning, but little of the mechanism has been understood. Existing work has largely been confined to the task level, without considering that tasks are artificially defined and, to LLMs, merely consist of tokens and representations. This line of research has been limited to examining transfe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 33 pages, 14 figures

  23. arXiv:2406.10539  [pdf, other

    cs.CV

    Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On

    Authors: Lingxiao Lu, Shengyi Wu, Haoxuan Sun, Junhong Gou, Jianlou Si, Chen Qian, Jianfu Zhang, Liqing Zhang

    Abstract: Virtual clothes try-on has emerged as a vital feature in online shopping, offering consumers a critical tool to visualize how clothing fits. In our research, we introduce an innovative approach for virtual clothes try-on, utilizing a self-supervised Vision Transformer (ViT) coupled with a diffusion model. Our method emphasizes detail enhancement by contrasting local clothing image embeddings, gene… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  24. arXiv:2406.09180  [pdf, other

    cs.LG

    Detection-Rate-Emphasized Multi-objective Evolutionary Feature Selection for Network Intrusion Detection

    Authors: Zi-Hang Cheng, Haopu Shang, Chao Qian

    Abstract: Network intrusion detection is one of the most important issues in the field of cyber security, and various machine learning techniques have been applied to build intrusion detection systems. However, since the number of features to describe the network connections is often large, where some features are redundant or noisy, feature selection is necessary in such scenarios, which can both improve t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  25. arXiv:2406.08979  [pdf, other

    cs.CL cs.AI cs.MA cs.SE

    Multi-Agent Software Development through Cross-Team Collaboration

    Authors: Zhuoyun Du, Chen Qian, Wei Liu, Zihao Xie, Yifei Wang, Yufan Dang, Weize Chen, Cheng Yang

    Abstract: The latest breakthroughs in Large Language Models (LLMs), eg., ChatDev, have catalyzed profound transformations, particularly through multi-agent collaboration for software development. LLM agents can collaborate in teams like humans, and follow the waterfall model to sequentially work on requirements analysis, development, review, testing, and other phases to perform autonomous software generatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Work in progress

  26. arXiv:2406.07155  [pdf, other

    cs.AI cs.CL cs.MA cs.NI cs.SI

    Scaling Large-Language-Model-based Multi-Agent Collaboration

    Authors: Chen Qian, Zihao Xie, Yifei Wang, Wei Liu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

    Abstract: Pioneering advancements in large language model-powered agents have underscored the design pattern of multi-agent collaboration, demonstrating that collective intelligence can surpass the capabilities of each individual. Inspired by the neural scaling law, which posits that increasing neurons leads to emergent abilities, this study investigates whether a similar principle applies to increasing age… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Work in progress; The code and data will be available at https://github.com/OpenBMB/ChatDev

  27. arXiv:2406.05743  [pdf, other

    cs.NE q-bio.BM

    Peptide Vaccine Design by Evolutionary Multi-Objective Optimization

    Authors: Dan-Xuan Liu, Yi-Heng Xu, Chao Qian

    Abstract: Peptide vaccines are growing in significance for fighting diverse diseases. Machine learning has improved the identification of peptides that can trigger immune responses, and the main challenge of peptide vaccine design now lies in selecting an effective subset of peptides due to the allelic diversity among individuals. Previous works mainly formulated this task as a constrained optimization prob… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: This paper has appeared at IJCAI'24

  28. arXiv:2406.04745  [pdf, other

    cs.LG cs.CV

    Confidence-aware Contrastive Learning for Selective Classification

    Authors: Yu-Chang Wu, Shen-Huan Lyu, Haopu Shang, Xiangyu Wang, Chao Qian

    Abstract: Selective classification enables models to make predictions only when they are sufficiently confident, aiming to enhance safety and reliability, which is important in high-stakes scenarios. Previous methods mainly use deep neural networks and focus on modifying the architecture of classification layers to enable the model to estimate the confidence of its prediction. This work provides a generaliz… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  29. arXiv:2406.03731  [pdf, other

    cs.LG cs.NE

    Quality-Diversity with Limited Resources

    Authors: Ren-Jian Wang, Ke Xue, Cong Guan, Chao Qian

    Abstract: Quality-Diversity (QD) algorithms have emerged as a powerful optimization paradigm with the aim of generating a set of high-quality and diverse solutions. To achieve such a challenging goal, QD algorithms require maintaining a large archive and a large population in each iteration, which brings two main issues, sample and resource efficiency. Most advanced QD algorithms focus on improving the samp… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  30. arXiv:2406.03722  [pdf, other

    cs.LG cs.AI cs.NE

    Offline Multi-Objective Optimization

    Authors: Ke Xue, Rong-Xi Tan, Xiaobin Huang, Chao Qian

    Abstract: Offline optimization aims to maximize a black-box objective function with a static dataset and has wide applications. In addition to the objective function being black-box and expensive to evaluate, numerous complex real-world problems entail optimizing multiple conflicting objectives, i.e., multi-objective optimization (MOO). Nevertheless, offline MOO has not progressed as much as offline single-… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  31. arXiv:2406.02658  [pdf, other

    cs.NE

    Maintaining Diversity Provably Helps in Evolutionary Multimodal Optimization

    Authors: Shengjie Ren, Zhijia Qiu, Chao Bian, Miqing Li, Chao Qian

    Abstract: In the real world, there exist a class of optimization problems that multiple (local) optimal solutions in the solution space correspond to a single point in the objective space. In this paper, we theoretically show that for such multimodal problems, a simple method that considers the diversity of solutions in the solution space can benefit the search in evolutionary algorithms (EAs). Specifically… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2406.02118

  32. arXiv:2406.02118  [pdf, other

    cs.NE

    An Archive Can Bring Provable Speed-ups in Multi-Objective Evolutionary Algorithms

    Authors: Chao Bian, Shengjie Ren, Miqing Li, Chao Qian

    Abstract: In the area of multi-objective evolutionary algorithms (MOEAs), there is a trend of using an archive to store non-dominated solutions generated during the search. This is because 1) MOEAs may easily end up with the final population containing inferior solutions that are dominated by other solutions discarded during the search process and 2) the population that has a commensurable size of the probl… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  33. arXiv:2405.20247  [pdf, other

    cs.AI cs.CV cs.LG cs.SE

    KerasCV and KerasNLP: Vision and Language Power-Ups

    Authors: Matthew Watson, Divyashree Shivakumar Sreepathihalli, Francois Chollet, Martin Gorner, Kiranbir Sodhia, Ramesh Sampath, Tirth Patel, Haifeng Jin, Neel Kovelamudi, Gabriel Rasskin, Samaneh Saadat, Luke Wood, Chen Qian, Jonathan Bischof, Ian Stenbit, Abheesht Sharma, Anshuman Mishra

    Abstract: We present the Keras domain packages KerasCV and KerasNLP, extensions of the Keras API for Computer Vision and Natural Language Processing workflows, capable of running on either JAX, TensorFlow, or PyTorch. These domain packages are designed to enable fast experimentation, with a focus on ease-of-use and performance. We adopt a modular, layered design: at the library's lowest level of abstraction… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Submitted to Journal of Machine Learning Open Source Software

    ACM Class: I.2.5; I.2.7; I.2.10

  34. arXiv:2405.17311  [pdf, other

    cs.LG

    Probabilistic Graph Rewiring via Virtual Nodes

    Authors: Chendi Qian, Andrei Manolache, Christopher Morris, Mathias Niepert

    Abstract: Message-passing graph neural networks (MPNNs) have emerged as a powerful paradigm for graph-based machine learning. Despite their effectiveness, MPNNs face challenges such as under-reaching and over-squashing, where limited receptive fields and structural bottlenecks hinder information flow in the graph. While graph transformers hold promise in addressing these issues, their scalability is limited… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.02156

  35. arXiv:2405.13839  [pdf, other

    cs.GR

    Diffusing Winding Gradients (DWG): A Parallel and Scalable Method for 3D Reconstruction from Unoriented Point Clouds

    Authors: Weizhou Liu, Jiaze Li, Xuhui Chen, Fei Hou, Shiqing Xin, Xingce Wang, Zhongke Wu, Chen Qian, Ying He

    Abstract: This paper presents a method for reconstructing watertight 3D surfaces from unoriented point clouds. Starting with randomly initialized normals, the method iteratively refines each normal by diffusing the gradient of the generalized winding number (GWN) field. Upon convergence, the target surface is extracted using the standard Marching Cubes algorithm. Our method is conceptually simple, easy to i… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  36. arXiv:2405.04219  [pdf, other

    cs.CL cs.AI cs.MA cs.SE

    Iterative Experience Refinement of Software-Developing Agents

    Authors: Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, YiFei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun

    Abstract: Autonomous agents powered by large language models (LLMs) show significant potential for achieving high autonomy in various scenarios such as software development. Recent research has shown that LLM agents can leverage past experiences to reduce errors and enhance efficiency. However, the static experience paradigm, reliant on a fixed collection of past experiences acquired heuristically, lacks it… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Work in progress

  37. arXiv:2404.19541  [pdf, other

    cs.CV cs.AI cs.GR eess.SP

    Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging

    Authors: Rayan Armani, Changlin Qian, Jiaxi Jiang, Christian Holz

    Abstract: While camera-based capture systems remain the gold standard for recording human motion, learning-based tracking systems based on sparse wearable sensors are gaining popularity. Most commonly, they use inertial sensors, whose propensity for drift and jitter have so far limited tracking accuracy. In this paper, we propose Ultra Inertial Poser, a novel 3D full body pose estimation method that constra… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGGRAPH 2024, Code: https://github.com/eth-siplab/UltraInertialPoser

    MSC Class: 68T07; 68T45; 68U01 ACM Class: I.2; I.3; I.4; I.5

  38. arXiv:2404.19401  [pdf, other

    cs.CV

    UniFS: Universal Few-shot Instance Perception with Point Representations

    Authors: Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo

    Abstract: Instance perception tasks (object detection, instance segmentation, pose estimation, counting) play a key role in industrial applications of visual models. As supervised learning methods suffer from high labeling cost, few-shot learning methods which effectively learn from a limited number of labeled examples are desired. Existing few-shot learning methods primarily focus on a restricted set of ta… ▽ More

    Submitted 18 July, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by ECCV 2024

  39. arXiv:2404.09927  [pdf, other

    cs.RO cs.LG

    Autonomous Path Planning for Intercostal Robotic Ultrasound Imaging Using Reinforcement Learning

    Authors: Yuan Bi, Cheng Qian, Zhicheng Zhang, Nassir Navab, Zhongliang Jiang

    Abstract: Ultrasound (US) has been widely used in daily clinical practice for screening internal organs and guiding interventions. However, due to the acoustic shadow cast by the subcutaneous rib cage, the US examination for thoracic application is still challenging. To fully cover and reconstruct the region of interest in US for diagnosis, an intercostal scanning path is necessary. To tackle this challenge… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  40. arXiv:2403.15627  [pdf

    cond-mat.mtrl-sci

    Nanoscale Imaging of Phonons and Reconfiguration in Topologically-Engineered, Self-Assembled Nanoparticle Lattice

    Authors: Chang Qian, Ethan Stanifer, Zhan Ma, Binbin Luo, Chang Liu, Lehan Yao, Wenxiao Pan, Xiaoming Mao, Qian Chen

    Abstract: Topologically-engineered mechanical frames are important model constructs for architecture, machine mechanisms, and metamaterials. Despite significant advances in macroscopically fashioned frames, realization and phonon imaging of nanoframes have remained challenging. Here we extend for the first time the principles of topologically-engineered mechanical frames to lattices self-assembled from nano… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  41. Experimental Quantum Byzantine Agreement on a Three-User Quantum Network with Integrated Photonics

    Authors: Xu Jing, Cheng Qian, Chen-Xun Weng, Bing-Hong Li, Zhe Chen, Chen-Quan Wang, Jie Tang, Xiao-Wen Gu, Yue-Chan Kong, Tang-Sheng Chen, Hua-Lei Yin, Dong Jiang, Bin Niu, Liang-Liang Lu

    Abstract: Quantum communication networks are crucial for both secure communication and cryptographic networked tasks. Building quantum communication networks in a scalable and cost-effective way is essential for their widespread adoption, among which a stable and miniaturized high-quality quantum light source is a key component. Here, we establish a complete polarization entanglement-based fully connected n… ▽ More

    Submitted 27 August, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Journal ref: Science Advances 10, eadp2877 (2024)

  42. arXiv:2403.10319  [pdf, other

    cs.NI cs.CR

    NetBench: A Large-Scale and Comprehensive Network Traffic Benchmark Dataset for Foundation Models

    Authors: Chen Qian, Xiaochang Li, Qineng Wang, Gang Zhou, Huajie Shao

    Abstract: In computer networking, network traffic refers to the amount of data transmitted in the form of packets between internetworked computers or Cyber-Physical Systems. Monitoring and analyzing network traffic is crucial for ensuring the performance, security, and reliability of a network. However, a significant challenge in network traffic analysis is to process diverse data packets including both cip… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  43. arXiv:2403.09338  [pdf, other

    cs.CV cs.AI

    LocalMamba: Visual State Space Model with Windowed Selective Scan

    Authors: Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu

    Abstract: Recent advancements in state space models, notably Mamba, have demonstrated significant progress in modeling long sequences for tasks like language understanding. Yet, their application in vision tasks has not markedly surpassed the performance of traditional Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). This paper posits that the key to enhancing Vision Mamba (ViM) lies in… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  44. arXiv:2403.08604  [pdf, other

    cs.CL cs.SE

    DevBench: A Comprehensive Benchmark for Software Development

    Authors: Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen

    Abstract: Recent advancements in large language models (LLMs) have significantly enhanced their coding capabilities. However, existing benchmarks predominantly focused on simplified or isolated aspects of programming, such as single-file code generation or repository issue debugging, falling short of measuring the full spectrum of challenges raised by real-world programming activities. To this end, we propo… ▽ More

    Submitted 15 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Our data and code are available at https://github.com/open-compass/DevBench

  45. arXiv:2403.05155  [pdf, other

    cs.CV

    LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves

    Authors: Jiayan Cao, Xueyu Zhu, Cheng Qian

    Abstract: Lane detection plays a critical role in the field of autonomous driving. Prevailing methods generally adopt basic concepts (anchors, key points, etc.) from object detection and segmentation tasks, while these approaches require manual adjustments for curved objects, involve exhaustive searches on predefined anchors, require complex post-processing steps, and may lack flexibility when applied to re… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  46. FlowPrecision: Advancing FPGA-Based Real-Time Fluid Flow Estimation with Linear Quantization

    Authors: Tianheng Ling, Julian Hoever, Chao Qian, Gregor Schiele

    Abstract: In industrial and environmental monitoring, achieving real-time and precise fluid flow measurement remains a critical challenge. This study applies linear quantization in FPGA-based soft sensors for fluid flow estimation, significantly enhancing Neural Network model precision by overcoming the limitations of traditional fixed-point quantization. Our approach achieves up to a 10.10% reduction in Me… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, The 22nd International Conference on Pervasive Computing and Communications (PerCom 2024), PerConAI Workshop

  47. arXiv:2403.01740  [pdf, other

    cs.CV

    DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

    Authors: Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

    Abstract: Motion synthesis in real-world 3D scenes has recently attracted much attention. However, the static environment assumption made by most current methods usually cannot be satisfied especially for real-time motion synthesis in scanned point cloud scenes, if multiple dynamic objects exist, e.g., moving persons or vehicles. To handle this problem, we propose the first Dynamic Environment MOtion Synthe… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  48. arXiv:2402.19465  [pdf, other

    cs.CL cs.AI

    Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models

    Authors: Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao

    Abstract: Ensuring the trustworthiness of large language models (LLMs) is crucial. Most studies concentrate on fully pre-trained LLMs to better understand and improve LLMs' trustworthiness. In this paper, to reveal the untapped potential of pre-training, we pioneer the exploration of LLMs' trustworthiness during this period, focusing on five key dimensions: reliability, privacy, toxicity, fairness, and robu… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  49. Take It, Leave It, or Fix It: Measuring Productivity and Trust in Human-AI Collaboration

    Authors: Crystal Qian, James Wexler

    Abstract: Although recent developments in generative AI have greatly enhanced the capabilities of conversational agents such as Google's Gemini (formerly Bard) or OpenAI's ChatGPT, it's unclear whether the usage of these agents aids users across various contexts. To better understand how access to conversational AI affects productivity and trust, we conducted a mixed-methods, task-based user study, observin… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 15 pages. Published in the 29th International Conference on Intelligent User Interfaces (IUI '24)

  50. arXiv:2402.18439  [pdf, other

    cs.CL cs.AI

    Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

    Authors: Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

    Abstract: Natural language (NL) has long been the predominant format for human cognition and communication, and by extension, has been similarly pivotal in the development and application of Large Language Models (LLMs). Yet, besides NL, LLMs have seen various non-NL formats during pre-training, such as code and logical expression. NL's status as the optimal format for LLMs, particularly in single-LLM reaso… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Code release at https://github.com/thunlp/AutoForm