Skip to main content

Showing 51–100 of 758 results for author: Zheng, C

.
  1. arXiv:2405.04032  [pdf, other

    cs.CR cs.AI

    Locally Differentially Private In-Context Learning

    Authors: Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixin Jiang, Shaoyang Song, Chunlai Zhou

    Abstract: Large pretrained language models (LLMs) have shown surprising In-Context Learning (ICL) ability. An important application in deploying large language models is to augment LLMs with a private database for some specific task. The main problem with this promising commercial use is that LLMs have been shown to memorize their training data and their prompt data are vulnerable to membership inference at… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: This paper was published at LREC-Coling 2024

  2. arXiv:2405.02863  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.HE

    Stellar X-ray activity and habitability revealed by ROSAT sky survey

    Authors: Henggeng Han, Song Wang, Chuanjie Zheng, Xue Li, Kai Xiao, Jifeng Liu

    Abstract: Using the homogeneous X-ray catalog from ROSAT observations, we conducted a comprehensive investigation into stellar X-ray activity-rotation relations for both single and binary stars. Generally, the relation for single stars consists of two distinct regions: a weak decay region, indicating a continued dependence of the magnetic dynamo on stellar rotation rather than a saturation regime with const… ▽ More

    Submitted 20 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 17 pages, 12 figures, ApJS accepted

  3. arXiv:2405.02354  [pdf

    cs.LG cs.AI q-bio.QM

    Heterogeneous network and graph attention auto-encoder for LncRNA-disease association prediction

    Authors: Jin-Xing Liu, Wen-Yu Xi, Ling-Yun Dai, Chun-Hou Zheng, Ying-Lian Gao

    Abstract: The emerging research shows that lncRNAs are associated with a series of complex human diseases. However, most of the existing methods have limitations in identifying nonlinear lncRNA-disease associations (LDAs), and it remains a huge challenge to predict new LDAs. Therefore, the accurate identification of LDAs is very important for the warning and treatment of diseases. In this work, multiple sou… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures

    ACM Class: I.2.4; I.2.6; I.2.m

  4. arXiv:2405.02287  [pdf, other

    cs.CL cs.AI cs.CV

    Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

    Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

    Abstract: We introduce Vibe-Eval: a new open benchmark and framework for evaluating multimodal chat models. Vibe-Eval consists of 269 visual understanding prompts, including 100 of hard difficulty, complete with gold-standard responses authored by experts. Vibe-Eval is open-ended and challenging with dual objectives: (i) vibe checking multimodal chat models for day-to-day tasks and (ii) rigorously testing a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2405.01053  [pdf, other

    cs.LG cs.AI

    Explicitly Modeling Universality into Self-Supervised Learning

    Authors: Jingyao Wang, Wenwen Qiang, Zeen Song, Lingyu Si, Jiangmeng Li, Changwen Zheng, Bing Su

    Abstract: The goal of universality in self-supervised learning (SSL) is to learn universal representations from unlabeled data and achieve excellent performance on all samples and tasks. However, these methods lack explicit modeling of the universality in the learning objective, and the related theoretical understanding remains limited. This may cause models to overfit in data-scarce situations and generali… ▽ More

    Submitted 23 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 28 pages, submitted to ICML24 with 7766

  6. arXiv:2404.19620  [pdf, other

    cs.LG cs.IR stat.ML

    Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference

    Authors: Haoxuan Li, Chunyuan Zheng, Sihao Ding, Peng Wu, Zhi Geng, Fuli Feng, Xiangnan He

    Abstract: Selection bias in recommender system arises from the recommendation process of system filtering and the interactive process of user selection. Many previous studies have focused on addressing selection bias to achieve unbiased learning of the prediction model, but ignore the fact that potential outcomes for a given user-item pair may vary with the treatments assigned to other user-item pairs, name… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: ICLR 24

  7. arXiv:2404.19596  [pdf, other

    cs.IR cs.LG

    Debiased Collaborative Filtering with Kernel-Based Causal Balancing

    Authors: Haoxuan Li, Chunyuan Zheng, Yanghao Xiao, Peng Wu, Zhi Geng, Xu Chen, Peng Cui

    Abstract: Debiased collaborative filtering aims to learn an unbiased prediction model by removing different biases in observational datasets. To solve this problem, one of the simple and effective methods is based on the propensity score, which adjusts the observational sample distribution to the target one by reweighting observed instances. Ideally, propensity scores should be learned with causal balancing… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: ICLR 24 Spotlight

  8. arXiv:2404.16792  [pdf, other

    cs.LG cs.AI cs.CL

    Weak-to-Strong Extrapolation Expedites Alignment

    Authors: Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng

    Abstract: The open-source community is experiencing a surge in the release of large language models (LLMs) that are trained to follow instructions and align with human preference. However, further training to improve them still requires expensive computational resources and data annotations. Is it possible to bypass additional training and cost-effectively acquire better-aligned models? Inspired by the lite… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Add theoretical explanation and more evaluation results

  9. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  10. arXiv:2404.13026  [pdf, other

    cs.CV cs.AI

    PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

    Authors: Tianyuan Zhang, Hong-Xing Yu, Rundi Wu, Brandon Y. Feng, Changxi Zheng, Noah Snavely, Jiajun Wu, William T. Freeman

    Abstract: Realistic object interactions are crucial for creating immersive virtual experiences, yet synthesizing realistic 3D object dynamics in response to novel interactions remains a significant challenge. Unlike unconditional or text-conditioned dynamics generation, action-conditioned dynamics requires perceiving the physical material properties of objects and grounding the 3D motion prediction on these… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Project website at: https://physdreamer.github.io/

  11. arXiv:2404.12387  [pdf, other

    cs.CL cs.CV

    Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

    Authors: Reka Team, Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu , et al. (1 additional authors not shown)

    Abstract: We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka. Reka models are able to process and reason with text, images, video, and audio inputs. This technical report discusses details of training some of these models and provides comprehensive evaluation results. We show that Reka Edge and Reka Flash are not only state-of-the-art but al… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  12. arXiv:2404.12024  [pdf, other

    cs.CV

    Meta-Auxiliary Learning for Micro-Expression Recognition

    Authors: Jingyao Wang, Yunhan Tian, Yuxuan Yang, Xiaoxin Chen, Changwen Zheng, Wenwen Qiang

    Abstract: Micro-expressions (MEs) are involuntary movements revealing people's hidden feelings, which has attracted numerous interests for its objectivity in emotion detection. However, despite its wide applications in various scenarios, micro-expression recognition (MER) remains a challenging problem in real life due to three reasons, including (i) data-level: lack of data and imbalanced classes, (ii) feat… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 10 pages, 7 figures, 3 tables

  13. arXiv:2404.10337  [pdf, other

    cs.AI

    Intriguing Properties of Positional Encoding in Time Series Forecasting

    Authors: Jianqi Zhang, Jingyao Wang, Wenwen Qiang, Fanjiang Xu, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: Transformer-based methods have made significant progress in time series forecasting (TSF). They primarily handle two types of tokens, i.e., temporal tokens that contain all variables of the same timestamp, and variable tokens that contain all input time points for a specific variable. Transformer-based methods rely on positional encoding (PE) to mark tokens' positions, facilitating the model to pe… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  14. arXiv:2404.07127  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA

    Searching for short-period variables in M31: method and catalogs

    Authors: Hongrui Gu, Haibo Yuan, Subo Dong, Chenfa Zheng, Shenzhe Cui, Yi Ren, Haozhu Fu, Yang Huang, Zhou Fan

    Abstract: Utilizing high-cadence and continuous g- and r-band data over three nights acquired from the 3.6-meter Canada France Hawaii Telescope (CFHT) aimed to find short-duration microlensing events, we conduct a systematic search for variables, transients, and asteroids across a $\sim1^\circ$ field of view of the Andromeda Galaxy (M 31). We present a catalog of 5859 variable stars, yielding the most exten… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  15. arXiv:2404.06988  [pdf, other

    quant-ph

    Quantum Network Tomography via Learning Isometries on Stiefel Manifold

    Authors: Ze-Tong Li, Xin-Lin He, Cong-Cong Zheng, Yu-Qian Dong, Tian Luan, Xu-Tao Yu, Zai-Chen Zhang

    Abstract: Explicit mathematical reconstructions of quantum networks play a significant role in developing quantum information science. However, tremendous parameter requirements and physical constraint implementations have become computationally non-ignorable encumbrances. In this work, we propose an efficient method for quantum network tomography by learning isometries on the Stiefel manifold. Tasks of rec… ▽ More

    Submitted 6 May, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  16. arXiv:2404.05242  [pdf, other

    cs.RO

    Collision-Free Trajectory Optimization in Cluttered Environments with Sums-of-Squares Programming

    Authors: Yulin Li, Chunxin Zheng, Kai Chen, Yusen Xie, Xindong Tang, Michael Yu Wang, Jun Ma

    Abstract: In this work, we propose a trajectory optimization approach for robot navigation in cluttered 3D environments. We represent the robot's geometry as a semialgebraic set defined by polynomial inequalities such that robots with general shapes can be suitably characterized. To address the robot navigation task in obstacle-dense environments, we exploit the free space directly to construct a sequence o… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  17. arXiv:2404.04922  [pdf, other

    cs.CV cs.AI

    Efficient Learnable Collaborative Attention for Single Image Super-Resolution

    Authors: Yigang Zhao Chaowei Zheng, Jiannan Su, GuangyongChen, MinGan

    Abstract: Non-Local Attention (NLA) is a powerful technique for capturing long-range feature correlations in deep single image super-resolution (SR). However, NLA suffers from high computational complexity and memory consumption, as it requires aggregating all non-local feature information for each query response and recalculating the similarity weight distribution for different abstraction levels of featur… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  18. arXiv:2404.04835  [pdf, other

    astro-ph.SR astro-ph.HE

    A born ultramassive white dwarf-hot subdwarf super-Chandrasekhar candidate

    Authors: Changqing Luo, Jiao Li, Chuanjie Zheng, Dongdong Liu, Zhenwei Li, Yangping Luo, Peter Nemeth, Bo Zhang, Jianping Xiong, Bo Wang, Song Wang, Yu Bai, Qingzheng Li, Pei Wang, Zhanwen Han, Jifeng Liu, Yang Huang, Xuefei Chen, Chao Liu

    Abstract: Although supernovae is a well-known endpoint of an accreting white dwarf, alternative theoretical possibilities has been discussing broadly, such as the accretion-induced collapse (AIC) event as the endpoint of oxygen-neon (ONe) white dwarfs, either accreting up to or merging to excess the Chandrasekhar limit (the maximum mass of a stable white dwarf). AIC is an important channel to form neutron s… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 25 pages, 14 figures

  19. arXiv:2404.03229  [pdf, other

    astro-ph.HE

    Relation between the keV-MeV and TeV emission of GRB 221009A and its implications

    Authors: Yan-Qiu Zhang, Hao-Xiang Lin, Shao-Lin Xiong, Zhuo Li, Ming-Yu Ge, Chen-Wei Wang, Shu-Xu Yi, Zhen Zhang, Shuang-Nan Zhang, Li-Ming Song, Chao Zheng, Wang-Chen Xue, Jia-Cong Liu, Wen-Jun Tan, Yue Wang, Wen-Long Zhang

    Abstract: Gamma-ray bursts (GRBs) are believed to launch relativistic jets, which generate prompt emission by their internal processes and drive external shocks into surrounding medium, accounting for the long-lasting afterglow emission. However, how the jet powers the external shock is an open question. The unprecedented observations of the keV-MeV emission with GECAM and the TeV emission with LHAASO of so… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  20. arXiv:2404.02145  [pdf, other

    cs.CV

    Iterated Learning Improves Compositionality in Large Vision-Language Models

    Authors: Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna

    Abstract: A fundamental characteristic common to both human vision and natural language is their compositional nature. Yet, despite the performance gains contributed by large vision and language pretraining, recent investigations find that most-if not all-our state-of-the-art vision-language models struggle at compositionality. They are unable to distinguish between images of " a girl in white facing a man… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  21. arXiv:2404.00889  [pdf, other

    astro-ph.SR

    Transport of the magnetic flux away from a decaying sunspot via convective motions

    Authors: Chenxi Zheng, Thierry Roudier, Brigitte Schmieder, Guiping Ruan, Jean-Marie Malherbe, Yang Liu, Yao Chen, Wenda Cao

    Abstract: Aims. The aim of this paper is to consider relationship between the decay of sunspots and convection via the motion of the family of granules and how the diffusion mechanism of magnetic field operates in a decaying sunspot. Methods. We report the decay of a sunspot observed by the 1.6m Goode Solar Telescope (GST) with the TiO Broadband Filter Imager (BFI) and the Near-InfraRed Imaging Spectropolar… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  22. arXiv:2403.19586  [pdf, other

    cs.CV cs.GR

    TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering

    Authors: Shuai Zhang, Huangxuan Zhao, Zhenghong Zhou, Guanjun Wu, Chuansheng Zheng, Xinggang Wang, Wenyu Liu

    Abstract: Four-dimensional Digital Subtraction Angiography (4D DSA) is a medical imaging technique that provides a series of 2D images captured at different stages and angles during the process of contrast agent filling blood vessels. It plays a significant role in the diagnosis of cerebrovascular diseases. Improving the rendering quality and speed under sparse sampling is important for observing the status… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  23. arXiv:2403.18296  [pdf, other

    cs.LG cs.AI eess.SP

    GeNet: A Graph Neural Network-based Anti-noise Task-Oriented Semantic Communication Paradigm

    Authors: Chunhang Zheng, Kechao Cai

    Abstract: Traditional approaches to semantic communication tasks rely on the knowledge of the signal-to-noise ratio (SNR) to mitigate channel noise. Moreover, these methods necessitate training under specific SNR conditions, entailing considerable time and computational resources. In this paper, we propose GeNet, a Graph Neural Network (GNN)-based paradigm for semantic communication aimed at combating noise… ▽ More

    Submitted 14 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  24. arXiv:2403.16812  [pdf, other

    cs.HC cs.AI

    Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making

    Authors: Shuai Ma, Qiaoyi Chen, Xinru Wang, Chengbo Zheng, Zhenhui Peng, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, humans often passively review AI's suggestion and decide whether to accept or reject it as a whole. In such a paradigm, humans are found to rarely trigger analytical thinking and face difficulties in communicating the nuances of conflicting opinions to the AI when disagreements occur. To tackle this challenge, we propose Human-AI Deliberation, a novel framework to p… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  25. arXiv:2403.16071  [pdf, other

    cs.AI cs.CV cs.MM

    Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization

    Authors: Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan, Erwei Yin

    Abstract: Lip reading, the process of interpreting silent speech from visual lip movements, has gained rising attention for its wide range of realistic applications. Deep learning approaches greatly improve current lip reading systems. However, lip reading in cross-speaker scenarios where the speaker identity changes, poses a challenging problem due to inter-speaker variability. A well-trained lip reading s… ▽ More

    Submitted 2 May, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: To appear in LREC-COLING 2024

    Journal ref: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  26. arXiv:2403.15382  [pdf, other

    cs.CV

    DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

    Authors: Ruining Li, Chuanxia Zheng, Christian Rupprecht, Andrea Vedaldi

    Abstract: We introduce DragAPart, a method that, given an image and a set of drags as input, can generate a new image of the same object in a new state, compatible with the action of the drags. Differently from prior works that focused on repositioning objects, DragAPart predicts part-level interactions, such as opening and closing a drawer. We study this problem as a proxy for learning a generalist motion… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Project page: https://dragapart.github.io/

  27. arXiv:2403.14972  [pdf, other

    cs.AI cs.CL cs.MA cs.MM

    A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning

    Authors: Changmeng Zheng, Dayong Liang, Wengyu Zhang, Xiao-Yong Wei, Tat-Seng Chua, Qing Li

    Abstract: This paper presents a pilot study aimed at introducing multi-agent debate into multimodal reasoning. The study addresses two key challenges: the trivialization of opinions resulting from excessive summarization and the diversion of focus caused by distractor concepts introduced from images. These challenges stem from the inductive (bottom-up) nature of existing debating schemes. To address the iss… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Work in progress

  28. arXiv:2403.14627  [pdf, other

    cs.CV

    MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

    Authors: Yuedong Chen, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, Jianfei Cai

    Abstract: We introduce MVSplat, an efficient model that, given sparse multi-view images as input, predicts clean feed-forward 3D Gaussians. To accurately localize the Gaussian centers, we build a cost volume representation via plane sweeping, where the cross-view feature similarities stored in the cost volume can provide valuable geometry cues to the estimation of depth. We also learn other Gaussian primiti… ▽ More

    Submitted 18 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: ECCV2024, Project page: https://donydchen.github.io/mvsplat, Code: https://github.com/donydchen/mvsplat

  29. arXiv:2403.14619  [pdf, other

    cs.CV

    ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition

    Authors: Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham, Qianyi Wu

    Abstract: 3D decomposition/segmentation still remains a challenge as large-scale 3D annotated data is not readily available. Contemporary approaches typically leverage 2D machine-generated segments, integrating them for 3D consistency. While the majority of these methods are based on NeRFs, they face a potential weakness that the instance/semantic embedding features derive from independent MLPs, thus preven… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Project Page: https://sm0kywu.github.io/ClusteringSDF/

  30. Observation of spectral lines in the exceptional GRB 221009A

    Authors: Yan-Qiu Zhang, Shao-Lin Xiong, Ji-Rong Mao, Shuang-Nan Zhang, Wang-Chen Xue, Chao Zheng, Jia-Cong Liu, Zhen Zhang, Xi-Lu Wang, Ming-Yu Ge, Shu-Xu Yi, Li-Ming Song, Zheng-Hua An, Ce Cai, Xin-Qiao Li, Wen-Xi Peng, Wen-Jun Tan, Chen-Wei Wang, Xiang-Yang Wen, Yue Wang, Shuo Xiao, Fan Zhang, Peng Zhang, Shi-Jie Zheng

    Abstract: As the brightest gamma-ray burst ever observed, GRB 221009A provided a precious opportunity to explore spectral line features. In this paper, we performed a comprehensive spectroscopy analysis of GRB 221009A jointly with GECAM-C and Fermi/GBM data to search for emission and absorption lines. For the first time we investigated the line feature throughout this GRB including the most bright part wher… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by SCIENCE CHINA Physics, Mechanics & Astronomy (SCPMA)

    Journal ref: Observation of spectral lines in the exceptional GRB 221009A. Sci. China-Phys. Mech. Astron. 67, 289511 (2024)

  31. arXiv:2403.12327  [pdf, other

    cs.CV cs.LG

    GT-Rain Single Image Deraining Challenge Report

    Authors: Howard Zhang, Yunhao Ba, Ethan Yang, Rishi Upadhyay, Alex Wong, Achuta Kadambi, Yun Guo, Xueyao Xiao, Xiaoxiong Wang, Yi Li, Yi Chang, Luxin Yan, Chaochao Zheng, Luping Wang, Bin Liu, Sunder Ali Khowaja, Jiseok Yoon, Ik-Hyun Lee, Zhao Zhang, Yanyan Wei, Jiahuan Ren, Suiyi Zhao, Huan Zheng

    Abstract: This report reviews the results of the GT-Rain challenge on single image deraining at the UG2+ workshop at CVPR 2023. The aim of this competition is to study the rainy weather phenomenon in real world scenarios, provide a novel real world rainy image dataset, and to spark innovative ideas that will further the development of single image deraining methods on real images. Submissions were trained o… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  32. arXiv:2403.11449  [pdf, other

    cs.LG

    Graph Partial Label Learning with Potential Cause Discovering

    Authors: Hang Gao, Jiaguo Yuan, Jiangmeng Li, Peng Qiao, Fengge Wu, Changwen Zheng, Huaping Liu

    Abstract: Graph Neural Networks (GNNs) have garnered widespread attention for their potential to address the challenges posed by graph representation learning, which face complex graph-structured data across various domains. However, due to the inherent complexity and interconnectedness of graphs, accurately annotating graph data for training GNNs is extremely challenging. To address this issue, we have int… ▽ More

    Submitted 21 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  33. arXiv:2403.11310  [pdf, other

    cs.CV

    A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation

    Authors: Qucheng Peng, Ce Zheng, Chen Chen

    Abstract: 3D human pose data collected in controlled laboratory settings present challenges for pose estimators that generalize across diverse scenarios. To address this, domain generalization is employed. Current methodologies in domain generalization for 3D human pose estimation typically utilize adversarial training to generate synthetic poses for training. Nonetheless, these approaches exhibit several l… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  34. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  35. arXiv:2403.02635  [pdf, other

    cs.AI

    PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning

    Authors: Ke Zhang, DanDan Zhu, Qiuhan Xu, Hao Zhou, Ce Zheng

    Abstract: Training for multi-agent reinforcement learning(MARL) is a time-consuming process caused by distribution shift of each agent. One drawback is that strategy of each agent in MARL is independent but actually in cooperation. Thus, a vertical issue in multi-agent reinforcement learning is how to efficiently accelerate training process. To address this problem, current research has leveraged a centrali… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 10 pages, 5 figures

  36. arXiv:2403.02513  [pdf, other

    cs.CL

    Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF

    Authors: Chen Zheng, Ke Sun, Hang Wu, Chenguang Xi, Xun Zhou

    Abstract: In recent advancements in Conversational Large Language Models (LLMs), a concerning trend has emerged, showing that many new base LLMs experience a knowledge reduction in their foundational capabilities following Supervised Fine-Tuning (SFT). This process often leads to issues such as forgetting or a decrease in the base model's abilities. Moreover, fine-tuned models struggle to align with user pr… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  37. arXiv:2402.19143  [pdf, ps, other

    quant-ph

    Recurrence Theorem for Open Quantum Systems

    Authors: Zhihang Liu, Chao Zheng

    Abstract: Quantum (Poincaré) recurrence theorem are known for closed quantum (classical) systems. Can recurrence happen in open systems? We provide the recurrence theorem for open quantum systems via non-Hermitian (NH) description. We find that PT symmetry and pseudo-Hermitian symmetry protect recurrence for NH open quantum systems and the recurrence fails with the symmetry breaking. Applying our theorem… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  38. arXiv:2402.17384  [pdf, other

    astro-ph.SR astro-ph.EP

    Ultraviolet and Chromospheric activity and Habitability of M stars

    Authors: Xue Li, Song Wang, Henggeng Han, Huiqin Yang, Chuanjie Zheng, Yang Huang, Jifeng Liu

    Abstract: M-type stars are crucial for stellar activity studies since they cover two types of magnetic dynamos and particularly intriguing for habitability studies due to their abundance and long lifespans during the main-sequence stage. In this paper, we used the LAMOST DR9 catalog and the GALEX UV archive data to investigate the chromospheric and UV activities of M-type stars. All the chromospheric and UV… ▽ More

    Submitted 27 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 27 pages, 32 figures, accepted by ApJ

  39. arXiv:2402.13572  [pdf, other

    cs.LG cs.AI math.NA

    On the Expressive Power of a Variant of the Looped Transformer

    Authors: Yihang Gao, Chuanyang Zheng, Enze Xie, Han Shi, Tianyang Hu, Yu Li, Michael K. Ng, Zhenguo Li, Zhaoqiang Liu

    Abstract: Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  40. arXiv:2401.18018  [pdf, other

    cs.LG cs.AI cs.CL

    On Prompt-Driven Safeguarding for Large Language Models

    Authors: Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie Zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng

    Abstract: Prepending model inputs with safety prompts is a common practice for safeguarding large language models (LLMs) against queries with harmful intents. However, the underlying working mechanisms of safety prompts have not been unraveled yet, restricting the possibility of automatically optimizing them to improve LLM safety. In this work, we investigate how LLMs' behavior (i.e., complying with or refu… ▽ More

    Submitted 3 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: ICML 2024

  41. arXiv:2401.17268  [pdf, other

    cs.CL cs.AI cs.LG

    Weaver: Foundation Models for Creative Writing

    Authors: Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, Jing Wang , et al. (21 additional authors not shown)

    Abstract: This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  42. arXiv:2401.14915  [pdf, other

    cs.HC cs.AI

    Charting the Future of AI in Project-Based Learning: A Co-Design Exploration with Students

    Authors: Chengbo Zheng, Kangyu Yuan, Bingcan Guo, Reza Hadi Mogavi, Zhenhui Peng, Shuai Ma, Xiaojuan Ma

    Abstract: The increasing use of Artificial Intelligence (AI) by students in learning presents new challenges for assessing their learning outcomes in project-based learning (PBL). This paper introduces a co-design study to explore the potential of students' AI usage data as a novel material for PBL assessment. We conducted workshops with 18 college students, encouraging them to speculate an alternative worl… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Conditionally accepted by CHI '24

  43. arXiv:2401.14857  [pdf, other

    cs.RO

    LIV-GaussMap: LiDAR-Inertial-Visual Fusion for Real-time 3D Radiance Field Map Rendering

    Authors: Sheng Hong, Junjie He, Xinhu Zheng, Chunran Zheng, Shaojie Shen

    Abstract: We introduce an integrated precise LiDAR, Inertial, and Visual (LIV) multimodal sensor fused mapping system that builds on the differentiable \pre{surface splatting }\now{Gaussians} to improve the mapping fidelity, quality, and structural accuracy. Notably, this is also a novel form of tightly coupled map for LiDAR-visual-inertial sensor fusion. This system leverages the complementary characteri… ▽ More

    Submitted 16 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  44. arXiv:2401.14166  [pdf, other

    cs.CL cs.AI

    BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction

    Authors: Jiangmeng Li, Fei Song, Yifan Jin, Wenwen Qiang, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: As a novel and effective fine-tuning paradigm based on large-scale pre-trained language models (PLMs), prompt-tuning aims to reduce the gap between downstream tasks and pre-training objectives. While prompt-tuning has yielded continuous advancements in various tasks, such an approach still remains a persistent defect: prompt-tuning methods fail to generalize to specific few-shot patterns. From the… ▽ More

    Submitted 20 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR2024

  45. arXiv:2401.12785  [pdf, other

    quant-ph

    Extended imaginary gauge transformation in a general nonreciprocal lattice

    Authors: Yunyao Qi, Jinghui Pi, Yuquan Wu, Heng Lin, Chao Zheng, Guilu Long

    Abstract: Imaginary gauge transformation (IGT) provides a clear understanding of the non-Hermitian skin effect by transforming the non-Hermitian Hamiltonians with real spectra into Hermitian ones. In this work, we extend this approach to the complex spectrum regime in a general nonreciprocal lattice model. We unveil the validity of IGT hinges on a class of pseudo-Hermitian symmetry. The generalized Brilloui… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 16 pages, 6 figures

  46. arXiv:2401.10973  [pdf, other

    cs.MA cs.LG

    T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration

    Authors: Chuxiong Sun, Zehua Zang, Jiabao Li, Jiangmeng Li, Xiao Xu, Rui Wang, Changwen Zheng

    Abstract: Communication stands as a potent mechanism to harmonize the behaviors of multiple agents. However, existing works primarily concentrate on broadcast communication, which not only lacks practicality, but also leads to information redundancy. This surplus, one-fits-all information could adversely impact the communication efficiency. Furthermore, existing works often resort to basic mechanisms to int… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: AAAI24

  47. arXiv:2401.08621  [pdf, other

    math.GM

    Algebraic structure of the Gaussian-PDMF space and applications on fuzzy equations

    Authors: Chuang Zheng

    Abstract: In this paper, we extend the research presented in [Wang and Zheng, Fuzzy Sets and Systems, p108581, 2023] by establishing the algebraic structure of the Gaussian Probability Density Membership Function (Gaussian-PDMF) space. We consider fixed objective and subjective entities, denoted as $(h,p)$, and provide the explicit form of the membership function. Consequently, every fuzzy number with the m… ▽ More

    Submitted 5 December, 2023; originally announced January 2024.

    Comments: 23 pages, 5 figures

    MSC Class: 03E72

  48. arXiv:2401.07513  [pdf, other

    astro-ph.IM hep-ex nucl-ex physics.ins-det

    Detector performance of the Gamma-ray Transient Monitor onboard DRO-A Satellite

    Authors: Pei-Yi Feng, Zheng-Hua An, Da-Li Zhang, Chen-Wei Wang, Chao Zheng, Sheng Yang, Shao-Lin Xiong, Jia-Cong Liu, Xin-Qiao Li, Ke Gong, Xiao-Jing Liu, Min Gao, Xiang-Yang Wen, Ya-Qing liu, Xiao-Yun Zhao, Fan Zhang, Xi-Lei Sun, Hong Lu

    Abstract: Gamma-ray Transient Monitor (GTM) is an all-sky monitor onboard the Distant Retrograde Orbit-A (DRO-A) satellite with the scientific objective of detecting gamma-ray transients ranging from 20 keV to 1 MeV. GTM is equipped with 5 Gamma-ray Transient Probe (GTP) detector modules, utilizing the NaI(Tl) scintillator coupled with a SiPM array. To reduce the SiPM noise, GTP makes use of a dedicated dua… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 13 pages, 25 figures

  49. arXiv:2401.05062  [pdf, other

    math.DG

    Discrete conformal structures on surfaces with boundary (I) -- Classification

    Authors: Xu Xu, Chao Zheng

    Abstract: In this paper, we introduce the discrete conformal structures on surfaces with boundary in an axiomatic approach pioneered by Glickenstein \cite{Glickenstein}. This ensures that the Poincaré dual of an ideally triangulated surface with boundary has a good geometric structure. Then we classify the discrete conformal structures on surfaces with boundary, which turns out to unify and generalize Guo-L… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: (2020): 52C25; 52C26

  50. arXiv:2401.05056  [pdf, ps, other

    math.DG

    A discrete uniformization theorem for decorated piecewise hyperbolic metrics on surfaces

    Authors: Xu Xu, Chao Zheng

    Abstract: In this paper, we study a natural discretization of the smooth Gaussian curvature on surfaces. A discrete uniformization theorem is established for this discrete Gaussian curvature. We further investigate the prescribing combinatorial curvature problem for a parametrization of this discrete Gaussian curvature, which is called the combinatorial $α$-curvature. To find decorated piecewise hyperbolic… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.02271

    MSC Class: (2020): 52C26