Skip to main content

Showing 1–50 of 134 results for author: Yao, F

.
  1. arXiv:2407.14544  [pdf, other

    cs.DC

    Fast Iterative Graph Computing with Updated Neighbor States

    Authors: Yijie Zhou, Shufeng Gong, Feng Yao, Hanzhang Chen, Song Yu, Pengxi Liu, Yanfeng Zhang, Ge Yu, Jeffrey Xu Yu

    Abstract: Enhancing the efficiency of iterative computation on graphs has garnered considerable attention in both industry and academia. Nonetheless, the majority of efforts focus on expediting iterative computation by minimizing the running time per iteration step, ignoring the optimization of the number of iteration rounds, which is a crucial aspect of iterative computation. We experimentally verified the… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 14 pages, 13 figures, 2 tables; accepted for publication in ICDE 2024

  2. arXiv:2407.08010  [pdf

    cs.LG cs.NE

    A New Self-organizing Interval Type-2 Fuzzy Neural Network for Multi-Step Time Series Prediction

    Authors: Fulong Yao, Wanqing Zhao, Matthew Forshaw, Yang Song

    Abstract: This paper proposes a new self-organizing interval type-2 fuzzy neural network with multiple outputs (SOIT2FNN-MO) for multi-step time series prediction. Differing from the traditional six-layer IT2FNN, a nine-layer network is developed to improve prediction accuracy, uncertainty handling and model interpretability. First, a new co-antecedent layer and a modified consequent layer are devised to im… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2406.13236  [pdf, other

    cs.CL cs.AI

    Data Contamination Can Cross Language Barriers

    Authors: Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang

    Abstract: The opacity in developing large language models (LLMs) is raising growing concerns about the potential contamination of public benchmarks in the pre-training data. Existing contamination detection methods are typically based on the text overlap between training and evaluation data, which can be too superficial to reflect deeper forms of contamination. In this paper, we first present a cross-lingua… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  4. arXiv:2406.06028  [pdf, other

    cs.CV

    ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery

    Authors: Xian Sun, Qiwei Yan, Chubo Deng, Chenglong Liu, Yi Jiang, Zhongyan Hou, Wanxuan Lu, Fanglong Yao, Xiaoyu Liu, Lingxiang Hao, Hongfeng Yu

    Abstract: Scene Graph Generation (SGG) is a high-level visual understanding and reasoning task aimed at extracting entities (such as objects) and their interrelationships from images. Significant progress has been made in the study of SGG in natural images in recent years, but its exploration in the domain of remote sensing images remains very limited. The complex characteristics of remote sensing images ne… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  5. arXiv:2406.04460  [pdf, other

    cs.CL

    Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs

    Authors: Shang Zhou, Feng Yao, Chengyu Dong, Zihan Wang, Jingbo Shang

    Abstract: Controlling the attribute intensity of text generation is crucial across scenarios (e.g., writing conciseness, chatting emotion, and explanation clarity). The remarkable capabilities of large language models (LLMs) have revolutionized text generation, prompting us to explore such \emph{smooth control} of LLM generation. Specifically, we propose metrics to assess the range, calibration, and consist… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  6. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.11204  [pdf, other

    cs.LG stat.ML

    Learning from Imperfect Human Feedback: a Tale from Corruption-Robust Dueling

    Authors: Yuwei Cheng, Fan Yao, Xuefeng Liu, Haifeng Xu

    Abstract: This paper studies Learning from Imperfect Human Feedback (LIHF), motivated by humans' potential irrationality or imperfect perception of true preference. We revisit the classic dueling bandit problem as a model of learning from comparative human feedback, and enrich it by casting the imperfection in human feedback as agnostic corruption to user utilities. We start by identifying the fundamental l… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  8. arXiv:2404.18652  [pdf

    math.OC

    Energy Efficiency Optimization of Multi-unit System with Different Devices

    Authors: Fulai Yao

    Abstract: The energy efficiency optimization of the power generation system and the energy efficiency optimization of the energy consumption system are unified into the same optimization problem, and a simple method to achieve energy efficiency optimization without establishing an accurate mathematical model of the system is proposed. For systems with similar energy efficiency, it is proved that the best lo… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  9. arXiv:2404.18319  [pdf, other

    cs.IR

    User Welfare Optimization in Recommender Systems with Competing Content Creators

    Authors: Fan Yao, Yiming Liao, Mingzhe Wu, Chuanhao Li, Yan Zhu, James Yang, Qifan Wang, Haifeng Xu, Hongning Wang

    Abstract: Driven by the new economic opportunities created by the creator economy, an increasing number of content creators rely on and compete for revenue generated from online content recommendation platforms. This burgeoning competition reshapes the dynamics of content distribution and profoundly impacts long-term user welfare on the platform. However, the absence of a comprehensive picture of global use… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  10. arXiv:2404.14372  [pdf, other

    cs.CL cs.AI

    Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph

    Authors: Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo Shang

    Abstract: Model scaling is becoming the default choice for many language tasks due to the success of large language models (LLMs). However, it can fall short in specific scenarios where simple customized methods excel. In this paper, we delve into the patent approval pre-diction task and unveil that simple domain-specific graph methods outperform enlarging the model, using the intrinsic dependencies within… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 17 Pages, Under Review

  11. arXiv:2404.01687  [pdf, other

    hep-ex

    Search for a sub-eV sterile neutrino using Daya Bay's full dataset

    Authors: F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding, Y. Y. Ding , et al. (176 additional authors not shown)

    Abstract: This Letter presents results of a search for the mixing of a sub-eV sterile neutrino with three active neutrinos based on the full data sample of the Daya Bay Reactor Neutrino Experiment, collected during 3158 days of detector operation, which contains $5.55 \times 10^{6}$ reactor \anue candidates identified as inverse beta-decay interactions followed by neutron-capture on gadolinium. The analysis… ▽ More

    Submitted 15 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, 1 table

  12. arXiv:2404.00693  [pdf, ps, other

    hep-ph hep-lat nucl-ex nucl-th

    Total Gluon Helicity from Lattice without Effective Theory Matching

    Authors: Zhuoyi Pang, Fei Yao, Jian-Hui Zhang

    Abstract: We propose two approaches for extracting the total gluon helicity contribution to proton spin from lattice QCD, one from local operator matrix elements in a fixed gauge accessible on lattice with feasible renormalization, and the other from gauge-invariant nonlocal gluon correlators. Neither of these approaches requires a matching procedure when converted to the MS scheme. Our proposal resolves a… ▽ More

    Submitted 29 June, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 14 pages

  13. arXiv:2404.00457  [pdf, other

    cs.CL

    MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks

    Authors: Letian Peng, Zilong Wang, Feng Yao, Zihan Wang, Jingbo Shang

    Abstract: Information extraction (IE) is a fundamental area in natural language processing where prompting large language models (LLMs), even with in-context examples, cannot defeat small LMs tuned on very small IE datasets. We observe that IE tasks, such as named entity recognition and relation extraction, all focus on extracting important information, which can be formalized as a label-to-span matching. I… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  14. Green's matching: an efficient approach to parameter estimation in complex dynamic systems

    Authors: Jianbin Tan, Guoyu Zhang, Xueqin Wang, Hui Huang, Fang Yao

    Abstract: Parameters of differential equations are essential to characterize intrinsic behaviors of dynamic systems. Numerous methods for estimating parameters in dynamic systems are computationally and/or statistically inadequate, especially for complex systems with general-order differential operators, such as motion dynamics. This article presents Green's matching, a computationally tractable and statist… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 40 pages, 4 figures

    Journal ref: Journal of the Royal Statistical Society: Series B, 2024

  15. arXiv:2403.11400  [pdf, other

    math.ST

    Spatially Randomized Designs Can Enhance Policy Evaluation

    Authors: Ying Yang, Chengchun Shi, Fang Yao, Shouyang Wang, Hongtu Zhu

    Abstract: This article studies the benefits of using spatially randomized experimental designs which partition the experimental area into distinct, non-overlapping units with treatments assigned randomly. Such designs offer improved policy evaluation in online experiments by providing more precise policy value estimators and more effective A/B testing algorithms than traditional global designs, which apply… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  16. arXiv:2403.06239  [pdf, other

    cs.LG cs.AI

    Cooperative Classification and Rationalization for Graph Generalization

    Authors: Linan Yue, Qi Liu, Ye Liu, Weibo Gao, Fangzhou Yao, Wenfeng Li

    Abstract: Graph Neural Networks (GNNs) have achieved impressive results in graph classification tasks, but they struggle to generalize effectively when faced with out-of-distribution (OOD) data. Several approaches have been proposed to address this problem. Among them, one solution is to diversify training distributions in vanilla classification by modifying the data environment, yet accessing the environme… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted to WWW 2024

  17. arXiv:2403.01013  [pdf

    eess.SY

    A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement Learning

    Authors: Fulong Yao, Wanqing Zhao, Matthew Forshaw, Yang Song

    Abstract: The global energy landscape is undergoing a transformation towards decarbonization, sustainability, and cost-efficiency. In this transition, microgrid systems integrated with renewable energy sources (RES) and energy storage systems (ESS) have emerged as a crucial component. However, optimizing the operational control of such an integrated energy system lacks a holistic view of multiple environmen… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  18. SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation

    Authors: Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Fanglong Yao, Xian Sun, Kun Fu

    Abstract: Extrapolating future weather radar echoes from past observations is a complex task vital for precipitation nowcasting. The spatial morphology and temporal evolution of radar echoes exhibit a certain degree of correlation, yet they also possess independent characteristics. {Existing methods learn unified spatial and temporal representations in a highly coupled feature space, emphasizing the correla… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 16 pages, 11 figures, TGRS

  19. arXiv:2402.15467  [pdf, other

    cs.GT cs.HC

    Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?

    Authors: Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu

    Abstract: The advent of generative AI (GenAI) technology produces transformative impact on the content creation landscape, offering alternative approaches to produce diverse, high-quality content across media, thereby reshaping online ecosystems but also raising concerns about market over-saturation and the potential marginalization of human creativity. Our work introduces a competition model generalized fr… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 43 pages, 20 figures

  20. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  21. Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective

    Authors: Feiyu Yao, Zongkai Wu, Li Yi

    Abstract: Estimating 3D full-body pose from sparse sensor data is a pivotal technique employed for the reconstruction of realistic human motions in Augmented Reality and Virtual Reality. However, translating sparse sensor signals into comprehensive human motion remains a challenge since the sparsely distributed sensors in common VR systems fail to capture the motion of full human body. In this paper, we use… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  22. arXiv:2401.02901  [pdf, other

    hep-ph hep-ex

    Charged-current non-standard neutrino interactions at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 25 pages, 16 figures, 6 tables; 36 pages, format changed, references added

  23. arXiv:2312.13434  [pdf, other

    cs.AI cs.IR

    Zero-1-to-3: Domain-level Zero-shot Cognitive Diagnosis via One Batch of Early-bird Students towards Three Diagnostic Objectives

    Authors: Weibo Gao, Qi Liu, Hao Wang, Linan Yue, Haoyang Bi, Yin Gu, Fangzhou Yao, Zheng Zhang, Xin Li, Yuanjing He

    Abstract: Cognitive diagnosis seeks to estimate the cognitive states of students by exploring their logged practice quiz data. It plays a pivotal role in personalized learning guidance within intelligent education systems. In this paper, we focus on an important, practical, yet often underexplored task: domain-level zero-shot cognitive diagnosis (DZCD), which arises due to the absence of student practice lo… ▽ More

    Submitted 4 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  24. arXiv:2312.10392  [pdf, other

    math.NA

    Numerical approximation of discontinuous solutions of the semilinear wave equation

    Authors: Jiachuan Cao, Buyang Li, Yanping Lin, Fangyan Yao

    Abstract: A fully discrete low-regularity integrator with high-frequency recovery techniques is constructed to approximate rough and possibly discontinuous solutions of the semilinear wave equation. The proposed method can capture the discontinuities of the solutions correctly without spurious oscillations and can approximate rough and discontinuous solutions with a higher convergence rate than pre-existing… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  25. arXiv:2311.14977  [pdf

    cs.CV cs.MM

    Incorporating granularity bias as the margin into contrastive loss for video captioning

    Authors: Jiayang Gu, Fengming Yao

    Abstract: Video captioning models easily suffer from long-tail distribution of phrases, which makes captioning models prone to generate vague sentences instead of accurate ones. However, existing debiasing strategies tend to export external knowledge to build dependency trees of words or refine frequency distribution by complex losses and extra input features, which lack interpretability and are hard to tra… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures

  26. arXiv:2311.02085  [pdf, other

    cs.IR cs.AI

    Preference Elicitation with Soft Attributes in Interactive Recommendation

    Authors: Erdem Biyik, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Preference elicitation plays a central role in interactive recommender systems. Most preference elicitation approaches use either item queries that ask users to select preferred items from a slate, or attribute queries that ask them to express their preferences for item characteristics. Unfortunately, users often wish to describe their preferences using soft attributes for which no ground-truth se… ▽ More

    Submitted 22 October, 2023; originally announced November 2023.

  27. MUSER: A Multi-View Similar Case Retrieval Dataset

    Authors: Qingquan Li, Yiran Hu, Feng Yao, Chaojun Xiao, Zhiyuan Liu, Maosong Sun, Weixing Shen

    Abstract: Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness. However, existing SCR datasets only focus on the fact description section when judging the similarity between cases, ignoring other valuable sections (e.g., the court's opinion) that can provide insightful reasoning process behind. Furthermore, the case similarities are t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by CIKM 2023 Resource Track

    Journal ref: CIKM 2023

  28. arXiv:2310.12436  [pdf, other

    math.ST

    Learning prediction function of prior measures for statistical inverse problems of partial differential equations

    Authors: Junxiong Jia, Deyu Meng, Zongben Xu, Fang Yao

    Abstract: In this paper, we view the statistical inverse problems of partial differential equations (PDEs) as PDE-constrained regression and focus on learning the prediction function of the prior probability measures. From this perspective, we propose general generalization bounds for learning infinite-dimensionally defined prior measures in the style of the probability approximately correct Bayesian learni… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 57 pages

    MSC Class: 62F15; 65N21

  29. arXiv:2309.14258  [pdf, other

    cs.CL cs.AI

    OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding

    Authors: Hao Peng, Xiaozhi Wang, Feng Yao, Zimu Wang, Chuzhao Zhu, Kaisheng Zeng, Lei Hou, Juanzi Li

    Abstract: Event understanding aims at understanding the content and relationship of events within texts, which covers multiple complicated information extraction tasks: event detection, event argument extraction, and event relation extraction. To facilitate related research and application, we present an event understanding toolkit OmniEvent, which features three desiderata: (1) Comprehensive. OmniEvent sup… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  30. arXiv:2309.08173  [pdf, other

    cs.CL

    FedJudge: Federated Legal Large Language Model

    Authors: Linan Yue, Qi Liu, Yichao Du, Weibo Gao, Ye Liu, Fangzhou Yao

    Abstract: Large Language Models (LLMs) have gained prominence in the field of Legal Intelligence, offering potential applications in assisting legal professionals and laymen. However, the centralized training of these Legal LLMs raises data privacy concerns, as legal data is distributed among various institutions containing sensitive individual information. This paper addresses this challenge by exploring t… ▽ More

    Submitted 10 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to DASFAA 2024

  31. arXiv:2309.00300  [pdf, other

    cs.AI

    Towards the Identifiability and Explainability for Personalized Learner Modeling: An Inductive Paradigm

    Authors: Jiatong Li, Qi Liu, Fei Wang, Jiayu Liu, Zhenya Huang, Fangzhou Yao, Linbo Zhu, Yu Su

    Abstract: Personalized learner modeling using cognitive diagnosis (CD), which aims to model learners' cognitive states by diagnosing learner traits from behavioral data, is a fundamental yet significant task in many web learning services. Existing cognitive diagnosis models (CDMs) follow the proficiency-response paradigm that views learner traits and question parameters as trainable embeddings and learns th… ▽ More

    Submitted 19 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted by the ACM Web Conference 2024 (WWW '24)

  32. arXiv:2308.08355  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Multiple antiferromagnetic phases and magnetic anisotropy in exfoliated CrBr$_3$ multilayers

    Authors: Fengrui Yao, Volodymyr Multian, Zhe Wang, Nicolas Ubrig, Jérémie Teyssier, Fan Wu, Enrico Giannini, Marco Gibertini, Ignacio Gutiérrez-Lezama, Alberto F. Morpurgo

    Abstract: In twisted two-dimensional (2D) magnets, the stacking dependence of the magnetic exchange interaction can lead to regions of ferromagnetic and antiferromagnetic interlayer order, separated by non-collinear, skyrmion-like spin textures. Recent experimental searches for these textures have focused on CrI$_3$, known to exhibit either ferromagnetic or antiferromagnetic interlayer order, depending on l… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  33. arXiv:2308.06207  [pdf, other

    cs.CL

    Thinking Like an Expert:Multimodal Hypergraph-of-Thought (HoT) Reasoning to boost Foundation Modals

    Authors: Fanglong Yao, Changyuan Tian, Jintao Liu, Zequn Zhang, Qing Liu, Li Jin, Shuchao Li, Xiaoyu Li, Xian Sun

    Abstract: Reasoning ability is one of the most crucial capabilities of a foundation model, signifying its capacity to address complex reasoning tasks. Chain-of-Thought (CoT) technique is widely regarded as one of the effective methods for enhancing the reasoning ability of foundation models and has garnered significant attention. However, the reasoning process of CoT is linear, step-by-step, similar to pers… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  34. arXiv:2306.07893  [pdf, other

    cs.GT

    Rethinking Incentives in Recommender Systems: Are Monotone Rewards Always Beneficial?

    Authors: Fan Yao, Chuanhao Li, Karthik Abinav Sankararaman, Yiming Liao, Yan Zhu, Qifan Wang, Hongning Wang, Haifeng Xu

    Abstract: The past decade has witnessed the flourishing of a new profession as media content creators, who rely on revenue streams from online content recommendation platforms. The reward mechanism employed by these platforms creates a competitive environment among creators which affect their production choices and, consequently, content distribution and system welfare. It is thus crucial to design the plat… ▽ More

    Submitted 9 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  35. arXiv:2306.06918  [pdf, other

    cs.CL cs.AI

    The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation

    Authors: Hao Peng, Xiaozhi Wang, Feng Yao, Kaisheng Zeng, Lei Hou, Juanzi Li, Zhiyuan Liu, Weixing Shen

    Abstract: Event extraction (EE) is a crucial task aiming at extracting events from texts, which includes two subtasks: event detection (ED) and event argument extraction (EAE). In this paper, we check the reliability of EE evaluations and identify three major pitfalls: (1) The data preprocessing discrepancy makes the evaluation results on the same dataset not directly comparable, but the data preprocessing… ▽ More

    Submitted 15 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted at Findings of ACL 2023

  36. arXiv:2305.16236  [pdf

    stat.ME math.ST

    Robust Functional Data Analysis for Discretely Observed Data

    Authors: Lingxuan Shao, Fang Yao

    Abstract: This paper examines robust functional data analysis for discretely observed data, where the underlying process encompasses various distributions, such as heavy tail, skewness, or contaminations. We propose a unified robust concept of functional mean, covariance, and principal component analysis, while existing methods and definitions often differ from one another or only address fully observed fun… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  37. Dynamic Matrix Recovery

    Authors: Ziyuan Chen, Ying Yang, Fang Yao

    Abstract: Matrix recovery from sparse observations is an extensively studied topic emerging in various applications, such as recommendation system and signal processing, which includes the matrix completion and compressed sensing models as special cases. In this work we propose a general framework for dynamic matrix recovery of low-rank matrices that evolve smoothly over time. We start from the setting that… ▽ More

    Submitted 21 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Journal of the American Statistical Association (2023)

  38. arXiv:2305.08172  [pdf, other

    stat.ME

    Fast Signal Region Detection with Application to Whole Genome Association Studies

    Authors: Wei Zhang, Fan Wang, Fang Yao

    Abstract: Research on the localization of the genetic basis associated with diseases or traits has been widely conducted in the last a few decades. Scan methods have been developed for region-based analysis in whole-genome association studies, helping us better understand how genetics influences human diseases or traits, especially when the aggregated effects of multiple causal variants are present. In this… ▽ More

    Submitted 8 February, 2024; v1 submitted 14 May, 2023; originally announced May 2023.

  39. arXiv:2302.13908  [pdf, ps, other

    math.ST

    Nonparametric regression for repeated measurements with deep neural networks

    Authors: Shunxing Yan, Fang Yao

    Abstract: Analysis of repeated measurements for a sample of subjects has been intensively studied with several important branches developed, including longitudinal/panel/functional data analysis, while nonparametric regression of the mean function serves as a cornerstone that many statistical models are built upon. In this work, we investigate this problem using fully connected deep neural network (DNN) est… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 20 pages, 0 figures

  40. arXiv:2302.01971  [pdf, other

    cs.GT cs.CY cs.IR cs.LG

    How Bad is Top-$K$ Recommendation under Competing Content Creators?

    Authors: Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu

    Abstract: Content creators compete for exposure on recommendation platforms, and such strategic behavior leads to a dynamic shift over the content distribution. However, how the creators' competition impacts user welfare and how the relevance-driven recommendation influences the dynamics in the long run are still largely unknown. This work provides theoretical insights into these research questions. We mo… ▽ More

    Submitted 2 May, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted as ICML2023 Oral

  41. arXiv:2301.05952  [pdf, other

    astro-ph.GA

    A New WISE Calibration of Stellar Mass

    Authors: T. H. Jarrett, M. E. Cluver, Edward N. Taylor, Sabine Bellstedt, A. S. G Robotham, H. F. M. Yao

    Abstract: We derive new empirical scaling relations between WISE mid-infrared galaxy photometry and well-determined stellar masses from SED modeling of a suite of optical-infrared photometry provided by the DR4 Catalogue of the GAMA-KiDS-VIKING survey of the southern G23 field. The mid-infrared source extraction and characterization are drawn from the WISE Extended Source Catalogue (WXSC) and the archival A… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in the Astrophysical Journal (ApJ)

  42. arXiv:2301.04352  [pdf, ps, other

    cs.CV cs.AI cs.RO

    Graph based Environment Representation for Vision-and-Language Navigation in Continuous Environments

    Authors: Ting Wang, Zongkai Wu, Feiyu Yao, Donglin Wang

    Abstract: Vision-and-Language Navigation in Continuous Environments (VLN-CE) is a navigation task that requires an agent to follow a language instruction in a realistic environment. The understanding of environments is a crucial part of the VLN-CE task, but existing methods are relatively simple and direct in understanding the environment, without delving into the relationship between language instructions… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: 10 pages, 5 figures

  43. arXiv:2212.14415  [pdf, other

    hep-ph hep-lat nucl-th

    Connecting Euclidean to light-cone correlations: From flavor nonsinglet in forward kinematics to flavor singlet in non-forward kinematics

    Authors: Fei Yao, Yao Ji, Jian-Hui Zhang

    Abstract: We present a unified framework for the perturbative factorization connecting Euclidean correlations to light-cone correlations. Starting from nonlocal quark and gluon bilinear correlators, we derive the relevant hard-matching kernel up to the next-to-leading-order, both for the flavor singlet and non-singlet combinations, in non-forward and forward kinematics, and in coordinate and momentum space.… ▽ More

    Submitted 14 November, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 31 pages, 4 figures, typos corrected, additional appendix added

    Report number: TUM-HEP-1446/22

  44. arXiv:2211.14988  [pdf, other

    hep-ex

    Precision measurement of reactor antineutrino oscillation at kilometer-scale baselines by Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, Y. Y. Ding, X. Y. Ding , et al. (176 additional authors not shown)

    Abstract: We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Comp… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: 7 pages, 3 figures, 1 table, 10 supplementary files

  45. arXiv:2209.08768  [pdf, other

    math.ST

    Theory of functional principal component analysis for discretely observed data

    Authors: Hang Zhou, Dongyi Wei, Fang Yao

    Abstract: Functional data analysis is an important research field in statistics which treats data as random functions drawn from some infinite-dimensional functional space, and functional principal component analysis (FPCA) based on eigen-decomposition plays a central role for data reduction and representation. After nearly three decades of research, there remains a key problem unsolved, namely, the perturb… ▽ More

    Submitted 1 April, 2024; v1 submitted 19 September, 2022; originally announced September 2022.

  46. arXiv:2209.01236  [pdf, other

    hep-ph hep-lat nucl-th

    Resumming Quark's Longitudinal Momentum Logarithms in LaMET Expansion of Lattice PDFs

    Authors: Yushan Su, Jack Holligan, Xiangdong Ji, Fei Yao, Jian-Hui Zhang, Rui Zhang

    Abstract: In the large-momentum expansion for parton distribution functions (PDFs), the natural physics scale is the longitudinal momentum ($p_z$) of the quarks (or gluons) in a large-momentum hadron. We show how to expose this scale dependence through resumming logarithms of the type $\ln^n p_z/μ$ in the matching coefficient, where $μ$ is a fixed renormalization scale. The result enhances the accuracy of t… ▽ More

    Submitted 29 March, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

    Comments: 17 pages, 9 figures

  47. arXiv:2208.08008  [pdf, other

    hep-lat

    Nucleon Transversity Distribution in the Continuum and Physical Mass Limit from Lattice QCD

    Authors: Fei Yao, Lisa Walter, Jiunn-Wei Chen, Jun Hua, Xiangdong Ji, Luchang Jin, Sebastian Lahrtz, Lingquan Ma, Protick Mohanta, Andreas Schäfer, Hai-Tao Shu, Yushan Su, Peng Sun, Xiaonu Xiong, Yi-Bo Yang, Jian-Hui Zhang

    Abstract: We report a state-of-the-art lattice QCD calculation of the isovector quark transversity distribution of the proton in the continuum and physical mass limit using large-momentum effective theory. The calculation is done at four lattice spacings $a=\{0.098,0.085,0.064,0.049\}$~fm and various pion masses ranging between $220$ and $350$ MeV, with proton momenta up to $2.8$ GeV. The result is non-pert… ▽ More

    Submitted 24 February, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 16 pages, 18 figures, 2 tables

  48. Connecting MeerKAT radio continuum properties to GAMA optical emission-line and WISE mid-infrared activity

    Authors: H. F. M. Yao, M. E. Cluver, T. H. Jarrett, Gyula I. G. Jozsa, M. G. Santos, L. Marchetti, M. J. I. Brown, Y. A. Gordon, S. Brough, A. M. Hopkins, B. W. Holwerda, S. P. Driver, E. M. Sadler

    Abstract: The identification of AGN in large surveys has been hampered by seemingly discordant classifications arising from differing diagnostic methods, usually tracing distinct processes specific to a particular wavelength regime. However, as shown in Yao et al. (2020), the combination of optical emission line measurements and mid-infrared photometry can be used to optimise the discrimination capability b… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  49. arXiv:2207.14238  [pdf, other

    eess.IV cs.CV

    Re-thinking and Re-labeling LIDC-IDRI for Robust Pulmonary Cancer Prediction

    Authors: Hanxiao Zhang, Xiao Gu, Minghui Zhang, Weihao Yu, Liang Chen, Zhexin Wang, Feng Yao, Yun Gu, Guang-Zhong Yang

    Abstract: The LIDC-IDRI database is the most popular benchmark for lung cancer prediction. However, with subjective assessment from radiologists, nodules in LIDC may have entirely different malignancy annotations from the pathological ground truth, introducing label assignment errors and subsequent supervision bias during training. The LIDC database thus requires more objective labels for learning-based can… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  50. arXiv:2207.02175  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Band Gap Opening in Bilayer Graphene-CrCl$_3$/CrBr$_3$/CrI$_3$ van der Waals Interfaces

    Authors: Giulia Tenasini, David Soler-Delgado, Zhe Wang, Fengrui Yao, Dumitru Dumcenco, Enrico Giannini, Kenji Watanabe, Takashi Taniguchi, Christian Moulsdale, Aitor Garcia-Ruiz, Vladimir I. Fal'ko, Ignacio Gutiérrez-Lezama, Alberto F. Morpurgo

    Abstract: We report experimental investigations of transport through bilayer graphene (BLG)/chromium trihalide (CrX$_3$; X=Cl, Br, I) van der Waals interfaces. In all cases, a large charge transfer from BLG to CrX$_3$ takes place (reaching densities in excess of $10^{13}$ cm$^{-2}$), and generates an electric field perpendicular to the interface that opens a band gap in BLG. We determine the gap from the ac… ▽ More

    Submitted 24 August, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Journal ref: Nano Lett., 22 (16), 6760-6766 (2022)