Skip to main content

Showing 1–50 of 1,186 results for author: Tan, C

.
  1. arXiv:2408.15903  [pdf, other

    cs.CL

    LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments

    Authors: Ruirui Chen, Weifeng Jiang, Chengwei Qin, Ishaan Singh Rawal, Cheston Tan, Dongkyu Choi, Bo Xiong, Bo Ai

    Abstract: The rapid obsolescence of information in Large Language Models (LLMs) has driven the development of various techniques to incorporate new facts. However, existing methods for knowledge editing still face difficulties with multi-hop questions that require accurate fact identification and sequential logical reasoning, particularly among numerous fact updates. To tackle these challenges, this paper i… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2408.14917  [pdf, other

    cs.NE

    PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

    Authors: Xinyi Chen, Jibin Wu, Chenxiang Ma, Yinsong Yan, Yujie Wu, Kay Chen Tan

    Abstract: Spiking Neural Networks (SNNs) hold great potential to realize brain-inspired, energy-efficient computational systems. However, current SNNs still fall short in terms of multi-scale temporal processing compared to their biological counterparts. This limitation has resulted in poor performance in many pattern recognition tasks with information that varies across different timescales. To address thi… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  3. arXiv:2408.13987  [pdf, other

    cs.CL cs.AI

    Focused Large Language Models are Stable Many-Shot Learners

    Authors: Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: In-Context Learning (ICL) enables large language models (LLMs) to achieve rapid task adaptation by learning from demonstrations. With the increase in available context length of LLMs, recent experiments have shown that the performance of ICL does not necessarily scale well in many-shot (demonstration) settings. We theoretically and experimentally confirm that the reason lies in more demonstrations… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 15 pages

  4. Contemporaneous X-ray Observations of 30 Bright Radio Bursts from the Prolific Fast Radio Burst Source FRB 20220912A

    Authors: Amanda M. Cook, Paul Scholz, Aaron B. Pearlman, Thomas C. Abbott, Marilyn Cruces, B. M. Gaensler, Fengqiu, Dong, Daniele Michilli, Gwendolyn Eadie, Victoria M. Kaspi, Ingrid Stairs, Chia Min Tan, Mohit Bhardwaj, Tomas Cassanelli, Alice P. Curtin, Adaeze L. Ibik, Mattias Lazda, Kiyoshi W. Masui, Ayush Pandhi, Masoud Rafiei-Ravandi, Mawson W. Sammons, Kaitlyn Shin, Kendrick Smith, David C. Stenning

    Abstract: We present an extensive contemporaneous X-ray and radio campaign performed on the repeating fast radio burst (FRB) source FRB 20220912A for eight weeks immediately following the source's detection by CHIME/FRB. This includes X-ray data from XMM-Newton, NICER, and Swift, and radio detections of FRB 20220912A from CHIME/Pulsar and Effelsberg. We detect no significant X-ray emission at the time of 30… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 23 pages, 3 figures. ApJ in press (accepted after resubmission July 19th, 2024)

  5. arXiv:2408.11330  [pdf, other

    cs.LG cs.CL

    Design Principle Transfer in Neural Architecture Search via Large Language Models

    Authors: Xun Zhou, Liang Feng, Xingyu Wu, Zhichao Lu, Kay Chen Tan

    Abstract: Transferable neural architecture search (TNAS) has been introduced to design efficient neural architectures for multiple tasks, to enhance the practical applicability of NAS in real-world scenarios. In TNAS, architectural knowledge accumulated in previous search processes is reused to warm up the architecture search for new tasks. However, existing TNAS methods still search in an extensive search… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  6. arXiv:2408.10316  [pdf, other

    astro-ph.GA astro-ph.CO

    Project Dinos II: Redshift evolution of dark and luminous matter density profiles in strong-lensing elliptical galaxies across $0.1 < z < 0.9$

    Authors: William Sheu, Anowar J. Shajib, Tommaso Treu, Alessandro Sonnenfeld, Simon Birrer, Michele Cappellari, Lindsay J. Oldham, Chin Yi Tan

    Abstract: We present a new measurement of the dark and luminous matter distribution of massive elliptical galaxies, and their evolution with redshift, by combining strong lensing and dynamical observables. Our sample of 58 lens galaxies covers a redshift range of $0.090\leq z_{\rm l}\leq0.884$. By combining new Hubble Space Telescope imaging with previously observed velocity dispersion and line-of-sight mea… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 28 pages, 20 figures

  7. arXiv:2408.10287  [pdf

    physics.optics cs.AI eess.IV

    Recognizing Beam Profiles from Silicon Photonics Gratings using Transformer Model

    Authors: Yu Dian Lim, Hong Yu Li, Simon Chun Kiat Goh, Xiangyu Wang, Peng Zhao, Chuan Seng Tan

    Abstract: Over the past decade, there has been extensive work in developing integrated silicon photonics (SiPh) gratings for the optical addressing of trapped ion qubits in the ion trap quantum computing community. However, when viewing beam profiles from infrared (IR) cameras, it is often difficult to determine the corresponding heights where the beam profiles are located. In this work, we developed transf… ▽ More

    Submitted 22 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  8. arXiv:2408.09691  [pdf, ps, other

    math.CA

    Regularity of Fourier integrals on product spaces

    Authors: Chaoqiang Tan, Zipeng Wang

    Abstract: We study a family of Fourier integral operators by allowing their symbols to satisfy a multi-parameter differential inequality on R^N. We show that these operators of order -(N-1)/2 are bounded from classical, atom decomposable H^1-Hardy space to L^1(R^N). Consequently, we obtain a sharp L^p-regularity result due to Seeger, Sogge and Stein.

    Submitted 28 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: We corrected some typos and errors from the previous version

  9. arXiv:2408.09647  [pdf, other

    cs.CV

    C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection

    Authors: Chuangchuang Tan, Renshuai Tao, Huan Liu, Guanghua Gu, Baoyuan Wu, Yao Zhao, Yunchao Wei

    Abstract: This work focuses on AIGC detection to develop universal detectors capable of identifying various types of forgery images. Recent studies have found large pre-trained models, such as CLIP, are effective for generalizable deepfake detection along with linear classifiers. However, two critical issues remain unresolved: 1) understanding why CLIP features are effective on deepfake detection through a… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures

  10. arXiv:2408.08044  [pdf, other

    cs.CE

    Crystalline Material Discovery in the Era of Artificial Intelligence

    Authors: Zhenzhong Wang, Haowei Hua, Wanyu Lin, Ming Yang, Kay Chen Tan

    Abstract: Crystalline materials, with their symmetrical and periodic structures, possess a diverse array of properties and have been widely used in various fields, ranging from electronic devices to energy applications. To discover crystalline materials, traditional experimental and computational approaches are often time-consuming and expensive. In these years, thanks to the explosive amount of crystalline… ▽ More

    Submitted 23 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  11. arXiv:2408.07176  [pdf, other

    cs.NE

    Surrogate-Assisted Search with Competitive Knowledge Transfer for Expensive Optimization

    Authors: Xiaoming Xue, Yao Hu, Liang Feng, Kai Zhang, Linqi Song, Kay Chen Tan

    Abstract: Expensive optimization problems (EOPs) have attracted increasing research attention over the decades due to their ubiquity in a variety of practical applications. Despite many sophisticated surrogate-assisted evolutionary algorithms (SAEAs) that have been developed for solving such problems, most of them lack the ability to transfer knowledge from previously-solved tasks and always start their sea… ▽ More

    Submitted 20 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 22 pages, 14 figures

  12. arXiv:2408.06351  [pdf, other

    eess.SP math.ST

    A Probabilistic Approach for Queue Length Estimation Using License Plate Recognition Data: Considering Overtaking in Multi-lane Scenarios

    Authors: Lyuzhou Luo, Hao Wu, Jiahao Liu, Keshuang Tang, Chaopeng Tan

    Abstract: Multi-section license plate recognition (LPR) data provides input-output information and sampled travel times of the investigated link, serving as an ideal data source for lane-based queue length estimation in recent studies. However, most of these studies assumed the strict FIFO rule or a specific arrival process, thus ignoring the potential impact of overtaking and the variation of traffic flows… ▽ More

    Submitted 24 July, 2024; originally announced August 2024.

    Comments: 30 pages, 20 figures

  13. arXiv:2408.03506  [pdf, ps, other

    cs.CL

    1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data

    Authors: Calvin Tan, Jerome Wang

    Abstract: This paper presents a compute-efficient approach to pre-training a Language Model-the "1.5-Pints"-in only 9 days, while outperforming state-of-the-art models as an instruction-following assistant.Based on MT-Bench (a benchmark that emulates human judgments), 1.5-Pints outperforms Apple's OpenELM and Microsoft's Phi.This is achieved by a carefully curated pre-training dataset of 57 billion tokens,… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Technical Report for 1.5-Pints

  14. arXiv:2408.03211  [pdf, ps, other

    math.FA

    Boundedness of New Type Fourier Integral Operators with Product Structure

    Authors: Chaoqiang Tan, Zipeng Wang

    Abstract: We investigate a class of Fourier integral operators with weakened symbols, which satisfy a multi-parameter differential inequality in $\R^n$. We establish that these operators retain the classical $L^p$ boundedness and the $H^1$ to $L^1$ boundedness. Notably, the Hardy space considered here is the traditional single-parameter Hardy space rather than a product Hardy space.

    Submitted 6 August, 2024; originally announced August 2024.

    MSC Class: Primary 42B20; Secondary 42B30; 42B37; 42B15

  15. arXiv:2408.01735  [pdf, other

    quant-ph cond-mat.mes-hall physics.optics

    Something from Nothing: A Theoretical Framework for Enhancing or Enabling Cooling of a Mechanical Resonator via the anti-Stokes or Stokes Interaction and Zero-Photon Detection

    Authors: Jack Clarke, Evan A. Cryer-Jenkins, Arjun Gupta, Kyle D. Major, Jinglei Zhang, Georg Enzian, Magdalena Szczykulska, Anthony C. Leung, Harsh Rathee, Andreas Ø. Svela, Anthony K. C. Tan, Almut Beige, Klaus Mølmer, Michael R. Vanner

    Abstract: We develop a theoretical framework to describe how zero-photon detection may be utilized to enhance laser cooling via the anti-Stokes interaction and, somewhat surprisingly, enable cooling via the Stokes interaction commonly associated with heating. Our description includes both pulsed and continuous measurements as well as optical detection efficiency and open-system dynamics. For both cases, we… ▽ More

    Submitted 6 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: 15 pages, 6 figures

  16. arXiv:2408.01734  [pdf, other

    quant-ph cond-mat.mes-hall physics.optics

    Something from Nothing: Enhanced Laser Cooling of a Mechanical Resonator via Zero-Photon Detection

    Authors: Evan A. Cryer-Jenkins, Kyle D. Major, Jack Clarke, Georg Enzian, Magdalena Szczykulska, Jinglei Zhang, Arjun Gupta, Anthony C. Leung, Harsh Rathee, Andreas Ø. Svela, Anthony K. C. Tan, Almut Beige, Klaus Mølmer, Michael R. Vanner

    Abstract: Throughout quantum science and technology, measurement is used as a powerful resource for nonlinear operations and quantum state engineering. In particular, single-photon detection is commonly employed for quantum-information applications and tests of fundamental physics. By contrast, and perhaps counter-intuitively, measurement of the absence of photons also provides useful information, and offer… ▽ More

    Submitted 6 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: Main: 5 pages, 2 figures. Supplemental: 6 pages, 2 figures

  17. arXiv:2408.01669  [pdf, other

    cs.CV cs.MM

    SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

    Authors: Chaolei Tan, Zihang Lin, Junfu Pu, Zhongang Qi, Wei-Yi Pei, Zhi Qu, Yexin Wang, Ying Shan, Wei-Shi Zheng, Jian-Fang Hu

    Abstract: Video grounding is a fundamental problem in multimodal content understanding, aiming to localize specific natural language queries in an untrimmed video. However, current video grounding datasets merely focus on simple events and are either limited to shorter videos or brief sentences, which hinders the model from evolving toward stronger multimodal understanding capabilities. To address these lim… ▽ More

    Submitted 18 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: Accepted to ACM MM 2024. Project page: https://synopground.github.io/

  18. arXiv:2408.01551  [pdf, other

    cs.SD eess.AS

    PiCoGen2: Piano cover generation with transfer learning approach and weakly aligned data

    Authors: Chih-Pin Tan, Hsin Ai, Yi-Hsin Chang, Shuen-Huei Guan, Yi-Hsuan Yang

    Abstract: Piano cover generation aims to create a piano cover from a pop song. Existing approaches mainly employ supervised learning and the training demands strongly-aligned and paired song-to-piano data, which is built by remapping piano notes to song audio. This would, however, result in the loss of piano information and accordingly cause inconsistencies between the original and remapped piano versions.… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: Accepted at the 25th International Society for Music Information Retrieval Conference (ISMIR), 2024

  19. arXiv:2408.00865  [pdf, other

    astro-ph.GA

    A Pride of Satellites in the Constellation Leo? Discovery of the Leo VI Milky Way Satellite Galaxy with DELVE Early Data Release 3

    Authors: C. Y. Tan, W. Cerny, A. Drlica-Wagner, A. B. Pace, M. Geha, A. P. Ji, T. S. Li, M. Adamów, D. Anbajagane, C. R. Bom, J. A. Carballo-Bello, J. L. Carlin, C. Chang, Y. Choi, M. L. M. Collins, A. Doliva-Dolinsky, P. S. Ferguson, R. A. Gruendl, D. J. James, G. Limberg, M. Navabi, D. Martínez-Delgado, C. E. Martínez-Vázquez, G. E. Medina, B. Mutlu-Pakdil , et al. (9 additional authors not shown)

    Abstract: We report the discovery and spectroscopic confirmation of an ultra-faint Milky Way (MW) satellite in the constellation of Leo. This system was discovered as a spatial overdensity of resolved stars observed with Dark Energy Camera (DECam) data from an early version of the third data release of the DECam Local Volume Exploration survey (DELVE EDR3). The low luminosity ($M_V = -3.56_{-0.37}^{+0.47}$;… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 21 pages, 11 figures, 2 tables; to be submitted to AAS Journals

    Report number: FERMILAB-PUB-24-0358-LDRD-PPD

  20. arXiv:2407.21713  [pdf, other

    cs.LG cs.AI

    Social Learning through Interactions with Other Agents: A Survey

    Authors: Dylan Hillier, Cheston Tan, Jing Jiang

    Abstract: Social learning plays an important role in the development of human intelligence. As children, we imitate our parents' speech patterns until we are able to produce sounds; we learn from them praising us and scolding us; and as adults, we learn by working with others. In this work, we survey the degree to which this paradigm -- social learning -- has been mirrored in machine learning. In particular… ▽ More

    Submitted 3 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: To be published in IJCAI 2024, available on http://www.ijcai.org

    ACM Class: I.2.7; I.2.0

  21. arXiv:2407.21242  [pdf, other

    stat.AP stat.CO

    Supervised brain node and network construction under voxel-level functional imaging

    Authors: Wanwan Xu, Selena Wang, Chichun Tan, Xilin Shen, Wenjing Luo, Todd Constable, Tianxi Li, Yize Zhao

    Abstract: Recent advancements in understanding the brain's functional organization related to behavior have been pivotal, particularly in the development of predictive models based on brain connectivity. Traditional methods in this domain often involve a two-step process by first constructing a connectivity matrix from predefined brain regions, and then linking these connections to behaviors or clinical out… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  22. PiCoGen: Generate Piano Covers with a Two-stage Approach

    Authors: Chih-Pin Tan, Shuen-Huei Guan, Yi-Hsuan Yang

    Abstract: Cover song generation stands out as a popular way of music making in the music-creative community. In this study, we introduce Piano Cover Generation (PiCoGen), a two-stage approach for automatic cover song generation that transcribes the melody line and chord progression of a song given its audio recording, and then uses the resulting lead sheet as the condition to generate a piano cover in the s… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Published at ICMR 2024 (project page: https://tanchihpin0517.github.io/PiCoGen/)

  23. arXiv:2407.17448  [pdf, ps, other

    hep-ph

    Chiral-even twist-3 GPDs for the proton in a spectator diquark model

    Authors: Chentao Tan, Zhun Lu

    Abstract: We investigate the chiral-even twist-3 generalized parton distributions (GPDs) of valence quarks in the proton at nonzero skewness $ξ$, using a spectator model with scalar and axial-vector diquarks. We consider the exponential form factor for the nucleon-quark-diquark vertex and the axial-vector diquark with light-cone transverse polarization. We analyze the dependence of GPDs on the longitudinal… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 18 pages, 8 figures

  24. arXiv:2407.16148  [pdf, other

    cs.CL

    CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support

    Authors: Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan, David Wadden, Lucy Lu Wang, Aakanksha Naik

    Abstract: Literature review requires researchers to synthesize a large amount of information and is increasingly challenging as the scientific literature expands. In this work, we investigate the potential of LLMs for producing hierarchical organizations of scientific studies to assist researchers with literature review. We define hierarchical organizations as tree structures where nodes refer to topical ca… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 2024 ACL Findings

  25. arXiv:2407.15734  [pdf, other

    cs.AI cs.MA

    TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

    Authors: John Chong Min Tan, Prince Saroj, Bharat Runwal, Hardik Maheshwari, Brian Lim Yi Sheng, Richard Cottrill, Alankrit Chona, Ambuj Kumar, Mehul Motani

    Abstract: TaskGen is an open-sourced agentic framework which uses an Agent to solve an arbitrary task by breaking them down into subtasks. Each subtask is mapped to an Equipped Function or another Agent to execute. In order to reduce verbosity (and hence token usage), TaskGen uses StrictJSON that ensures JSON output from the Large Language Model (LLM), along with additional features such as type checking an… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 53 pages

  26. arXiv:2407.12176  [pdf, other

    cs.CY cs.AI cs.CL

    GPT-4V Cannot Generate Radiology Reports Yet

    Authors: Yuyang Jiang, Chacha Chen, Dang Nguyen, Benjamin M. Mervak, Chenhao Tan

    Abstract: GPT-4V's purported strong multimodal abilities raise interests in using it to automate radiology report writing, but there lacks thorough evaluations. In this work, we perform a systematic evaluation of GPT-4V in generating radiology reports on two chest X-ray report datasets: MIMIC-CXR and IU X-Ray. We attempt to directly generate reports using GPT-4V through different prompting strategies and fi… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 24 pages, 3 figures, code: https://github.com/YuyangJ0/GPT-4V-evaluation-radiology-report

  27. arXiv:2407.11845  [pdf, other

    astro-ph.GA astro-ph.SR

    Asymmetric Kinematics in Young Clusters: The λ Ori Cluster

    Authors: Joseph J. Armstrong, Jonathan C. Tan

    Abstract: Context. Most stars form in clusters or associations but only a small number of these groups are expected to remain bound for longer than a few Myr. Once star formation has ended and the molecular gas around young stellar objects has been expelled via feedback processes, most initially bound young clusters lose the majority of their binding mass and begin to disperse into the Galactic field. Aims.… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 20 pages, 17 figures, submitted to A&A

  28. arXiv:2407.10058  [pdf, other

    cs.CL cs.AI

    Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

    Authors: Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Wenliang Chen

    Abstract: Large language models (LLMs) exhibit remarkable capabilities in understanding and generating natural language. However, these models can inadvertently memorize private information, posing significant privacy risks. This study addresses the challenge of enabling LLMs to protect specific individuals' private data without the need for complete retraining. We propose \return, a Real-world pErsonal daT… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  29. arXiv:2407.09949  [pdf, other

    astro-ph.GA

    The formation of supermassive black holes from Population III.1 seeds. III. Galaxy evolution and black hole growth from semi-analytic modelling

    Authors: Vieri Cammelli, Pierluigi Monaco, Jonathan C. Tan, Jasbir Singh, Fabio Fontanot, Gabriella De Lucia, Michaela Hirschmann, Lizhi Xie

    Abstract: We present an implementation of Pop III.1 seeding of supermassive black holes (SMBHs) in a theoretical model of galaxy formation and evolution to assess the growth the SMBH population and the properties of the host galaxies. The model of Pop III.1 seeding involves SMBH formation at redshifts $z\gtrsim 20$ in dark matter minihalos that are isolated from external radiative feedback, parameterized by… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Submitted to MNRAS, comments welcome

  30. arXiv:2407.09045  [pdf, other

    cs.IR cs.AI

    Time-Frequency Analysis of Variable-Length WiFi CSI Signals for Person Re-Identification

    Authors: Chen Mao, Chong Tan, Jingqi Hu, Min Zheng

    Abstract: Person re-identification (ReID), as a crucial technology in the field of security, plays an important role in security detection and people counting. Current security and monitoring systems largely rely on visual information, which may infringe on personal privacy and be susceptible to interference from pedestrian appearances and clothing in certain scenarios. Meanwhile, the widespread use of rout… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  31. arXiv:2407.07480  [pdf, other

    astro-ph.HE

    The discovery of a nearby 421~s transient with CHIME/FRB/Pulsar

    Authors: Fengqiu Adam Dong, Tracy Clarke, Alice P. Curtin, Ajay Kumar, Ingrid Stairs, Shami Chatterjee, Amanda M. Cook, Emmanuel Fonseca, B. M. Gaensler, Jason W. T. Hessels, Victoria M. Kaspi, Mattias Lazda, Kiyoshi W. Masui, James W. McKee, Bradley W. Meyers, Aaron B. Pearlman, Scott M. Ransom, Paul Scholz, Kaitlyn Shin, Kendrick M. Smith, Chia Min Tan

    Abstract: Neutron stars and white dwarfs are both dense remnants of post-main-sequence stars. Pulsars, magnetars and strongly magnetised white dwarfs have all been seen to been observed to exhibit coherent, pulsed radio emission in relation to their rotational period. Recently, a new type of radio long period transient (LPT) has been discovered. The bright radio emission of LPTs resembles that of radio puls… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Submitted

  32. arXiv:2407.05410  [pdf, other

    cs.SE cs.DB cs.LG cs.LO

    Synthetic Test Data Generation Using Recurrent Neural Networks: A Position Paper

    Authors: Razieh Behjati, Erik Arisholm, Chao Tan, Margrethe M. Bedregal

    Abstract: Testing in production-like test environments is an essential part of quality assurance processes in many industries. Provisioning of such test environments, for information-intensive services, involves setting up databases that are rich-enough to enable simulating a wide variety of user scenarios. While production data is perhaps the gold-standard here, many organizations, particularly within the… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: This paper was published in the proceedings of RAISE@ICSE in 2019

    Journal ref: Proceedings of the 7th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, RAISE@ICSE 2019, (2019), 22-27

  33. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  34. arXiv:2407.01418  [pdf, other

    cs.RO cs.AI cs.LG

    RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing

    Authors: Bo Ai, Stephen Tian, Haochen Shi, Yixuan Wang, Cheston Tan, Yunzhu Li, Jiajun Wu

    Abstract: Tactile feedback is critical for understanding the dynamics of both rigid and deformable objects in many manipulation tasks, such as non-prehensile manipulation and dense packing. We introduce an approach that combines visual and tactile sensing for robotic manipulation by learning a neural, tactile-informed dynamics model. Our proposed framework, RoboPack, employs a recurrent graph neural network… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Robotics: Science and Systems (RSS), 2024. Project page: https://robo-pack.github.io/

    ACM Class: I.2.9; I.2.6; I.2.10

  35. arXiv:2407.00050  [pdf, other

    q-bio.BM cs.AI cs.LG

    FoldToken2: Learning compact, invariant and generative protein structure language

    Authors: Zhangyang Gao, Cheng Tan, Stan Z. Li

    Abstract: The equivalent nature of 3D coordinates has posed long term challenges in protein structure representation learning, alignment, and generation. Can we create a compact and invariant language that equivalently represents protein structures? Towards this goal, we propose FoldToken2 to transfer equivariant structures into discrete tokens, while maintaining the recoverability of the original structure… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

  36. arXiv:2406.16603  [pdf, other

    cond-mat.mtrl-sci

    Bipolarized Weyl semimetals and quantum crystal valley Hall effect in two-dimensional altermagnetic materials

    Authors: Chao-Yang Tan, Ze-Feng Gao, Huan-Cheng Yang, Kai Liu, Peng-Jie Guo, Zhong-Yi Lu

    Abstract: Magnetism and topology are two major areas of condensed matter physics. The combination of magnetism and topology gives rise to more novel physical effects, which have attracted strongly theoretical and experimental attention. Recently, the concept of altermagnetism has been introduced, characterized by a dual nature: real-space antiferromagnetism and reciprocal-space anisotropic spin polarization… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures

  37. arXiv:2406.15238  [pdf, other

    physics.acc-ph

    Fermilab Booster Beam Emittances from Quadrupole Modes Measured by BPMs

    Authors: C. Y. Tan, M. Balcewicz

    Abstract: The measurement of beam emittances by extracting the quadrupole mode signal from a 4 plate beam position monitor (BPM) was published at least 40 years ago. Unfortunately, in practice, this method suffers from poor signal to noise ratio and requires a lot of tuning to extract out the emittances. In this paper, an improved method where multiple BPMs are used together with better mathematical analysi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 15th International Particle Accelerator Conference (IPAC'24)

    Report number: FERMILAB-CONF-24-0179-AD

  38. arXiv:2406.14359  [pdf, other

    cs.NE

    Learning to Transfer for Evolutionary Multitasking

    Authors: Sheng-Hao Wu, Yuxiao Huang, Xingyu Wu, Liang Feng, Zhi-Hui Zhan, Kay Chen Tan

    Abstract: Evolutionary multitasking (EMT) is an emerging approach for solving multitask optimization problems (MTOPs) and has garnered considerable research interest. The implicit EMT is a significant research branch that utilizes evolution operators to enable knowledge transfer (KT) between tasks. However, current approaches in implicit EMT face challenges in adaptability, due to the use of a limited numbe… ▽ More

    Submitted 22 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  39. arXiv:2406.14108  [pdf, other

    math.OC

    Connected Vehicle Data-driven Robust Optimization for Traffic Signal Timing: Modeling Traffic Flow Variability and Errors

    Authors: Chaopeng Tan, Yue Ding, Kaidi Yang, Hong Zhu, Keshuang Tang

    Abstract: Recent advancements in Connected Vehicle (CV) technology have prompted research on leveraging CV data for more effective traffic management. Despite the low penetration rate, such detailed CV data has demonstrated great potential in improving traffic signal performance. However, existing studies share a common shortcoming in that they all ignore traffic flow estimation errors in their modeling pro… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for podium session of the Conference in Emerging Technologies in Transportation Systems (TRC-30)

  40. arXiv:2406.13434  [pdf, other

    cs.RO

    Tactile Aware Dynamic Obstacle Avoidance in Crowded Environment with Deep Reinforcement Learning

    Authors: Yung Chuen Ng, Qi Wen, Lim, Chun Ye Tan, Zhen Hao Gan, Meng Yee, Chuah

    Abstract: Mobile robots operating in crowded environments require the ability to navigate among humans and surrounding obstacles efficiently while adhering to safety standards and socially compliant mannerisms. This scale of the robot navigation problem may be classified as both a local path planning and trajectory optimization problem. This work presents an array of force sensors that act as a tactile laye… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  41. arXiv:2406.12266  [pdf, other

    cs.CL

    Towards a Client-Centered Assessment of LLM Therapists by Client Simulation

    Authors: Jiashuo Wang, Yang Xiao, Yanran Li, Changhe Song, Chunpu Xu, Chenhao Tan, Wenjie Li

    Abstract: Although there is a growing belief that LLMs can be used as therapists, exploring LLMs' capabilities and inefficacy, particularly from the client's perspective, is limited. This work focuses on a client-centered assessment of LLM therapists with the involvement of simulated clients, a standard approach in clinical medical education. However, there are two challenges when applying the approach to a… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  42. arXiv:2406.10840  [pdf, other

    cs.LG cs.AI q-bio.BM

    CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

    Authors: Haitao Lin, Guojiang Zhao, Odin Zhang, Yufei Huang, Lirong Wu, Zicheng Liu, Siyuan Li, Cheng Tan, Zhifeng Gao, Stan Z. Li

    Abstract: Structure-based drug design (SBDD) aims to generate potential drugs that can bind to a target protein and is greatly expedited by the aid of AI techniques in generative models. However, a lack of systematic understanding persists due to the diverse settings, complex implementation, difficult reproducibility, and task singularity. Firstly, the absence of standardization can lead to unfair compariso… ▽ More

    Submitted 22 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 9 pages main context

  43. Massive Dirac Fermions and Strong Shubnikov-de Haas Oscillations in Topological Insulator Sm,Fe:Bi2Se3 Single Crystals

    Authors: Weiyao Zhao, Chi Xuan Trang, Qile Li, Lei Chen, Zengji Yue, Abdulhakim Bake, Cheng Tan, Lan Wang, Mitchell Nancarrow, Mark Edmonds, David Cortie, Xiaolin Wang

    Abstract: Topological insulators (TIs) are emergent materials with unique band structure, which allow the study of quantum effect in solids, as well as contribute to high performance quantum devices. To achieve the better performance of TI, here we present a co-doping strategy using synergistic rare-earth Sm and transition-metal Fe dopants in Bi2Se3 single crystals, which combine the advantages of both tran… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 5 figures

    Journal ref: Physical Review B 104, 085153 (2021)

  44. arXiv:2406.08987  [pdf, other

    cs.NE

    Autonomous Multi-Objective Optimization Using Large Language Model

    Authors: Yuxiao Huang, Shenghao Wu, Wenjie Zhang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Multi-objective optimization problems (MOPs) are ubiquitous in real-world applications, presenting a complex challenge of balancing multiple conflicting objectives. Traditional evolutionary algorithms (EAs), though effective, often rely on domain-specific expertise and iterative fine-tuning, hindering adaptability to unseen MOPs. In recent years, the advent of Large Language Models (LLMs) has revo… ▽ More

    Submitted 26 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 11 figures, 6 tables

  45. arXiv:2406.05688  [pdf, other

    cs.CL cs.AI cs.LG

    Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

    Authors: Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

    Abstract: Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process. However, existing applications are primarily limited to static review generation based on submitted papers, which fail to capture the dynamic and iterative nature of real-world peer reviews. In this paper, we reformulate the peer-r… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Under review

  46. arXiv:2406.03198  [pdf, other

    cs.CL cs.HC cs.LG stat.AP stat.ML

    The Impossibility of Fair LLMs

    Authors: Jacy Anthis, Kristian Lum, Michael Ekstrand, Avi Feller, Alexander D'Amour, Chenhao Tan

    Abstract: The need for fair AI is increasingly clear in the era of general-purpose systems such as ChatGPT, Gemini, and other large language models (LLMs). However, the increasing complexity of human-AI interaction and its social impacts have raised questions of how fairness standards could be applied. Here, we review the technical frameworks that machine learning researchers have used to evaluate fairness,… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

    Comments: Presented at the 1st Human-Centered Evaluation and Auditing of Language Models (HEAL) workshop at CHI 2024

  47. arXiv:2406.02234  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    On the Limitations of Fractal Dimension as a Measure of Generalization

    Authors: Charlie Tan, Inés García-Redondo, Qiquan Wang, Michael M. Bronstein, Anthea Monod

    Abstract: Bounding and predicting the generalization gap of overparameterized neural networks remains a central open problem in theoretical machine learning. Neural network optimization trajectories have been proposed to possess fractal structure, leading to bounds and generalization measures based on notions of fractal dimension on these trajectories. Prominently, both the Hausdorff dimension and the persi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 17 pages, 6 figures

  48. arXiv:2406.01627  [pdf, other

    q-bio.GN cs.LG

    GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models

    Authors: Zicheng Liu, Jiahui Li, Siyuan Li, Zelin Zang, Cheng Tan, Yufei Huang, Yajing Bai, Stan Z. Li

    Abstract: The Genomic Foundation Model (GFM) paradigm is expected to facilitate the extraction of generalizable representations from massive genomic data, thereby enabling their application across a spectrum of downstream applications. Despite advancements, a lack of evaluation framework makes it difficult to ensure equitable assessment due to experimental settings, model intricacy, benchmark datasets, and… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  49. arXiv:2406.01333  [pdf, other

    cs.CL cs.AI

    Probing Language Models for Pre-training Data Detection

    Authors: Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Haonan Lu, Bing Liu, Wenliang Chen

    Abstract: Large Language Models (LLMs) have shown their impressive capabilities, while also raising concerns about the data contamination problems due to privacy issues and leakage of benchmark datasets in the pre-training phase. Therefore, it is vital to detect the contamination by checking whether an LLM has been pre-trained on the target texts. Recent studies focus on the generated texts and compute perp… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL-2024 main conference

  50. arXiv:2405.20834  [pdf, other

    cs.CV

    Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning

    Authors: Cheng Tan, Jingxuan Wei, Linzhuang Sun, Zhangyang Gao, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

    Abstract: Large language models equipped with retrieval-augmented generation (RAG) represent a burgeoning field aimed at enhancing answering capabilities by leveraging external knowledge bases. Although the application of RAG with language-only models has been extensively explored, its adaptation into multimodal vision-language models remains nascent. Going beyond mere answer generation, the primary goal of… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Under review