Skip to main content

Showing 1–50 of 148 results for author: Wei, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12246  [pdf, other

    cs.CV

    OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion

    Authors: Guoting Wei, Xia Yuan, Yu Liu, Zhenhao Shang, Kelu Yao, Chao Li, Qingsen Yan, Chunxia Zhao, Haokui Zhang, Rong Xiao

    Abstract: Aerial object detection has been a hot topic for many years due to its wide application requirements. However, most existing approaches can only handle predefined categories, which limits their applicability for the open scenarios in real-world. In this paper, we extend aerial object detection to open scenarios by exploiting the relationship between image and text, and propose OVA-DETR, a high-eff… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.01246  [pdf, other

    cs.CR

    MapComp: A Secure View-based Collaborative Analytics Framework for Join-Group-Aggregation

    Authors: Xinyu Peng, Feng Han, Li Peng, Weiran Liu, Zheng Yan, Kai Kang, Xinyuan Zhang, Guoxing Wei, Jianling Sun, Jinfei Liu

    Abstract: This paper introduces MapComp, a novel view-based framework to facilitate join-group-aggregation (JGA) queries for collaborative analytics. Through specially crafted materialized view for join and novel design of group-aggregation (GA) protocols, MapComp removes duplicated join workload and expedites subsequent GA, improving the efficiency of JGA query execution. To support continuous data updates… ▽ More

    Submitted 15 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: 12 pages

  3. arXiv:2408.00220  [pdf, other

    math.DG cs.LG

    Persistent de Rham-Hodge Laplacians in the Eulerian representation

    Authors: Zhe Su, Yiying Tong, Guo-Wei Wei

    Abstract: Recently, topological data analysis (TDA) has become a trending topic in data science and engineering. However, the key technique of TDA, i.e., persistent homology, is defined on point cloud data, which restricts its scope. In this work, we propose persistent de Rham-Hodge Laplacian, or persistent Hodge Laplacian (PHL) for abbreviation, for the TDA on manifolds with boundaries, or volumetric data.… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  4. arXiv:2408.00118  [pdf, other

    cs.CL cs.AI

    Gemma 2: Improving Open Language Models at a Practical Size

    Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (172 additional authors not shown)

    Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  5. arXiv:2406.11434  [pdf, other

    cs.DB

    DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models

    Authors: Fan Zhou, Siqiao Xue, Danrui Qi, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang, Caigai Jiang, Gangwei Jiang, Zhixuan Chu, Faqiang Chen

    Abstract: Large language models (LLMs) becomes the dominant paradigm for the challenging task of text-to-SQL. LLM-empowered text-to-SQL methods are typically categorized into prompting-based and tuning approaches. Compared to prompting-based methods, benchmarking fine-tuned LLMs for text-to-SQL is important yet under-explored, partially attributed to the prohibitively high computational cost. In this paper,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.10126  [pdf, other

    cs.CV

    Training-free Camera Control for Video Generation

    Authors: Chen Hou, Guoqiang Wei, Yan Zeng, Zhibo Chen

    Abstract: We propose a training-free and robust solution to offer camera movement control for off-the-shelf video diffusion models. Unlike previous work, our method does not require any supervised finetuning on camera-annotated datasets or self-supervised training via data augmentation. Instead, it can be plugged and played with most pretrained video diffusion models and generate camera controllable videos… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  7. arXiv:2406.06479  [pdf, other

    cs.LG q-bio.QM

    Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data

    Authors: Nicole Hayes, Ekaterina Merkurjev, Guo-Wei Wei

    Abstract: Data sets with imbalanced class sizes, where one class size is much smaller than that of others, occur exceedingly often in many applications, including those with biological foundations, such as disease diagnosis and drug discovery. Therefore, it is extremely important to be able to identify data elements of classes of various sizes, as a failure to do so can result in heavy costs. Nonetheless, m… ▽ More

    Submitted 3 September, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.00492  [pdf, other

    eess.IV cs.CV cs.LG

    SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation

    Authors: Xueying Zeng, Baixiang Huang, Yu Luo, Guangyu Wei, Songyan He, Yushuang Shao

    Abstract: Coronary artery disease (CAD) is one of the most prevalent diseases in the cardiovascular field and one of the major contributors to death worldwide. Computed Tomography Angiography (CTA) images are regarded as the authoritative standard for the diagnosis of coronary artery disease, and by performing vessel segmentation and stenosis detection on CTA images, physicians are able to diagnose coronary… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  9. arXiv:2405.14858  [pdf, other

    cs.CV

    Mamba-R: Vision Mamba ALSO Needs Registers

    Authors: Feng Wang, Jiahao Wang, Sucheng Ren, Guoyizhe Wei, Jieru Mei, Wei Shao, Yuyin Zhou, Alan Yuille, Cihang Xie

    Abstract: Similar to Vision Transformers, this paper identifies artifacts also present within the feature maps of Vision Mamba. These artifacts, corresponding to high-norm tokens emerging in low-information background areas of images, appear much more severe in Vision Mamba -- they exist prevalently even with the tiny-sized model and activate extensively across background regions. To mitigate this issue, we… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.13858  [pdf, other

    cs.DC cs.AR cs.ET cs.LG

    Carbon Connect: An Ecosystem for Sustainable Computing

    Authors: Benjamin C. Lee, David Brooks, Arthur van Benthem, Udit Gupta, Gage Hills, Vincent Liu, Benjamin Pierce, Christopher Stewart, Emma Strubell, Gu-Yeon Wei, Adam Wierman, Yuan Yao, Minlan Yu

    Abstract: Computing is at a moment of profound opportunity. Emerging applications -- such as capable artificial intelligence, immersive virtual realities, and pervasive sensor systems -- drive unprecedented demand for computer. Despite recent advances toward net zero carbon emissions, the computing industry's gross energy usage continues to rise at an alarming rate, outpacing the growth of new energy instal… ▽ More

    Submitted 21 August, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2405.11461  [pdf, other

    cs.IR cs.AI cs.CL

    DocReLM: Mastering Document Retrieval with Language Model

    Authors: Gengchen Wei, Xinle Pang, Tianning Zhang, Yu Sun, Xun Qian, Chen Lin, Han-Sen Zhong, Wanli Ouyang

    Abstract: With over 200 million published academic documents and millions of new documents being written each year, academic researchers face the challenge of searching for information within this vast corpus. However, existing retrieval systems struggle to understand the semantics and domain knowledge present in academic papers. In this work, we demonstrate that by utilizing large language models, a docume… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  12. arXiv:2405.03987  [pdf, other

    cs.LG physics.chem-ph

    Navigating Chemical Space with Latent Flows

    Authors: Guanghao Wei, Yining Huang, Chenru Duan, Yue Song, Yuanqi Du

    Abstract: Recent progress of deep generative models in the vision and language domain has stimulated significant interest in more structured data generation such as molecules. However, beyond generating new random molecules, efficient exploration and a comprehensive understanding of the vast chemical space are of great importance to molecular science and applications in drug design and materials discovery.… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  13. arXiv:2405.02803  [pdf, other

    cs.LG cs.DC

    Is Flash Attention Stable?

    Authors: Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

    Abstract: Training large-scale machine learning models poses distinct system challenges, given both the size and complexity of today's workloads. Recently, many organizations training state-of-the-art Generative AI models have reported cases of instability during training, often taking the form of loss spikes. Numeric deviation has emerged as a potential cause of this training instability, although quantify… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  14. arXiv:2404.19534  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu Jin, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huanjing Yue, Jingyu Yang , et al. (38 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  15. arXiv:2404.10209  [pdf, other

    cs.AI cs.LG

    Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models

    Authors: Siqiao Xue, Danrui Qi, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhiping Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou, Hong Yi, Shaodong Liu, Hongjun Yang, Faqiang Chen

    Abstract: The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. The technologies of interacting with data particularly have an important entanglement with LLMs as efficient and intuitive data interactions are paramount. In this paper, we present DB-GPT, a revolutionary and product-ready Python library that integrates LLMs into traditional data interact… ▽ More

    Submitted 24 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  16. arXiv:2404.08217  [pdf, other

    cs.PL

    Avoid Arguments and Escape with Your Self: Expressive Subtyping and Decidable Bidirectional Checking for Reachability Types

    Authors: Songlin Jia, Guannan Wei, Siyuan He, Yuyan Bao, Tiark Rompf

    Abstract: Despite Rust's success in systems programming, its ``shared XOR mutable'' principle significantly restricts how mutable values can be used, precluding many useful functional programming idioms. Reachability types are a recent proposal to address the key limitations of Rust-style approaches by tracking, rather than prohibiting, shared, escaping, and mutable data, even in the presence of higher-orde… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  17. arXiv:2402.13513  [pdf, other

    cs.AR

    Guac: Energy-Aware and SSA-Based Generation of Coarse-Grained Merged Accelerators from LLVM-IR

    Authors: Iulian Brumar, Rodrigo Rocha, Alex Bernat, Devashree Tripathy, David Brooks, Gu-Yeon Wei

    Abstract: Designing accelerators for resource- and power-constrained applications is a daunting task. High-level Synthesis (HLS) addresses these constraints through resource sharing, an optimization at the HLS binding stage that maps multiple operations to the same functional unit. However, resource sharing is often limited to reusing instructions within a basic block. Instead of searching globally for th… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  18. arXiv:2402.10186  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Self-consistent Validation for Machine Learning Electronic Structure

    Authors: Gengyuan Hu, Gengchen Wei, Zekun Lou, Philip H. S. Torr, Wanli Ouyang, Han-sen Zhong, Chen Lin

    Abstract: Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems. Despite its potential, there is less guarantee for the model to generalize to unseen data that hinders its application in real-world scenarios. To address this issue, a technique has been proposed to estimate the accuracy of the predictions. This method integrates machine learning with self-… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

  19. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 6 August, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  20. arXiv:2402.01566  [pdf, other

    cs.CV cs.AI

    Boximator: Generating Rich and Controllable Motions for Video Synthesis

    Authors: Jiawei Wang, Yuchen Zhang, Jiaxin Zou, Yan Zeng, Guoqiang Wei, Liping Yuan, Hang Li

    Abstract: Generating rich and controllable motion is a pivotal challenge in video synthesis. We propose Boximator, a new approach for fine-grained motion control. Boximator introduces two constraint types: hard box and soft box. Users select objects in the conditional frame using hard boxes and then use either type of boxes to roughly or rigorously define the object's position, shape, or motion path in futu… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 16 pages, 9 figures

  21. arXiv:2401.16732  [pdf, other

    cs.CR

    Flash: A Hybrid Private Inference Protocol for Deep CNNs with High Accuracy and Low Latency on CPU

    Authors: Hyeri Roh, Jinsu Yeo, Yeongil Ko, Gu-Yeon Wei, David Brooks, Woo-Seok Choi

    Abstract: This paper presents Flash, an optimized private inference (PI) hybrid protocol utilizing both homomorphic encryption (HE) and secure two-party computation (2PC), which can reduce the end-to-end PI latency for deep CNN models less than 1 minute with CPU. To this end, first, Flash proposes a low-latency convolution algorithm built upon a fast slot rotation operation and a novel data encoding scheme,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  22. arXiv:2312.17449  [pdf, other

    cs.DB

    DB-GPT: Empowering Database Interactions with Private Large Language Models

    Authors: Siqiao Xue, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhiping Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou, Danrui Qi, Hong Yi, Shaodong Liu, Faqiang Chen

    Abstract: The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. Database technologies particularly have an important entanglement with LLMs as efficient and intuitive database interactions are paramount. In this paper, we present DB-GPT, a revolutionary and production-ready project that integrates LLMs with traditional database systems to enhance user… ▽ More

    Submitted 3 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  23. arXiv:2312.15707  [pdf, other

    cs.CV

    High-Fidelity Diffusion-based Image Editing

    Authors: Chen Hou, Guoqiang Wei, Zhibo Chen

    Abstract: Diffusion models have attained remarkable success in the domains of image generation and editing. It is widely recognized that employing larger inversion and denoising steps in diffusion model leads to improved image reconstruction quality. However, the editing performance of diffusion models tends to be no more satisfactory even with increasing denoising steps. The deficiency in editing could be… ▽ More

    Submitted 4 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  24. arXiv:2312.15561  [pdf, other

    cs.CL cs.AI

    README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

    Authors: Zonghai Yao, Nandyala Siddharth Kantu, Guanghao Wei, Hieu Tran, Zhangqi Duan, Sunjae Kwon, Zhichao Yang, README annotation team, Hong Yu

    Abstract: The advancement in healthcare has shifted focus toward patient-centric approaches, particularly in self-care and patient education, facilitated by access to Electronic Health Records (EHR). However, medical jargon in EHRs poses significant challenges in patient comprehension. To address this, we introduce a new task of automatically generating lay definitions, aiming to simplify complex medical te… ▽ More

    Submitted 16 June, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  25. arXiv:2312.14385  [pdf, other

    cs.DC cs.LG cs.MM

    Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

    Authors: Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

    Abstract: As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial and temporal information presents unique challenges to quality, performance, and efficiency. We present the first work towards understanding this new system design space for multi-modal text-to-image (TTI) and text-to-video (TTV) generation m… ▽ More

    Submitted 5 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Published at 2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

  26. arXiv:2312.01272  [pdf, other

    q-bio.BM cs.LG q-bio.GN

    Multiscale Topology in Interactomic Network: From Transcriptome to Antiaddiction Drug Repurposing

    Authors: Hongyan Du, Guo-Wei Wei, Tingjun Hou

    Abstract: The escalating drug addiction crisis in the United States underscores the urgent need for innovative therapeutic strategies. This study embarked on an innovative and rigorous strategy to unearth potential drug repurposing candidates for opioid and cocaine addiction treatment, bridging the gap between transcriptomic data analysis and drug discovery. We initiated our approach by conducting different… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  27. arXiv:2311.14062  [pdf, other

    cs.CV

    Hardware Resilience Properties of Text-Guided Image Classifiers

    Authors: Syed Talal Wasim, Kabila Haile Soboka, Abdulrahman Mahmoud, Salman Khan, David Brooks, Gu-Yeon Wei

    Abstract: This paper presents a novel method to enhance the reliability of image classification models during deployment in the face of transient hardware errors. By utilizing enriched text embeddings derived from GPT-3 with question prompts per class and CLIP pretrained text encoder, we investigate their impact as an initialization for the classification layer. Our approach achieves a remarkable… ▽ More

    Submitted 5 December, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  28. arXiv:2311.10982  [pdf, other

    cs.CV

    Make Pixels Dance: High-Dynamic Video Generation

    Authors: Yan Zeng, Guoqiang Wei, Jiani Zheng, Jiaxin Zou, Yang Wei, Yuchen Zhang, Hang Li

    Abstract: Creating high-dynamic videos such as motion-rich actions and sophisticated visual effects poses a significant challenge in the field of artificial intelligence. Unfortunately, current state-of-the-art video generation methods, primarily focusing on text-to-video generation, tend to produce video clips with minimal motions despite maintaining high fidelity. We argue that relying solely on text inst… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 12 pages

  29. arXiv:2310.20602  [pdf, other

    cs.RO eess.SY

    Compliant actuators that mimic biological muscle performance with applications in a highly biomimetic robotic arm

    Authors: Haosen Yang, Guowu Wei, Lei Ren, Lingyun Yan

    Abstract: This paper endeavours to bridge the existing gap in muscular actuator design for ligament-skeletal-inspired robots, thereby fostering the evolution of these robotic systems. We introduce two novel compliant actuators, namely the Internal Torsion Spring Compliant Actuator (ICA) and the External Spring Compliant Actuator (ECA), and present a comparative analysis against the previously conceived Magn… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  30. arXiv:2310.18765  [pdf, other

    cs.LG

    Rethinking Semi-Supervised Imbalanced Node Classification from Bias-Variance Decomposition

    Authors: Divin Yan, Gengchen Wei, Chen Yang, Shengzhong Zhang, Zengfeng Huang

    Abstract: This paper introduces a new approach to address the issue of class imbalance in graph neural networks (GNNs) for learning on graph-structured data. Our approach integrates imbalanced node classification and Bias-Variance Decomposition, establishing a theoretical framework that closely relates data imbalance to model variance. We also leverage graph augmentation technique to estimate the variance,… ▽ More

    Submitted 5 February, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems. (NeurIPS 2023)

  31. arXiv:2310.18299  [pdf, other

    cs.RO eess.SY

    Enhancing the Performance of a Biomimetic Robotic Elbow-and-Forearm System Through Bionics-Inspired Optimization

    Authors: Haosen Yang, Guowu Wei, Lei Ren

    Abstract: This paper delineates the formulation and verification of an innovative robotic forearm and elbow design, mirroring the intricate biomechanics of human skeletal and ligament systems. Conventional robotic models often undervalue the substantial function of soft tissues, leading to a compromise between compactness, safety, stability, and range of motion. In contrast, this study proposes a holistic r… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  32. arXiv:2310.18283  [pdf, other

    cs.RO eess.SY

    Development and Characteristics of a Highly Biomimetic Robotic Shoulder Through Bionics-Inspired Optimization

    Authors: Haosen Yang, Guowu Wei, Lei Ren

    Abstract: This paper critically analyzes conventional and biomimetic robotic arms, underscoring the trade-offs between size, motion range, and load capacity in current biomimetic models. By delving into the human shoulder's mechanical intelligence, particularly the glenohumeral joint's intricate features such as its unique ball-and-socket structure and self-locking mechanism, we pinpoint innovations that bo… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  33. arXiv:2310.15744  [pdf, other

    stat.ML cs.LG math.AT

    Analyzing Single Cell RNA Sequencing with Topological Nonnegative Matrix Factorization

    Authors: Yuta Hozumi, Guo-Wei Wei

    Abstract: Single-cell RNA sequencing (scRNA-seq) is a relatively new technology that has stimulated enormous interest in statistics, data science, and computational biology due to the high dimensionality, complexity, and large scale associated with scRNA-seq data. Nonnegative matrix factorization (NMF) offers a unique approach due to its meta-gene interpretation of resulting low-dimensional components. Howe… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  34. arXiv:2310.14521  [pdf, other

    q-bio.QM cs.LG math.AT

    K-Nearest-Neighbors Induced Topological PCA for scRNA Sequence Data Analysis

    Authors: Sean Cottrell, Yuta Hozumi, Guo-Wei Wei

    Abstract: Single-cell RNA sequencing (scRNA-seq) is widely used to reveal heterogeneity in cells, which has given us insights into cell-cell communication, cell differentiation, and differential gene expression. However, analyzing scRNA-seq data is a challenge due to sparsity and the large number of genes involved. Therefore, dimensionality reduction and feature selection are important for removing spurious… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 28 pages, 11 figures

  35. arXiv:2310.09092  [pdf, other

    cs.CV

    iPUNet:Iterative Cross Field Guided Point Cloud Upsampling

    Authors: Guangshun Wei, Hao Pan, Shaojie Zhuang, Yuanfeng Zhou, Changjian Li

    Abstract: Point clouds acquired by 3D scanning devices are often sparse, noisy, and non-uniform, causing a loss of geometric features. To facilitate the usability of point clouds in downstream applications, given such input, we present a learning-based point upsampling method, i.e., iPUNet, which generates dense and uniform points at arbitrary ratios and better captures sharp features. To generate feature-a… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  36. ChatGPT for Computational Topology

    Authors: Jian Liu, Li Shen, Guo-Wei Wei

    Abstract: ChatGPT represents a significant milestone in the field of artificial intelligence (AI), finding widespread applications across diverse domains. However, its effectiveness in mathematical contexts has been somewhat constrained by its susceptibility to conceptual errors. Concurrently, topological data analysis (TDA), a relatively new discipline, has garnered substantial interest in recent years. No… ▽ More

    Submitted 14 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Journal ref: Foundations of Data Science, 2024

  37. arXiv:2310.05171  [pdf, other

    cs.AI cs.CV

    Multi-Ship Tracking by Robust Similarity metric

    Authors: Hongyu Zhao, Gongming Wei, Yang Xiao, Xianglei Xing

    Abstract: Multi-ship tracking (MST) as a core technology has been proven to be applied to situational awareness at sea and the development of a navigational system for autonomous ships. Despite impressive tracking outcomes achieved by multi-object tracking (MOT) algorithms for pedestrian and vehicle datasets, these models and techniques exhibit poor performance when applied to ship datasets. Intersection of… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  38. arXiv:2310.03103  [pdf, other

    cs.LG

    Learning to Prompt Your Domain for Vision-Language Models

    Authors: Guoyizhe Wei, Feng Wang, Anshul Shah, Rama Chellappa

    Abstract: Prompt learning has recently become a very efficient transfer learning paradigm for Contrastive Language Image Pretraining (CLIP) models. Compared with fine-tuning the entire encoder, prompt learning can obtain highly competitive results by optimizing only a small number of parameters, which presents considerably exciting benefits for federated learning applications that prioritizes communication… ▽ More

    Submitted 29 August, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  39. arXiv:2310.02784  [pdf, other

    cs.DC cs.AR cs.LG

    MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems

    Authors: Samuel Hsia, Alicia Golden, Bilge Acun, Newsha Ardalani, Zachary DeVito, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

    Abstract: Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high operational costs. Our analysis, grounded in real-world large model training on datacenter-scale infrastructures, reveals that 14~32% of all GPU hours are spent on communication with no overlapping computation. To minimize this outstanding commun… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ISCA 2024

  40. arXiv:2309.16809  [pdf, other

    cs.LG

    GraB-sampler: Optimal Permutation-based SGD Data Sampler for PyTorch

    Authors: Guanghao Wei

    Abstract: The online Gradient Balancing (GraB) algorithm greedily choosing the examples ordering by solving the herding problem using per-sample gradients is proved to be the theoretically optimal solution that guarantees to outperform Random Reshuffling. However, there is currently no efficient implementation of GraB for the community to easily use it. This work presents an efficient Python library,… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  41. arXiv:2309.14396  [pdf, other

    cs.SE cs.LG cs.PL

    Guess & Sketch: Language Model Guided Transpilation

    Authors: Celine Lee, Abdulrahman Mahmoud, Michal Kurek, Simone Campanoni, David Brooks, Stephen Chong, Gu-Yeon Wei, Alexander M. Rush

    Abstract: Maintaining legacy software requires many software and systems engineering hours. Assembly code programs, which demand low-level control over the computer machine state and have no variable names, are particularly difficult for humans to analyze. Existing conventional program translators guarantee correctness, but are hand-engineered for the source and target programming languages in question. Lea… ▽ More

    Submitted 15 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  42. arXiv:2309.08118  [pdf, ps, other

    cs.PL

    Graph IRs for Impure Higher-Order Languages (Technical Report)

    Authors: Oliver Bračevac, Guannan Wei, Songlin Jia, Supun Abeysinghe, Yuxuan Jiang, Yuyan Bao, Tiark Rompf

    Abstract: This is a companion report for the OOPSLA 2023 paper of the same title, presenting a detailed end-to-end account of the $λ^*_{\mathsf{G}}$ graph IR, at a level of detail beyond a regular conference paper. Our first concern is adequacy and soundness of $λ^*_{\mathsf{G}}$, which we derive from a direct-style imperative functional language (a variant of Bao et al.'s $λ^*$-calculus with reachability t… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2309.05885

  43. arXiv:2309.07270  [pdf, other

    cs.DC

    GPU Scheduler for De Novo Genome Assembly with Multiple MPI Processes

    Authors: Minhao Li, Siyu Wang, Guanghao Wei

    Abstract: $\textit{De Novo}$ Genome assembly is one of the most important tasks in computational biology. ELBA is the state-of-the-art distributed-memory parallel algorithm for overlap detection and layout simplification steps of $\textit{De Novo}… ▽ More

    Submitted 10 October, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

  44. arXiv:2309.05885  [pdf, ps, other

    cs.PL

    Modeling Reachability Types with Logical Relations

    Authors: Yuyan Bao, Guannan Wei, Oliver Bračevac, Tiark Rompf

    Abstract: Reachability types are a recent proposal to bring Rust-style reasoning about memory properties to higher-level languages. While key type soundness results for reachability types have been established using syntactic techniques in prior work, stronger metatheoretic properties have so far been unexplored. This paper presents an alternative semantic model of reachability types using logical relations… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  45. arXiv:2308.15874  [pdf

    cs.RO

    Sarrus-inspired Deployable Polyhedral Mechanisms

    Authors: Yuanqing Gu, Xiao Zhang, Guowu Wei, Yan Chen

    Abstract: Deployable polyhedral mechanisms (DPMs) have witnessed flourishing growth in recent years because of their potential applications in robotics, space exploration, structure engineering, etc. This paper firstly presents the construction, mobility and kinematics of a family of Sarrus-inspired deployable polyhedral mechanisms. By carrying out expansion operation and implanting Sarrus linkages along th… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  46. arXiv:2308.13934  [pdf, other

    cs.GR

    Patch-Grid: An Efficient and Feature-Preserving Neural Implicit Surface Representation

    Authors: Guying Lin, Lei Yang, Congyi Zhang, Hao Pan, Yuhan Ping, Guodong Wei, Taku Komura, John Keyser, Wenping Wang

    Abstract: Neural implicit representations are known to be more compact for depicting 3D shapes than traditional discrete representations. However, the neural representations tend to round sharp corners or edges and struggle to represent surfaces with open boundaries. Moreover, they are slow to train. We present a unified neural implicit representation, called Patch-Grid, that fits to complex shapes efficien… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  47. arXiv:2308.06920  [pdf, other

    cs.AI cs.HC q-bio.BM

    ChatGPT in Drug Discovery: A Case Study on Anti-Cocaine Addiction Drug Development with Chatbots

    Authors: Rui Wang, Hongsong Feng, Guo-Wei Wei

    Abstract: The birth of ChatGPT, a cutting-edge language model-based chatbot developed by OpenAI, ushered in a new era in AI. However, due to potential pitfalls, its role in rigorous scientific research is not clear yet. This paper vividly showcases its innovative application within the field of drug discovery. Focused specifically on developing anti-cocaine addiction drugs, the study employs GPT-4 as a virt… ▽ More

    Submitted 19 October, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

  48. arXiv:2307.13844  [pdf, other

    cs.PL

    Polymorphic Reachability Types: Tracking Freshness, Aliasing, and Separation in Higher-Order Generic Programs

    Authors: Guannan Wei, Oliver Bračevac, Songlin Jia, Yuyan Bao, Tiark Rompf

    Abstract: Reachability types are a recent proposal that has shown promise in scaling to higher-order but monomorphic settings, tracking aliasing and separation on top of a substrate inspired by separation logic. The prior $λ^*$ reachability type system qualifies types with sets of reachable variables and guarantees separation if two terms have disjoint qualifiers. However, naive extensions with type polymor… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  49. arXiv:2307.05794  [pdf, other

    q-bio.BM cs.LG

    Machine Learning Study of the Extended Drug-target Interaction Network informed by Pain Related Voltage-Gated Sodium Channels

    Authors: Long Chen, Jian Jiang, Bozheng Dou, Hongsong Feng, Jie Liu, Yueying Zhu, Bengong Zhang, Tianshou Zhou, Guo-Wei Wei

    Abstract: Pain is a significant global health issue, and the current treatment options for pain management have limitations in terms of effectiveness, side effects, and potential for addiction. There is a pressing need for improved pain treatments and the development of new drugs. Voltage-gated sodium channels, particularly Nav1.3, Nav1.7, Nav1.8, and Nav1.9, play a crucial role in neuronal excitability and… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  50. arXiv:2307.02469  [pdf, other

    cs.CV cs.CL

    What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?

    Authors: Yan Zeng, Hanbo Zhang, Jiani Zheng, Jiangnan Xia, Guoqiang Wei, Yang Wei, Yuchen Zhang, Tao Kong

    Abstract: Recent advancements in Large Language Models (LLMs) such as GPT4 have displayed exceptional multi-modal capabilities in following open-ended instructions given images. However, the performance of these models heavily relies on design choices such as network structures, training data, and training strategies, and these choices have not been extensively discussed in the literature, making it difficu… ▽ More

    Submitted 30 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 32 pages