Skip to main content

Showing 1–50 of 82 results for author: Qiu, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00378  [pdf

    cs.CE

    A deep spatio-temporal attention model of dynamic functional network connectivity shows sensitivity to Alzheimer's in asymptomatic individuals

    Authors: Yuxiang Wei, Anees Abrol, James Lah, Deqiang Qiu, Vince D. Calhoun

    Abstract: Alzheimer's disease (AD) progresses from asymptomatic changes to clinical symptoms, emphasizing the importance of early detection for proper treatment. Functional magnetic resonance imaging (fMRI), particularly dynamic functional network connectivity (dFNC), has emerged as an important biomarker for AD. Nevertheless, studies probing at-risk subjects in the pre-symptomatic stage using dFNC are limi… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by EMBC 2024

  2. arXiv:2407.21075  [pdf, other

    cs.AI cs.CL cs.LG

    Apple Intelligence Foundation Language Models

    Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

    Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  3. arXiv:2407.00295  [pdf, other

    cs.CV

    A deep neural network framework for dynamic multi-valued mapping estimation and its applications

    Authors: Geng Li, Di Qiu, Lok Ming Lui

    Abstract: This paper addresses the problem of modeling and estimating dynamic multi-valued mappings. While most mathematical models provide a unique solution for a given input, real-world applications often lack deterministic solutions. In such scenarios, estimating dynamic multi-valued mappings is necessary to suggest different reasonable solutions for each input. This paper introduces a deep neural networ… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.18115  [pdf, other

    cs.RO cs.AI cs.CV

    Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps

    Authors: Dicong Qiu, Wenzong Ma, Zhenfu Pan, Hui Xiong, Junwei Liang

    Abstract: Open-Vocabulary Mobile Manipulation (OVMM) is a crucial capability for autonomous robots, especially when faced with the challenges posed by unknown and dynamic environments. This task requires robots to explore and build a semantic understanding of their surroundings, generate feasible plans to achieve manipulation goals, adapt to environmental changes, and comprehend natural language instruction… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Open-vocabulary, Mobile Manipulation, Dynamic Environments, 3D Semantic Maps, Zero-shot, LLMs, VLMs, 18 pages, 2 figures

  5. arXiv:2405.13467  [pdf, other

    cs.CV

    AdaFedFR: Federated Face Recognition with Adaptive Inter-Class Representation Learning

    Authors: Di Qiu, Xinyang Lin, Kaiye Wang, Xiangxiang Chu, Pengfei Yan

    Abstract: With the growing attention on data privacy and communication security in face recognition applications, federated learning has been introduced to learn a face recognition model with decentralized datasets in a privacy-preserving manner. However, existing works still face challenges such as unsatisfying performance and additional communication costs, limiting their applicability in real-world scena… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  6. arXiv:2405.11315  [pdf, other

    cs.CV

    MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly Detection

    Authors: Ximiao Zhang, Min Xu, Dehui Qiu, Ruixin Yan, Ning Lang, Xiuzhuang Zhou

    Abstract: In the field of medical decision-making, precise anomaly detection in medical imaging plays a pivotal role in aiding clinicians. However, previous work is reliant on large-scale datasets for training anomaly detection models, which increases the development cost. This paper first focuses on the task of medical image anomaly detection in the few-shot setting, which is critically significant for the… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 12 pages, 3 figures, 5 tables, early accepted at MICCAI 2024

  7. arXiv:2404.19519  [pdf, ps, other

    cs.LG cs.DB

    Generating Robust Counterfactual Witnesses for Graph Neural Networks

    Authors: Dazhuo Qiu, Mengying Wang, Arijit Khan, Yinghui Wu

    Abstract: This paper introduces a new class of explanation structures, called robust counterfactual witnesses (RCWs), to provide robust, both counterfactual and factual explanations for graph neural networks. Given a graph neural network M, a robust counterfactual witness refers to the fraction of a graph G that are counterfactual and factual explanation of the results of M over G, but also remains so for a… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by ICDE 2024

  8. arXiv:2404.02225  [pdf, other

    cs.CV cs.AI

    CHOSEN: Contrastive Hypothesis Selection for Multi-View Depth Refinement

    Authors: Di Qiu, Yinda Zhang, Thabo Beeler, Vladimir Tankovich, Christian Häne, Sean Fanello, Christoph Rhemann, Sergio Orts Escolano

    Abstract: We propose CHOSEN, a simple yet flexible, robust and effective multi-view depth refinement framework. It can be employed in any existing multi-view stereo pipeline, with straightforward generalization capability for different multi-view capture systems such as camera relative positioning and lenses. Given an initial depth estimation, CHOSEN iteratively re-samples and selects the best hypotheses, a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2404.01296  [pdf, other

    cs.CV

    MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space

    Authors: Armand Comas-Massagué, Di Qiu, Menglei Chai, Marcel Bühler, Amit Raj, Ruiqi Gao, Qiangeng Xu, Mark Matthews, Paulo Gotardo, Octavia Camps, Sergio Orts-Escolano, Thabo Beeler

    Abstract: We introduce a novel framework for 3D human avatar generation and personalization, leveraging text prompts to enhance user engagement and customization. Central to our approach are key innovations aimed at overcoming the challenges in photo-realistic avatar synthesis. Firstly, we utilize a conditional Neural Radiance Fields (NeRF) model, trained on a large-scale unannotated multi-view dataset, to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2404.00667  [pdf, other

    cs.CV

    Weakly-Supervised Cross-Domain Segmentation of Electron Microscopy with Sparse Point Annotation

    Authors: Dafei Qiu, Shan Xiong, Jiajin Yi, Jialin Peng

    Abstract: Accurate segmentation of organelle instances from electron microscopy (EM) images plays an essential role in many neuroscience researches. However, practical scenarios usually suffer from high annotation costs, label scarcity, and large domain diversity. While unsupervised domain adaptation (UDA) that assumes no annotation effort on the target data is promising to alleviate these challenges, its p… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  11. arXiv:2401.08957  [pdf, other

    cs.RO cs.AI

    SWBT: Similarity Weighted Behavior Transformer with the Imperfect Demonstration for Robotic Manipulation

    Authors: Kun Wu, Ning Liu, Zhen Zhao, Di Qiu, Jinming Li, Zhengping Che, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Imitation learning (IL), aiming to learn optimal control policies from expert demonstrations, has been an effective method for robot manipulation tasks. However, previous IL methods either only use expensive expert demonstrations and omit imperfect demonstrations or rely on interacting with the environment and learning from online experiences. In the context of robotic manipulation, we aim to conq… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 8 pages, 5 figures

    ACM Class: I.2.9

  12. arXiv:2401.02086  [pdf, other

    cs.LG cs.DB

    View-based Explanations for Graph Neural Networks

    Authors: Tingyang Chen, Dazhuo Qiu, Yinghui Wu, Arijit Khan, Xiangyu Ke, Yunjun Gao

    Abstract: Generating explanations for graph neural networks (GNNs) has been studied to understand their behavior in analytical tasks such as graph classification. Existing approaches aim to understand the overall results of GNNs rather than providing explanations for specific class labels of interest, and may return explanation structures that are hard to access, nor directly queryable.We propose GVEX, a no… ▽ More

    Submitted 7 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by SIGMOD 2024

  13. arXiv:2312.09463  [pdf, other

    cs.CL

    Partial Rewriting for Multi-Stage ASR

    Authors: Antoine Bruguier, David Qiu, Yanzhang He

    Abstract: For many streaming automatic speech recognition tasks, it is important to provide timely intermediate streaming results, while refining a high quality final result. This can be done using a multi-stage architecture, where a small left-context only model creates streaming results and a larger left- and right-context model produces a final result at the end. While this significantly improves the qua… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  14. arXiv:2312.08553  [pdf, other

    eess.AS cs.SD

    USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

    Authors: Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal

    Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios… ▽ More

    Submitted 16 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024. Preprint

  15. arXiv:2312.03763  [pdf, other

    cs.CV cs.GR cs.LG

    Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing

    Authors: Yushi Lan, Feitong Tan, Di Qiu, Qiangeng Xu, Kyle Genova, Zeng Huang, Sean Fanello, Rohit Pandey, Thomas Funkhouser, Chen Change Loy, Yinda Zhang

    Abstract: We present a novel framework for generating photorealistic 3D human head and subsequently manipulating and reposing them with remarkable flexibility. The proposed approach leverages an implicit function representation of 3D human heads, employing 3D Gaussians anchored on a parametric face model. To enhance representational capabilities and encode spatial information, we embed a lightweight tri-pla… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: project webpage: https://nirvanalan.github.io/projects/gaussian3diff/

  16. arXiv:2311.04643  [pdf, other

    cs.SE

    Software Architecture Recovery with Information Fusion

    Authors: Yiran Zhang, Zhengzi Xu, Chengwei Liu, Hongxu Chen, Jianwen Sun, Dong Qiu, Yang Liu

    Abstract: Understanding the architecture is vital for effectively maintaining and managing large software systems. However, as software systems evolve over time, their architectures inevitably change. To keep up with the change, architects need to track the implementation-level changes and update the architectural documentation accordingly, which is time-consuming and error-prone. Therefore, many automatic… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  17. arXiv:2311.00353  [pdf, other

    cs.CV

    LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation

    Authors: Yuxiang Bao, Di Qiu, Guoliang Kang, Baochang Zhang, Bo Jin, Kaiye Wang, Pengfei Yan

    Abstract: Leveraging the generative ability of image diffusion models offers great potential for zero-shot video-to-video translation. The key lies in how to maintain temporal consistency across generated video frames by image diffusion models. Previous methods typically adopt cross-frame attention, \emph{i.e.,} sharing the \textit{key} and \textit{value} tokens across attentions of different frames, to enc… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  18. arXiv:2309.11488  [pdf, other

    cs.DC cs.AR

    An Evaluation and Comparison of GPU Hardware and Solver Libraries for Accelerating the OPM Flow Reservoir Simulator

    Authors: Tong Dong Qiu, Andreas Thune, Markus Blatt, Alf Birger Rustad, Razvan Nane

    Abstract: Realistic reservoir simulation is known to be prohibitively expensive in terms of computation time when increasing the accuracy of the simulation or by enlarging the model grid size. One method to address this issue is to parallelize the computation by dividing the model in several partitions and using multiple CPUs to compute the result using techniques such as MPI and multi-threading. Alternativ… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  19. arXiv:2307.03870  [pdf, other

    cs.FL eess.SY

    Opacity of Parametric Discrete Event Systems: Models, Decidability, and Algorithms

    Authors: Weilin Deng, Daowen Qiu, Jingkai Yang

    Abstract: Finite automata (FAs) model is a popular tool to characterize discrete event systems (DESs) due to its succinctness. However, for some complex systems, it is difficult to describe the necessary details by means of FAs model. In this paper, we consider a kind of extended finite automata (EFAs) in which each transition carries a predicate over state and event parameters. We also consider a type of s… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 13 pages, 9 figures

  20. arXiv:2306.07719  [pdf, other

    cs.AI

    Contextual Dictionary Lookup for Knowledge Graph Completion

    Authors: Jining Wang, Delai Qiu, YouMing Liu, Yining Wang, Chuan Chen, Zibin Zheng, Yuren Zhou

    Abstract: Knowledge graph completion (KGC) aims to solve the incompleteness of knowledge graphs (KGs) by predicting missing links from known triples, numbers of knowledge graph embedding (KGE) models have been proposed to perform KGC by learning embeddings. Nevertheless, most existing embedding models map each relation into a unique vector, overlooking the specific fine-grained semantics of them under diffe… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  21. arXiv:2305.15536  [pdf, other

    eess.AS cs.LG

    RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

    Authors: David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He

    Abstract: With the rapid increase in the size of neural networks, model compression has become an important area of research. Quantization is an effective technique at decreasing the model size, memory access, and compute load of large models. Despite recent advances in quantization aware training (QAT) technique, most papers present evaluations that are focused on computer vision tasks, which have differen… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  22. arXiv:2305.06194  [pdf, other

    cs.RO

    Concentric Tube Robot Redundancy Resolution via Velocity/Compliance Manipulability Optimization

    Authors: Jia Shen, Yifan Wang, Milad Azizkhani, Deqiang Qiu, Yue Chen

    Abstract: Concentric Tube Robots (CTR) have the potential to enable effective minimally invasive surgeries. While extensive modeling and control schemes have been proposed in the past decade, limited efforts have been made to improve the trajectory tracking performance from the perspective of manipulability , which can be critical to generate safe motion and feasible actuator commands. In this paper, we pro… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 8 pages, 5 figures

  23. arXiv:2304.01436  [pdf, other

    cs.CV cs.GR

    Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

    Authors: Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, Ping Tan, Thabo Beeler, Sean Fanello, Yinda Zhang

    Abstract: We propose a method to learn a high-quality implicit 3D head avatar from a monocular RGB video captured in the wild. The learnt avatar is driven by a parametric face model to achieve user-controlled facial expressions and head poses. Our hybrid pipeline combines the geometry prior and dynamic tracking of a 3DMM with a neural radiance field to achieve fine-grained control and photorealism. To reduc… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: In CVPR2023. Project page: https://augmentedperception.github.io/monoavatar/

  24. arXiv:2212.05719  [pdf, other

    cs.CV

    Tensor Factorization via Transformed Tensor-Tensor Product for Image Alignment

    Authors: Sijia Xia, Duo Qiu, Xiongjun Zhang

    Abstract: In this paper, we study the problem of a batch of linearly correlated image alignment, where the observed images are deformed by some unknown domain transformations, and corrupted by additive Gaussian noise and sparse noise simultaneously. By stacking these images as the frontal slices of a third-order tensor, we propose to utilize the tensor factorization method via transformed tensor-tensor prod… ▽ More

    Submitted 13 December, 2022; v1 submitted 12 December, 2022; originally announced December 2022.

  25. arXiv:2210.13109  [pdf, other

    cs.CV

    WDA-Net: Weakly-Supervised Domain Adaptive Segmentation of Electron Microscopy

    Authors: Dafei Qiu, Jiajin Yi, Jialin Peng

    Abstract: Accurate segmentation of organelle instances, e.g., mitochondria, is essential for electron microscopy analysis. Despite the outstanding performance of fully supervised methods, they highly rely on sufficient per-pixel annotated data and are sensitive to domain shift. Aiming to develop a highly annotation-efficient approach with competitive performance, we focus on weakly-supervised domain adaptat… ▽ More

    Submitted 30 October, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted by BIBM 2022: International Conference on Bioinformatics & Biomedicine

  26. arXiv:2207.14709  [pdf, other

    eess.IV cs.CV

    Robust Quantitative Susceptibility Mapping via Approximate Message Passing with Parameter Estimation

    Authors: Shuai Huang, James J. Lah, Jason W. Allen, Deqiang Qiu

    Abstract: Purpose: For quantitative susceptibility mapping (QSM), the lack of ground-truth in clinical settings makes it challenging to determine suitable parameters for the dipole inversion. We propose a probabilistic Bayesian approach for QSM with built-in parameter estimation, and incorporate the nonlinear formulation of the dipole inversion to achieve a robust recovery of the susceptibility maps. Theo… ▽ More

    Submitted 30 May, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: Keywords: Approximate message passing, Compressive sensing, Outlier modelling, Parameter estimation, Quantitative susceptibility mapping

  27. arXiv:2205.13117  [pdf, other

    cs.CV

    Learn to Cluster Faces via Pairwise Classification

    Authors: Junfu Liu, Di Qiu, Pengfei Yan, Xiaolin Wei

    Abstract: Face clustering plays an essential role in exploiting massive unlabeled face data. Recently, graph-based face clustering methods are getting popular for their satisfying performances. However, they usually suffer from excessive memory consumption especially on large-scale graphs, and rely on empirical thresholds to determine the connectivities between samples in inference, which restricts their ap… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted by ICCV2021

  28. Approximate Message Passing with Parameter Estimation for Heavily Quantized Measurements

    Authors: Shuai Huang, Deqiang Qiu, Trac D. Tran

    Abstract: Designing efficient sparse recovery algorithms that could handle noisy quantized measurements is important in a variety of applications -- from radar to source localization, spectrum sensing and wireless networking. We take advantage of the approximate message passing (AMP) framework to achieve this goal given its high computational efficiency and state-of-the-art performance. In AMP, the signal o… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2007.07679

    Journal ref: IEEE Transactions on Signal Processing, Vol. 70, pp. 2062-2077, Apr. 2022

  29. arXiv:2202.00147  [pdf, ps, other

    quant-ph cs.CR

    Distributed Quantum Vote Based on Quantum Logical Operators, a New Battlefield of the Second Quantum Revolution

    Authors: Xin Sun, Feifei He, Daowen Qiu, Piotr Kulicki, Mirek Sopek, Meiyun Guo

    Abstract: We designed two rules of binary quantum computed vote: Quantum Logical Veto (QLV) and Quantum Logical Nomination (QLN). The conjunction and disjunction from quantum computational logic are used to define QLV and QLN, respectively. Compared to classical vote, quantum computed vote is fairer, more democratic and has stronger expressive power. Since the advantage of quantum computed vote is neither t… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 9 pages

    MSC Class: 81Pxx

  30. arXiv:2111.14041  [pdf, ps, other

    quant-ph cs.FL

    Learning Quantum Finite Automata with Queries

    Authors: Daowen Qiu

    Abstract: {\it Learning finite automata} (termed as {\it model learning}) has become an important field in machine learning and has been useful realistic applications. Quantum finite automata (QFA) are simple models of quantum computers with finite memory. Due to their simplicity, QFA have well physical realizability, but one-way QFA still have essential advantages over classical finite automata with regard… ▽ More

    Submitted 12 November, 2023; v1 submitted 27 November, 2021; originally announced November 2021.

    Comments: 25pages; comments are welcome

  31. arXiv:2110.03327  [pdf, other

    eess.AS cs.LG

    Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

    Authors: Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland

    Abstract: As end-to-end automatic speech recognition (ASR) models reach promising performance, various downstream tasks rely on good confidence estimators for these systems. Recent research has shown that model-based confidence estimators have a significant advantage over using the output softmax probabilities. If the input data to the speech recogniser is from mismatched acoustic and linguistic conditions,… ▽ More

    Submitted 2 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper at ICASSP 2022

  32. arXiv:2110.00165  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

    Authors: Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He

    Abstract: Self- and semi-supervised learning methods have been actively investigated to reduce labeled training data or enhance the model performance. However, the approach mostly focus on in-domain performance for public datasets. In this study, we utilize the combination of self- and semi-supervised learning methods to solve unseen domain adaptation problem in a large-scale production setting for online A… ▽ More

    Submitted 15 February, 2022; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: ICASSP 2022 accepted, 5 pages, 2 figures, 5 tables

  33. arXiv:2107.09635  [pdf, ps, other

    cond-mat.stat-mech cond-mat.str-el cs.CE

    Analyzing and predicting non-equilibrium many-body dynamics via dynamic mode decomposition

    Authors: Jia Yin, Yang-hao Chan, Felipe da Jornada, Diana Qiu, Chao Yang, Steven G. Louie

    Abstract: Simulating the dynamics of a nonequilibrium quantum many-body system by computing the two-time Green's function associated with such a system is computationally challenging. However, we are often interested in the time diagonal of such a Green's function or time dependent physical observables that are functions of one time. In this paper, we discuss the possibility of using dynamic model decomposi… ▽ More

    Submitted 26 June, 2021; originally announced July 2021.

    Comments: 22 pages, 17 pages

  34. arXiv:2104.12870  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

    Authors: David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw

    Abstract: Confidence scores are very useful for downstream applications of automatic speech recognition (ASR) systems. Recent works have proposed using neural networks to learn word or utterance confidence scores for end-to-end ASR. In those studies, word confidence by itself does not model deletions, and utterance confidence does not take advantage of word-level training signals. This paper proposes to joi… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: Submitted to Interspeech 2021

  35. arXiv:2104.09753  [pdf, other

    quant-ph cs.AI cs.FL eess.SY

    Supervisory Control of Quantum Discrete Event Systems

    Authors: Daowen Qiu

    Abstract: Discrete event systems (DES) have been deeply developed and applied in practice, but state complexity in DES still is an important problem to be better solved with innovative methods. With the development of quantum computing and quantum control, a natural problem is to simulate DES by means of quantum computing models and to establish {\it quantum DES} (QDES). The motivation is twofold: on the on… ▽ More

    Submitted 3 May, 2023; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: 35 pages, 5 figures; comments are welcome

  36. arXiv:2103.06716  [pdf, other

    eess.AS cs.CL cs.LG

    Learning Word-Level Confidence For Subword End-to-End ASR

    Authors: David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw

    Abstract: We study the problem of word-level confidence estimation in subword-based end-to-end (E2E) models for automatic speech recognition (ASR). Although prior works have proposed training auxiliary confidence models for ASR systems, they do not extend naturally to systems that operate on word-pieces (WP) as their vocabulary. In particular, ground truth WP correctness labels are needed for training confi… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: To appear in ICASSP 2021

  37. arXiv:2102.12642  [pdf, other

    cs.CV

    CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results

    Authors: Yuanhan Zhang, Zhenfei Yin, Jing Shao, Ziwei Liu, Shuo Yang, Yuanjun Xiong, Wei Xia, Yan Xu, Man Luo, Jian Liu, Jianshu Li, Zhijun Chen, Mingyu Guo, Hui Li, Junfu Liu, Pengfei Gao, Tianqi Hong, Hao Han, Shijie Liu, Xinhua Chen, Di Qiu, Cheng Zhen, Dashuang Liang, Yufeng Jin, Zhanlong Hao

    Abstract: As facial interaction systems are prevalently deployed, security and reliability of these systems become a critical issue, with substantial research efforts devoted. Among them, face anti-spoofing emerges as an important area, whose objective is to identify whether a presented face is live or spoof. Recently, a large-scale face anti-spoofing dataset, CelebA-Spoof which comprised of 625,537 picture… ▽ More

    Submitted 25 February, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Technical report. Challenge website: https://competitions.codalab.org/competitions/26210

  38. arXiv:2101.01745  [pdf, other

    cs.AR physics.comp-ph

    Hardware Acceleration of HPC Computational Flow Dynamics using HBM-enabled FPGAs

    Authors: Tom Hogervorst, Tong Dong Qiu, Giacomo Marchiori, Alf Birger, Markus Blatt, Razvan Nane

    Abstract: Scientific computing is at the core of many High-Performance Computing applications, including computational flow dynamics. Because of the uttermost importance to simulate increasingly larger computational models, hardware acceleration is receiving increased attention due to its potential to maximize the performance of scientific computing. A Field-Programmable Gate Array is a reconfigurable hardw… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Report number: Article No.: 20, pp 1--35

    Journal ref: ACM Transactions on Reconfigurable Technology and Systems, Volume 15, Issue 2, June 2022

  39. arXiv:2010.11428  [pdf, other

    eess.AS cs.CL cs.LG

    Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

    Authors: Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman

    Abstract: For various speech-related tasks, confidence scores from a speech recogniser are a useful measure to assess the quality of transcriptions. In traditional hidden Markov model-based automatic speech recognition (ASR) systems, confidence scores can be reliably obtained from word posteriors in decoding lattices. However, for an ASR system with an auto-regressive decoder, such as an attention-based seq… ▽ More

    Submitted 23 October, 2020; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Submitted to ICASSP 2021

  40. arXiv:2008.05258  [pdf, other

    cs.CV cs.LG eess.IV

    Guided Collaborative Training for Pixel-wise Semi-Supervised Learning

    Authors: Zhanghan Ke, Di Qiu, Kaican Li, Qiong Yan, Rynson W. H. Lau

    Abstract: We investigate the generalization of semi-supervised learning (SSL) to diverse pixel-wise tasks. Although SSL methods have achieved impressive results in image classification, the performances of applying them to pixel-wise tasks are unsatisfactory due to their need for dense outputs. In addition, existing pixel-wise SSL approaches are only suitable for certain tasks as they usually require to use… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 16th European Conference on Computer Vision (ECCV 2020)

  41. arXiv:2008.05157  [pdf, other

    cs.CV cs.GR cs.LG

    Towards Geometry Guided Neural Relighting with Flash Photography

    Authors: Di Qiu, Jin Zeng, Zhanghan Ke, Wenxiu Sun, Chengxi Yang

    Abstract: Previous image based relighting methods require capturing multiple images to acquire high frequency lighting effect under different lighting conditions, which needs nontrivial effort and may be unrealistic in certain practical use scenarios. While such approaches rely entirely on cleverly sampling the color images under different lighting conditions, little has been done to utilize geometric infor… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

  42. arXiv:2008.01806  [pdf, other

    eess.IV cs.CV

    Fast Nonconvex $T_2^*$ Mapping Using ADMM

    Authors: Shuai Huang, James J. Lah, Jason W. Allen, Deqiang Qiu

    Abstract: Magnetic resonance (MR)-$T_2^*$ mapping is widely used to study hemorrhage, calcification and iron deposition in various clinical applications, it provides a direct and precise mapping of desired contrast in the tissue. However, the long acquisition time required by conventional 3D high-resolution $T_2^*$ mapping method causes discomfort to patients and introduces motion artifacts to reconstructed… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

  43. arXiv:2007.14564  [pdf, other

    cs.IT eess.SP

    Bayesian Massive MIMO Channel Estimation with Parameter Estimation Using Low-Resolution ADCs

    Authors: Shuai Huang, Deqiang Qiu, Trac D. Tran

    Abstract: In order to reduce hardware complexity and power consumption, massive multiple-input multiple-output (MIMO) systems employ low-resolution analog-to-digital converters (ADCs) to acquire quantized measurements $\boldsymbol y$. This poses new challenges to the channel estimation problem, and the sparse prior on the channel coefficient vector $\boldsymbol x$ in the angle domain is often used to compen… ▽ More

    Submitted 11 February, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

  44. arXiv:2007.12942  [pdf, other

    cs.CV cs.LG

    Gradient Regularized Contrastive Learning for Continual Domain Adaptation

    Authors: Peng Su, Shixiang Tang, Peng Gao, Di Qiu, Ni Zhao, Xiaogang Wang

    Abstract: Human beings can quickly adapt to environmental changes by leveraging learning experience. However, the poor ability of adapting to dynamic environments remains a major challenge for AI models. To better understand this issue, we study the problem of continual domain adaptation, where the model is presented with a labeled source domain and a sequence of unlabeled target domains. There are two majo… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

  45. arXiv:2007.12858  [pdf, other

    cs.LG cs.CV stat.ML

    Modal Uncertainty Estimation via Discrete Latent Representation

    Authors: Di Qiu, Lok Ming Lui

    Abstract: Many important problems in the real world don't have unique solutions. It is thus important for machine learning models to be capable of proposing different plausible solutions with meaningful probability measures. In this work we introduce such a deep learning framework that learns the one-to-many mappings between the inputs and outputs, together with faithful uncertainty measures. We call our fr… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

  46. Partial Boolean functions with exact quantum 1-query complexity

    Authors: Guoliang Xu, Daowen Qiu

    Abstract: We provide two sufficient and necessary conditions to characterize any $n$-bit partial Boolean function with exact quantum 1-query complexity. Using the first characterization, we present all $n$-bit partial Boolean functions that depend on $n$ bits and have exact quantum 1-query complexity. Due to the second characterization, we construct a function $F$ that maps any $n$-bit partial Boolean funct… ▽ More

    Submitted 4 September, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: 11pages; comments are welcome

  47. Security Improvements of Several Basic Quantum Private Query Protocols with O(log N) Communication Complexity

    Authors: Fang Yu, Daowen Qiu, Xiaoming Wang, Qin Li, Lvzhou Li, Jozef Gruska

    Abstract: New quantum private database (with N elements) query protocols are presented and analyzed. Protocols preserve O(logN) communication complexity of known protocols for the same task, but achieve several significant improvements in security, especially concerning user privacy. For example, the randomized form of our protocol has a cheat-sensitive property - it allows the user to detect a dishonest da… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Journal ref: Theoretical Computer Science 807 (2020): 330-340

  48. arXiv:2004.01907  [pdf, other

    cs.CL cs.IR cs.LG

    Knowledge Guided Metric Learning for Few-Shot Text Classification

    Authors: Dianbo Sui, Yubo Chen, Binjie Mao, Delai Qiu, Kang Liu, Jun Zhao

    Abstract: The training of deep-learning-based text classification models relies heavily on a huge amount of annotation data, which is difficult to obtain. When the labeled data is scarce, models tend to struggle to achieve satisfactory performance. However, human beings can distinguish new categories very efficiently with few examples. This is mainly due to the fact that human beings can leverage knowledge… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

  49. arXiv:2003.07071  [pdf, other

    cs.CV

    Adapting Object Detectors with Conditional Domain Normalization

    Authors: Peng Su, Kun Wang, Xingyu Zeng, Shixiang Tang, Dapeng Chen, Di Qiu, Xiaogang Wang

    Abstract: Real-world object detectors are often challenged by the domain gaps between different datasets. In this work, we present the Conditional Domain Normalization (CDN) to bridge the domain gap. CDN is designed to encode different domain inputs into a shared latent space, where the features from different domains carry the same domain attribute. To achieve this, we first disentangle the domain-specific… ▽ More

    Submitted 22 July, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: Accepted at ECCV 2020

  50. arXiv:2003.01357  [pdf, other

    cs.CG cs.CV q-bio.QM

    Shape analysis via inconsistent surface registration

    Authors: Gary P. T. Choi, Di Qiu, Lok Ming Lui

    Abstract: In this work, we develop a framework for shape analysis using inconsistent surface mapping. Traditional landmark-based geometric morphometrics methods suffer from the limited degrees of freedom, while most of the more advanced non-rigid surface mapping methods rely on a strong assumption of the global consistency of two surfaces. From a practical point of view, given two anatomical surfaces with p… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of the Royal Society A, 476(2242), 20200147 (2020)