Skip to main content

Showing 1–50 of 126 results for author: Zhao, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.02943  [pdf, other

    eess.SP

    Recent Advances in Data-driven Intelligent Control for Wireless Communication: A Comprehensive Survey

    Authors: Wei Huo, Huiwen Yang, Nachuan Yang, Zhaohua Yang, Jiuzhou Zhang, Fuhai Nan, Xingzhou Chen, Yifan Mao, Suyang Hu, Pengyu Wang, Xuanyu Zheng, Mingming Zhao, Ling Shi

    Abstract: The advent of next-generation wireless communication systems heralds an era characterized by high data rates, low latency, massive connectivity, and superior energy efficiency. These systems necessitate innovative and adaptive strategies for resource allocation and device behavior control in wireless networks. Traditional optimization-based methods have been found inadequate in meeting the complex… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  2. arXiv:2407.21394  [pdf, other

    eess.IV cs.CV

    Force Sensing Guided Artery-Vein Segmentation via Sequential Ultrasound Images

    Authors: Yimeng Geng, Gaofeng Meng, Mingcong Chen, Guanglin Cao, Mingyang Zhao, Jianbo Zhao, Hongbin Liu

    Abstract: Accurate identification of arteries and veins in ultrasound images is crucial for vascular examinations and interventions in robotics-assisted surgeries. However, current methods for ultrasound vessel segmentation face challenges in distinguishing between arteries and veins due to their morphological similarities. To address this challenge, this study introduces a novel force sensing guided segmen… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  3. arXiv:2407.05619  [pdf, other

    cs.RO eess.SY

    AIRA: A Low-cost IR-based Approach Towards Autonomous Precision Drone Landing and NLOS Indoor Navigation

    Authors: Yanchen Liu, Minghui Zhao, Kaiyuan Hou, Junxi Xia, Charlie Carver, Stephen Xia, Xia Zhou, Xiaofan Jiang

    Abstract: Automatic drone landing is an important step for achieving fully autonomous drones. Although there are many works that leverage GPS, video, wireless signals, and active acoustic sensing to perform precise landing, autonomous drone landing remains an unsolved challenge for palm-sized microdrones that may not be able to support the high computational requirements of vision, wireless, or active audio… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  4. arXiv:2406.17672  [pdf, other

    cs.SD eess.AS

    SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond

    Authors: Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji

    Abstract: Recent advances in generative models that iteratively synthesize audio clips sparked great success to text-to-audio synthesis (TTA), but with the cost of slow synthesis speed and heavy computation. Although there have been attempts to accelerate the iterative procedure, high-quality TTA systems remain inefficient due to hundreds of iterations required in the inference phase and large amount of mod… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 6 pages, 8 figures, 8 tables. Audio samples: https://zzaudio.github.io/SpecMaskGIT/index.html

  5. arXiv:2406.08305  [pdf, other

    cs.NI eess.SP

    Large Language Model(LLM) assisted End-to-End Network Health Management based on Multi-Scale Semanticization

    Authors: Fengxiao Tang, Xiaonan Wang, Xun Yuan, Linfeng Luo, Ming Zhao, Nei Kato

    Abstract: Network device and system health management is the foundation of modern network operations and maintenance. Traditional health management methods, relying on expert identification or simple rule-based algorithms, struggle to cope with the dynamic heterogeneous networks (DHNs) environment. Moreover, current state-of-the-art distributed anomaly detection methods, which utilize specific machine learn… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2405.19516  [pdf, other

    eess.SP cs.CV cs.LG cs.RO

    Enabling Visual Recognition at Radio Frequency

    Authors: Haowen Lai, Gaoxiang Luo, Yifei Liu, Mingmin Zhao

    Abstract: This paper introduces PanoRadar, a novel RF imaging system that brings RF resolution close to that of LiDAR, while providing resilience against conditions challenging for optical signals. Our LiDAR-comparable 3D imaging results enable, for the first time, a variety of visual recognition tasks at radio frequency, including surface normal estimation, semantic segmentation, and object detection. Pano… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.16791  [pdf, ps, other

    cs.IT eess.SP

    Joint Node Selection and Resource Allocation Optimization for Cooperative Sensing with a Shared Wireless Backhaul

    Authors: Mingxin Chen, Ming-Min Zhao, An Liu, Min Li, Qingjiang Shi

    Abstract: In this paper, we consider a cooperative sensing framework in the context of future multi-functional network with both communication and sensing ability, where one base station (BS) serves as a sensing transmitter and several nearby BSs serve as sensing receivers. Each receiver receives the sensing signal reflected by the target and communicates with the fusion center (FC) through a wireless multi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 10 figures

  8. arXiv:2405.14598  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

    Authors: Shiqi Yang, Zhi Zhong, Mengjie Zhao, Shusuke Takahashi, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji

    Abstract: In recent years, with the realistic generation results and a wide range of personalized applications, diffusion-based generative models gain huge attention in both visual and audio generation areas. Compared to the considerable advancements of text2image or text2audio generation, research in audio2visual or visual2audio generation has been relatively slow. The recent audio-visual generation method… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 10 pages

  9. Design, Control, and Motion-Planning for a Root-Perching Rotor-Distributed Manipulator

    Authors: Takuzumi Nishio, Moju Zhao, Kei Okada, Masayuki Inaba

    Abstract: Manipulation performance improvement is crucial for aerial robots. For aerial manipulators, the baselink position and attitude errors directly affect the precision at the end effector. To address this stability problem, fixed-body approaches such as perching on the environment using the rotor suction force are useful. Additionally, conventional arm-equipped multirotors, called rotor-concentrated m… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: IEEE Transactions on Robotics (2023)

  10. arXiv:2405.04027  [pdf, other

    eess.SP

    Joint Visibility Region Detection and Channel Estimation for XL-MIMO Systems via Alternating MAP

    Authors: Wenkang Xu, An Liu, Min-jian Zhao

    Abstract: We investigate a joint visibility region (VR) detection and channel estimation problem in extremely large-scale multiple-input-multiple-output (XL-MIMO) systems, where near-field propagation and spatial non-stationary effects exist. In this case, each scatterer can only see a subset of antennas, i.e., it has a certain VR over the antennas. Because of the spatial correlation among adjacent sub-arra… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 13 pages, 14 figures, submitted to IEEE TSP

  11. arXiv:2405.01242  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms

    Authors: Yueyuan Sui, Minghui Zhao, Junxi Xia, Xiaofan Jiang, Stephen Xia

    Abstract: We propose TRAMBA, a hybrid transformer and Mamba architecture for acoustic and bone conduction speech enhancement, suitable for mobile and wearable platforms. Bone conduction speech enhancement has been impractical to adopt in mobile and wearable platforms for several reasons: (i) data collection is labor-intensive, resulting in scarcity; (ii) there exists a performance gap between state of-art m… ▽ More

    Submitted 29 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  12. arXiv:2403.10873  [pdf, other

    cs.IT eess.SP

    CSI Transfer From Sub-6G to mmWave: Reduced-Overhead Multi-User Hybrid Beamforming

    Authors: Weicao Deng, Min Li, Ming-Min Zhao, Min-Jian Zhao, Osvaldo Simeone

    Abstract: Hybrid beamforming is vital in modern wireless systems, especially for massive MIMO and millimeter-wave deployments, offering efficient directional transmission with reduced hardware complexity. However, effective beamforming in multi-user scenarios relies heavily on accurate channel state information, the acquisition of which often incurs excessive pilot overhead, degrading system performance. To… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 13 pages, 12 figures, submitted

  13. arXiv:2402.03042  [pdf, other

    eess.SP

    Semi-Passive Intelligent Reflecting Surface Enabled Sensing Systems

    Authors: Qiaoyan Peng, Qingqing Wu, Wen Chen, Shaodan Ma, Ming-Min Zhao, Octavia A. Dobre

    Abstract: Intelligent reflecting surface (IRS) has garnered growing interest and attention due to its potential for facilitating and supporting wireless communications and sensing. This paper studies a semi-passive IRS-enabled sensing system, where an IRS consists of both passive reflecting elements and active sensors. Our goal is to minimize the Cramér-Rao bound (CRB) for parameter estimation under both po… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  14. arXiv:2311.12745  [pdf, other

    cs.NI eess.SP

    Learn to Augment Network Simulators Towards Digital Network Twins

    Authors: Yuru Zhang, Ming Zhao, Qiang Liu

    Abstract: Digital network twin (DNT) is a promising paradigm to replicate real-world cellular networks toward continual assessment, proactive management, and what-if analysis. Existing discussions have been focusing on using only deep learning techniques to build DNTs, which raises widespread concerns regarding their generalization, explainability, and transparency. In this paper, we explore an alternative… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  15. arXiv:2311.08201  [pdf, other

    eess.SP

    Joint Location Sensing and Channel Estimation for IRS-Aided mmWave ISAC Systems

    Authors: Zijian Chen, Ming-Min Zhao, Min Li, Fan Xu, Qingqing Wu, Min-Jian Zhao

    Abstract: In this paper, we investigate a self-sensing intelligent reflecting surface (IRS) aided millimeter wave (mmWave) integrated sensing and communication (ISAC) system. Unlike the conventional purely passive IRS, the self-sensing IRS can effectively reduce the path loss of sensing-related links, thus rendering it advantageous in ISAC systems. Aiming to jointly sense the target/scatterer/user positions… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  16. arXiv:2311.08188  [pdf, ps, other

    cs.IT eess.SP

    Fast List Decoding of High-Rate Polar Codes

    Authors: Yang Lu, Ming-Min Zhao, Ming Lei, Min-Jian Zhao

    Abstract: Due to the ability to provide superior error-correction performance, the successive cancellation list (SCL) algorithm is widely regarded as one of the most promising decoding algorithms for polar codes with short-to-moderate code lengths. However, the application of SCL decoding in low-latency communication scenarios is limited due to its sequential nature. To reduce the decoding latency, developi… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 13 pages, 8 figures

  17. arXiv:2310.13267  [pdf, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    On the Language Encoder of Contrastive Cross-modal Models

    Authors: Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

    Abstract: Contrastive cross-modal models such as CLIP and CLAP aid various vision-language (VL) and audio-language (AL) tasks. However, there has been limited investigation of and improvement in their language encoder, which is the central component of encoding natural language descriptions of image/audio into vector representations. We extensively evaluate how unsupervised and supervised sentence embedding… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  18. arXiv:2310.05382  [pdf, other

    eess.SP

    A Stochastic Particle Variational Bayesian Inference Inspired Deep-Unfolding Network for Non-Convex Parameter Estimation

    Authors: Zhixiang Hu, An Liu, Minjian Zhao

    Abstract: Future wireless networks are envisioned to provide ubiquitous sensing services, which also gives rise to a substantial demand for high-dimensional non-convex parameter estimation, i.e., the associated likelihood function is non-convex and contains numerous local optima. Variational Bayesian inference (VBI) provides a powerful tool for modeling complex estimation problems and reasoning with prior i… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  19. arXiv:2309.04508  [pdf, other

    cs.LG cs.AI eess.SP

    Spatial-Temporal Graph Attention Fuser for Calibration in IoT Air Pollution Monitoring Systems

    Authors: Keivan Faghih Niresi, Mengjie Zhao, Hugo Bissig, Henri Baumann, Olga Fink

    Abstract: The use of Internet of Things (IoT) sensors for air pollution monitoring has significantly increased, resulting in the deployment of low-cost sensors. Despite this advancement, accurately calibrating these sensors in uncontrolled environmental conditions remains a challenge. To address this, we propose a novel approach that leverages graph neural networks, specifically the graph attention network… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  20. arXiv:2309.03114  [pdf, ps, other

    eess.SP

    NUV-DoA: NUV Prior-based Bayesian Sparse Reconstruction with Spatial Filtering for Super-Resolution DoA Estimation

    Authors: Mengyuan Zhao, Guy Revach, Tirza Routtenberg, Nir Shlezinger

    Abstract: Achieving high-resolution Direction of Arrival (DoA) recovery typically requires high Signal to Noise Ratio (SNR) and a sufficiently large number of snapshots. This paper presents NUV-DoA algorithm, that augments Bayesian sparse reconstruction with spatial filtering for super-resolution DoA estimation. By modeling each direction on the azimuth's grid with the sparsity-promoting normal with unknown… ▽ More

    Submitted 25 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: This paper has 5 pages including reference, 11 figures. This paper has been accepted to ICASSP 2024 - 2024 International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

  21. arXiv:2308.13996  [pdf

    cs.LG eess.SY

    Improve in-situ life prediction and classification performance by capturing both the present state and evolution rate of battery aging

    Authors: Mingyuan Zhao, Yongzhi Zhang

    Abstract: This study develops a methodology by capturing both the battery aging state and degradation rate for improved life prediction performance. The aging state is indicated by six physical features of an equivalent circuit model that are extracted from the voltage relaxation data. And the degradation rate is captured by two features extracted from the differences between the voltage relaxation curves w… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  22. arXiv:2308.09349  [pdf, other

    eess.SP

    Intelligent Reflecting Surface Aided Multi-Tier Hybrid Computing

    Authors: Yapeng Zhao, Qingqing Wu, Guangji Chen, Wen Chen, Ruiqi Liu, Ming-Min Zhao, Yuan Wu, Shaodan Ma

    Abstract: The digital twin edge network (DITEN) aims to integrate mobile edge computing (MEC) and digital twin (DT) to provide real-time system configuration and flexible resource allocation for the sixth-generation network. This paper investigates an intelligent reflecting surface (IRS)-aided multi-tier hybrid computing system that can achieve mutual benefits for DT and MEC in the DITEN. For the first time… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  23. arXiv:2307.09149  [pdf, other

    eess.SP

    Successive Linear Approximation VBI for Joint Sparse Signal Recovery and Dynamic Grid Parameters Estimation

    Authors: Wenkang Xu, An Liu, Bingpeng Zhou, Minjian Zhao

    Abstract: For many practical applications in wireless communications, we need to recover a structured sparse signal from a linear observation model with dynamic grid parameters in the sensing matrix. Conventional expectation maximization (EM)-based compressed sensing (CS) methods, such as turbo compressed sensing (Turbo-CS) and turbo variational Bayesian inference (Turbo-VBI), have double-loop iterations, w… ▽ More

    Submitted 12 November, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 14 pages, 15 figures, submitted to IEEE Transactions on Wireless Communications

  24. Theoretical Analysis of Binary Masks in Snapshot Compressive Imaging Systems

    Authors: Mengyu Zhao, Shirin Jalali

    Abstract: Snapshot compressive imaging (SCI) systems have gained significant attention in recent years. While previous theoretical studies have primarily focused on the performance analysis of Gaussian masks, practical SCI systems often employ binary-valued masks. Furthermore, recent research has demonstrated that optimized binary masks can significantly enhance system performance. In this paper, we present… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  25. arXiv:2307.01486  [pdf, other

    eess.IV cs.CV

    H-DenseFormer: An Efficient Hybrid Densely Connected Transformer for Multimodal Tumor Segmentation

    Authors: Jun Shi, Hongyu Kan, Shulan Ruan, Ziqi Zhu, Minfan Zhao, Liang Qiao, Zhaohui Wang, Hong An, Xudong Xue

    Abstract: Recently, deep learning methods have been widely used for tumor segmentation of multimodal medical images with promising results. However, most existing methods are limited by insufficient representational ability, specific modality number and high computational complexity. In this paper, we propose a hybrid densely connected network for tumor segmentation, named H-DenseFormer, which combines the… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 11 pages, 2 figures. This paper has been accepted by Medical Image Computing and Computer-Assisted Intervention(MICCAI) 2023

  26. arXiv:2307.00269  [pdf, other

    cs.CV eess.IV

    AE-RED: A Hyperspectral Unmixing Framework Powered by Deep Autoencoder and Regularization by Denoising

    Authors: Min Zhao, Jie Chen, Nicolas Dobigeon

    Abstract: Spectral unmixing has been extensively studied with a variety of methods and used in many applications. Recently, data-driven techniques with deep learning methods have obtained great attention to spectral unmixing for its superior learning ability to automatically learn the structure information. In particular, autoencoder based architectures are elaborately designed to solve blind unmixing and m… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  27. arXiv:2306.17197  [pdf, other

    eess.IV cs.LG

    Guided Deep Generative Model-based Spatial Regularization for Multiband Imaging Inverse Problems

    Authors: Min Zhao, Nicolas Dobigeon, Jie Chen

    Abstract: When adopting a model-based formulation, solving inverse problems encountered in multiband imaging requires to define spatial and spectral regularizations. In most of the works of the literature, spectral information is extracted from the observations directly to derive data-driven spectral priors. Conversely, the choice of the spatial regularization often boils down to the use of conventional pen… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  28. arXiv:2306.05775  [pdf, other

    cs.LG eess.SP

    Weight Freezing: A Regularization Approach for Fully Connected Layers with an Application in EEG Classification

    Authors: Zhengqing Miao, Meirong Zhao

    Abstract: In the realm of EEG decoding, enhancing the performance of artificial neural networks (ANNs) carries significant potential. This study introduces a novel approach, termed "weight freezing", that is anchored on the principles of ANN regularization and neuroscience prior knowledge. The concept of weight freezing revolves around the idea of reducing certain neurons' influence on the decision-making p… ▽ More

    Submitted 11 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 16 pages, 5 figures

  29. arXiv:2306.05704  [pdf, other

    cs.CV cs.MM eess.IV

    Exploring Effective Mask Sampling Modeling for Neural Image Compression

    Authors: Lin Liu, Mingming Zhao, Shanxin Yuan, Wenlong Lyu, Wengang Zhou, Houqiang Li, Yanfeng Wang, Qi Tian

    Abstract: Image compression aims to reduce the information redundancy in images. Most existing neural image compression methods rely on side information from hyperprior or context models to eliminate spatial redundancy, but rarely address the channel redundancy. Inspired by the mask sampling modeling in recent self-supervised learning methods for natural language processing and high-level vision, we propose… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 10 pages

  30. arXiv:2304.14508  [pdf

    eess.IV cs.CV cs.LG

    3D Brainformer: 3D Fusion Transformer for Brain Tumor Segmentation

    Authors: Rui Nian, Guoyao Zhang, Yao Sui, Yuqi Qian, Qiuying Li, Mingzhang Zhao, Jianhui Li, Ali Gholipour, Simon K. Warfield

    Abstract: Magnetic resonance imaging (MRI) is critically important for brain mapping in both scientific research and clinical studies. Precise segmentation of brain tumors facilitates clinical diagnosis, evaluations, and surgical planning. Deep learning has recently emerged to improve brain tumor segmentation and achieved impressive results. Convolutional architectures are widely used to implement those neu… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 figures

    MSC Class: 68T07 ACM Class: I.4.6; I.5.1

  31. arXiv:2304.01461  [pdf, other

    cs.LG cs.AI eess.SP

    Time-space-frequency feature Fusion for 3-channel motor imagery classification

    Authors: Zhengqing Miao, Meirong Zhao

    Abstract: Low-channel EEG devices are crucial for portable and entertainment applications. However, the low spatial resolution of EEG presents challenges in decoding low-channel motor imagery. This study introduces TSFF-Net, a novel network architecture that integrates time-space-frequency features, effectively compensating for the limitations of single-mode feature extraction networks based on time-series… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 15 pages, 4 Figures

  32. arXiv:2303.16407  [pdf, other

    cs.LG cs.AI eess.SP q-bio.NC

    LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interface paradigms and interpretability

    Authors: Zhengqing Miao, Xin Zhang, Meirong Zhao, Dong Ming

    Abstract: EEG-based recognition of activities and states involves the use of prior neuroscience knowledge to generate quantitative EEG features, which may limit BCI performance. Although neural network-based methods can effectively extract features, they often encounter issues such as poor generalization across datasets, high predicting volatility, and low model interpretability. Hence, we propose a novel l… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 20 pages, 7 Figures

  33. arXiv:2302.12368  [pdf, other

    eess.SY

    Power System Recovery Coordinated with (Non-)Black-Start Generators

    Authors: Meng Zhao, Patrick R. Maloney, Xinda Ke, Juan Carlos Bedoya Ceballos, Xiaoyuan Fan, Marcelo A. Elizondo

    Abstract: Power restoration is an urgent task after a black-out, and recovery efficiency is critical when quantifying system resilience. Multiple elements should be considered to restore the power system quickly and safely. This paper proposes a recovery model to solve a direct-current optimal power flow (DCOPF) based on mixed-integer linear programming (MILP). Since most of the generators cannot start inde… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 5 pages, 6 figures

  34. arXiv:2302.02587  [pdf, other

    eess.SP

    Joint Scattering Environment Sensing and Channel Estimation Based on Non-stationary Markov Random Field

    Authors: Wenkang Xu, Yongbo Xiao, An Liu, Ming Lei, Minjian Zhao

    Abstract: This paper considers an integrated sensing and communication system, where some radar targets also serve as communication scatterers. A location domain channel modeling method is proposed based on the position of targets and scatterers in the scattering environment, and the resulting radar and communication channels exhibit a two-dimensional (2-D) joint burst sparsity. We propose a joint scatterin… ▽ More

    Submitted 18 July, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 15 pages, 13 figures, submitted to IEEE Transactions on Wireless Communications

  35. arXiv:2302.01619  [pdf, other

    eess.SP

    Joint Scattering Environment Sensing and Channel Estimation for Integrated Sensing and Communication

    Authors: Wenkang Xu, Yongbo Xiao, An Liu, Minjian Zhao

    Abstract: This paper considers an integrated sensing and communication system, where some radar targets also serve as communication scatterers. A location domain channel modeling method is proposed based on the position of targets and scatterers in the scattering environment, and the resulting radar and communication channels exhibit a partially common sparsity. By exploiting this, we propose a joint scatte… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  36. arXiv:2302.00953  [pdf

    eess.IV cs.CV cs.LG

    Deep-Learning Tool for Early Identifying Non-Traumatic Intracranial Hemorrhage Etiology based on CT Scan

    Authors: Meng Zhao, Yifan Hu, Ruixuan Jiang, Yuanli Zhao, Dong Zhang, Yan Zhang, Rong Wang, Yong Cao, Qian Zhang, Yonggang Ma, Jiaxi Li, Shaochen Yu, Wenjie Li, Ran Zhang, Yefeng Zheng, Shuo Wang, Jizong Zhao

    Abstract: Background: To develop an artificial intelligence system that can accurately identify acute non-traumatic intracranial hemorrhage (ICH) etiology based on non-contrast CT (NCCT) scans and investigate whether clinicians can benefit from it in a diagnostic setting. Materials and Methods: The deep learning model was developed with 1868 eligible NCCT scans with non-traumatic ICH collected between Janua… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  37. arXiv:2301.13507  [pdf, ps, other

    cs.IR cs.LG cs.SD eess.AS

    An Analysis of Classification Approaches for Hit Song Prediction using Engineered Metadata Features with Lyrics and Audio Features

    Authors: Mengyisong Zhao, Morgan Harvey, David Cameron, Frank Hopfgartner, Valerie J. Gillet

    Abstract: Hit song prediction, one of the emerging fields in music information retrieval (MIR), remains a considerable challenge. Being able to understand what makes a given song a hit is clearly beneficial to the whole music industry. Previous approaches to hit song prediction have focused on using audio features of a record. This study aims to improve the prediction result of the top 10 hits among Billboa… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  38. arXiv:2211.16666  [pdf, ps, other

    cs.IT eess.SP

    Secrecy Rate Maximization of RIS-assisted SWIPT Systems: A Two-Timescale Beamforming Design Approach

    Authors: Ming-Min Zhao, Kaidi Xu, Yunlong Cai, Yong Niu, Lajos Hanzo

    Abstract: Reconfigurable intelligent surfaces (RISs) achieve high passive beamforming gains for signal enhancement or interference nulling by dynamically adjusting their reflection coefficients. Their employment is particularly appealing for improving both the wireless security and the efficiency of radio frequency (RF)-based wireless power transfer. Motivated by this, we conceive and investigate a RIS-assi… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 16 pages, 12 figures, accepted for publication in IEEE Transactions on Wireless Communications

  39. arXiv:2211.12082  [pdf, other

    cs.CV cs.LG eess.IV

    Brain MRI-to-PET Synthesis using 3D Convolutional Attention Networks

    Authors: Ramy Hussein, David Shin, Moss Zhao, Jia Guo, Guido Davidzon, Michael Moseley, Greg Zaharchuk

    Abstract: Accurate quantification of cerebral blood flow (CBF) is essential for the diagnosis and assessment of a wide range of neurological diseases. Positron emission tomography (PET) with radiolabeled water (15O-water) is considered the gold-standard for the measurement of CBF in humans. PET imaging, however, is not widely available because of its prohibitive costs and use of short-lived radiopharmaceuti… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 19 pages, 14 figures

  40. arXiv:2210.16197  [pdf

    eess.SP

    Dimensionality Reduced Antenna Array for Beamforming/steering

    Authors: Shiyi Xia, Mingyang Zhao, Qian Ma, Xunnan Zhang, Ling Yang, Yazhi Pi, Hyunchul Chung, Ad Reniers, A. M. J. Koonen, Zizheng Cao

    Abstract: Beamforming makes possible a focused communication method. It is extensively employed in many disciplines involving electromagnetic waves, including arrayed ultrasonic, optical, and high-speed wireless communication. Conventional beam steering often requires the addition of separate active amplitude phase control units after each radiating element. The high power consumption and complexity of larg… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  41. arXiv:2210.14725  [pdf, other

    cs.CL cs.SD eess.AS

    Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition

    Authors: Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao

    Abstract: The recent emergence of joint CTC-Attention model shows significant improvement in automatic speech recognition (ASR). The improvement largely lies in the modeling of linguistic information by decoder. The decoder joint-optimized with an acoustic encoder renders the language model from ground-truth sequences in an auto-regressive manner during training. However, the training corpus of the decoder… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted by ECAISS2022, The Fourth International Workshop on Edge Computing and Artificial Intelligence based Sensor-Cloud System

  42. arXiv:2210.08483  [pdf, ps, other

    eess.SY math.OC

    Direct Computing on Control Capability for Linear Continuous-time Systems Based on Hurwitz Matrix

    Authors: Mingwang Zhao

    Abstract: In this paper, based on the controllable canonical form and the Hurwitz matrix of the Hurwitz stability criterion, an analytical volume computing method for the smooth controllability zonotope for the linear continuous-time(LCT) systems, without of help of the eigenvalue computing of the systems, is presented. And then, the computing method is generlized to the volume computing of the controllabil… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: 16 pages

  43. arXiv:2210.08480  [pdf, ps, other

    eess.SY math.OC

    Analytical Volume Analysis for the Finite-time Controllable Region of the Linear Discrete-time Systems

    Authors: Mingwang Zhao

    Abstract: In this paper, the works on the analytical volume analysis for the controllable regions of the linear discrete-time (LDT) systems in papers \cite{zhaomw202001} and \cite {zhaomw202004} are discussed further and a new theorem on the analytical computing for the finite-time controllability zonotope (controllable region) of LDT systems are proven. And then, three analytical factors describing the con… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: 27 pages

  44. arXiv:2210.06111  [pdf, ps, other

    cs.SD cs.AI eess.AS eess.SP

    THUEE system description for NIST 2020 SRE CTS challenge

    Authors: Yu Zheng, Jinghan Peng, Miao Zhao, Yufeng Ma, Min Liu, Xinyue Ma, Tianyu Liang, Tianlong Kong, Liang He, Minqiang Xu

    Abstract: This paper presents the system description of the THUEE team for the NIST 2020 Speaker Recognition Evaluation (SRE) conversational telephone speech (CTS) challenge. The subsystems including ResNet74, ResNet152, and RepVGG-B2 are developed as speaker embedding extractors in this evaluation. We used combined AM-Softmax and AAM-Softmax based loss functions, namely CM-Softmax. We adopted a two-staged… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 3 pages, 1 table; System desciption of NIST 2020 SRE CTS challenge

  45. arXiv:2208.12133  [pdf, other

    cs.HC cs.AI cs.MM cs.SD eess.AS

    The ReprGesture entry to the GENEA Challenge 2022

    Authors: Sicheng Yang, Zhiyong Wu, Minglei Li, Mengchen Zhao, Jiuxin Lin, Liyang Chen, Weihong Bao

    Abstract: This paper describes the ReprGesture entry to the Generation and Evaluation of Non-verbal Behaviour for Embodied Agents (GENEA) challenge 2022. The GENEA challenge provides the processed datasets and performs crowdsourced evaluations to compare the performance of different gesture generation systems. In this paper, we explore an automatic gesture generation system based on multimodal representatio… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: 8 pages, 4 figures, ICMI 2022

  46. Automatic reorientation by deep learning to generate short axis SPECT myocardial perfusion images

    Authors: Fubao Zhu, Guojie Wang, Chen Zhao, Saurabh Malhotra, Min Zhao, Zhuo He, Jianzhou Shi, Zhixin Jiang, Weihua Zhou

    Abstract: Single photon emission computed tomography (SPECT) myocardial perfusion images (MPI) can be displayed both in traditional short-axis (SA) cardiac planes and polar maps for interpretation and quantification. It is essential to reorient the reconstructed transaxial SPECT MPI into standard SA slices. This study is aimed to develop a deep-learning-based approach for automatic reorientation of MPI. Met… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: 27 pages,7 figures

  47. arXiv:2207.10427  [pdf, other

    eess.SP

    A Two-stage Multiband WiFi Sensing Scheme via Stochastic Particle-Based Variational Bayesian Inference

    Authors: Zhixiang Hu, An Liu, Yubo Wan, Tony Xiao Han, Minjian Zhao

    Abstract: Multiband fusion enhances WiFi sensing by jointly utilizing signals from multiple non-contiguous frequency bands. However, in the multi-band WiFi sensing signal model, there are many local optimums in the associated likelihood function due to the existence of high frequency component and phase distortion factors, posing challenges for high-accuracy parameter estimation. To address this, we propose… ▽ More

    Submitted 9 October, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

  48. arXiv:2207.08123  [pdf, ps, other

    eess.SP

    Latency Minimization for mmWave D2D Mobile Edge Computing Systems: Joint Task Allocation and Hybrid Beamforming Design

    Authors: Yanzhen Liu, Yunlong Cai, An Liu, Minjian Zhao, Lajos Hanzo

    Abstract: Mobile edge computing (MEC) and millimeter wave (mmWave) communications are capable of significantly reducing the network's delay and enhancing its capacity. In this paper we investigate a mmWave and device-to-device (D2D) assisted MEC system, in which user A carries out some computational tasks and shares the results with user B with the aid of a base station (BS). We propose a novel two-timescal… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  49. Integration of Physics-Based and Data-Driven Models for Hyperspectral Image Unmixing

    Authors: Jie Chen, Min Zhao, Xiuheng Wang, Cédric Richard, Susanto Rahardja

    Abstract: Spectral unmixing is one of the most important quantitative analysis tasks in hyperspectral data processing. Conventional physics-based models are characterized by clear interpretation. However they may not be suitable for analyzing scenes with unknown complex physical characteristics. Data-driven methods have developed rapidly in recent years, in particular deep learning methods because they poss… ▽ More

    Submitted 27 August, 2022; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: IEEE Signal Process. Mag., to be published. Manuscript submitted March 14, 2022; revised June 25, 2022 and July 27, 2022; accepted August 27, 2022

  50. arXiv:2204.12115  [pdf, ps, other

    cs.IT eess.SP

    Fast Successive-Cancellation Decoding of Polar Codes with Sequence Nodes

    Authors: Yang Lu, Ming-Min Zhao, Ming Lei, Min-Jian Zhao

    Abstract: Due to the sequential nature of the successive-cancellation (SC) algorithm, the decoding of polar codes suffers from significant decoding latencies. Fast SC decoding is able to speed up the SC decoding process, by implementing parallel decoders at the intermediate levels of the SC decoding tree for some special nodes with specific information and frozen bit patterns. To further improve the paralle… ▽ More

    Submitted 18 November, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 30 pages, 6 figures, submitted for possible journal publication