Skip to main content

Showing 1–50 of 215 results for author: Yu, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.06288  [pdf, ps, other

    cs.IT eess.SP

    RIS-Aided Free-Space Optics Communications in A2G Networks over Inverted Gamma-Gamma Turbulent Channels

    Authors: Md. Abdur Rakib, Md. Ibrahim, A. S. M. Badrudduza, Imran Shafique Ansari, Md. Shahid Uz Zaman, Heejung Yu

    Abstract: With the advent of sixth-generation networks, reconfigurable intelligent surfaces (RISs) have revolutionized wireless communications through dynamic electromagnetic wave manipulation, thereby facilitating the adaptability and unparalleled control of real-time performance evaluations. This study proposed a framework to analyze the performance of RIS-assisted free-space optics (FSO) communication ov… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  2. arXiv:2408.05776  [pdf

    cs.NI eess.SP

    Convergence of Symbiotic Communications and Blockchain for Sustainable and Trustworthy 6G Wireless Networks

    Authors: Haoxiang Luo, Gang Sun, Cheng Chi, Hongfang Yu, Mohsen Guizani

    Abstract: Symbiotic communication (SC) is known as a new wireless communication paradigm, similar to the natural ecosystem population, and can enable multiple communication systems to cooperate and mutualize through service exchange and resource sharing. As a result, SC is seen as an important potential technology for future sixth-generation (6G) communications, solving the problem of lack of spectrum resou… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  3. arXiv:2408.00365  [pdf, other

    cs.AI cs.CV eess.IV

    Multimodal Fusion and Coherence Modeling for Video Topic Segmentation

    Authors: Hai Yu, Chong Deng, Qinglin Zhang, Jiaqing Liu, Qian Chen, Wen Wang

    Abstract: The video topic segmentation (VTS) task segments videos into intelligible, non-overlapping topics, facilitating efficient comprehension of video content and quick access to specific content. VTS is also critical to various downstream video understanding tasks. Traditional VTS methods using shallow features or unsupervised approaches struggle to accurately discern the nuances of topical transitions… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  4. arXiv:2407.20427  [pdf, other

    cs.CV eess.IV

    Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods

    Authors: Hyeon Yu, Jenny Benois-Pineau, Romain Bourqui, Romain Giot, Alexey Zhukov

    Abstract: This paper investigates the use of Mean Opinion Score (MOS), a common image quality metric, as a user-centric evaluation metric for XAI post-hoc explainers. To measure the MOS, a user experiment is proposed, which has been conducted with explanation maps of intentionally distorted images. Three methods from the family of feature attribution methods - Gradient-weighted Class Activation Mapping (Gra… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Supported by organization Laboratoire Bordelais de Recherche en Informatique, 15 pages, 4 figures, 3 tables

    ACM Class: I.4.7

  5. arXiv:2407.18766  [pdf, other

    cs.IT eess.SP

    Secrecy Performance Analysis of Integrated RF-UWOC IoT Networks Enabled by UAV and Underwater-RIS

    Authors: Abrar Bin Sarawar, A. S. M. Badrudduza, Md. Ibrahim, Imran Shafique Ansari, Heejung Yu

    Abstract: In the sixth-generation (6G) Internet of Things (IoT) networks, the use of UAV-mounted base stations and reconfigurable intelligent surfaces (RIS) has been considered to enhance coverage, flexibility, and security in non-terrestrial networks (NTNs). In addition to aerial networks enabled by NTN technologies, the integration of underwater networks with 6G IoT can be considered one of the most innov… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  6. arXiv:2407.18323  [pdf, other

    cs.NI eess.SP

    Active Reconfigurable Intelligent Surface-Aided Terahertz Wireless Communications

    Authors: Waqas Khalid, Heejung Yu, Yazdan Ahmad Qadri

    Abstract: Terahertz (THz) communication is expected to be a key technology for future sixth-generation (6G) wireless networks. Furthermore, reconfigurable intelligent surfaces (RIS) have been proposed to modify the wireless propagation environment and enhance system performance. Given the sensitivity to blockages and limited coverage range, RIS is particularly promising for THz communications. Active RIS ca… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Submitted in KICS Summer Conference 2024, (19 June 2024 - 22 June 2024), Jeju, Korea

  7. arXiv:2407.16634  [pdf, other

    eess.IV cs.AI cs.CV cs.HC

    Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses

    Authors: Haojun Yu, Youcheng Li, Nan Zhang, Zihan Niu, Xuantong Gong, Yanwen Luo, Quanlin Wu, Wangyan Qin, Mengyuan Zhou, Jie Han, Jia Tao, Ziwei Zhao, Di Dai, Di He, Dong Wang, Binghui Tang, Ling Huo, Qingli Zhu, Yong Wang, Liwei Wang

    Abstract: Data-driven deep learning models have shown great capabilities to assist radiologists in breast ultrasound (US) diagnoses. However, their effectiveness is limited by the long-tail distribution of training data, which leads to inaccuracies in rare cases. In this study, we address a long-standing challenge of improving the diagnostic model performance on rare cases using long-tailed data. Specifical… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  8. arXiv:2407.08481  [pdf, other

    eess.IV cs.CV

    SliceMamba with Neural Architecture Search for Medical Image Segmentation

    Authors: Chao Fan, Hongyuan Yu, Yan Huang, Liang Wang, Zhenghan Yang, Xibin Jia

    Abstract: Despite the progress made in Mamba-based medical image segmentation models, existing methods utilizing unidirectional or multi-directional feature scanning mechanisms struggle to effectively capture dependencies between neighboring positions, limiting the discriminant representation learning of local features. These local features are crucial for medical image segmentation as they provide critical… ▽ More

    Submitted 19 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2407.07347  [pdf, other

    cs.CV eess.IV

    MNeRV: A Multilayer Neural Representation for Videos

    Authors: Qingling Chang, Haohui Yu, Shuxuan Fu, Zhiqiang Zeng, Chuangquan Chen

    Abstract: As a novel video representation method, Neural Representations for Videos (NeRV) has shown great potential in the fields of video compression, video restoration, and video interpolation. In the process of representing videos using NeRV, each frame corresponds to an embedding, which is then reconstructed into a video frame sequence after passing through a small number of decoding layers (E-NeRV, HN… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages, 12 figures, 8 table

  10. arXiv:2407.06772  [pdf, other

    cs.IT eess.SP

    Revealing the evanescent components in Kronecker-product based codebooks: insights and implications

    Authors: Jun Yang, Yijian Chen, Yunqi Sun, Yuan Si, Hongkang Yu, Shujuan Zhang, Zhaohua Lu

    Abstract: The orthogonal bases of discrete Fourier transform (DFT) has been recognized as the standard spatial-domain bases for Type I, Type II and enhanced Type II codewords by the 3rd Generation Partnership Project (3GPP). For uniform planar arrays, these spatial-domain bases are derived as the Kronecker product of one-dimensional DFT bases. Theoretically, each spatial basis corresponds to a beam directed… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 11 pages, 9 figures

  11. arXiv:2407.05643  [pdf, other

    cs.IT eess.SP

    Spatial Non-Stationary Dual-Wideband Channel Estimation for XL-MIMO Systems

    Authors: Anzheng Tang, Jun-Bo Wang, Yijin Pan, Tuo Wu, Chuanwen Chang, Yijian Chen, Hongkang Yu, Maged Elkashlan

    Abstract: In this paper, we investigate the channel estimation problem for extremely large-scale multi-input and multi-output (XL-MIMO) systems, considering the spherical wavefront effect, spatially non-stationary (SnS) property, and dual-wideband effects. To accurately characterize the XL-MIMO channel, we first derive a novel spatial-and-frequency-domain channel model for XL-MIMO systems and carefully exam… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: This paper has been submitted to IEEE journal for possible publication

  12. arXiv:2407.05571  [pdf, other

    cs.NI eess.SP

    Cost-Efficient Computation Offloading in SAGIN: A Deep Reinforcement Learning and Perception-Aided Approach

    Authors: Yulan Gao, Ziqiang Ye, Han Yu

    Abstract: The Space-Air-Ground Integrated Network (SAGIN), crucial to the advancement of sixth-generation (6G) technology, plays a key role in ensuring universal connectivity, particularly by addressing the communication needs of remote areas lacking cellular network infrastructure. This paper delves into the role of unmanned aerial vehicles (UAVs) within SAGIN, where they act as a control layer owing to th… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  13. arXiv:2407.02052  [pdf, other

    eess.AS cs.SD

    The USTC-NERCSLIP Systems for The ICMC-ASR Challenge

    Authors: Minghui Wu, Luzhen Xu, Jie Zhang, Haitao Tang, Yanyan Yue, Ruizhi Liao, Jintao Zhao, Zhengzhe Zhang, Yichi Wang, Haoyin Yan, Hongliang Yu, Tongle Ma, Jiachen Liu, Chongliang Wu, Yongchao Li, Yanyong Zhang, Xin Fang, Yue Zhang

    Abstract: This report describes the submitted system to the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) challenge, which considers the ASR task with multi-speaker overlapping and Mandarin accent dynamics in the ICMC case. We implement the front-end speaker diarization using the self-supervised learning representation based multi-speaker embedding and beamforming using the speaker position,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ICASSP 2024

  14. arXiv:2407.01336  [pdf, other

    cs.IT eess.SP

    Compressed Sensing Inspired User Acquisition for Downlink Integrated Sensing and Communication Transmissions

    Authors: Yi Song, Fernando Pedraza, Shuangyang Li, Siyao Li, Han Yu, Giuseppe Caire

    Abstract: This paper investigates radar-assisted user acquisition for downlink multi-user multiple-input multiple-output (MIMO) transmission using Orthogonal Frequency Division Multiplexing (OFDM) signals. Specifically, we formulate a concise mathematical model for the user acquisition problem, where each user is characterized by its delay and beamspace response. Therefore, we propose a two-stage method for… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  15. arXiv:2406.11508  [pdf, other

    eess.SY

    Leveraging Cooperative Connected Automated Vehicles for Mixed Traffic Safety

    Authors: Chenguang Zhao, Tamas G. Molnar, Huan Yu

    Abstract: The introduction of connected and automated vehicles (CAV) is believed to reduce congestion, enhance safety, and improve traffic efficiency. Numerous research studies have focused on controlling pure CAV platoons in fully connected automated traffic, as well as single or multiple CAVs in mixed traffic with human-driven vehicles (HVs). CAV cruising control designs have been proposed to stabilize th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  16. arXiv:2406.07103  [pdf, other

    eess.AS cs.AI

    MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms

    Authors: Seung-bin Kim, Chan-yeong Lim, Jungwoo Heo, Ju-ho Kim, Hyun-seo Shin, Kyo-Won Koo, Ha-Jin Yu

    Abstract: In speaker verification systems, the utilization of short utterances presents a persistent challenge, leading to performance degradation primarily due to insufficient phonetic information to characterize the speakers. To overcome this obstacle, we propose a novel structure, MR-RawNet, designed to enhance the robustness of speaker verification systems against variable duration utterances using raw… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, accepted by Interspeech 2024

  17. arXiv:2406.01235  [pdf, other

    eess.IV

    Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification

    Authors: Junyan Lin, Xuepeng Jin, Feng Gao, Junyu Dong, Hui Yu

    Abstract: Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage. This issue directly impairs the representation ability of the model. To tackle the problem, we propose a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IGARSS 2024

  18. arXiv:2406.00399  [pdf, other

    eess.SP

    Patterned Beam Training: A Novel Low-Complexity and Low-Overhead Scheme for ELAA

    Authors: Hongkang Yu, Yuan Si, Shujuan Zhang, Yijian Chen

    Abstract: Extremely large antenna arrays (ELAAs) can provide higher spectral efficiency. However, the use of narrower beams for data transmission significantly increases the overhead associated with beam training. In this letter, we propose a novel patterned beam training (PBT) scheme characterized by its low overhead and complexity. This scheme requires only a single linear operation by both the base stati… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  19. arXiv:2405.19366  [pdf, other

    eess.SP cs.AI

    ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text

    Authors: Han Yu, Peikun Guo, Akane Sano

    Abstract: The utilization of deep learning on electrocardiogram (ECG) analysis has brought the advanced accuracy and efficiency of cardiac healthcare diagnostics. By leveraging the capabilities of deep learning in semantic understanding, especially in feature extraction and representation learning, this study introduces a new multimodal contrastive pretaining framework that aims to improve the quality and r… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  20. arXiv:2405.19079  [pdf, other

    eess.IV cs.CV

    On the Influence of Smoothness Constraints in Computed Tomography Motion Compensation

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Siyuan Mei, Mingxuan Gu, Laura Pfaff, Nastassia Vysotskaya, Haijun Yu, Andreas Maier

    Abstract: Computed tomography (CT) relies on precise patient immobilization during image acquisition. Nevertheless, motion artifacts in the reconstructed images can persist. Motion compensation methods aim to correct such artifacts post-acquisition, often incorporating temporal smoothness constraints on the estimated motion patterns. This study analyzes the influence of a spline-based motion model within an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  21. arXiv:2405.14770  [pdf, other

    eess.IV

    Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography

    Authors: Shuo Han, Yongshun Xu, Dayang Wang, Bahareh Morovati, Li Zhou, Jonathan S. Maltz, Ge Wang, Hengyong Yu

    Abstract: Cardiac computed tomography (CT) has emerged as a major imaging modality for the diagnosis and monitoring of cardiovascular diseases. High temporal resolution is essential to ensure diagnostic accuracy. Limited-angle data acquisition can reduce scan time and improve temporal resolution, but typically leads to severe image degradation and motivates for improved reconstruction techniques. In this pa… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 12 pages

  22. arXiv:2405.13339  [pdf, other

    eess.SP

    Floor-Plan-aided Indoor Localization: Zero-Shot Learning Framework, Data Sets, and Prototype

    Authors: Haiyao Yu, Changyang She, Yunkai Hu, Geng Wang, Rui Wang, Branka Vucetic, Yonghui Li

    Abstract: Machine learning has been considered a promising approach for indoor localization. Nevertheless, the sample efficiency, scalability, and generalization ability remain open issues of implementing learning-based algorithms in practical systems. In this paper, we establish a zero-shot learning framework that does not need real-world measurements in a new communication environment. Specifically, a gra… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  23. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  24. arXiv:2405.04629  [pdf, other

    eess.IV cs.AI physics.med-ph

    ResNCT: A Deep Learning Model for the Synthesis of Nephrographic Phase Images in CT Urography

    Authors: Syed Jamal Safdar Gardezi, Lucas Aronson, Peter Wawrzyn, Hongkun Yu, E. Jason Abel, Daniel D. Shapiro, Meghan G. Lubner, Joshua Warner, Giuseppe Toia, Lu Mao, Pallavi Tiwari, Andrew L. Wentland

    Abstract: Purpose: To develop and evaluate a transformer-based deep learning model for the synthesis of nephrographic phase images in CT urography (CTU) examinations from the unenhanced and urographic phases. Materials and Methods: This retrospective study was approved by the local Institutional Review Board. A dataset of 119 patients (mean $\pm$ SD age, 65 $\pm$ 12 years; 75/44 males/females) with three-… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 Figures,2 Tables

    MSC Class: eess.IV ACM Class: J.3

  25. arXiv:2404.18046  [pdf, ps, other

    eess.SP

    Fast Beam Training and Performance Analysis for Extremely Large Aperture Array

    Authors: Yuan Si, Hongkang Yu, Yijian Chen

    Abstract: Extremely large aperture array (ELAA) can significantly enhance beamforming gain and spectral efficiency. Unfortunately, the use of narrower beams for data transmission results in a substantial increase in the cost of beam training. In this paper, we study a high-efficiency and low-overhead scheme named hash beam training. Specifically, two improved hash codebook design methods, random and fixed,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  26. arXiv:2404.17490  [pdf, other

    eess.AS cs.SD eess.SP

    The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX

    Authors: Richard F. Lyon, Rob Schonberger, Malcolm Slaney, Mihajlo Velimirović, Honglin Yu

    Abstract: The open-source CARFAC (Cascade of Asymmetric Resonators with Fast-Acting Compression) cochlear model is upgraded to version 2, with improvements to the Matlab implementation, and with new Python/NumPy and JAX implementations -- but C++ version changes are still pending. One change addresses the DC (direct current, or zero frequency) quadratic distortion anomaly previously reported; another reduce… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  27. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  28. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhijing Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  29. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  30. arXiv:2403.19785  [pdf, other

    cs.IT eess.SP

    Integrated Communication, Localization, and Sensing in 6G D-MIMO Networks

    Authors: Hao Guo, Henk Wymeersch, Behrooz Makki, Hui Chen, Yibo Wu, Giuseppe Durisi, Musa Furkan Keskin, Mohammad H. Moghaddam, Charitha Madapatha, Han Yu, Peter Hammarberg, Hyowon Kim, Tommy Svensson

    Abstract: Future generations of mobile networks call for concurrent sensing and communication functionalities in the same hardware and/or spectrum. Compared to communication, sensing services often suffer from limited coverage, due to the high path loss of the reflected signal and the increased infrastructure requirements. To provide a more uniform quality of service, distributed multiple input multiple out… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  31. arXiv:2403.16473  [pdf, other

    cs.CR eess.IV

    Plaintext-Free Deep Learning for Privacy-Preserving Medical Image Analysis via Frequency Information Embedding

    Authors: Mengyu Sun, Ziyuan Yang, Maosong Ran, Zhiwen Wang, Hui Yu, Yi Zhang

    Abstract: In the fast-evolving field of medical image analysis, Deep Learning (DL)-based methods have achieved tremendous success. However, these methods require plaintext data for training and inference stages, raising privacy concerns, especially in the sensitive area of medical data. To tackle these concerns, this paper proposes a novel framework that uses surrogate images for analysis, eliminating the n… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  32. arXiv:2403.14194  [pdf, other

    eess.SY math.AP

    Event-triggered Boundary Control of Mixed-autonomy Traffic

    Authors: Yihuai Zhang, Huan Yu

    Abstract: Control problems of mixed-autonomy traffic system consisting of both Human-driven Vehicles (HV) and Autonomous Vehicles (AV) have gained increasing attention. This paper is focused on suppressing traffic oscillations of the mixed-autonomy traffic system using boundary control design. The mixed traffic dynamics are described by a 4 x 4 hyperbolic partial differential equations (PDE) which governs p… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  33. arXiv:2403.10797  [pdf

    math.OC eess.SY

    Frequency-Reactive Power Optimization Strategy of Grid-forming Offshore Wind Farm Using DRU-HVDC Transmission

    Authors: Zhekai Li, Kun Han, Xu Cai, Renxin Yang, Haotian Yu, Kepeng Xia, Lulu Liu

    Abstract: The diode rectifier unit-based high voltage direct current (DRU-HVDC) transmission with grid-forming (GFM) wind turbine is becoming a promising scheme for offshore wind farm(OWF) integration due to its high reliability and low cost. In this scheme, the AC network of the OWF and the DRU has completely different synchronization mechanisms and power flow characteristics from the traditional power sys… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 6 pages, 7 figures, to be published in the 7th IEEE Conference on Energy Internet and Energy System Integration (EI2 2023)

  34. arXiv:2403.09153  [pdf, other

    eess.SY

    Fairness-Aware Multi-Server Federated Learning Task Delegation over Wireless Networks

    Authors: Yulan Gao, Chao Ren, Han Yu

    Abstract: In the rapidly advancing field of federated learning (FL), ensuring efficient FL task delegation while incentivising FL client participation poses significant challenges, especially in wireless networks where FL participants' coverage is limited. Existing Contract Theory-based methods are designed under the assumption that there is only one FL server in the system (i.e., the monopoly market assump… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  35. A Novel Implicit Neural Representation for Volume Data

    Authors: Armin Sheibanifard, Hongchuan Yu

    Abstract: The storage of medical images is one of the challenges in the medical imaging field. There are variable works that use implicit neural representation (INR) to compress volumetric medical images. However, there is room to improve the compression rate for volumetric medical images. Most of the INR techniques need a huge amount of GPU memory and a long training time for high-quality medical volume re… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Journal ref: Appl. Sci. 2023, 13, 3242

  36. arXiv:2403.03642  [pdf, other

    eess.IV cs.CV cs.LG

    Generative Active Learning with Variational Autoencoder for Radiology Data Generation in Veterinary Medicine

    Authors: In-Gyu Lee, Jun-Young Oh, Hee-Jung Yu, Jae-Hwan Kim, Ki-Dong Eom, Ji-Hoon Jeong

    Abstract: Recently, with increasing interest in pet healthcare, the demand for computer-aided diagnosis (CAD) systems in veterinary medicine has increased. The development of veterinary CAD has stagnated due to a lack of sufficient radiology data. To overcome the challenge, we propose a generative active learning framework based on a variational autoencoder. This approach aims to alleviate the scarcity of r… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  37. arXiv:2403.02633  [pdf, other

    cs.IT eess.SP

    Spatially Non-Stationary XL-MIMO Channel Estimation: A Three-Layer Generalized Approximate Message Passing Method

    Authors: Anzheng Tang, Jun-Bo Wang, Yijin Pan, Wence Zhang, Xiaodan Zhang, Yijian Chen, Hongkang Yu, Rodrigo C. de Lamare

    Abstract: In this paper, channel estimation problem for extremely large-scale multi-input multi-output (XL-MIMO) systems is investigated with the considerations of the spherical wavefront effect and the spatially non-stationary (SnS) property. Due to the diversities of SnS characteristics among different propagation paths, the concurrent channel estimation of multiple paths becomes intractable. To address t… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: This manuscript has been submitted to the IEEE journal for possible pubilcation

  38. arXiv:2403.00665  [pdf, other

    cs.IT eess.SP

    Complex-Valued Neural Network based Federated Learning for Multi-user Indoor Positioning Performance Optimization

    Authors: Hanzhi Yu, Yuchen Liu, Mingzhe Chen

    Abstract: In this article, the use of channel state information (CSI) for indoor positioning is studied. In the considered model, a server equipped with several antennas sends pilot signals to users, while each user uses the received pilot signals to estimate channel states for user positioning. To this end, we formulate the positioning problem as an optimization problem aiming to minimize the gap between t… ▽ More

    Submitted 19 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures

  39. arXiv:2402.09433  [pdf, other

    eess.SP cs.AI cs.LG eess.SY

    Electrical Behavior Association Mining for Household ShortTerm Energy Consumption Forecasting

    Authors: Heyang Yu, Yuxi Sun, Yintao Liu, Guangchao Geng, Quanyuan Jiang

    Abstract: Accurate household short-term energy consumption forecasting (STECF) is crucial for home energy management, but it is technically challenging, due to highly random behaviors of individual residential users. To improve the accuracy of STECF on a day-ahead scale, this paper proposes an novel STECF methodology that leverages association mining in electrical behaviors. First, a probabilistic associati… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: 3 figures and 4 tables; This manuscript is submitted for possible publication

  40. arXiv:2401.09283  [pdf, other

    eess.IV cs.CV

    A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maier

    Abstract: Cone-beam computed tomography (CBCT) systems, with their portability, present a promising avenue for direct point-of-care medical imaging, particularly in critical scenarios such as acute stroke assessment. However, the integration of CBCT into clinical workflows faces challenges, primarily linked to long scan duration resulting in patient motion during scanning and leading to image quality degrad… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  41. arXiv:2401.07120  [pdf, other

    cs.NI eess.SP quant-ph

    Generative AI-enabled Quantum Computing Networks and Intelligent Resource Allocation

    Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Yuan Cao, Yulan Gao, Chao Ren, Han Yu

    Abstract: Quantum computing networks enable scalable collaboration and secure information exchange among multiple classical and quantum computing nodes while executing large-scale generative AI computation tasks and advanced quantum algorithms. Quantum computing networks overcome limitations such as the number of qubits and coherence time of entangled pairs and offer advantages for generative AI infrastruct… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  42. arXiv:2401.03912  [pdf, other

    eess.IV cs.CV cs.LG

    Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification

    Authors: Adarsh Bhandary Panambur, Hui Yu, Sheethal Bhat, Prathmesh Madhu, Siming Bayer, Andreas Maier

    Abstract: The assessment of breast density is crucial in the context of breast cancer screening, especially in populations with a higher percentage of dense breast tissues. This study introduces a novel data augmentation technique termed Attention-Guided Erasing (AGE), devised to enhance the downstream classification of four distinct breast density categories in mammography following the BI-RADS recommendat… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  43. arXiv:2312.16636  [pdf, other

    math.OC eess.SY math.AP

    Robust Boundary Stabilization of Stochastic Hyperbolic PDEs

    Authors: Yihuai Zhang, Jean Auriol, Huan Yu

    Abstract: This paper proposes a backstepping boundary control design for robust stabilization of linear first-order coupled hyperbolic partial differential equations (PDEs) with Markov-jumping parameters. The PDE system consists of 4 X 4 coupled hyperbolic PDEs whose first three characteristic speeds are positive and the last one is negative. We first design a full-state feedback boundary control law for a… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.15547

  44. arXiv:2312.10374  [pdf, other

    cs.LG eess.SY

    Neural Operators for Boundary Stabilization of Stop-and-go Traffic

    Authors: Yihuai Zhang, Ruiguo Zhong, Huan Yu

    Abstract: This paper introduces a novel approach to PDE boundary control design using neural operators to alleviate stop-and-go instabilities in congested traffic flow. Our framework leverages neural operators to design control strategies for traffic flow systems. The traffic dynamics are described by the Aw-Rascle-Zhang (ARZ) model, which comprises a set of second-order coupled hyperbolic partial different… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  45. arXiv:2312.10112  [pdf, other

    cs.CV cs.LG eess.IV

    NM-FlowGAN: Modeling sRGB Noise with a Hybrid Approach based on Normalizing Flows and Generative Adversarial Networks

    Authors: Young Joo Han, Ha-Jin Yu

    Abstract: Modeling and synthesizing real sRGB noise is crucial for various low-level vision tasks, such as building datasets for training image denoising systems. The distribution of real sRGB noise is highly complex and affected by a multitude of factors, making its accurate modeling extremely challenging. Therefore, recent studies have proposed methods that employ data-driven generative models, such as ge… ▽ More

    Submitted 14 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 25 pages, 11 figures, 7 tables

    MSC Class: 68T45 ACM Class: I.4.4

  46. arXiv:2312.02770  [pdf, other

    cs.LG eess.SY

    Learning "Look-Ahead" Nonlocal Traffic Dynamics in a Ring Road

    Authors: Chenguang Zhao, Huan Yu

    Abstract: The macroscopic traffic flow model is widely used for traffic control and management. To incorporate drivers' anticipative behaviors and to remove impractical speed discontinuity inherent in the classic Lighthill-Whitham-Richards (LWR) traffic model, nonlocal partial differential equation (PDE) models with ``look-ahead" dynamics have been proposed, which assume that the speed is a function of weig… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  47. arXiv:2311.17251  [pdf, other

    eess.IV cs.CV

    SubZero: Subspace Zero-Shot MRI Reconstruction

    Authors: Heng Yu, Yamin Arefeen, Berkin Bilgic

    Abstract: Recently introduced zero-shot self-supervised learning (ZS-SSL) has shown potential in accelerated MRI in a scan-specific scenario, which enabled high-quality reconstructions without access to a large training dataset. ZS-SSL has been further combined with the subspace model to accelerate 2D T2-shuffling acquisitions. In this work, we propose a parallel network framework and introduce an attention… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: ISMRM 2023 Power Pitch

  48. arXiv:2311.15551  [pdf, other

    cs.CV cs.AI cs.CR cs.LG eess.IV

    Instruct2Attack: Language-Guided Semantic Adversarial Attacks

    Authors: Jiang Liu, Chen Wei, Yuxiang Guo, Heng Yu, Alan Yuille, Soheil Feizi, Chun Pong Lau, Rama Chellappa

    Abstract: We propose Instruct2Attack (I2A), a language-guided semantic attack that generates semantically meaningful perturbations according to free-form language instructions. We make use of state-of-the-art latent diffusion models, where we adversarially guide the reverse diffusion process to search for an adversarial latent code conditioned on the input image and text instruction. Compared to existing no… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: under submission, code coming soon

  49. arXiv:2311.15420  [pdf

    eess.SY cs.CV

    Data-Driven Modelling for Harmonic Current Emission in Low-Voltage Grid Using MCReSANet with Interpretability Analysis

    Authors: Jieyu Yao, Hao Yu, Paul Judge, Jiabin Jia, Sasa Djokic, Verner Püvi, Matti Lehtonen, Jan Meyer

    Abstract: Even though the use of power electronics PE loads offers enhanced electrical energy conversion efficiency and control, they remain the primary sources of harmonics in grids. When diverse loads are connected in the distribution system, their interactions complicate establishing analytical models for the relationship between harmonic voltages and currents. To solve this, our paper presents a data-dr… ▽ More

    Submitted 19 January, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  50. arXiv:2311.12770  [pdf, other

    eess.IV cs.CV

    Swift Parameter-free Attention Network for Efficient Super-Resolution

    Authors: Cheng Wan, Hongyuan Yu, Zhiqi Li, Yihang Chen, Yajun Zou, Yuqing Liu, Xuanwu Yin, Kunlong Zuo

    Abstract: Single Image Super-Resolution (SISR) is a crucial task in low-level computer vision, aiming to reconstruct high-resolution images from low-resolution counterparts. Conventional attention mechanisms have significantly improved SISR performance but often result in complex network structures and large number of parameters, leading to slow inference speed and large model size. To address this issue, w… ▽ More

    Submitted 12 May, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: NTIRE2024 ESR winner