GitHub - DocF_multispectral-object-detection_ Multispectral Object Detection with Yolov5 and Transformer
GitHub - DocF_multispectral-object-detection_ Multispectral Object Detection with Yolov5 and Transformer
com/DocF/multispectral-object-detection
AGPL-3.0 license
Star Notifications
1 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
Multispectral-Object-Detection
custom badge resource not found
2 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
Intro
Official Code for Cross-Modality Fusion Transformer for Multispectral Object Detection.
Abstract
Multispectral image pairs can provide the combined information, making object detection
applications more reliable and robust in the open world. To fully exploit the different
modalities, we present a simple yet effective cross-modality feature fusion approach,
named Cross-Modality Fusion Transformer (CFT) in this paper. Unlike prior CNNs-based
works, guided by the Transformer scheme, our network learns long-range dependencies
and integrates global contextual information in the feature extraction stage. More
importantly, by leveraging the self attention of the Transformer, the network can naturally
carry out simultaneous intra-modality and inter-modality fusion, and robustly capture the
latent interactions between RGB and Thermal domains, thereby significantly improving
the performance of multispectral object detection. Extensive experiments and ablation
studies on multiple datasets demonstrate that our approach is effective and achieves
state-of-the-art detection performance.
Demo
Night Scene
Day Scene
Overview
3 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
Citation
If you use this repo for your research, please cite our paper:
@article{qingyun2022cross,
title={Cross-modality attentive feature fusion for object
detection in multispectral remote sensing imagery},
author={Qingyun, Fang and Zhaokui, Wang},
journal={Pattern Recognition},
volume={130},
pages={108786},
year={2022},
publisher={Elsevier}
}
@article{fang2021cross,
title={Cross-Modality Fusion Transformer for Multispectral Object
Detection},
author={Fang Qingyun and Han Dapeng and Wang Zhaokui},
journal={arXiv preprint arXiv:2111.00273},
year={2021}
}
Installation
Python>=3.6.0 is required with all requirements.txt installed including PyTorch>=1.7 (The
same as yolov5 https://github.com/ultralytics/yolov5 ).
4 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
Install requirements
$ cd multispectral-object-detection
$ pip install -r requirements.txt
Dataset
-[FLIR] [Google Drive] [Baidu Drive] extraction code:qwer
-[LLVIP] download
-[VEDAI] download
Refer: https://github.com/ultralytics/yolov5/wiki/Train-Custom-Data
Run
CFT weights
5 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
Results
LLVIP
Log Average Miss Rate
YOLOv3-RGB 37.70%
YOLOv3-IR 17.73%
YOLOv5-RGB 22.59%
YOLOv5-IR 10.66%
Baseline(Ours) 6.91%
CFT(Ours) 5.40%
6 of 7 10/28/24, 11:18
GitHub - DocF/multispectral-object-detection: Multisp... https://github.com/DocF/multispectral-object-detection
References
https://github.com/ultralytics/yolov5
Releases
No releases published
Packages
No packages published
Languages
7 of 7 10/28/24, 11:18