Two-Stream Gated Fusion ConvNets for Action Recognition

Jiagang Zhu; Wei Zou; Zheng Zhu

doi:10.1109/ICPR.2018.8545639

2018 24th International Conference on Pattern Recognition (ICPR)

Two-Stream Gated Fusion ConvNets for Action Recognition

Year: 2018, Pages: 597-602

DOI Bookmark: 10.1109/ICPR.2018.8545639

Authors

Jiagang Zhu, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Wei Zou, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zheng Zhu, Institute of Automation, Chinese Academy of Sciences, Beijing, China

Abstract

The two-stream ConvNets in action recognition always fuse the two streams' predictions by the weighted averaging scheme. This fusion way with fixed weights lacks of pertinence to different action videos and always needs trial and error on the validation set. In order to enhance the adaptability of two-stream ConvNets, an end-to-end trainable gated fusion method, namely gating ConvNet, is proposed in this paper based on the MoE (Mixture of Experts) theory. The gating ConvNet takes the combination of convolutional layers of the spatial and temporal nets as input and outputs two fusion weights. To reduce the over-fitting of gating ConvNet caused by the redundancy of parameters, a new multi-task learning method is designed, which jointly learns the gating fusion weights for the two streams and learns the gating ConvNet for action classification. With the proposed gated fusion method and multi-task learning approach, competitive performance is achieved on the video action dataset UCF101.

Like what you’re reading?

Already a member?Sign In

Member Price

$11

Non-Member Price

$21

Add to Cart Sign In

Get this article FREE with a new membership!

Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions
2017 IEEE Winter Conference on Applications of Computer Vision (WACV)
Convolutional Two-Stream Network Fusion for Video Action Recognition
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Weighted Multi-Region Convolutional Neural Network for Action Recognition With Low-Latency Online Prediction
2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
Region and Temporal Dependency Fusion for Multi-label Action Unit Detection
2018 24th International Conference on Pattern Recognition (ICPR)
Leaky Gated Cross-Attention for Weakly Supervised Multi-Modal Temporal Action Localization
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
Spatial-temporal Concept based Explanation of 3D ConvNets
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Bayesian 3D ConvNets for Action Recognition from Few Examples
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Pose Guided Gated Fusion for Person Re-identification
2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
NEF-GGCN: Node-Edge Fusion Gated Graph Convolutional Networks For Skeleton-based Medical Action Recognition
2024 IEEE 9th International Conference on Data Science in Cyberspace (DSC)

Two-Stream Gated Fusion ConvNets for Action Recognition

Authors

Abstract

Related Articles