PR Product: A Substitute for Inner Product in Neural Networks

Wang, Zhennan; Zou, Wenbin; Xu, Chen

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.13148 (cs)

[Submitted on 30 Apr 2019 (v1), last revised 16 Aug 2019 (this version, v2)]

Title:PR Product: A Substitute for Inner Product in Neural Networks

Authors:Zhennan Wang, Wenbin Zou, Chen Xu

View PDF

Abstract:In this paper, we analyze the inner product of weight vector w and data vector x in neural networks from the perspective of vector orthogonal decomposition and prove that the direction gradient of w decreases with the angle between them close to 0 or {\pi}. We propose the Projection and Rejection Product (PR Product) to make the direction gradient of w independent of the angle and consistently larger than the one in standard inner product while keeping the forward propagation identical. As a reliable substitute for standard inner product, the PR Product can be applied into many existing deep learning modules, so we develop the PR Product version of fully connected layer, convolutional layer and LSTM layer. In static image classification, the experiments on CIFAR10 and CIFAR100 datasets demonstrate that the PR Product can robustly enhance the ability of various state-of-the-art classification networks. On the task of image captioning, even without any bells and whistles, our PR Product version of captioning model can compete or outperform the state-of-the-art models on MS COCO dataset. Code has been made available at:this https URL.

Comments:	ICCV2019 oral
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1904.13148 [cs.CV]
	(or arXiv:1904.13148v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.13148
Journal reference:	Proceedings of the IEEE International Conference on Computer Vision. 2019: 6013-6022

Submission history

From: Zhennan Wang [view email]
[v1] Tue, 30 Apr 2019 10:43:38 UTC (168 KB)
[v2] Fri, 16 Aug 2019 11:37:06 UTC (1,819 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PR Product: A Substitute for Inner Product in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PR Product: A Substitute for Inner Product in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators