Difference Target Propagation

Lee, Dong-Hyun; Zhang, Saizheng; Fischer, Asja; Bengio, Yoshua

Computer Science > Machine Learning

arXiv:1412.7525 (cs)

[Submitted on 23 Dec 2014 (v1), last revised 25 Nov 2015 (this version, v5)]

Title:Difference Target Propagation

Authors:Dong-Hyun Lee, Saizheng Zhang, Asja Fischer, Yoshua Bengio

View PDF

Abstract:Back-propagation has been the workhorse of recent successes of deep learning but it relies on infinitesimal effects (partial derivatives) in order to perform credit assignment. This could become a serious issue as one considers deeper and more non-linear functions, e.g., consider the extreme case of nonlinearity where the relation between parameters and cost is actually discrete. Inspired by the biological implausibility of back-propagation, a few approaches have been proposed in the past that could play a similar credit assignment role. In this spirit, we explore a novel approach to credit assignment in deep networks that we call target propagation. The main idea is to compute targets rather than gradients, at each layer. Like gradients, they are propagated backwards. In a way that is related but different from previously proposed proxies for back-propagation which rely on a backwards network with symmetric weights, target propagation relies on auto-encoders at each layer. Unlike back-propagation, it can be applied even when units exchange stochastic bits rather than real numbers. We show that a linear correction for the imperfectness of the auto-encoders, called difference target propagation, is very effective to make target propagation actually work, leading to results comparable to back-propagation for deep networks with discrete and continuous units and denoising auto-encoders and achieving state of the art for stochastic networks.

Comments:	13 pages, 8 figures, Accepted in ECML/PKDD 2015
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1412.7525 [cs.LG]
	(or arXiv:1412.7525v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1412.7525

Submission history

From: Dong-Hyun Lee [view email]
[v1] Tue, 23 Dec 2014 20:57:59 UTC (327 KB)
[v2] Tue, 3 Mar 2015 16:54:57 UTC (313 KB)
[v3] Sat, 18 Apr 2015 01:01:54 UTC (418 KB)
[v4] Sat, 14 Nov 2015 07:05:40 UTC (418 KB)
[v5] Wed, 25 Nov 2015 02:30:41 UTC (413 KB)

Computer Science > Machine Learning

Title:Difference Target Propagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Difference Target Propagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators