Skip to main content

Showing 1–8 of 8 results for author: Kokiopoulou, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2301.12860  [pdf, other

    cs.LG stat.ML

    Massively Scaling Heteroscedastic Classifiers

    Authors: Mark Collier, Rodolphe Jenatton, Basil Mustafa, Neil Houlsby, Jesse Berent, Effrosyni Kokiopoulou

    Abstract: Heteroscedastic classifiers, which learn a multivariate Gaussian distribution over prediction logits, have been shown to perform well on image classification problems with hundreds to thousands of classes. However, compared to standard classifiers, they introduce extra parameters that scale linearly with the number of classes. This makes them infeasible to apply to larger-scale problems. In additi… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR 2023

  2. arXiv:2110.02609  [pdf, other

    stat.ML cs.LG

    Deep Classifiers with Label Noise Modeling and Distance Awareness

    Authors: Vincent Fortuin, Mark Collier, Florian Wenzel, James Allingham, Jeremiah Liu, Dustin Tran, Balaji Lakshminarayanan, Jesse Berent, Rodolphe Jenatton, Effrosyni Kokiopoulou

    Abstract: Uncertainty estimation in deep learning has recently emerged as a crucial area of interest to advance reliability and robustness in safety-critical applications. While there have been many proposed methods that either focus on distance-aware model uncertainties for out-of-distribution detection or on input-dependent label uncertainties for in-distribution calibration, both of these types of uncert… ▽ More

    Submitted 8 August, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Published in TMLR

  3. arXiv:2105.10305  [pdf, other

    cs.LG cs.CV stat.ML

    Correlated Input-Dependent Label Noise in Large-Scale Image Classification

    Authors: Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton, Jesse Berent

    Abstract: Large scale image classification datasets often contain noisy labels. We take a principled probabilistic approach to modelling input-dependent, also known as heteroscedastic, label noise in these datasets. We place a multivariate Normal distributed latent variable on the final hidden layer of a neural network classifier. The covariance matrix of this latent variable, models the aleatoric uncertain… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: Accepted as Oral at CVPR 2021

  4. arXiv:2009.04381  [pdf, other

    cs.LG stat.ML

    Routing Networks with Co-training for Continual Learning

    Authors: Mark Collier, Efi Kokiopoulou, Andrea Gesmundo, Jesse Berent

    Abstract: The core challenge with continual learning is catastrophic forgetting, the phenomenon that when neural networks are trained on a sequence of tasks they rapidly forget previously learned tasks. It has been observed that catastrophic forgetting is most severe when tasks are dissimilar to each other. We propose the use of sparse routing networks for continual learning. For each input, these network a… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: Presented at ICML Workshop on Continual Learning 2020

  5. arXiv:2003.06778  [pdf, other

    cs.LG stat.ML

    A Simple Probabilistic Method for Deep Classification under Input-Dependent Label Noise

    Authors: Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton, Jesse Berent

    Abstract: Datasets with noisy labels are a common occurrence in practical applications of classification methods. We propose a simple probabilistic method for training deep classifiers under input-dependent (heteroscedastic) label noise. We assume an underlying heteroscedastic generative process for noisy labels. To make gradient based training feasible we use a temperature parameterized softmax as a smooth… ▽ More

    Submitted 12 November, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

  6. arXiv:1911.11481  [pdf, other

    cs.LG stat.ML

    Ranking architectures using meta-learning

    Authors: Alina Dubatovka, Efi Kokiopoulou, Luciano Sbaiz, Andrea Gesmundo, Gabor Bartok, Jesse Berent

    Abstract: Neural architecture search has recently attracted lots of research efforts as it promises to automate the manual design of neural networks. However, it requires a large amount of computing resources and in order to alleviate this, a performance prediction network has been recently proposed that enables efficient architecture search by forecasting the performance of candidate architectures, instead… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019 Meta-Learning workshop

  7. arXiv:1910.04915  [pdf, other

    cs.LG stat.ML

    Flexible Multi-task Networks by Learning Parameter Allocation

    Authors: Krzysztof Maziarz, Efi Kokiopoulou, Andrea Gesmundo, Luciano Sbaiz, Gabor Bartok, Jesse Berent

    Abstract: This paper proposes a novel learning method for multi-task applications. Multi-task neural networks can learn to transfer knowledge across different tasks by using parameter sharing. However, sharing parameters between unrelated tasks can hurt performance. To address this issue, we propose a framework to learn fine-grained patterns of parameter sharing. Assuming that the network is composed of sev… ▽ More

    Submitted 18 July, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

  8. arXiv:1902.05781  [pdf, other

    cs.LG stat.ML

    Fast Task-Aware Architecture Inference

    Authors: Efi Kokiopoulou, Anja Hauth, Luciano Sbaiz, Andrea Gesmundo, Gabor Bartok, Jesse Berent

    Abstract: Neural architecture search has been shown to hold great promise towards the automation of deep learning. However in spite of its potential, neural architecture search remains quite costly. To this point, we propose a novel gradient-based framework for efficient architecture search by sharing information across several tasks. We start by training many model architectures on several related (trainin… ▽ More

    Submitted 15 February, 2019; originally announced February 2019.