Skip to main content

Showing 1–31 of 31 results for author: Schiele, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2309.06166  [pdf, other

    cs.LG cs.CV stat.ML

    Certified Robust Models with Slack Control and Large Lipschitz Constants

    Authors: Max Losch, David Stutz, Bernt Schiele, Mario Fritz

    Abstract: Despite recent success, state-of-the-art learning-based models remain highly vulnerable to input changes such as adversarial examples. In order to obtain certifiable robustness against such perturbations, recent work considers Lipschitz-based regularizers or constraints while at the same time increasing prediction margin. Unfortunately, this comes at the cost of significantly decreased accuracy. I… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: To be published at GCPR 2023

  2. arXiv:2301.08669  [pdf, other

    cs.CV stat.ML

    Holistically Explainable Vision Transformers

    Authors: Moritz Böhle, Mario Fritz, Bernt Schiele

    Abstract: Transformers increasingly dominate the machine learning landscape across many tasks and domains, which increases the importance for understanding their outputs. While their attention modules provide partial insight into their inner workings, the attention scores have been shown to be insufficient for explaining the models as a whole. To address this, we propose B-cos transformers, which inherently… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  3. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  4. arXiv:2205.10268  [pdf, other

    cs.CV stat.ML

    B-cos Networks: Alignment is All We Need for Interpretability

    Authors: Moritz Böhle, Mario Fritz, Bernt Schiele

    Abstract: We present a new direction for increasing the interpretability of deep neural networks (DNNs) by promoting weight-input alignment during training. For this, we propose to replace the linear transforms in DNNs by our B-cos transform. As we show, a sequence (network) of such transforms induces a single linear transform that faithfully summarises the full model computations. Moreover, the B-cos trans… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  5. arXiv:2204.12393  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    On Fragile Features and Batch Normalization in Adversarial Training

    Authors: Nils Philipp Walter, David Stutz, Bernt Schiele

    Abstract: Modern deep learning architecture utilize batch normalization (BN) to stabilize training and improve accuracy. It has been shown that the BN layers alone are surprisingly expressive. In the context of robustness against adversarial examples, however, BN is argued to increase vulnerability. That is, BN helps to learn fragile features. Nevertheless, BN is still used in adversarial training, which is… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  6. arXiv:2109.13004  [pdf, other

    stat.ML cs.CV cs.LG

    Optimising for Interpretability: Convolutional Dynamic Alignment Networks

    Authors: Moritz Böhle, Mario Fritz, Bernt Schiele

    Abstract: We introduce a new family of neural network models called Convolutional Dynamic Alignment Networks (CoDA Nets), which are performant classifiers with a high degree of inherent interpretability. Their core building blocks are Dynamic Alignment Units (DAUs), which are optimised to transform their inputs with dynamically computed weight vectors that align with task-relevant patterns. As a result, CoD… ▽ More

    Submitted 15 January, 2024; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: Extension of "Convolutional Dynamic Alignment Networks for Interpretable Classifications" (Böhle et al., CVPR 2021). arXiv admin note: substantial text overlap with arXiv:2104.00032

    Journal ref: Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume 45, Issue: 6, 01 June 2023, Page(s): 7625 - 7638)

  7. arXiv:2104.04448  [pdf, other

    cs.LG cs.CV stat.ML

    Relating Adversarially Robust Generalization to Flat Minima

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training (AT) has become the de-facto standard to obtain models robust against adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on adversarial examples, so-called robust loss, decreases continuously on training examples, while eventually increasing on test examples. In practice, this leads to poor robust generalization, i.e., adversarial robustne… ▽ More

    Submitted 6 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: ICCV'21

  8. Adaptive Aggregation Networks for Class-Incremental Learning

    Authors: Yaoyao Liu, Bernt Schiele, Qianru Sun

    Abstract: Class-Incremental Learning (CIL) aims to learn a classification model with the number of classes increasing phase-by-phase. An inherent problem in CIL is the stability-plasticity dilemma between the learning of old and new classes, i.e., high-plasticity models easily forget old classes, but high-stability models are weak to learn new classes. We alleviate this issue by proposing a novel network ar… ▽ More

    Submitted 29 March, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: Accepted to CVPR 2021. Code: https://github.com/yaoyao-liu/class-incremental-learning

  9. arXiv:2006.13977  [pdf, other

    cs.LG cs.AR cs.CR cs.CV stat.ML

    Bit Error Robustness for Energy-Efficient DNN Accelerators

    Authors: David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

    Abstract: Deep neural network (DNN) accelerators received considerable attention in past years due to saved energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption significantly, however, causes bit-level failures in the memory storing the quantized DNN weights. In this paper, we show that a combination of robust fixed-point quantization,… ▽ More

    Submitted 9 April, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  10. arXiv:2005.02313  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial Training against Location-Optimized Adversarial Patches

    Authors: Sukrut Rao, David Stutz, Bernt Schiele

    Abstract: Deep neural networks have been shown to be susceptible to adversarial examples -- small, imperceptible changes constructed to cause mis-classification in otherwise highly accurate image classifiers. As a practical alternative, recent work proposed so-called adversarial patches: clearly visible, but adversarially crafted rectangular patches in images. These patches can easily be printed and applied… ▽ More

    Submitted 14 December, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: 20 pages, 6 tables, 4 figures, 2 algorithms, European Conference on Computer Vision Workshops 2020

    Journal ref: Bartoli, A., Fusiello, A. (eds) Computer Vision - ECCV 2020 Workshops. ECCV 2020. Lecture Notes in Computer Science, vol 12539. Springer, Cham

  11. arXiv:2004.03891  [pdf, other

    cs.LG cs.CV stat.ML

    Normalizing Flows with Multi-Scale Autoregressive Priors

    Authors: Shweta Mahajan, Apratim Bhattacharyya, Mario Fritz, Bernt Schiele, Stefan Roth

    Abstract: Flow-based generative models are an important class of exact inference models that admit efficient inference and sampling for image synthesis. Owing to the efficiency constraints on the design of the flow layers, e.g. split coupling flow layers in which approximately half the pixels do not undergo further transformations, they have limited expressiveness for modeling long-range data dependencies c… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: To appear in CVPR 2020

  12. Mnemonics Training: Multi-Class Incremental Learning without Forgetting

    Authors: Yaoyao Liu, Yuting Su, An-An Liu, Bernt Schiele, Qianru Sun

    Abstract: Multi-Class Incremental Learning (MCIL) aims to learn new concepts by incrementally updating a model trained on previous concepts. However, there is an inherent trade-off to effectively learning new concepts without catastrophic forgetting of previous ones. To alleviate this issue, it has been proposed to keep around a few examples of the previous concepts but the effectiveness of this approach he… ▽ More

    Submitted 4 April, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Experiment results updated (different from the conference version). Code is available at https://github.com/yaoyao-liu/mnemonics-training

  13. arXiv:1910.06259  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Confidence-Calibrated Adversarial Training: Generalizing to Unseen Attacks

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training yields robust models against a specific threat model, e.g., $L_\infty$ adversarial examples. Typically robustness does not generalize to previously unseen threat models, e.g., other $L_p$ norms, or larger perturbations. Our confidence-calibrated adversarial training (CCAT) tackles this problem by biasing the model towards low confidence predictions on adversarial examples. By… ▽ More

    Submitted 30 June, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

  14. arXiv:1910.03648  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Meta-Transfer Learning through Hard Tasks

    Authors: Qianru Sun, Yaoyao Liu, Zhaozheng Chen, Tat-Seng Chua, Bernt Schiele

    Abstract: Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which only a few labeled samples are available. As deep neural networks (DNNs) tend to overfit using a few samples only, typical meta-learning models use shallow neur… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: An extended version of a paper published in CVPR2019. Under review. arXiv admin note: substantial text overlap with arXiv:1812.02391

  15. arXiv:1909.12598  [pdf, other

    cs.LG stat.ML

    "Best-of-Many-Samples" Distribution Matching

    Authors: Apratim Bhattacharyya, Mario Fritz, Bernt Schiele

    Abstract: Generative Adversarial Networks (GANs) can achieve state-of-the-art sample quality in generative modelling tasks but suffer from the mode collapse problem. Variational Autoencoders (VAE) on the other hand explicitly maximize a reconstruction-based data log-likelihood forcing it to cover all modes, but suffer from poorer sample quality. Recent works have proposed hybrid VAE-GAN frameworks which int… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  16. arXiv:1908.09008  [pdf, other

    cs.CV cs.LG stat.ML

    Conditional Flow Variational Autoencoders for Structured Sequence Prediction

    Authors: Apratim Bhattacharyya, Michael Hanselmann, Mario Fritz, Bernt Schiele, Christoph-Nikolas Straehle

    Abstract: Prediction of future states of the environment and interacting agents is a key competence required for autonomous agents to operate successfully in the real world. Prior work for structured sequence prediction based on latent variable models imposes a uni-modal standard Gaussian prior on the latent variables. This induces a strong model bias which makes it challenging to fully capture the multi-mo… ▽ More

    Submitted 18 August, 2020; v1 submitted 24 August, 2019; originally announced August 2019.

    Comments: To appear at Bayesian Deep Learning and Machine Learning for Autonomous Driving @NeurIPS 2019

  17. arXiv:1906.10908  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks

    Authors: Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz

    Abstract: High-performance Deep Neural Networks (DNNs) are increasingly deployed in many real-world applications e.g., cloud prediction APIs. Recent advances in model functionality stealing attacks via black-box access (i.e., inputs in, predictions out) threaten the business model of such applications, which require a lot of time, money, and effort to develop. Existing defenses take a passive role against s… ▽ More

    Submitted 3 March, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: ICLR 2020, Project page: https://resources.mpi-inf.mpg.de/d2/orekondy/predpoison/

  18. arXiv:1906.00562  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Learning to Self-Train for Semi-Supervised Few-Shot Classification

    Authors: Xinzhe Li, Qianru Sun, Yaoyao Liu, Shibao Zheng, Qin Zhou, Tat-Seng Chua, Bernt Schiele

    Abstract: Few-shot classification (FSC) is challenging due to the scarcity of labeled training data (e.g. only one labeled data point per class). Meta-learning has shown to achieve promising results by learning to initialize a classification model for FSC. In this paper we propose a novel semi-supervised meta-learning method called learning to self-train (LST) that leverages unlabeled data and specifically… ▽ More

    Submitted 29 September, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  19. An Ensemble of Epoch-wise Empirical Bayes for Few-shot Learning

    Authors: Yaoyao Liu, Bernt Schiele, Qianru Sun

    Abstract: Few-shot learning aims to train efficient predictive models with a few examples. The lack of training data leads to poor models that perform high-variance or low-confidence predictions. In this paper, we propose to meta-learn the ensemble of epoch-wise empirical Bayes models (E3BM) to achieve robust predictions. "Epoch-wise" means that each training epoch has a Bayes model whose parameters are spe… ▽ More

    Submitted 17 July, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

  20. arXiv:1812.06707  [pdf, other

    cs.CV cs.AI stat.ML

    Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation

    Authors: Rakshith Shetty, Bernt Schiele, Mario Fritz

    Abstract: Importance of visual context in scene understanding tasks is well recognized in the computer vision community. However, to what extent the computer vision models for image classification and semantic segmentation are dependent on the context to make their predictions is unclear. A model overly relying on context will fail when encountering objects in context distributions different from training d… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 14 pages (12 figures)

  21. arXiv:1812.00740  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Disentangling Adversarial Robustness and Generalization

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Obtaining deep networks that are robust against adversarial examples and generalize well is an open problem. A recent hypothesis even states that both robust and accurate models are impossible, i.e., adversarial robustness and generalization are conflicting goals. In an effort to clarify the relationship between robustness and generalization, we assume an underlying, low-dimensional data manifold… ▽ More

    Submitted 10 April, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: Conference on Computer Vision and Pattern Recognition 2019

  22. arXiv:1806.07772  [pdf, other

    cs.LG cs.CV stat.ML

    Accurate and Diverse Sampling of Sequences based on a "Best of Many" Sample Objective

    Authors: Apratim Bhattacharyya, Bernt Schiele, Mario Fritz

    Abstract: For autonomous agents to successfully operate in the real world, anticipation of future events and states of their environment is a key competence. This problem has been formalized as a sequence extrapolation problem, where a number of observations are used to predict the sequence into the future. Real-world scenarios demand a model of uncertainty of such predictions, as predictions become increas… ▽ More

    Submitted 15 October, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: Added additional references and baselines. (Appeared in CVPR 2018)

  23. arXiv:1806.01911  [pdf, other

    cs.CV cs.AI stat.ML

    Adversarial Scene Editing: Automatic Object Removal from Weak Supervision

    Authors: Rakshith Shetty, Mario Fritz, Bernt Schiele

    Abstract: While great progress has been made recently in automatic image manipulation, it has been limited to object centric images like faces or structured scene datasets. In this work, we take a step towards general scene-level image editing by developing an automatic interaction-free object removal model. Our model learns to find and remove objects from general scene images using image-level labels and u… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

  24. arXiv:1805.05838  [pdf, other

    cs.CR cs.AI cs.CV cs.LG stat.ML

    Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning

    Authors: Tribhuvanesh Orekondy, Seong Joon Oh, Yang Zhang, Bernt Schiele, Mario Fritz

    Abstract: Federated Learning (FL) systems are gaining popularity as a solution to training Machine Learning (ML) models from large-scale user data collected on personal devices (e.g., smartphones) without their raw data leaving the device. At the core of FL is a network of anonymous user devices sharing training information (model parameter updates) computed locally on personal data. However, the type and d… ▽ More

    Submitted 13 September, 2020; v1 submitted 15 May, 2018; originally announced May 2018.

  25. arXiv:1711.01921  [pdf, other

    cs.CR cs.CL cs.CY cs.SI stat.ML

    $A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

    Authors: Rakshith Shetty, Bernt Schiele, Mario Fritz

    Abstract: Text-based analysis methods allow to reveal privacy relevant author attributes such as gender, age and identify of the text's author. Such methods can compromise the privacy of an anonymous author even when the author tries to remove privacy sensitive content. In this paper, we propose an automatic method, called Adversarial Author Attribute Anonymity Neural Translation ($A^4NT$), to combat such t… ▽ More

    Submitted 19 February, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 16 pages, 10 figures and 8 tables

  26. arXiv:1711.01768  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Towards Reverse-Engineering Black-Box Neural Networks

    Authors: Seong Joon Oh, Max Augustin, Bernt Schiele, Mario Fritz

    Abstract: Many deployed learned models are black boxes: given input, returns output. Internal information about the model, such as the architecture, optimisation procedure, or training data, is not disclosed explicitly as it might contain proprietary information or make the system more vulnerable. This work shows that such attributes of neural networks can be exposed from a sequence of queries. This has mul… ▽ More

    Submitted 14 February, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 20 pages, 12 figures, to appear at ICLR'18. Code: https://goo.gl/MbYfsv

  27. arXiv:1612.03663  [pdf, other

    cs.CV cs.LG stat.ML

    Analysis and Optimization of Loss Functions for Multiclass, Top-k, and Multilabel Classification

    Authors: Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: Top-k error is currently a popular performance measure on large scale image classification benchmarks such as ImageNet and Places. Despite its wide acceptance, our understanding of this metric is limited as most of the previous research is focused on its special case, the top-1 error. In this work, we explore two directions that shed more light on the top-k error. First, we provide an in-depth ana… ▽ More

    Submitted 12 December, 2016; originally announced December 2016.

  28. arXiv:1512.00486  [pdf, other

    stat.ML cs.CV cs.LG

    Loss Functions for Top-k Error: Analysis and Insights

    Authors: Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: In order to push the performance on realistic computer vision tasks, the number of classes in modern benchmark datasets has significantly increased in recent years. This increase in the number of classes comes along with increased ambiguity between the class labels, raising the question if top-1 error is the right performance measure. In this paper, we provide an extensive comparison and evaluatio… ▽ More

    Submitted 13 April, 2016; v1 submitted 1 December, 2015; originally announced December 2015.

    Comments: In Computer Vision and Pattern Recognition (CVPR), 2016

  29. arXiv:1511.06683  [pdf, other

    stat.ML cs.CV cs.LG

    Top-k Multiclass SVM

    Authors: Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: Class ambiguity is typical in image classification problems with a large number of classes. When classes are difficult to discriminate, it makes sense to allow k guesses and evaluate classifiers based on the top-k error instead of the standard zero-one loss. We propose top-k multiclass SVM as a direct method to optimize for top-k performance. Our generalization of the well-known multiclass SVM is… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

    Comments: NIPS 2015

  30. arXiv:1511.05706  [pdf, other

    stat.ML cs.LG

    Efficient Output Kernel Learning for Multiple Tasks

    Authors: Pratik Jawanpuria, Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: The paradigm of multi-task learning is that one can achieve better generalization by learning tasks jointly and thus exploiting the similarity between the tasks rather than learning them independently of each other. While previously the relationship between tasks had to be user-defined in the form of an output kernel, recent approaches jointly learn the tasks and the output kernel. As the output k… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  31. Learning Using Privileged Information: SVM+ and Weighted SVM

    Authors: Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: Prior knowledge can be used to improve predictive performance of learning algorithms or reduce the amount of data required for training. The same goal is pursued within the learning using privileged information paradigm which was recently introduced by Vapnik et al. and is aimed at utilizing additional information available only at training time -- a framework implemented by SVM+. We relate the pr… ▽ More

    Submitted 2 March, 2014; v1 submitted 13 June, 2013; originally announced June 2013.

    Comments: 18 pages, 8 figures; integrated reviewer comments, improved typesetting

    Journal ref: Neural Networks 53C (2014), pp. 95-108