• search hit 80 of 195
Back to Result List

Generalisation in humans and deep neural networks

  • We compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new datasetWe compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new dataset consisting of 83K carefully measured human psychophysical trials provide a useful reference for lifelong robustness against image degradations set by the human visual system.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Robert Geirhos, Carlos R. Medina Temme, Jonas Rauber, Heiko Herbert SchüttORCiDGND, Matthias Bethge, Felix A. WichmannORCiD
ISSN:1049-5258
Title of parent work (English):Proceedings of the 32nd International Conference on Neural Information Processing Systems
Publisher:Curran Associates Inc.
Place of publishing:Red Hook
Publication type:Other
Language:English
Date of first publication:2018/12/03
Publication year:2018
Release date:2022/02/24
Volume:31
Number of pages:13
First page:7549
Last Page:7561
Funding institution:German Federal Ministry of Education and Research (BMBF) through the Bernstein Computational Neuroscience Program TubingenFederal Ministry of Education & Research (BMBF) [FKZ: 01GQ1002]; German Research Foundation (DFG)German Research Foundation (DFG) [Sachbeihilfe Wi 2103/4-1, SFB 1233]; International Max Planck Research School for Intelligent Systems (IMPRS-IS); Bosch Forschungsstiftung (Stifterverband) [T113/30057/17]; Centre for Integrative Neuroscience Tubingen [EXC 307]; Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior/Interior Business Center (DoI/IBC) [D16PC00003]
Organizational units:Humanwissenschaftliche Fakultät / Strukturbereich Kognitionswissenschaften / Department Psychologie
DDC classification:1 Philosophie und Psychologie / 15 Psychologie / 150 Psychologie
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.