Generalisation in humans and deep neural networks

Geirhos, Robert; Temme, Carlos R. Medina; Rauber, Jonas; Schütt, Heiko Herbert; Bethge, Matthias; Wichmann, Felix A.

Treffer 11 von 13

Generalisation in humans and deep neural networks

Robert Geirhos, Carlos R. Medina Temme, Jonas Rauber, Heiko Herbert Schütt, Matthias Bethge, Felix A. Wichmann

We compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new datasetWe compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new dataset consisting of 83K carefully measured human psychophysical trials provide a useful reference for lifelong robustness against image degradations set by the human visual system.…

Metadaten
Verfasserangaben:	Robert Geirhos, Carlos R. Medina Temme, Jonas Rauber, Heiko Herbert Schütt ORCiD GND, Matthias Bethge, Felix A. Wichmann ORCiD
ISSN:	1049-5258
Titel des übergeordneten Werks (Englisch):	Proceedings of the 32nd International Conference on Neural Information Processing Systems
Verlag:	Curran Associates Inc.
Verlagsort:	Red Hook
Publikationstyp:	Sonstiges
Sprache:	Englisch
Datum der Erstveröffentlichung:	03.12.2018
Erscheinungsjahr:	2018
Datum der Freischaltung:	24.02.2022
Band:	31
Seitenanzahl:	13
Erste Seite:	7549
Letzte Seite:	7561
Fördernde Institution:	German Federal Ministry of Education and Research (BMBF) through the Bernstein Computational Neuroscience Program TubingenFederal Ministry of Education & Research (BMBF) [FKZ: 01GQ1002]; German Research Foundation (DFG)German Research Foundation (DFG) [Sachbeihilfe Wi 2103/4-1, SFB 1233]; International Max Planck Research School for Intelligent Systems (IMPRS-IS); Bosch Forschungsstiftung (Stifterverband) [T113/30057/17]; Centre for Integrative Neuroscience Tubingen [EXC 307]; Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior/Interior Business Center (DoI/IBC) [D16PC00003]
Organisationseinheiten:	Humanwissenschaftliche Fakultät / Strukturbereich Kognitionswissenschaften / Department Psychologie
DDC-Klassifikation:	1 Philosophie und Psychologie / 15 Psychologie / 150 Psychologie

Generalisation in humans and deep neural networks

Metadaten exportieren

Weitere Dienste