• Treffer 11 von 13
Zurück zur Trefferliste

Generalisation in humans and deep neural networks

  • We compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new datasetWe compare the robustness of humans and current convolutional deep neural networks (DNNs) on object recognition under twelve different types of image degradations. First, using three well known DNNs (ResNet-152, VGG-19, GoogLeNet) we find the human visual system to be more robust to nearly all of the tested image manipulations, and we observe progressively diverging classification error-patterns between humans and DNNs when the signal gets weaker. Secondly, we show that DNNs trained directly on distorted images consistently surpass human performance on the exact distortion types they were trained on, yet they display extremely poor generalisation abilities when tested on other distortion types. For example, training on salt-and-pepper noise does not imply robustness on uniform white noise and vice versa. Thus, changes in the noise distribution between training and testing constitutes a crucial challenge to deep learning vision systems that can be systematically addressed in a lifelong machine learning approach. Our new dataset consisting of 83K carefully measured human psychophysical trials provide a useful reference for lifelong robustness against image degradations set by the human visual system.zeige mehrzeige weniger

Metadaten exportieren

Weitere Dienste

Suche bei Google Scholar Statistik - Anzahl der Zugriffe auf das Dokument
Metadaten
Verfasserangaben:Robert Geirhos, Carlos R. Medina Temme, Jonas Rauber, Heiko Herbert SchüttORCiDGND, Matthias Bethge, Felix A. WichmannORCiD
ISSN:1049-5258
Titel des übergeordneten Werks (Englisch):Proceedings of the 32nd International Conference on Neural Information Processing Systems
Verlag:Curran Associates Inc.
Verlagsort:Red Hook
Publikationstyp:Sonstiges
Sprache:Englisch
Datum der Erstveröffentlichung:03.12.2018
Erscheinungsjahr:2018
Datum der Freischaltung:24.02.2022
Band:31
Seitenanzahl:13
Erste Seite:7549
Letzte Seite:7561
Fördernde Institution:German Federal Ministry of Education and Research (BMBF) through the Bernstein Computational Neuroscience Program TubingenFederal Ministry of Education & Research (BMBF) [FKZ: 01GQ1002]; German Research Foundation (DFG)German Research Foundation (DFG) [Sachbeihilfe Wi 2103/4-1, SFB 1233]; International Max Planck Research School for Intelligent Systems (IMPRS-IS); Bosch Forschungsstiftung (Stifterverband) [T113/30057/17]; Centre for Integrative Neuroscience Tubingen [EXC 307]; Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior/Interior Business Center (DoI/IBC) [D16PC00003]
Organisationseinheiten:Humanwissenschaftliche Fakultät / Strukturbereich Kognitionswissenschaften / Department Psychologie
DDC-Klassifikation:1 Philosophie und Psychologie / 15 Psychologie / 150 Psychologie
Verstanden ✔
Diese Webseite verwendet technisch erforderliche Session-Cookies. Durch die weitere Nutzung der Webseite stimmen Sie diesem zu. Unsere Datenschutzerklärung finden Sie hier.