Open access
Autor(in)
Datum
2020Typ
- Master Thesis
ETH Bibliographie
yes
Altmetrics
Abstract
Over the last decades, complex deep neural networks have revolutionized Artificial Intelligence (AI) research. These models can now achieve impressive performances on various complex tasks like recognition, detection and image semantic segmentation, achieving accuracy close to, or even better, than human perception. However, these neural networks require to be both deep and complex and this complexity constitutes a danger for the safety verification (certification) and interpretability of a neural network model.
This project explores the certification properties of complex neural networks by taking them into ”shallow waters”. First, a detailed investigation of efficient model distillation techniques is conducted. Then, using the shallow models trained with these distillation methods, several of their properties are further explored, among them adversarial robustness and their performance under parameter reduction procedures. Finally, by combining network’s convex relaxation with model compression, the certification area of shallow student models (derived from either normally or robustly trained teacher networks) is researched. Through all of these experimental results, it is empirically demonstrated and proved that model distillation leads to shallow models with larger certification areas than their equivalent complex teacher networks. Therefore, based on this thesis evidence, shallow distillated networks constitute a possible solution to the safety and interpretability issues that modern complex Artificial Intelligence (AI) models face. Mehr anzeigen
Persistenter Link
https://doi.org/10.3929/ethz-b-000531617Publikationsstatus
publishedBeteiligte
Referent: Hofmann, Thomas
Referent: Roth, Kevin
Referent: Kilcher, Yannic
Referent: Haber, David
Verlag
ETH ZurichThema
Machine learning; Artificial intelligence; Model DistillationOrganisationseinheit
09462 - Hofmann, Thomas / Hofmann, Thomas
ETH Bibliographie
yes
Altmetrics