DNNs are widely used for complex tasks like image and signal processing, and they are in increasing demand for implementation on Internet of Things (IoT) devices. For these devices, optimizing DNN models is a necessary task. Generally, standard optimization approaches require specialists to manually fine-tune hyper-parameters to find a good trade-off between efficiency and accuracy. In this paper, we propose OptDNN, a software that employs innovative and automatic approaches to determine optimal hyper-parameters for pruning, clustering, and quantization. The models optimized by OptDNN have a smaller memory footprint, faster inference time, and a similar accuracy to the original models.
Dettaglio pubblicazione
2024, SOFTWARE IMPACTS, Pages - (volume: 20)
OptDNN: Automatic deep neural networks optimizer for edge computing (01a Articolo in rivista)
Giovannesi Luca, Mattia Gabriele Proietti, Beraldi Roberto
Gruppo di ricerca: Computer Vision, Computer Graphics, Deep Learning
keywords