Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks

Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1381018242
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks by : Ravi Shanker Raju (Ph.D.)

Download or read book Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks written by Ravi Shanker Raju (Ph.D.) and published by . This book was released on 2022 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: In recent years, deep neural networks have surpassed human performance on image classification tasks and and speech recognition. While current models can reach state of the art performance on stand-alone benchmarks, deploying them on embedded systems that have real-time latency deadlines either cause them to fail these requirements or severely get degraded in performance to meet the stated specifications. This requires intelligent design of the network architecture in order to minimize the accuracy degradation while deployed on the edge. Similarly, deep learning often has a long turn-around time due to the volume of the experiments on different hyperparameters and consumes time and resources. This motivates a need for developing training strategies that allow researchers who do not have access to large computational resources to train large models without waiting for exorbitant training cycles to be completed. This dissertation addresses these concerns through data dependent pruning of deep learning computation. First, regarding inference, we propose an integration of two different conditional execution strategies we call FBS-pruned CondConv by noticing that if we use input-specific filters instead of standard convolutional filters, we can aggressively prune at higher rates and mitigate accuracy degradation for significant computation savings. Then, regarding long training times, we introduce our dynamic data pruning framework which takes ideas from active learning and reinforcement learning to dynamically select subsets of data to train the model. Finally, as opposed to pruning data and in the same spirit of reducing training time, we investigate the vision transformer and introduce a unique training method called PatchDrop (originally designed for robustness to occlusions on transformers [1]), which uses the self-supervised DINO [2] model to identify the salient patches in an image and train on the salient subsets of an image. These strategies/training methods take a step in a direction to make models more accessible to deploy on edge devices in an efficient inference context and reduces the barrier for the independent researcher to train deep learning models which would require immense computational resources, pushing towards the democratization of machine learning.

Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks Related Books

Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks
Language: en
Pages: 0
Authors: Ravi Shanker Raju (Ph.D.)
Categories:
Type: BOOK - Published: 2022 - Publisher:

GET EBOOK

In recent years, deep neural networks have surpassed human performance on image classification tasks and and speech recognition. While current models can reach
Towards Efficient Inference and Improved Training Efficiency of Deep Neural Networks
Language: en
Pages: 0
Authors: Ravi Shanker Raju (Ph.D.)
Categories:
Type: BOOK - Published: 2022 - Publisher:

GET EBOOK

In recent years, deep neural networks have surpassed human performance on image classification tasks and and speech recognition. While current models can reach
Efficient Processing of Deep Neural Networks
Language: en
Pages: 254
Authors: Vivienne Sze
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

GET EBOOK

This book provides a structured treatment of the key principles and techniques for enabling efficient processing of deep neural networks (DNNs). DNNs are curren
Learning in Energy-Efficient Neuromorphic Computing: Algorithm and Architecture Co-Design
Language: en
Pages: 389
Authors: Nan Zheng
Categories: Computers
Type: BOOK - Published: 2019-10-18 - Publisher: John Wiley & Sons

GET EBOOK

Explains current co-design and co-optimization methodologies for building hardware neural networks and algorithms for machine learning applications This book fo
Low-Power Computer Vision
Language: en
Pages: 395
Authors: George K. Thiruvathukal
Categories: Computers
Type: BOOK - Published: 2022-02-22 - Publisher: CRC Press

GET EBOOK

Energy efficiency is critical for running computer vision on battery-powered systems, such as mobile phones or UAVs (unmanned aerial vehicles, or drones). This