Exploring Accumulated Gradient-Based Quantization and Compression for Deep Neural Networks

Exploring Accumulated Gradient-Based Quantization and Compression for Deep Neural Networks

The growing complexity of neural networks makes their deployment on resource-constrained embedded or mobile devices challenging. With millions of weights and biases, modern deep neural networks can be computationally intensive, with large memory, power and computational requirements. In this thesis,...

Full description

Bibliographic Details
Main Author:	Gaopande, Meghana Laxmidhar
Other Authors:	Electrical and Computer Engineering
Format:	Others
Published:	Virginia Tech 2020
Subjects:	Deep Neural Networks Quantization Pruning Fixed-Point
Online Access:	http://hdl.handle.net/10919/98617

Similar Items

Learning Sparse Low-Precision Neural Networks With Learnable Regularization
by: Yoojin Choi, et al.
Published: (2020-01-01)

Zero-Centered Fixed-Point Quantization With Iterative Retraining for Deep Convolutional Neural Network-Based Object Detectors
by: Sungrae Kim, et al.
Published: (2021-01-01)

A Deep Learning Framework of Quantized Compressed Sensing for Wireless Neural Recording
by: Biao Sun, et al.
Published: (2016-01-01)

Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference
by: Benjamin Hawks, et al.
Published: (2021-07-01)

Deep Learning Models Compression for Agricultural Plants
by: Arnauld Nzegha Fountsop, et al.
Published: (2020-09-01)

Optimized Compression for Implementing Convolutional Neural Networks on FPGA
by: Min Zhang, et al.
Published: (2019-03-01)

Learning Sparse Convolutional Neural Network via Quantization With Low Rank Regularization
by: Xin Long, et al.
Published: (2019-01-01)

Human Segmentation Based on Compressed Deep Convolutional Neural Network
by: Jun Miao, et al.
Published: (2020-01-01)

Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks
Published: (2018)

Learning a Deep Vector Quantization Network for Image Compression
by: Xiaotong Lu, et al.
Published: (2019-01-01)

Low-Complexity Vector Quantized Compressed Sensing via Deep Neural Networks
by: Markus Leinonen, et al.
Published: (2020-01-01)

Quantized Deep Residual Convolutional Neural Network for Image-Based Dietary Assessment
by: Ren Zhang Tan, et al.
Published: (2020-01-01)

Deep Neural Network Compression Technique Towards Efficient Digital Signal Modulation Recognition in Edge Device
by: Ya Tu, et al.
Published: (2019-01-01)

Pruning optimization based on deep convolution neural network
by: Ma Zhinan, et al.
Published: (2018-12-01)

Network Compression via Mixed Precision Quantization Using a Multi-Layer Perceptron for the Bit-Width Allocation
by: Efstathia Soufleri, et al.
Published: (2021-01-01)

Asymmetric Deep Semantic Quantization for Image Retrieval
by: Zhan Yang, et al.
Published: (2019-01-01)

ALigN: A Highly Accurate Adaptive Layerwise Log_2_Lead Quantization of Pre-Trained Neural Networks
by: Siddharth Gupta, et al.
Published: (2020-01-01)

Weight Quantization Retraining for Sparse and Compressed Spatial Domain Correlation Filters
by: Dilshad Sabir, et al.
Published: (2021-02-01)

Deep Task-Based Quantization
by: Nir Shlezinger, et al.
Published: (2021-01-01)

Hardware-Aware Design for Edge Intelligence
by: Warren J. Gross, et al.
Published: (2021-01-01)

Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer
by: Sanghyun Seo, et al.
Published: (2019-06-01)

NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
by: Chaim Baskin, et al.
Published: (2021-09-01)

Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence
by: Robert A. Cohen, et al.
Published: (2021-01-01)

A note on the convergence of quantizers
by: Hall Eric B.
Published: (1996-01-01)

A note on the convergence of quantizers
by: Eric B. Hall
Published: (1996-01-01)

Compressed Sensing and ΣΔ-Quantization
by: Feng, Joe-Mei
Published: (2019)

Perceptual Adaptive Quantization Parameter Selection Using Deep Convolutional Features for HEVC Encoder
by: Ismail Marzuki, et al.
Published: (2020-01-01)

Applications of Tropical Geometry in Deep Neural Networks
by: Alfarra, Motasem
Published: (2020)

Design of a 2-Bit Neural Network Quantizer for Laplacian Source
by: Zoran Perić, et al.
Published: (2021-07-01)

Design of Neural Network Quantizers for Networked Control Systems
by: Juan Esteban Rodriguez Ramirez, et al.
Published: (2019-03-01)

An efficient pruning scheme of deep neural networks for Internet of Things applications
by: Chen Qi, et al.
Published: (2021-06-01)

Exploring Model Stability of Deep Neural Networks for Reliable RRAM-based In-Memory Acceleration
by: Beckmann, K., et al.
Published: (2022)

Prune Deep Neural Networks With the Modified <inline-formula> <tex-math notation="LaTeX">$L_{1/2}$ </tex-math></inline-formula> Penalty
by: Jing Chang, et al.
Published: (2019-01-01)

Selection of number of neurons for vector quantization methods
by: Olga Kurasova, et al.
Published: (2008-12-01)

Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder
by: Akm Ashiquzzaman, et al.
Published: (2020-11-01)

Analysis of the Quantization Noise in Discrete Wavelet Transform Filters for Image Processing
by: Nikolay Chervyakov, et al.
Published: (2018-08-01)

IR-QNN Framework: An IR Drop-Aware Offline Training of Quantized Crossbar Arrays
by: Mohammed E. Fouda, et al.
Published: (2020-01-01)

Binary Quantization Analysis of Neural Networks Weights on MNIST Dataset
by: Zoran H. Peric, et al.
Published: (2021-08-01)

Zero-Keep Filter Pruning for Energy/Power Efficient Deep Neural Networks
by: Yunhee Woo, et al.
Published: (2021-05-01)

Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks
by: Tao Wu, et al.
Published: (2021-01-01)