A 58.6mW Real-Time Programmable Object Detector with Multi-Scale Multi-Object Support Using Deformable Parts Model on 1920x1080 Video at 30fps

This paper presents a programmable, energy-efficient and real-time object detection accelerator using deformable parts models (DPM), with 2× higher accuracy than traditional rigid body models. With 8 deformable parts detection, three methods are used to address the high computational complexity: cla...

Full description

Bibliographic Details
Main Authors:	Suleiman, Amr AbdulZahir (Contributor), Zhang, Zhengdong (Contributor), Sze, Vivienne (Contributor)
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format:	Article
Language:	English
Published:	Institute of Electrical and Electronics Engineers (IEEE), 2017-04-28T17:22:06Z.
Subjects:	Article
Online Access:	Get fulltext


LEADER	02034 am a22002413u 4500
001	108495
042			\|a dc
100	1	0	\|a Suleiman, Amr AbdulZahir \|e author
100	1	0	\|a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science \|e contributor
100	1	0	\|a Sze, Vivienne \|e contributor
100	1	0	\|a Suleiman, Amr AbdulZahir \|e contributor
100	1	0	\|a Zhang, Zhengdong \|e contributor
100	1	0	\|a Sze, Vivienne \|e contributor
700	1	0	\|a Zhang, Zhengdong \|e author
700	1	0	\|a Sze, Vivienne \|e author
245	0	0	\|a A 58.6mW Real-Time Programmable Object Detector with Multi-Scale Multi-Object Support Using Deformable Parts Model on 1920x1080 Video at 30fps
260			\|b Institute of Electrical and Electronics Engineers (IEEE), \|c 2017-04-28T17:22:06Z.
856			\|z Get fulltext \|u http://hdl.handle.net/1721.1/108495
520			\|a This paper presents a programmable, energy-efficient and real-time object detection accelerator using deformable parts models (DPM), with 2× higher accuracy than traditional rigid body models. With 8 deformable parts detection, three methods are used to address the high computational complexity: classification pruning for 33× fewer parts classification, vector quantization for 15× memory size reduction, and feature basis projection for 2× reduction of the cost of each classification. The chip is implemented in 65nm CMOS technology, and can process HD (1920×1080) images at 30fps without any off-chip storage while consuming only 58.6mW (0.94nJ/pixel, 1168 GOPS/W). The chip has two classification engines to simultaneously detect two different classes of objects. With a tested high throughput of 60fps, the classification engines can be time multiplexed to detect even more than two object classes. It is energy scalable by changing the pruning factor or disabling the parts classification.
520			\|a United States. Defense Advanced Research Projects Agency
546			\|a en_US
655	7		\|a Article
773			\|t 2016 IEEE Symposium on VLSI Circuits (VLSI-Circuits)

A 58.6mW Real-Time Programmable Object Detector with Multi-Scale Multi-Object Support Using Deformable Parts Model on 1920x1080 Video at 30fps

Similar Items