Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA

Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene....

Full description

Bibliographic Details
Main Authors:	Ricardo Tapiador-Morales, Jean-Matthieu Maro, Angel Jimenez-Fernandez, Gabriel Jimenez-Moreno, Ryad Benosman, Alejandro Linares-Barranco
Format:	Article
Language:	English
Published:	MDPI AG 2020-06-01
Series:	Sensors
Subjects:	dynamic vision sensors event-based synchronous digital VLSI HDL FPGA pattern recognition
Online Access:	https://www.mdpi.com/1424-8220/20/12/3404

id	doaj-5d0bc19eb6904f4b8b668519e6271bbc
record_format	Article
spelling	doaj-5d0bc19eb6904f4b8b668519e6271bbc2020-11-25T03:59:23ZengMDPI AGSensors1424-82202020-06-01203404340410.3390/s20123404Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGARicardo Tapiador-Morales0Jean-Matthieu Maro1Angel Jimenez-Fernandez2Gabriel Jimenez-Moreno3Ryad Benosman4Alejandro Linares-Barranco5Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.https://www.mdpi.com/1424-8220/20/12/3404dynamic vision sensorsevent-basedsynchronous digital VLSIHDLFPGApattern recognition
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco
spellingShingle	Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA Sensors dynamic vision sensors event-based synchronous digital VLSI HDL FPGA pattern recognition
author_facet	Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco
author_sort	Ricardo Tapiador-Morales
title	Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_short	Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_full	Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_fullStr	Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_full_unstemmed	Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_sort	event-based gesture recognition through a hierarchy of time-surfaces for fpga
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2020-06-01
description	Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.
topic	dynamic vision sensors event-based synchronous digital VLSI HDL FPGA pattern recognition
url	https://www.mdpi.com/1424-8220/20/12/3404
work_keys_str_mv	AT ricardotapiadormorales eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT jeanmatthieumaro eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT angeljimenezfernandez eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT gabrieljimenezmoreno eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT ryadbenosman eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT alejandrolinaresbarranco eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
_version_	1724454374278692864

Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA

Similar Items