Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA

Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene....

Full description

Bibliographic Details
Main Authors: Ricardo Tapiador-Morales, Jean-Matthieu Maro, Angel Jimenez-Fernandez, Gabriel Jimenez-Moreno, Ryad Benosman, Alejandro Linares-Barranco
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Series:Sensors
Subjects:
HDL
Online Access:https://www.mdpi.com/1424-8220/20/12/3404
id doaj-5d0bc19eb6904f4b8b668519e6271bbc
record_format Article
spelling doaj-5d0bc19eb6904f4b8b668519e6271bbc2020-11-25T03:59:23ZengMDPI AGSensors1424-82202020-06-01203404340410.3390/s20123404Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGARicardo Tapiador-Morales0Jean-Matthieu Maro1Angel Jimenez-Fernandez2Gabriel Jimenez-Moreno3Ryad Benosman4Alejandro Linares-Barranco5Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.https://www.mdpi.com/1424-8220/20/12/3404dynamic vision sensorsevent-basedsynchronous digital VLSIHDLFPGApattern recognition
collection DOAJ
language English
format Article
sources DOAJ
author Ricardo Tapiador-Morales
Jean-Matthieu Maro
Angel Jimenez-Fernandez
Gabriel Jimenez-Moreno
Ryad Benosman
Alejandro Linares-Barranco
spellingShingle Ricardo Tapiador-Morales
Jean-Matthieu Maro
Angel Jimenez-Fernandez
Gabriel Jimenez-Moreno
Ryad Benosman
Alejandro Linares-Barranco
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
Sensors
dynamic vision sensors
event-based
synchronous digital VLSI
HDL
FPGA
pattern recognition
author_facet Ricardo Tapiador-Morales
Jean-Matthieu Maro
Angel Jimenez-Fernandez
Gabriel Jimenez-Moreno
Ryad Benosman
Alejandro Linares-Barranco
author_sort Ricardo Tapiador-Morales
title Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_short Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_full Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_fullStr Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_full_unstemmed Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
title_sort event-based gesture recognition through a hierarchy of time-surfaces for fpga
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2020-06-01
description Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.
topic dynamic vision sensors
event-based
synchronous digital VLSI
HDL
FPGA
pattern recognition
url https://www.mdpi.com/1424-8220/20/12/3404
work_keys_str_mv AT ricardotapiadormorales eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
AT jeanmatthieumaro eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
AT angeljimenezfernandez eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
AT gabrieljimenezmoreno eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
AT ryadbenosman eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
AT alejandrolinaresbarranco eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga
_version_ 1724454374278692864