Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA
Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene....
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-06-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/20/12/3404 |
id |
doaj-5d0bc19eb6904f4b8b668519e6271bbc |
---|---|
record_format |
Article |
spelling |
doaj-5d0bc19eb6904f4b8b668519e6271bbc2020-11-25T03:59:23ZengMDPI AGSensors1424-82202020-06-01203404340410.3390/s20123404Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGARicardo Tapiador-Morales0Jean-Matthieu Maro1Angel Jimenez-Fernandez2Gabriel Jimenez-Moreno3Ryad Benosman4Alejandro Linares-Barranco5Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, FranceRobotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, SpainNeuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.https://www.mdpi.com/1424-8220/20/12/3404dynamic vision sensorsevent-basedsynchronous digital VLSIHDLFPGApattern recognition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco |
spellingShingle |
Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA Sensors dynamic vision sensors event-based synchronous digital VLSI HDL FPGA pattern recognition |
author_facet |
Ricardo Tapiador-Morales Jean-Matthieu Maro Angel Jimenez-Fernandez Gabriel Jimenez-Moreno Ryad Benosman Alejandro Linares-Barranco |
author_sort |
Ricardo Tapiador-Morales |
title |
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA |
title_short |
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA |
title_full |
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA |
title_fullStr |
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA |
title_full_unstemmed |
Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA |
title_sort |
event-based gesture recognition through a hierarchy of time-surfaces for fpga |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2020-06-01 |
description |
Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s to 6.7 <inline-formula> <math display="inline"> <semantics> <mi mathvariant="sans-serif">μ</mi> </semantics> </math> </inline-formula>s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS. |
topic |
dynamic vision sensors event-based synchronous digital VLSI HDL FPGA pattern recognition |
url |
https://www.mdpi.com/1424-8220/20/12/3404 |
work_keys_str_mv |
AT ricardotapiadormorales eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT jeanmatthieumaro eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT angeljimenezfernandez eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT gabrieljimenezmoreno eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT ryadbenosman eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga AT alejandrolinaresbarranco eventbasedgesturerecognitionthroughahierarchyoftimesurfacesforfpga |
_version_ |
1724454374278692864 |