Recognising Actions for Instructional Training using Pose Information: A Comparative Evaluation.

Lacey, Gerard

This item is covered by a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationa. Click to find out more

File Type:

PDF

Item Type:

Conference Paper

Date:

2019

Author:

Lacey, Gerard

Access:

openAccess

Citation:

Bruton, S., Lacey, G., Recognising Actions for Instructional Training using Pose Information: A Comparative Evaluation., 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 25-27 Feb 2019, Prague, Czech Republic

Download Item:

(Accepted for publication (author's copy) - Peer Reviewed) 15.21Mb

Abstract:

Humans perform many complex tasks involving the manipulation of multiple objects. Recognition of the constituent actions of these tasks can be used to drive instructional training systems. The identities and poses of the objects used during such tasks are salient for the purposes of recognition. In this work, 3D object detection and registration techniques are used to identify and track objects involved in an everyday task of preparing a cup of tea. The pose information serves as input to an action classification system that uses Long-Short Term Memory (LSTM) recurrent neural networks as part of a deep architecture. An advantage of this approach is that it can represent the complex dynamics of object and human poses at hierarchical levels without the need for design of specific spatio-temporal features. By using such compact features, we demonstrate the feasibility of using the hyperparameter optimisation technique of Tree-Parzen Estimators to identify optimal hyperparameters as well as network architectures. The results of 83% recognition show that this approach is viable for similar scenarios of pervasive computing applications where prior scene knowledge exists.

URI:

http://hdl.handle.net/2262/92985

Sponsor

Grant Number

IRCSET (OK)

Author's Homepage:

http://people.tcd.ie/gjlacey

Author: Lacey, Gerard

Other Titles:

14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications.

Type of material:

Conference Paper

URI:

http://hdl.handle.net/2262/92985

Collections

Availability:

Full text available

Keywords:

Action Recognition, Deep Learning, Pose Estimation

Subject (TCD):

Creative Technologies , Computer Vision and Image Processing , Information technology in education , Pervasive Computing

Metadata

Show full item record

Licences:

Original License

Browse

My Account