Fusion of Range and Intensity Information for View Invariant Gesture Recognition

Michael Boelstoft Holte; Thomas B. Moeslund; Preben Fihl

Fusion of Range and Intensity Information for View Invariant Gesture Recognition

Michael Boelstoft Holte, Thomas B. Moeslund, Preben Fihl

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

31 Citations (Scopus)

Abstract

This paper presents a system for view invariant gesture recognition. The approach is based on 3D data from a CSEM SwissRanger SR-2 camera. This camera produces both a depth map as well as an intensity image of a scene. Since the two information types are aligned, we can use the intensity image to define a region of interest for the relevant 3D data. This data fusion improves the quality of the range data and hence results in better recognition. The gesture recognition is based on finding motion primitives in the 3D data. The primitives are represented compactly and view invariant using harmonic shape context. A probabilistic Edit Distance classifier is applied to identify which gesture best describes a string of primitives. The approach is trained on data from one viewpoint and tested on data from a different viewpoint. The recognition rate is 92.9% which is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view invariant.

Original language	English
Title of host publication	Computer Vision and Pattern Recognition Workshops, 2008
Publisher	Electrical Engineering/Electronics, Computer, Communications and Information Technology Association
Publication date	2008
Pages	1-7
ISBN (Print)	9781424423392
Publication status	Published - 2008
Event	Computer Vision and Pattern Recognition Workshops, 2008. CVPR Workshops 2008. IEEE Computer Society Conference on - Anchorage, Alaska, Canada Duration: 23 Jun 2008 → 28 Jun 2008

Conference

Conference	Computer Vision and Pattern Recognition Workshops, 2008. CVPR Workshops 2008. IEEE Computer Society Conference on
Country/Territory	Canada
City	Anchorage, Alaska
Period	23/06/2008 → 28/06/2008

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{dd2b3b506ad111dd92a2000ea68e967b,

title = "Fusion of Range and Intensity Information for View Invariant Gesture Recognition",

abstract = "This paper presents a system for view invariant gesture recognition. The approach is based on 3D data from a CSEM SwissRanger SR-2 camera. This camera produces both a depth map as well as an intensity image of a scene. Since the two information types are aligned, we can use the intensity image to define a region of interest for the relevant 3D data. This data fusion improves the quality of the range data and hence results in better recognition. The gesture recognition is based on finding motion primitives in the 3D data. The primitives are represented compactly and view invariant using harmonic shape context. A probabilistic Edit Distance classifier is applied to identify which gesture best describes a string of primitives. The approach is trained on data from one viewpoint and tested on data from a different viewpoint. The recognition rate is 92.9% which is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view invariant.",

author = "Holte, {Michael Boelstoft} and Moeslund, {Thomas B.} and Preben Fihl",

year = "2008",

language = "English",

isbn = "9781424423392",

pages = "1--7",

booktitle = "Computer Vision and Pattern Recognition Workshops, 2008",

publisher = "Electrical Engineering/Electronics, Computer, Communications and Information Technology Association",

note = "Computer Vision and Pattern Recognition Workshops, 2008. CVPR Workshops 2008. IEEE Computer Society Conference on ; Conference date: 23-06-2008 Through 28-06-2008",

}

Holte, MB, Moeslund, TB & Fihl, P 2008, Fusion of Range and Intensity Information for View Invariant Gesture Recognition. in Computer Vision and Pattern Recognition Workshops, 2008. Electrical Engineering/Electronics, Computer, Communications and Information Technology Association, pp. 1-7, Computer Vision and Pattern Recognition Workshops, 2008. CVPR Workshops 2008. IEEE Computer Society Conference on, Anchorage, Alaska, Canada, 23/06/2008.

Fusion of Range and Intensity Information for View Invariant Gesture Recognition. / Holte, Michael Boelstoft; Moeslund, Thomas B.; Fihl, Preben.
Computer Vision and Pattern Recognition Workshops, 2008. Electrical Engineering/Electronics, Computer, Communications and Information Technology Association, 2008. p. 1-7.

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Fusion of Range and Intensity Information for View Invariant Gesture Recognition

AU - Holte, Michael Boelstoft

AU - Moeslund, Thomas B.

AU - Fihl, Preben

PY - 2008

Y1 - 2008

N2 - This paper presents a system for view invariant gesture recognition. The approach is based on 3D data from a CSEM SwissRanger SR-2 camera. This camera produces both a depth map as well as an intensity image of a scene. Since the two information types are aligned, we can use the intensity image to define a region of interest for the relevant 3D data. This data fusion improves the quality of the range data and hence results in better recognition. The gesture recognition is based on finding motion primitives in the 3D data. The primitives are represented compactly and view invariant using harmonic shape context. A probabilistic Edit Distance classifier is applied to identify which gesture best describes a string of primitives. The approach is trained on data from one viewpoint and tested on data from a different viewpoint. The recognition rate is 92.9% which is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view invariant.

AB - This paper presents a system for view invariant gesture recognition. The approach is based on 3D data from a CSEM SwissRanger SR-2 camera. This camera produces both a depth map as well as an intensity image of a scene. Since the two information types are aligned, we can use the intensity image to define a region of interest for the relevant 3D data. This data fusion improves the quality of the range data and hence results in better recognition. The gesture recognition is based on finding motion primitives in the 3D data. The primitives are represented compactly and view invariant using harmonic shape context. A probabilistic Edit Distance classifier is applied to identify which gesture best describes a string of primitives. The approach is trained on data from one viewpoint and tested on data from a different viewpoint. The recognition rate is 92.9% which is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view invariant.

M3 - Article in proceeding

SN - 9781424423392

SP - 1

EP - 7

BT - Computer Vision and Pattern Recognition Workshops, 2008

PB - Electrical Engineering/Electronics, Computer, Communications and Information Technology Association

T2 - Computer Vision and Pattern Recognition Workshops, 2008. CVPR Workshops 2008. IEEE Computer Society Conference on

Y2 - 23 June 2008 through 28 June 2008

ER -