Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition

Sven Ewan Shepstone; Zheng-Hua Tan; Søren Holdt Jensen

Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition

Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen

Department of Electronic Systems

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

4 Citations (Scopus)

Abstract

In this paper we show a new method of using automatic age
and gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers.
Instead of relying on explicitly provided demographic data for
each user, we define an audio-based demographic group profile
that captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusion
of acoustic and prosodic features determines the probability of
each speaker belonging to each class. The information for all
speakers is then combined to form the group profile, which itself is the input to a recommender system. The recommender
system finds the content items whose demographics best match
the group profile. We tested the effectiveness of the system for
several typical home audience configurations. In a survey, users
were given a configuration and asked to rate a set of advertisements on how well each advertisement matched the configuration. Unbeknown to the subjects, half of the adverts were recommended using the derived audio demographics and the other
half were randomly chosen. The recommended adverts received
a significantly higher median rating of 7.75, as opposed to 4.25
for the randomly selected adverts.

Original language	English
Title of host publication	14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013) : Speech in Life Sciences and Human Societies
Editors	F. Bimbot, C. Cerisara, G. Gravier, L. Lamel, F. Pellegrino, P. Perrier
Number of pages	5
Volume	1
Publisher	Curran Associates, Inc
Publication date	2013
Pages	2827-2831
ISBN (Print)	978-1-62993-443-3
Publication status	Published - 2013
Event	Interspeech 2013 - Lyon, France Duration: 25 Aug 2013 → 29 Aug 2013 http://www.interspeech2013.org/

Conference

Conference	Interspeech 2013
Country/Territory	France
City	Lyon
Period	25/08/2013 → 29/08/2013
Internet address	http://www.interspeech2013.org/

Series	Proceedings of the International Conference on Spoken Language Processing
ISSN	2308-457x

Access to Document

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

Shepstone, S. E., Tan, Z-H., & Jensen, S. H. (2013). Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition. In F. Bimbot, C. Cerisara, G. Gravier, L. Lamel, F. Pellegrino, & P. Perrier (Eds.), 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies (Vol. 1, pp. 2827-2831). Curran Associates, Inc. http://www.interspeech2013.org/

Shepstone, Sven Ewan ; Tan, Zheng-Hua ; Jensen, Søren Holdt. / Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition. 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies. editor / F. Bimbot ; C. Cerisara ; G. Gravier ; L. Lamel ; F. Pellegrino ; P. Perrier. Vol. 1 Curran Associates, Inc, 2013. pp. 2827-2831 (Proceedings of the International Conference on Spoken Language Processing).

@inproceedings{021b4b28332c4180ae618278b6963365,

title = "Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition",

abstract = "In this paper we show a new method of using automatic ageand gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers.Instead of relying on explicitly provided demographic data foreach user, we define an audio-based demographic group profilethat captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusionof acoustic and prosodic features determines the probability ofeach speaker belonging to each class. The information for allspeakers is then combined to form the group profile, which itself is the input to a recommender system. The recommendersystem finds the content items whose demographics best matchthe group profile. We tested the effectiveness of the system forseveral typical home audience configurations. In a survey, userswere given a configuration and asked to rate a set of advertisements on how well each advertisement matched the configuration. Unbeknown to the subjects, half of the adverts were recommended using the derived audio demographics and the otherhalf were randomly chosen. The recommended adverts receiveda significantly higher median rating of 7.75, as opposed to 4.25for the randomly selected adverts.",

author = "Shepstone, {Sven Ewan} and Zheng-Hua Tan and Jensen, {S{\o}ren Holdt}",

year = "2013",

language = "English",

isbn = "978-1-62993-443-3",

volume = "1",

series = "Proceedings of the International Conference on Spoken Language Processing",

publisher = "Curran Associates, Inc",

pages = "2827--2831",

editor = "F. Bimbot and C. Cerisara and G. Gravier and L. Lamel and F. Pellegrino and P. Perrier",

booktitle = "14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013)",

note = "Interspeech 2013 ; Conference date: 25-08-2013 Through 29-08-2013",

url = "http://www.interspeech2013.org/",

}

Shepstone, SE, Tan, Z-H & Jensen, SH 2013, Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition. in F Bimbot, C Cerisara, G Gravier, L Lamel, F Pellegrino & P Perrier (eds), 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies. vol. 1, Curran Associates, Inc, Proceedings of the International Conference on Spoken Language Processing, pp. 2827-2831, Interspeech 2013, Lyon, France, 25/08/2013. <http://www.interspeech2013.org/>

Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition. / Shepstone, Sven Ewan; Tan, Zheng-Hua; Jensen, Søren Holdt.
14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies. ed. / F. Bimbot; C. Cerisara; G. Gravier; L. Lamel; F. Pellegrino; P. Perrier. Vol. 1 Curran Associates, Inc, 2013. p. 2827-2831 (Proceedings of the International Conference on Spoken Language Processing).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition

AU - Shepstone, Sven Ewan

AU - Tan, Zheng-Hua

AU - Jensen, Søren Holdt

PY - 2013

Y1 - 2013

N2 - In this paper we show a new method of using automatic ageand gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers.Instead of relying on explicitly provided demographic data foreach user, we define an audio-based demographic group profilethat captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusionof acoustic and prosodic features determines the probability ofeach speaker belonging to each class. The information for allspeakers is then combined to form the group profile, which itself is the input to a recommender system. The recommendersystem finds the content items whose demographics best matchthe group profile. We tested the effectiveness of the system forseveral typical home audience configurations. In a survey, userswere given a configuration and asked to rate a set of advertisements on how well each advertisement matched the configuration. Unbeknown to the subjects, half of the adverts were recommended using the derived audio demographics and the otherhalf were randomly chosen. The recommended adverts receiveda significantly higher median rating of 7.75, as opposed to 4.25for the randomly selected adverts.

AB - In this paper we show a new method of using automatic ageand gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers.Instead of relying on explicitly provided demographic data foreach user, we define an audio-based demographic group profilethat captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusionof acoustic and prosodic features determines the probability ofeach speaker belonging to each class. The information for allspeakers is then combined to form the group profile, which itself is the input to a recommender system. The recommendersystem finds the content items whose demographics best matchthe group profile. We tested the effectiveness of the system forseveral typical home audience configurations. In a survey, userswere given a configuration and asked to rate a set of advertisements on how well each advertisement matched the configuration. Unbeknown to the subjects, half of the adverts were recommended using the derived audio demographics and the otherhalf were randomly chosen. The recommended adverts receiveda significantly higher median rating of 7.75, as opposed to 4.25for the randomly selected adverts.

M3 - Article in proceeding

SN - 978-1-62993-443-3

VL - 1

T3 - Proceedings of the International Conference on Spoken Language Processing

SP - 2827

EP - 2831

BT - 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013)

A2 - Bimbot, F.

A2 - Cerisara, C.

A2 - Gravier, G.

A2 - Lamel, L.

A2 - Pellegrino, F.

A2 - Perrier, P.

PB - Curran Associates, Inc

T2 - Interspeech 2013

Y2 - 25 August 2013 through 29 August 2013

ER -

Shepstone SE, Tan Z-H, Jensen SH. Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition. In Bimbot F, Cerisara C, Gravier G, Lamel L, Pellegrino F, Perrier P, editors, 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013): Speech in Life Sciences and Human Societies. Vol. 1. Curran Associates, Inc. 2013. p. 2827-2831. (Proceedings of the International Conference on Spoken Language Processing).

Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition

Abstract

Conference

Access to Document

AUB Link

Fingerprint

Cite this