Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content

Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen

Research output: Contribution to journalJournal articleResearchpeer-review

9 Citations (Scopus)

Abstract

Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1.
Original languageEnglish
JournalI E E E Transactions on Consumer Electronics
Volume59
Issue number3
Pages (from-to)721 - 729
ISSN0098-3063
DOIs
Publication statusPublished - Aug 2013

Fingerprint

Classifiers

Keywords

    Cite this

    @article{b3d915125a9a4a379c3291c2eee0fc62,
    title = "Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content",
    abstract = "Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1.",
    keywords = "demographic filtering, Genetic algorithms, age identification, gender identification, proportional recommendation, advertisement",
    author = "Shepstone, {Sven Ewan} and Zheng-Hua Tan and Jensen, {S{\o}ren Holdt}",
    year = "2013",
    month = "8",
    doi = "10.1109/TCE.2013.6626261",
    language = "English",
    volume = "59",
    pages = "721 -- 729",
    journal = "I E E E Transactions on Consumer Electronics",
    issn = "0098-3063",
    publisher = "IEEE",
    number = "3",

    }

    Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content. / Shepstone, Sven Ewan; Tan, Zheng-Hua; Jensen, Søren Holdt.

    In: I E E E Transactions on Consumer Electronics, Vol. 59, No. 3, 08.2013, p. 721 - 729 .

    Research output: Contribution to journalJournal articleResearchpeer-review

    TY - JOUR

    T1 - Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content

    AU - Shepstone, Sven Ewan

    AU - Tan, Zheng-Hua

    AU - Jensen, Søren Holdt

    PY - 2013/8

    Y1 - 2013/8

    N2 - Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1.

    AB - Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1.

    KW - demographic filtering

    KW - Genetic algorithms

    KW - age identification

    KW - gender identification

    KW - proportional recommendation

    KW - advertisement

    U2 - 10.1109/TCE.2013.6626261

    DO - 10.1109/TCE.2013.6626261

    M3 - Journal article

    VL - 59

    SP - 721

    EP - 729

    JO - I E E E Transactions on Consumer Electronics

    JF - I E E E Transactions on Consumer Electronics

    SN - 0098-3063

    IS - 3

    ER -