Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

Sven Ewan Shepstone; Zheng-Hua Tan; Miklas Strøm Kristoffersen

doi:10.1109/TCE.2018.2811250

Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

Sven Ewan Shepstone, Zheng-Hua Tan, Miklas Strøm Kristoffersen

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

3 Citationer (Scopus)

364 Downloads (Pure)

Abstract

In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.

Originalsprog	Engelsk
Tidsskrift	IEEE Transactions on Consumer Electronics
Vol/bind	64
Udgave nummer	1
Sider (fra-til)	11-18
Antal sider	8
ISSN	0098-3063
DOI	https://doi.org/10.1109/TCE.2018.2811250
Status	Udgivet - 2018

Adgang til dokumentet

10.1109/TCE.2018.2811250

Shepstone_TCE_2018Accepteret manuskript, 586 KB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85043367306&partnerID=8YFLogxK

Citationsformater

@article{793bbba249de463ca66751a5a150a493,

title = "Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users",

abstract = "In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.",

keywords = "Collaborative filtering (CF), confidence",

author = "Shepstone, {Sven Ewan} and Zheng-Hua Tan and Kristoffersen, {Miklas Str{\o}m}",

year = "2018",

doi = "10.1109/TCE.2018.2811250",

language = "English",

volume = "64",

pages = "11--18",

journal = "IEEE Transactions on Consumer Electronics",

issn = "0098-3063",

publisher = "IEEE",

number = "1",

}

TY - JOUR

T1 - Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

AU - Shepstone, Sven Ewan

AU - Tan, Zheng-Hua

AU - Kristoffersen, Miklas Strøm

PY - 2018

Y1 - 2018

N2 - In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.

AB - In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.

KW - Collaborative filtering (CF)

KW - confidence

UR - http://www.scopus.com/inward/record.url?scp=85043367306&partnerID=8YFLogxK

U2 - 10.1109/TCE.2018.2811250

DO - 10.1109/TCE.2018.2811250

M3 - Journal article

SN - 0098-3063

VL - 64

SP - 11

EP - 18

JO - IEEE Transactions on Consumer Electronics

JF - IEEE Transactions on Consumer Electronics

IS - 1

ER -

Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater