TY - JOUR
T1 - Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users
AU - Shepstone, Sven Ewan
AU - Tan, Zheng-Hua
AU - Kristoffersen, Miklas Strøm
PY - 2018
Y1 - 2018
N2 - In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.
AB - In this paper, we utilize a closed-set speaker-identification approach to convey the ratings needed for collaborative filtering-based recommendation. Instead of explicitly providing a rating for a given program, users use a speech interface to dictate the desired rating after watching a movie. Due to the inaccuracies that may be imposed by a state-of-the-art speaker identification system, it is possible to mistake a user for another user in the household, especially when the users exhibit similar or identical age and gender demographics. This leads to the undesirable effect of injecting unwanted ratings into the collaborative rating matrix, and when the users have different tastes, can result in the recommendation of undesirable items. We therefore propose a simple confidence-based heuristic that utilizes the log-likelihood scores from the speaker identification front-end. The algorithm limits the degree to which unwanted ratings negatively affect the integrity of the ratings information. Using real-speaker utterances over a range of age and gender demographics, we compare our approach against upper and lower-bound (nonspeaker-identification-based) baseline systems. Results show that by taking the confidence into account of users that we were able to improve upon the lower-bound that unconditionally accepts ratings by a relative 6.9%.
KW - Collaborative filtering (CF)
KW - confidence
UR - http://www.scopus.com/inward/record.url?scp=85043367306&partnerID=8YFLogxK
U2 - 10.1109/TCE.2018.2811250
DO - 10.1109/TCE.2018.2811250
M3 - Journal article
SN - 0098-3063
VL - 64
SP - 11
EP - 18
JO - IEEE Transactions on Consumer Electronics
JF - IEEE Transactions on Consumer Electronics
IS - 1
ER -