Outsourced similarity search on metric data assets

Man Lung Yiu, Ira Assent, Christian Søndergaard Jensen, Panos Kalnis

Research output: Contribution to journalJournal articleResearchpeer-review

54 Citations (Scopus)

Abstract

This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.
Original languageEnglish
JournalI E E E Transactions on Knowledge & Data Engineering
Volume24
Issue number2
Pages (from-to)338-352
Number of pages15
ISSN1041-4347
DOIs
Publication statusPublished - 2012

Cite this

Yiu, Man Lung ; Assent, Ira ; Jensen, Christian Søndergaard ; Kalnis, Panos. / Outsourced similarity search on metric data assets. In: I E E E Transactions on Knowledge & Data Engineering. 2012 ; Vol. 24, No. 2. pp. 338-352.
@article{6fb035a3b8034b00bb0cb0af380149a6,
title = "Outsourced similarity search on metric data assets",
abstract = "This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.",
author = "Yiu, {Man Lung} and Ira Assent and Jensen, {Christian S{\o}ndergaard} and Panos Kalnis",
year = "2012",
doi = "10.1109/TKDE.2010.222",
language = "English",
volume = "24",
pages = "338--352",
journal = "I E E E Transactions on Knowledge & Data Engineering",
issn = "1041-4347",
publisher = "IEEE",
number = "2",

}

Outsourced similarity search on metric data assets. / Yiu, Man Lung; Assent, Ira; Jensen, Christian Søndergaard; Kalnis, Panos.

In: I E E E Transactions on Knowledge & Data Engineering, Vol. 24, No. 2, 2012, p. 338-352.

Research output: Contribution to journalJournal articleResearchpeer-review

TY - JOUR

T1 - Outsourced similarity search on metric data assets

AU - Yiu, Man Lung

AU - Assent, Ira

AU - Jensen, Christian Søndergaard

AU - Kalnis, Panos

PY - 2012

Y1 - 2012

N2 - This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.

AB - This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.

U2 - 10.1109/TKDE.2010.222

DO - 10.1109/TKDE.2010.222

M3 - Journal article

VL - 24

SP - 338

EP - 352

JO - I E E E Transactions on Knowledge & Data Engineering

JF - I E E E Transactions on Knowledge & Data Engineering

SN - 1041-4347

IS - 2

ER -