Outsourced similarity search on metric data assets

Man Lung Yiu, Ira Assent, Christian Søndergaard Jensen, Panos Kalnis

Research output: Contribution to journalArticlepeer-review

65 Scopus citations

Abstract

This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low-initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.
Original languageEnglish (US)
Pages (from-to)338-352
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume24
Issue number2
DOIs
StatePublished - Feb 2012

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Information Systems
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Outsourced similarity search on metric data assets'. Together they form a unique fingerprint.

Cite this