Abstract
A variety of peer-to-peer (P2P) systems for sharing digital information are currently available and most of them perform searching by exact key matching. In this paper we focus on similarity searching and describe FuzzyPeer, a generic broadcast-based P2P system which supports a wide range of fuzzy queries. As a case study we present an image retrieval application implemented on top of FuzzyPeer. Users provide sample images whose sets of features are propagated through the peers. The answer consists of the top-k most similar images within the query horizon. In our system the participation of peers is ad hoc and dynamic, their functionality is symmetric and there is no centralized index. In order to avoid flooding the network with messages, we develop a technique that takes advantage of the fuzzy nature of the queries. Specifically, some queries are "frozen" inside the network, and are satisfied by the streaming results of similar queries that are already running. We describe several optimization techniques for single and multiple-attribute queries, and study their tradeoffs. We evaluate the performance of our algorithms by a prototype implementation on our P2P platform and a simulated large-scale network. Our results suggest that by reusing the existing streams, the scalability of the system improves both in terms of number of nodes and query throughput.
Original language | English (US) |
---|---|
Pages (from-to) | 57-72 |
Number of pages | 16 |
Journal | Information Systems |
Volume | 31 |
Issue number | 1 |
DOIs | |
State | Published - Mar 2006 |
Externally published | Yes |
Keywords
- Gnutella
- Peer-to-peer
- Similarity searching
ASJC Scopus subject areas
- Software
- Information Systems
- Hardware and Architecture