PIRS: Peer-to-Peer Information Retrieval System

Projects | Home

News

Our P2P IR research tool IR-Wire and sample data sets are available to download!

Description

PIRS is a peer-to-peer (P2P) information retrieval system. The goal of PIRS is to enhance peer-to-peer systems with information retrieval-like search capabilities. Current P2P search is based on substring matching of keywords, which leaves much to be desired.

PIRS improves upon current commercial P2P systems (e.g., Gnutella and Kazaa) by adding three components:

  1. Metadata collector
  2. Metadata distributor
  3. Result ranker
Because PIRS functionality is independent of lower-level network management and query routing, it can be easily incorporated into today's commercial P2P file sharing systems.

Publications*

  1. D. Jia, W. G. Yee, O. Frieder, "Spam Characterization and Detection in Peer-to-Peer File-Sharing Systems", In Proc. 2008 ACM Conf. on Inf. and Knowl. Mgt. (CIKM08), 2008. [PDF]
  2. D. Jia, "Cost-Effective Spam Detection in P2P File-Sharing Systems", In Proc. 2008 ACM CIKM Workshops, 2008, presented in the Workshop on Large Scale Distributed Systems for Information Retrieval, 2008 (LSDS-IR08). [PDF]
  3. L. T. Nguyen, W. G. Yee, O. Frieder, "Adaptive Distributed Indexing for Structured Peer-to-Peer Networks", In Proc. 2008 ACM Conf. on Inf. and Knowl. Mgt. (CIKM08), 2008. [PDF]
  4. L. T. Nguyen, W. G. Yee, O. Frieder, "Query Workload Driven Summarization for P2P Query Routing", In Proc. 2008 IEEE Conf. on Peer to Peer Comp. (P2P08), 2008. [PDF]
  5. Efficient Query Routing by Improved Peer Description in P2P Networks. W. Yee, L. T. Nguyen, D. Jia, O. Frieder. In Proc. ACM/ICST Infoscale, 2008. [Version with some edits for clarity: PDF / Original version: PDF] [Slides:PPT] (*Best Paper Award)
  6. Distributed, Automatic File Description Tuning in Peer-to-Peer File-Sharing Systems. D. Jia, W. G. Yee, L. T. Nguyen, O. Frieder. In Proc. IEEE P2P, 2007. [PDF]
  7. Masked Queries for Search Accuracy in Peer-to-Peer File-Sharing Systems. Wai Gen Yee, Linh Thai Nguyen, and Ophir Frieder. Proc. IEEE IPDPS, 2007. [PDF] [Slides: PPT]
  8. Novel Applications of Information Retrieval Techniques to Peer-to-Peer File-Sharing Systems. Wai Gen Yee, Linh Thai Nguyen, and Ophir Frieder. Proc. Wkshp. P2PIR, 2006. [PDF]
  9. Improved Result Ranking in P2P File-Sharing Systems by Probing for Meta-data. W. G. Yee, L. T. Nguyen, and O. Frieder. In Proc. IEEE NCA, 2006. [PDF]
  10. IR-Wire: A Research Tool for P2P Information Retrieval. S. Sharma, L. T. Nguyen, D. Jia. In Proc. ACM Wkshp. Open Source Inf. Retr., 2006. [PDF]
  11. Automatic Tuning of File Descriptors in P2P File-Sharing Systems. D. Jia, W. G. Yee, and O. Frieder. In Proc. Wkshp on Web and Databases (WebDB), 2006. [PDF]
  12. Conjunction Dysfunction: The Weakness of Conjunctive Queries in Peer-to-Peer File-sharing Systems. W. G. Yee, L. T. Nguyen, O. Frieder. In Proc. IEEE P2P, 2006. [PDF] [Slides: PPT]
  13. Search in Peer-to-Peer File-Sharing System: Like Metasearch Engines, But Not Really, Wai Gen Yee, Dongmei Jia, Linh Thai Nguyen, Workshop on Open Source Web Information Retrieval, Compiègne, France, September, 2005. Michel Beigbeder, Wai Gen Yee (Eds.), ISBN:2-913923-19-4, p. 35-38. [PDF]
  14. Finding Rare Data Objects in P2P File-Sharing Systems, Wai Gen Yee, Dongmei Jia, and Ophir Frieder, IEEE P2P, 2005 [PDF]
  15. On Search in Peer to Peer File Sharing Systems, Wai Gen Yee and Ophir Frieder, ACM SAC, 2005 [PDF]
  16. The Design of PIRS, a Peer-to-peer Information Retrieval System, Wai Gen Yee and Ophir Frieder, DBISP2P 2004 Workshop [PDF]

*For more related publications, please see my CV.

Courses

CS 595 - Design and Analysis of Distributed System Infrastructures, Spring, 2004

Resources

Data used in our experiments can be found in our data directory.

P2P Research Group Page

If you have special interests regarding this project (e.g., the data or simulator), please contact us. We'll are happy to help. All we ask is that you cite our work.