Group Meeting Minutes
Date: Aug. 11th, 2005
Place: SB232C
Attendees: Wai Gen, Linh,
Dongmei
1/ Dongmei¡¯s presentation on
the paper ¡°Text-Based Content Search and
Retrieval in ad hoc P2P Communities¡±:
- The paper addresses the main
issues of p2p information retrieval systems: peer selection, document
selection, and result merging (similar to issues of a metasearch engine).
- The authors consider the
content of each peer as one super-document (a document that contains all
terms appears in any documents) and use the Bloom filter to represent the
content of the peer.
- The authors propose using
gossiping algorithm to distribute the Bloom filter of a peer to all other
peers in the p2p network. By doing this, any peer in the network has global
information (which they called global index) about the content of others.
- The authors suggest using
IPF (Inverted Peer Frequency), that is defined similarly as IDF (Inverted
Document Frequency), to evaluate the similarity of a peer and a given
query. Based on the similarity values, a querying peer can ranks all other
peers and decide which peers to forward the query to.
- For document selection
issue, the authors suggest that the querying peer contacts the top-ranked
peers sequentially and maintains a list of top-k results. The
search process will stop whenever p peers in a row fail to
contribute to the top-k list.
- For result merging, the
author use IPF instead of IDF to rank the results. The formula to
calculate rank score is similar to the one of TF/IDF technique.
2/ Assignments:
Linh:
- Update simulator¡¯s source
code.
- Send experiment results and
the query relaxation paper to Wai Gen.
- Prepare for the qualifier.
- Write the summary report
(due on Aug 25).
Dongmei:
- Work on the prototype.
- Prepare for the qualifier.
- Write the summary report
(due on Aug 25).
3/ Paper for next meeting:
Querying Peer-to-Peer
Networks Using P-Trees. Adina Crainiceanu et. al., CS Dept., Cornell University. In Proc. of 7th Intl. Workshop on the Web and Databases (WebDB
2004), June 17-18, 2004, Paris, France.
http://www.cs.cornell.edu/database/Pepper/PTree.pdf