Information Retrieval Lab Projects

Complex Document Image Processing (CDIP)
A data set composed of both text and metadata for use with research where a large body of queryable information is desired.

Detecting Misuse for Information Retrieval Systems
Based on the studies of the Computer Security Institute/Federal Bureau of Investigation, after virus, i.e, malicious code, the insider abuse is the second most threat.

IIT Intranet Mediator
A data mediator focusing on the problem of query dispatching and result integration in an intranet environment, with varied data repositories.

Parallel Clustering and Classification
An scalable parallel approach to clustering and classification of a large document corpus.

Search Study
A thorough study being conducted to examine and compare today's leading search engines.

SQLGenerator
SQLGenerator is a scalable XML retrieval engine developed in collaboration with Bitsystems, Inc. that fully implements the XML-QL query language by translating it to SQL