We have implemented an inverted index as a part of a mass collaboration system. It provides the facility to search for documents that satisfy a given query. It also supports incremental updates whereby documents can be added without re-indexing. The index can be queried even when updates are being done to it. Further, querying can be done in two modes. A normal mode that can be used when an immediate response is required and a batched mode that can provide better throughput at the cost of increased response time for some requests. The batched mode may be useful in an alert system where some of the queries are can be scheduled. We have implemented generators to generate large data sets that we use as benchmarks. We have tested our inverted index with data sets of the order of gigabytes to ensure scalability
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.