In Information Retrieval, a thesaurus which could provide a term list with its similar
terms can be used to search a document within a collection of documents. With the growth of
information, a thesaurus is expected to help more in finding information so that more relevant
document could be retrieved The purpose of this research is to find a method to form an
automatic thesaurus generation. This process requires a term dictionary in specific field and a
group of documents to perform calculation defining the relationship within its existing terms.
The generation of the thesaurus is done by calculating the paired-occurrence value within the
terms which is found in a collection of documents. Theexperiment is done by using some
respondents to define the terms that are relevant with a specific term. The result showed that the
system could provide accuracy in generating the thesaurus with the average recall value of
59.62 % and the average precision of66. 78 %.
Keywords: Information Retrieval, automatic thesaurus generation, similarity, recal/,
Indonesian thesauru