Full-text search engines are important tools for information retrieval. Term
proximity is an important factor in relevance score measurement. In a proximity
full-text search, we assume that a relevant document contains query terms near
each other, especially if the query terms are frequently occurring words. A
methodology for high-performance full-text query execution is discussed. We
build additional indexes to achieve better efficiency. For a word that occurs
in the text, we include in the indexes some information about nearby words.
What types of additional indexes do we use? How do we use them? These questions
are discussed in this work. We present the results of experiments showing that
the average time of search query execution is 44-45 times less than that
required when using ordinary inverted indexes.
This is a pre-print of a contribution "Veretennikov A.B. Proximity Full-Text
Search with a Response Time Guarantee by Means of Additional Indexes" published
in "Arai K., Kapoor S., Bhatia R. (eds) Intelligent Systems and Applications.
IntelliSys 2018. Advances in Intelligent Systems and Computing, vol 868"
published by Springer, Cham. The final authenticated version is available
online at: https://doi.org/10.1007/978-3-030-01054-6_66. The work was supported
by Act 211 Government of the Russian Federation, contract no 02.A03.21.0006.Comment: Alexander B. Veretennikov. Chair of Calculation Mathematics and
Computer Science, INSM. Ural Federal Universit