Skip to main content
Article thumbnail
Location of Repository

Smart Search Engine For Information Retrieval

By LEILEI SUN

Abstract

This project addresses the main research problem in information retrieval and semantic search. It proposes the smart search theory as new theory based on hypothesis that semantic meanings of a document can be described by a set of\ud keywords. With two experiments designed and carried out in this project, the experiment result demonstrates positive evidence that meet the smart search theory.\ud \ud In the theory proposed in this project, the smart search aims to determine a set of keywords for any web documents, by which the semantic meanings of the documents can be uniquely identified. Meanwhile, the size of the set of keywords is supposed to be small enough which can be easily managed. This is the fundamental assumption for creating the smart semantic search engine. In this project, the rationale of the assumption and the theory based on it will be discussed, as well as the processes of how the theory can be applied to the keyword allocation and the data model to be\ud generated. Then the design of the smart search engine will be proposed, in order to create a solution to the efficiency problem while searching among huge amount of increasing information published on the web.\ud \ud To achieve high efficiency in web searching, statistical method is proved to be an effective way and it can be interpreted from the semantic level. Based on the frequency of joint keywords, the keyword list can be generated and linked to each other to form a meaning structure. A data model is built when a proper keyword list is achieved and the model is applied to the design of the smart search engine

Topics: Search Engine, Google, Information Theory, Information Retrieval, Semantic Web
Year: 2009
OAI identifier: oai:etheses.dur.ac.uk:72
Provided by: Durham e-Theses

Suggested articles

Citations

  1. (2006). A Semantic Search Engine for the Storage Resource Broker. Available at: http://dart.edu.au/publications/global-grid-forum-2006-semantic-workshop.pdf
  2. (2008). A Semantic Web Primer. 2 nd edition. doi
  3. (2008). A Semantic Web Primer. 2nd edition. doi
  4. (2007). Data Mining and its Scope. Available at: http://www.rimtengg.com/coit2007/proceedings/pdfs/109.pdf
  5. (1995). Dynamic Inverted Indexes for a Distributed Full-Text Retrieval System,
  6. (2008). Improving the Search Efficiency of Unstructured P2P Networks by Differentiated Search Algorithm. Available at: http://csconf.org/Volume2/page426.pdf
  7. (2005). Improving Web Search Efficiency via a Locality Based Static Pruning Method. Available at: http://www2005.org/cdrom/docs/p235.pdf
  8. (2006). Information retrieval systems: A perspective on human computer interaction’,
  9. (1968). Information Retrieval Systems: Characteristics, Testing and Evaluation. doi
  10. (2004). Information retrieval: Algorithms and heuristics. doi
  11. (1979). Information Retrieval. 2nd edition. doi
  12. (1996). Knowledge Discovery and Data Mining: Towards a Unifying Framework’,
  13. (1999). Modern Information Retrieval. doi
  14. (2008). Page 70 Semantic Web
  15. (2006). Probabilistic information retrieval approach for ranking of database query results’, doi
  16. (2006). Search Engine: An Effective tool for exploring the Internet,
  17. (2008). Search evaluation at Google. Available
  18. (2007). Semantic Web for the Working Ontologist: Effective Modelling doi
  19. (2006). Semantic Web technologies. Trends and research in ontology-based systems: doi
  20. (2006). Terrier: A high performance and scalable information retrieval platform. Available at: http://ir.dcs.gla.ac.uk/terrier/publications/ounis06terrier-osir.pdf
  21. (1998). The anatomy of large-scale hyper textual web search engine, doi
  22. (2000). The great libraries: From antiquity to the Renaissance, 3000 B.C. to A.D.1600. doi
  23. (2004). The Lucene Search Engine: Adding search to your applications. doi
  24. (2000). The Lucene search engine: Powerful, flexible and free. Available at:

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.