11 research outputs found

    A Review on Identification of Contextual Similar Sentences

    Get PDF
    The task of identifying contextual similar sentences plays a crucial role in various natural language processing applications such as information retrieval, paraphrase detection, and question answering systems. This paper presents a comprehensive review of the methodologies, techniques, and advancements in the identification of contextual similar sentences. Beginning with an overview of the importance and challenges associated with this task, the paper delves into the various approaches employed, including traditional similarity metrics, deep learning architectures, and transformer-based models. Furthermore, the review explores different datasets and evaluation metrics used to assess the performance of these methods. Additionally, the paper discusses recent trends, emerging research directions, and potential applications in the field. By synthesizing existing literature, this review aims to provide researchers and practitioners with insights into the state-of-the-art techniques and future avenues for advancing the identification of contextual similar sentences

    Clustering Data Text Based on Semantic

    Get PDF
    Clustering is one of the most important data mining techniques which categorize a large number of unordered text documents into meaningful and coherent clusters. Most of text clustering algorithms do not consider the semantic relationships between words and do not have the ability to recognize and use the semantic concepts.In this paper, a new algorithm has been presented to cluster texts based on meanings of the words. First, a new method has been presented to find semantic relationship between words based on Wordnet ontology then, text data is clustered using the proposed method and hierarchical clustering algorithm. Documents are preprocessed, converted to vector space model, and then are clustered using the proposed algorithm semantically. The experimental results show that the quality and accuracy of the proposed algorithm are more reliable than the existing hierarchical clustering algorithms

    Model of a semantic web meta-browser based on a general knowledge taxonomy, a general domain ontology, specific ontologies and user profile

    Get PDF
    Web search has become one of the most important fields of research around the world. They are many reasonsincluding: the fast-growing nature of information sources; the search necessity for information closer to specificuser requirements; the need to reduce search time; and the desire to take into account the semantics of terms usedwhen doing search queries. This paper shows a semantic meta-web search model called XGhobi which uses indexedresources by Google, Yahoo! and Bing. The XGhobi engine combines a general taxonomy of knowledge, a generaldomain ontology –WordNet-, a set of specific domain ontologies, and user profile management to improve therelevance of recovered documents in both English and Spanish. A detailed description of the meta-web searchengine’s components, some user interfaces and its results and its assessments are shown. The assessment covers theobtained precision on tests done by users.La bĂșsqueda web en los Ășltimos años se ha convertido en una de las ĂĄreas de investigaciĂłn mĂĄs importantes delmundo, debido entre otras cosas: al crecimiento acelerado de las fuentes de informaciĂłn, a la necesidad de contar coninformaciĂłn mĂĄs relevante a los requerimientos especĂ­ficos de cada usuario, a la exploraciĂłn de menores tiempos debĂșsqueda y a la falta de usar la semĂĄntica de los tĂ©rminos implicados en las consultas. En este artĂ­culo se presentael modelo de un meta-buscador (usa los recursos indexados por Google, Yahoo! y Bing) web semĂĄntico llamadoXGhobi, que incorpora una taxonomĂ­a general de conocimiento, una ontologĂ­a de dominio general (WordNet), unconjunto de ontologĂ­as de dominio especĂ­fico y el perfil de los usuarios para mejorar la relevancia de los documentosrecuperados tanto en inglĂ©s como en español. Se describe en detalle los componentes del meta-buscador, algunasinterfaces de usuario y los resultados de su evaluaciĂłn. La evaluaciĂłn del sistema muestra la precisiĂłn obtenida enpruebas realizadas con usuarios

    Modelo de un meta-buscador web semĂĄntico basado en una taxonomĂ­a general de conocimiento, una ontologĂ­a de dominio general, ontologĂ­as especĂ­ficas y perfil de usuario

    Get PDF
    La bĂșsqueda web en los Ășltimos años se ha convertido en una de las ĂĄreas de investigaciĂłn mĂĄs importantes delmundo, debido entre otras cosas: al crecimiento acelerado de las fuentes de informaciĂłn, a la necesidad de contar coninformaciĂłn mĂĄs relevante a los requerimientos especĂ­ficos de cada usuario, a la exploraciĂłn de menores tiempos debĂșsqueda y a la falta de usar la semĂĄntica de los tĂ©rminos implicados en las consultas. En este artĂ­culo se presentael modelo de un meta-buscador (usa los recursos indexados por Google, Yahoo! y Bing) web semĂĄntico llamadoXGhobi, que incorpora una taxonomĂ­a general de conocimiento, una ontologĂ­a de dominio general (WordNet), unconjunto de ontologĂ­as de dominio especĂ­fico y el perfil de los usuarios para mejorar la relevancia de los documentosrecuperados tanto en inglĂ©s como en español. Se describe en detalle los componentes del meta-buscador, algunasinterfaces de usuario y los resultados de su evaluaciĂłn. La evaluaciĂłn del sistema muestra la precisiĂłn obtenida enpruebas realizadas con usuarios.Web search has become one of the most important fields of research around the world. They are many reasonsincluding: the fast-growing nature of information sources; the search necessity for information closer to specificuser requirements; the need to reduce search time; and the desire to take into account the semantics of terms usedwhen doing search queries. This paper shows a semantic meta-web search model called XGhobi which uses indexedresources by Google, Yahoo! and Bing. The XGhobi engine combines a general taxonomy of knowledge, a generaldomain ontology –WordNet-, a set of specific domain ontologies, and user profile management to improve therelevance of recovered documents in both English and Spanish. A detailed description of the meta-web searchengine’s components, some user interfaces and its results and its assessments are shown. The assessment covers theobtained precision on tests done by users

    Procedimento para a construção de índices semùnticos baseados em ontologias de domínio específico

    Get PDF
    The current on-line search systems are still far from providing users with contextualized and accurate answers because users have to make additional efforts to filter and evaluate information supplied to them. One of the ways to improve the results is to create semantic indexes that incorporate knowledge and intelligent processing of resources. When it comes to the implementation of semantic indexes, however, there is a wide range of research studies with their own procedures and lengthy conceptualization, implementation, and refinement processes. Thus, it becomes of the utmost importance to define an instrument that allows creating these kinds of structures in a more structured and efficient manner. This work proposes a procedure that makes it possible to create semantic indexes based on domain-specific ontologies. The methodology entailed creating a state of the art of the various existing proposals and drawing a general procedure that incorporates the best practice for creating semantic indexes. Then, a semantic index was created of the domain of plants and their components. The results demonstrate that the defined process is a good instrument that guides implementation of these kinds of structures with a high degree of customization. Nevertheless, it also shows that the process depends on other variables in building and processing the index, so the design needs to be re-examined until the desirable results are obtained.Los sistemas de bĂșsqueda web actual, aĂșn estĂĄn lejos de ofrecer respuestas completamente contextualizadas y precisas a los usuarios, ya que Ă©stos deben hacer esfuerzos adicionales de filtrado y evaluaciĂłn de la informaciĂłn proporcionada. Una forma de mejorar los resultados, es mediante la creaciĂłn de Ă­ndices semĂĄnticos, los cuales incorporan conocimiento y procesamiento inteligente de los recursos. Sin embargo, al momento de implementar los Ă­ndices semĂĄnticos, existen variadas investigaciones con procedimientos propios y con procesos largos de conceptualizaciĂłn, implementaciĂłn y afinaciĂłn. Es asĂ­, como se vuelve importante definir una herramienta que permita crear este tipo de estructuras de una manera mĂĄs estructurada y eficiente. El presente trabajo propone un procedimiento que permite crear Ă­ndices semĂĄnticos a partir de ontologĂ­as de dominio especĂ­fico. La metodologĂ­a utilizada fue la creaciĂłn de un estado del arte de las diferentes propuestas existentes y posteriormente la abstracciĂłn de un procedimiento general que incorpore las mejores prĂĄcticas de creaciĂłn de Ă­ndices semĂĄnticos. Posteriormente, se creĂł un Ă­ndice semĂĄntico el dominio de las plantas y sus componentes. Los resultados permiten establecer que el proceso definido es una buena herramienta para guiar la implementaciĂłn de este tipo de estructuras con un alto grado de personalizaciĂłn. Sin embargo, tambiĂ©n evidenciĂł que el proceso depende otras variables al momento de construir y trabajar con el Ă­ndice y por lo tanto se debe reevaluar el diseño hasta obtener los resultados deseados.Os atuais sistemas de busca na web, estĂŁo ainda longe de fornecer respostas plenamente contextualizadas e precisas aos usuĂĄrios, uma vez que eles devem fazer esforços extras de filtragem e avaliação das informaçÔes fornecidas. Uma forma de melhorar os resultados Ă© atravĂ©s da criação de Ă­ndices semĂąnticos, que incorporam conhecimento e processamento inteligente dos recursos. No entanto, no momento de implementar os Ă­ndices semĂąnticos, existem variadas investigaçÔes com procedimentos prĂłprios e com longos processos de conceituação, implementação e ajuste. É assim que se torna importante definir uma ferramenta que permita criar este tipo de estruturas de uma maneira mais estruturada e eficiente. Este artigo propĂ”e um procedimento que permite criar Ă­ndices semĂąnticos a partir de ontologias de domĂ­nio especĂ­fico. A metodologia usada foi a criação de um estado de arte das diferentes propostas existentes e posteriormente a abstração de um procedimento geral que incorpore as melhores prĂĄticas de criação de Ă­ndices semĂąnticos. Posteriormente, foi criado um Ă­ndice semĂąntico de masterização das plantas e seus componentes. Os resultados permitem estabelecer que o processo definido Ă© uma boa ferramenta para orientar a implementação deste tipo de estruturas com um alto grau de personalização. No entanto, tambĂ©m revelou que o processo depende de outras variĂĄveis no momento de construir e trabalhar com o Ă­ndice e, portanto, o projeto deve ser reavaliado atĂ© obter os resultados desejado

    Nanoscale simulation of crystal defects with application to mantle minerals

    Get PDF
    Minor and trace elements can influence the chemical and physical properties of the Earth's mantle, whether by forming separate minerals that can host trace elements at far higher concentrations than major mantle minerals, or by influencing physical properties of major minerals, for instance by enhancing diffusion or pinning topological defects. The present study uses atomistic computational modeling to investigate the properties of calcium phosphate minerals at mantle pressures, and to determine if cation vacancies can lubricate dislocation glide in mantle minerals. With their large, high-coordination cation sites, calcium phosphate minerals can potentially dissolve large quantities of incompatible elements, and may be an important mantle reservoirs for these elements. Quantum mechanical calculations are used to determine the variation of structure and properties of apatite, a common low-pressure phosphate, with the identity of its channel anion. The crystal structure and compressibility are both found to vary substantially with channel anion identity. Tuite, which forms by the breakdown of apatite at high pressure, may extend the phosphorous cycle beyond the stability field of apatite and into the lower mantle. DFT calculations are used to show that the pressure at which apatite decomposes to form tuite is sensitive to the identity of the channel anion. The calculated bulk moduli of the calcium-hosting sites in tuite are considerably lower than those of the M site in CaSiO3 perovskite (cpv). Atomistic calculations show that strontium and barium impurities partition strongly from cpv into tuite. The second part of this thesis considers the role that dislocations play in hosting vacancy-related defects, and influence that these segregated defects may have on the rheology of mantle minerals. Interatomic potentials are used to calculate energies for the segregation of bare and protonated cation defects to dislocation cores in MgO and forsterite. Vacancies segregate strongly to [100](010) and [001](010) dislocations in forsterite. [100](010) edge dislocations serve as particularly suitable hosts for vacancies on the M1 sub-lattice, while vacant M2 sites, which are energetically unfavorable in the unstrained bulk lattice are more common around [001](010) edge dislocations. Similarly, both {□Mg}″ and {2HMg}X defects segregate strongly to all of the major dislocation slip systems in MgO. For both minerals, cation vacancies will be far higher at dislocation core sites than in the unstrained lattice. Peierls-Nabarro calculations, parameterized using generalized stacking fault energies calculated using plane-wave density functional theory, show that protonated vacancies increase dislocation core widths and reduce the Peierls stresses in MgO. Bare Mg vacancies, especially those on the M2 sub-lattice, similarly reduce g-line energies and Peierls stresses for the dislocations in olivine. The magnitude of this decrease depends strongly on dislocation and the type of the lattice site (i.e. M1 or M2). Concentrations of vacancy related defects are likely to be elevated particularly in mantle wedge, due to the combination of oxidizing conditions and the availability of liquid water from dehydrating phases in the subducting slab, and changes in the olivine deformation fabric in these regions match those predicted in this study
    corecore