9 research outputs found

    Tracing Linguistic Relations in Winning and Losing Sides of Explicit Opposing Groups

    Full text link
    Linguistic relations in oral conversations present how opinions are constructed and developed in a restricted time. The relations bond ideas, arguments, thoughts, and feelings, re-shape them during a speech, and finally build knowledge out of all information provided in the conversation. Speakers share a common interest to discuss. It is expected that each speaker's reply includes duplicated forms of words from previous speakers. However, linguistic adaptation is observed and evolves in a more complex path than just transferring slightly modified versions of common concepts. A conversation aiming a benefit at the end shows an emergent cooperation inducing the adaptation. Not only cooperation, but also competition drives the adaptation or an opposite scenario and one can capture the dynamic process by tracking how the concepts are linguistically linked. To uncover salient complex dynamic events in verbal communications, we attempt to discover self-organized linguistic relations hidden in a conversation with explicitly stated winners and losers. We examine open access data of the United States Supreme Court. Our understanding is crucial in big data research to guide how transition states in opinion mining and decision-making should be modeled and how this required knowledge to guide the model should be pinpointed, by filtering large amount of data.Comment: Full paper, Proceedings of FLAIRS-2017 (30th Florida Artificial Intelligence Research Society), Special Track, Artificial Intelligence for Big Social Data Analysi

    БСмантичСский Π°Π½Π°Π»ΠΈΠ· ΠΈ поиск тСкстов Π½Π° СстСствСнном языкС для Π˜Π½Ρ‚Π΅Ρ€Π½Π΅Ρ‚-ΠΏΠΎΡ€Ρ‚Π°Π»Π°

    Get PDF
    The article is devoted to solving the set of problems related to natural language texts semantic analysis. The following problems are addressed: automation of generating metadata files describing the semantic representation of a web page; semantic network construction for a given set of texts; semantic search execution for a given set of texts using metadata files; and semantic network export to RDF format. The algorithms for knowledge extraction from text, semantic network construction and query execution on a given semantic network are described. The lexico-syntactic patterns method was used as a basis to approach these problems. A specification for describing lexico-syntactic patterns has been developed and a pattern interpreter based on the morphological dictionary of the Ukrainian language has been created as a part of the software implementation of the method. Experimental studies have been carried out for the Β«classification of living organismsΒ» subject environment set of patterns. Modified Boyer–Moore–Horspool algorithm was used to address the problem of interpreting.Бтаття присвячСна Ρ€ΠΎΠ·Π²β€™ΡΠ·Π°Π½Π½ΡŽ комплСксу Π·Π°Π΄Π°Ρ‡ Π· сСмантичного Π°Π½Π°Π»Ρ–Π·Ρƒ тСкстів ΠΏΡ€ΠΈΡ€ΠΎΠ΄Π½ΠΎΡŽ мовою. Розглянуті Ρ‚Π°ΠΊΡ– Π·Π°Π΄Π°Ρ‡Ρ–: автоматизація процСсу Π³Π΅Π½Π΅Ρ€Π°Ρ†Ρ–Ρ— Ρ„Π°ΠΉΠ»Ρ–Π² ΠΌΠ΅Ρ‚Π°Π΄Π°Π½ΠΈΡ…, Ρ‰ΠΎ ΠΎΠΏΠΈΡΡƒΡŽΡ‚ΡŒ сСмантичнС прСдставлСння Π²Π΅Π±-сторінки; ΠΏΠΎΠ±ΡƒΠ΄ΠΎΠ²Π° сСмантичної ΠΌΠ΅Ρ€Π΅ΠΆΡ– ΠΏΠΎ Π·Π°Π΄Π°Π½Ρ–ΠΉ ΠΌΠ½ΠΎΠΆΠΈΠ½Ρ– тСкстів; виконання сСмантичного ΠΏΠΎΡˆΡƒΠΊΡƒ ΠΏΠΎ Π·Π°Π΄Π°Π½Ρ–ΠΉ ΠΌΠ½ΠΎΠΆΠΈΠ½Ρ– тСкстів Π· використанням Ρ„Π°ΠΉΠ»Ρ–Π² ΠΌΠ΅Ρ‚Π°Π΄Π°Π½ΠΈΡ…; Скспорт сСмантичної ΠΌΠ΅Ρ€Π΅ΠΆΡ– Π² Ρ„ΠΎΡ€ΠΌΠ°Ρ‚ RDF. Для розв’язання поставлСних Π·Π°Π΄Π°Ρ‡ описані Π°Π»Π³ΠΎΡ€ΠΈΡ‚ΠΌΠΈ відокрСмлСння знань Ρ–Π· тСкстів, прСдставлСння Ρ—Ρ… Ρƒ вигляді сСмантичної ΠΌΠ΅Ρ€Π΅ΠΆΡ– Ρ– Π²ΠΈΠΊΠΎΠ½Π°Π½Π½Ρ– Π·Π°ΠΏΠΈΡ‚Ρ–Π² Π΄ΠΎ ΠΏΠΎΠ±ΡƒΠ΄ΠΎΠ²Π°Π½ΠΎΡ— ΠΌΠ΅Ρ€Π΅ΠΆΡ–. Основним ΠΏΡ–Π΄Ρ…ΠΎΠ΄ΠΎΠΌ Π΄ΠΎ розв’язання Ρ†ΠΈΡ… Π·Π°Π΄Π°Ρ‡ слугував ΠΌΠ΅Ρ‚ΠΎΠ΄ лСксико-синтаксичних ΡˆΠ°Π±Π»ΠΎΠ½Ρ–Π².Для ΠΏΡ€ΠΎΠ³Ρ€Π°ΠΌΠ½ΠΎΡ— Ρ€Π΅Π°Π»Ρ–Π·Π°Ρ†Ρ–Ρ— ΠΌΠ΅Ρ‚ΠΎΠ΄Ρƒ Ρ€ΠΎΠ·Ρ€ΠΎΠ±Π»Π΅Π½ΠΎ ΡΠΏΠ΅Ρ†ΠΈΡ„Ρ–ΠΊΠ°Ρ†Ρ–ΡŽ опису лСксико-синтаксичних ΡˆΠ°Π±Π»ΠΎΠ½Ρ–Π², створСно Ρ–Π½Ρ‚Π΅Ρ€ΠΏΡ€Π΅Ρ‚Π°Ρ‚ΠΎΡ€ ΡˆΠ°Π±Π»ΠΎΠ½Ρ–Π² Π½Π° основі ΠΌΠΎΡ€Ρ„ΠΎΠ»ΠΎΠ³Ρ–Ρ‡Π½ΠΎΠ³ΠΎ словнику ΡƒΠΊΡ€Π°Ρ—Π½ΡΡŒΠΊΠΎΡ— ΠΌΠΎΠ²ΠΈ. Π•ΠΊΡΠΏΠ΅Ρ€ΠΈΠΌΠ΅Π½Ρ‚Π°Π»ΡŒΠ½Ρ– дослідТСння ΠΏΡ€ΠΎΠ²Π΅Π΄Π΅Π½Ρ– для Π½Π°Π±ΠΎΡ€ ΡˆΠ°Π±Π»ΠΎΠ½Ρ–Π² ΠΏΡ€Π΅Π΄ΠΌΠ΅Ρ‚Π½ΠΎΠ³ΠΎ сСрСдовища «класифікація ΠΆΠΈΠ²ΠΈΡ… ΠΎΡ€Π³Π°Π½Ρ–Π·ΠΌΡ–Π²Β». Для розв’язання Π·Π°Π΄Π°Ρ‡Ρ– Ρ–Π½Ρ‚Π΅Ρ€ΠΏΡ€Π΅Ρ‚Π°Ρ†Ρ–Ρ— лСксико-синтаксичних ΡˆΠ°Π±Π»ΠΎΠ½Ρ–Π² використовувався ΠΌΠΎΠ΄ΠΈΡ„Ρ–ΠΊΠΎΠ²Π°Π½ΠΈΠΉ Π°Π»Π³ΠΎΡ€ΠΈΡ‚ΠΌ Π‘ΠΎΠΉΠ΅Ρ€Π°β€“ΠœΡƒΡ€Π°β€“Π₯орпускула.Π‘Ρ‚Π°Ρ‚ΡŒΡ посвящСна Ρ€Π΅ΡˆΠ΅Π½ΠΈΡŽ комплСкса Π·Π°Π΄Π°Ρ‡ сСмантичСского Π°Π½Π°Π»ΠΈΠ·Π° тСкстов Π½Π° СстСствСнном языкС. РассмотрСны ΡΠ»Π΅Π΄ΡƒΡŽΡ‰ΠΈΠ΅ Π·Π°Π΄Π°Ρ‡ΠΈ: автоматизация процСсса Π³Π΅Π½Π΅Ρ€Π°Ρ†ΠΈΠΈ Ρ„Π°ΠΉΠ»ΠΎΠ² ΠΌΠ΅Ρ‚Π°Π΄Π°Π½Π½Ρ‹Ρ…, ΠΎΠΏΠΈΡΡ‹Π²Π°ΡŽΡ‰ΠΈΡ… сСмантичСскоС прСдставлСниС Π²Π΅Π±-страницы; построСниС сСмантичСской сСти ΠΏΠΎ Π·Π°Π΄Π°Π½Π½ΠΎΠΌΡƒ мноТСству тСкстов; выполнСния сСмантичСского поиска ΠΏΠΎ Π·Π°Π΄Π°Π½Π½ΠΎΠΌΡƒ мноТСству тСкстов с использованиСм Ρ„Π°ΠΉΠ»ΠΎΠ² ΠΌΠ΅Ρ‚Π°Π΄Π°Π½Π½Ρ‹Ρ…; экспорт сСмантичСской сСти Π² Ρ„ΠΎΡ€ΠΌΠ°Ρ‚ RDF. Для Ρ€Π΅ΡˆΠ΅Π½ΠΈΡ поставлСнных Π·Π°Π΄Π°Ρ‡ описаны Π°Π»Π³ΠΎΡ€ΠΈΡ‚ΠΌΡ‹ выдСлСния Π·Π½Π°Π½ΠΈΠΉ ΠΈΠ· тСкстов, прСдставлСниС ΠΈΡ… Π² Π²ΠΈΠ΄Π΅ сСмантичСской сСти ΠΈ Π²Ρ‹ΠΏΠΎΠ»Π½Π΅Π½ΠΈΠΈ запросов ΠΊ построСнной сСти. ΠžΡΠ½ΠΎΠ²Π½Ρ‹ΠΌ ΠΏΠΎΠ΄Ρ…ΠΎΠ΄ΠΎΠΌ ΠΊ Ρ€Π΅ΡˆΠ΅Π½ΠΈΡŽ этих Π·Π°Π΄Π°Ρ‡ слуТил ΠΌΠ΅Ρ‚ΠΎΠ΄ лСксико-синтаксичСских шаблонов. Для ΠΏΡ€ΠΎΠ³Ρ€Π°ΠΌΠΌΠ½ΠΎΠΉ Ρ€Π΅Π°Π»ΠΈΠ·Π°Ρ†ΠΈΠΈ ΠΌΠ΅Ρ‚ΠΎΠ΄Π° Ρ€Π°Π·Ρ€Π°Π±ΠΎΡ‚Π°Π½Ρ‹ спСцификации описания лСксико-синтаксичСских шаблонов, создан ΠΈΠ½Ρ‚Π΅Ρ€ΠΏΡ€Π΅Ρ‚Π°Ρ‚ΠΎΡ€ шаблонов Π½Π° основС морфологичСского словарС украинского языка. Π­ΠΊΡΠΏΠ΅Ρ€ΠΈΠΌΠ΅Π½Ρ‚Π°Π»ΡŒΠ½Ρ‹Π΅ исслСдования ΠΏΡ€ΠΎΠ²Π΅Π΄Π΅Π½Ρ‹ для Π½Π°Π±ΠΎΡ€ шаблонов ΠΏΡ€Π΅Π΄ΠΌΠ΅Ρ‚Π½ΠΎΠΉ срСды «классификация ΠΆΠΈΠ²Ρ‹Ρ… ΠΎΡ€Π³Π°Π½ΠΈΠ·ΠΌΠΎΠ²Β». Для Ρ€Π΅ΡˆΠ΅Π½ΠΈΡ Π·Π°Π΄Π°Ρ‡ΠΈ ΠΈΠ½Ρ‚Π΅Ρ€ΠΏΡ€Π΅Ρ‚Π°Ρ†ΠΈΠΈ лСксико-синтаксичСских шаблонов использовался ΠΌΠΎΠ΄ΠΈΡ„ΠΈΡ†ΠΈΡ€ΠΎΠ²Π°Π½Π½Ρ‹ΠΉ Π°Π»Π³ΠΎΡ€ΠΈΡ‚ΠΌ Π‘ΠΎΠΉΠ΅Ρ€Π°-ΠœΡƒΡ€Π°-Π₯орпускул

    Building WordNet for Afaan Oromoo

    Get PDF
    WordNet is a lexical database which has many relations to disambiguate the sense of words for natural languages. From the WordNet relations synonyms and hyponym has major role for natural language processing and artificial intelligence applications. In this paper, word embedding (Word2Vec) and lexico-syntactic pattern (LSP) are developed to extract automatically synonyms and hyponyms respectively. For this study, the word embedding is evaluated on two specialized domain algorithms such as a continuous bag of words and Skip Gram algorithms and show superior results. Applying word embedding (Word2Vec) algorithms for Afaan Oromo texts has been registered 80.09% and 85.04% for the continuous bag of words and Skip Gram respectively. According to the result achieved in this study, the skip-gram algorithm does a better job for frequent pairs of words than a continuous bag of words. But, a continuous bag of words algorithm is faster while skip-gram is slower. A lexical syntactic pattern with the combination of Word2Vec and without Word2Vec is also evaluated using information retrieval evaluation metrics such as precision, recall and F-measure to extract hyponym relation from Afaan Oromoo texts. The precision, recall and F-measure have been registered by lexical syntactic patterns without the combination of Word2Vec is 66.73%, 72%, and 69.26% respectively and with the combination of Word2Vec 81.14%, 80.8%, and 81.1% have been registered for precision, recall and F-measure respectively. There are factors that could affect the accuracy of results: 1) the style of writer of Afaan Oromoo i.e. they write a noun phrase with many adjective to express the noun for the reader; and, 2) it is possible that some instances of the LSP are missed due to misspellings and other typographical errors. Keywords: Afaan Oromoo WordNet, Word embedding, Lexico syntactic patterns, Extraction of WordNet relations. DOI: 10.7176/CEIS/11-3-01 Publication date:May 31st 202

    Ekstraksi Relasi Meronymy dengan Lexico-Syntactic Patterns

    Get PDF
    Ontologi terdiri atas konsep dan relasi yang masing-masing dapat diekstrak dengan berbagai macam metode. Salah satu metode yang dapat digunakan untuk ekstraksi relasi adalah metode berdasarkan Lexico-Syntactic Patterns. Secara sederhana, ekstraksi relasi dilakukan dengan mendapatkan sebuah pola yang menunjukkan sebuah relasi. Kemudian dilakukan percobaan untuk menguji apakah pola yang didapatkan mampu memprediksi relasi dengan tepat. Pada penelitian ini dilakukan percobaan untuk menguji pola relasi meronymy yang didapatkan dari dataset penelitian terdahulu. Evaluasi dilakukan dengan menggunakan nilai recall dan precision. Dari penelitian ini, ditemukan bahwa banyaknya (keragaman) variasi dalam sekumpulan pola yang menunjukkan suatu relasi dapat mempengaruhi kemampuan kumpulan pola tersebut untuk memprediksi relasi dengan tepat. Semakin banyak variasi pola dalam satu relasi, maka ketepatan prediksi cenderung menurun

    Π’Π΅ΠΊΡ‚ΠΎΡ€Π½ΠΎΠ΅ прСдставлСниС слов с сСмантичСскими ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΡΠΌΠΈ: ΡΠΊΡΠΏΠ΅Ρ€ΠΈΠΌΠ΅Π½Ρ‚Π°Π»ΡŒΠ½Ρ‹Π΅ наблюдСния

    Get PDF
    The ability to identify semantic relations between words has made a word2vec model widely used in NLP tasks. The idea of word2vec is based on a simple rule that a higher similarity can be reached if two words have a similar context. Each word can be represented as a vector, so the closest coordinates of vectors can be interpreted as similar words. It allows to establish semantic relations (synonymy, relations of hypernymy and hyponymy and other semantic relations) by applying an automatic extraction. The extraction of semantic relations by hand is considered as a time-consuming and biased task, requiring a large amount of time and some help of experts. Unfortunately, the word2vec model provides an associative list of words which does not consist of relative words only. In this paper, we show some additional criteria that may be applicable to solve this problem. Observations and experiments with well-known characteristics, such as word frequency, a position in an associative list, might be useful for improving results for the task of extraction of semantic relations for the Russian language by using word embedding. In the experiments, the word2vec model trained on the Flibusta and pairs from Wiktionary are used as examples with semantic relationships. Semantically related words are applicable to thesauri, ontologies and intelligent systems for natural language processing.Π’ΠΎΠ·ΠΌΠΎΠΆΠ½ΠΎΡΡ‚ΡŒ ΠΈΠ΄Π΅Π½Ρ‚ΠΈΡ„ΠΈΠΊΠ°Ρ†ΠΈΠΈ сСмантичСской близости ΠΌΠ΅ΠΆΠ΄Ρƒ словами сдСлала модСль word2vec ΡˆΠΈΡ€ΠΎΠΊΠΎ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅ΠΌΠΎΠΉ Π² NLP-Π·Π°Π΄Π°Ρ‡Π°Ρ…. ИдСя word2vec основана Π½Π° контСкстной близости слов. КаТдоС слово ΠΌΠΎΠΆΠ΅Ρ‚ Π±Ρ‹Ρ‚ΡŒ прСдставлСно Π² Π²ΠΈΠ΄Π΅ Π²Π΅ΠΊΡ‚ΠΎΡ€Π°, Π±Π»ΠΈΠ·ΠΊΠΈΠ΅ ΠΊΠΎΠΎΡ€Π΄ΠΈΠ½Π°Ρ‚Ρ‹ Π²Π΅ΠΊΡ‚ΠΎΡ€ΠΎΠ² ΠΌΠΎΠ³ΡƒΡ‚ Π±Ρ‹Ρ‚ΡŒ ΠΈΠ½Ρ‚Π΅Ρ€ΠΏΡ€Π΅Ρ‚ΠΈΡ€ΠΎΠ²Π°Π½Ρ‹ ΠΊΠ°ΠΊ Π±Π»ΠΈΠ·ΠΊΠΈΠ΅ ΠΏΠΎ смыслу слова. Π’Π°ΠΊΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ, ΠΈΠ·Π²Π»Π΅Ρ‡Π΅Π½ΠΈΠ΅ сСмантичСских ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΠΉ (ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΠ΅ синонимии, Ρ€ΠΎΠ΄ΠΎ-Π²ΠΈΠ΄ΠΎΠ²Ρ‹Π΅ ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΡ ΠΈ Π΄Ρ€ΡƒΠ³ΠΈΠ΅) ΠΌΠΎΠΆΠ΅Ρ‚ Π±Ρ‹Ρ‚ΡŒ Π°Π²Ρ‚ΠΎΠΌΠ°Ρ‚ΠΈΠ·ΠΈΡ€ΠΎΠ²Π°Π½ΠΎ. УстановлСниС сСмантичСских ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΠΉ Π²Ρ€ΡƒΡ‡Π½ΡƒΡŽ считаСтся Ρ‚Ρ€ΡƒΠ΄ΠΎΠ΅ΠΌΠΊΠΎΠΉ ΠΈ Π½Π΅ΠΎΠ±ΡŠΠ΅ΠΊΡ‚ΠΈΠ²Π½ΠΎΠΉ Π·Π°Π΄Π°Ρ‡Π΅ΠΉ, Ρ‚Ρ€Π΅Π±ΡƒΡŽΡ‰Π΅ΠΉ большого количСства Π²Ρ€Π΅ΠΌΠ΅Π½ΠΈ ΠΈ привлСчСния экспСртов. Но срСди ассоциативных слов, сформированных с использованиСм ΠΌΠΎΠ΄Π΅Π»ΠΈ word2vec, Π²ΡΡ‚Ρ€Π΅Ρ‡Π°ΡŽΡ‚ΡΡ слова, Π½Π΅ ΠΏΡ€Π΅Π΄ΡΡ‚Π°Π²Π»ΡΡŽΡ‰ΠΈΠ΅ Π½ΠΈΠΊΠ°ΠΊΠΈΡ… ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΠΉ с Π³Π»Π°Π²Π½Ρ‹ΠΌ словом, для ΠΊΠΎΡ‚ΠΎΡ€ΠΎΠ³ΠΎ Π±Ρ‹Π» прСдставлСн ассоциативный ряд. Π’ Ρ€Π°Π±ΠΎΡ‚Π΅ Ρ€Π°ΡΡΠΌΠ°Ρ‚Ρ€ΠΈΠ²Π°ΡŽΡ‚ΡΡ Π΄ΠΎΠΏΠΎΠ»Π½ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ ΠΊΡ€ΠΈΡ‚Π΅Ρ€ΠΈΠΈ, ΠΊΠΎΡ‚ΠΎΡ€Ρ‹Π΅ ΠΌΠΎΠ³ΡƒΡ‚ Π±Ρ‹Ρ‚ΡŒ ΠΏΡ€ΠΈΠΌΠ΅Π½ΠΈΠΌΡ‹ для Ρ€Π΅ΡˆΠ΅Π½ΠΈΡ Π΄Π°Π½Π½ΠΎΠΉ ΠΏΡ€ΠΎΠ±Π»Π΅ΠΌΡ‹. НаблюдСния ΠΈ ΠΏΡ€ΠΎΠ²Π΅Π΄Π΅Π½Π½Ρ‹Π΅ экспСримСнты с общСизвСстными характСристиками, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ частота слов, позиция Π² ассоциативном ряду, ΠΌΠΎΠ³ΡƒΡ‚ Π±Ρ‹Ρ‚ΡŒ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΠΎΠ²Π°Π½Ρ‹ для ΡƒΠ»ΡƒΡ‡ΡˆΠ΅Π½ΠΈΡ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚ΠΎΠ² ΠΏΡ€ΠΈ Ρ€Π°Π±ΠΎΡ‚Π΅ с Π²Π΅ΠΊΡ‚ΠΎΡ€Π½Ρ‹ΠΌ прСдставлСниСм слов Π² части опрСдСлСния сСмантичСских ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΠΉ для русского языка. Π’ экспСримСнтах ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ обучСнная Π½Π° корпусах Ѐлибусты модСль word2vec ΠΈ Ρ€Π°Π·ΠΌΠ΅Ρ‡Π΅Π½Π½Ρ‹Π΅ Π΄Π°Π½Π½Ρ‹Π΅ Викисловаря Π² качСствС ΠΎΠ±Ρ€Π°Π·Ρ†ΠΎΠ²Ρ‹Ρ… ΠΏΡ€ΠΈΠΌΠ΅Ρ€ΠΎΠ², Π² ΠΊΠΎΡ‚ΠΎΡ€Ρ‹Ρ… ΠΎΡ‚Ρ€Π°ΠΆΠ΅Π½Ρ‹ сСмантичСскиС ΠΎΡ‚Π½ΠΎΡˆΠ΅Π½ΠΈΡ. БСмантичСски связанныС слова (ΠΈΠ»ΠΈ Ρ‚Π΅Ρ€ΠΌΠΈΠ½Ρ‹) нашли своС ΠΏΡ€ΠΈΠΌΠ΅Π½Π΅Π½ΠΈΠ΅ Π² тСзаурусах, онтологиях, ΠΈΠ½Ρ‚Π΅Π»Π»Π΅ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½Ρ‹Ρ… систСмах для ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠΈ СстСствСнного языка

    ASPER: Attention-based Approach to Extract Syntactic Patterns denoting Semantic Relations in Sentential Context

    Get PDF
    Semantic relationships, such as hyponym-hypernym, cause-effect, meronym-holonym etc., between a pair of entities in a sentence are usually reflected through syntactic patterns. Automatic extraction of such patterns benefits several downstream tasks, including, entity extraction, ontology building, and question answering. Unfortunately, automatic extraction of such patterns has not yet received much attention from NLP and information retrieval researchers. In this work, we propose an attentionbased supervised deep learning model, ASPER, which extracts syntactic patterns between entities exhibiting a given semantic relation in the sentential context. We validate the performance of ASPER on three distinct semantic relationsβ€”hyponym-hypernym, cause-effect, and meronym-holonym on six datasets. Experimental results show that for all these semantic relations, ASPER can automatically identify a collection of syntactic patterns reflecting the existence of such a relation between a pair of entities in a sentence. In comparison to the existing methodologies of syntactic pattern extraction, ASPER’s performance is substantially superior

    Design of an E-learning system using semantic information and cloud computing technologies

    Get PDF
    Humanity is currently suffering from many difficult problems that threaten the life and survival of the human race. It is very easy for all mankind to be affected, directly or indirectly, by these problems. Education is a key solution for most of them. In our thesis we tried to make use of current technologies to enhance and ease the learning process. We have designed an e-learning system based on semantic information and cloud computing, in addition to many other technologies that contribute to improving the educational process and raising the level of students. The design was built after much research on useful technology, its types, and examples of actual systems that were previously discussed by other researchers. In addition to the proposed design, an algorithm was implemented to identify topics found in large textual educational resources. It was tested and proved to be efficient against other methods. The algorithm has the ability of extracting the main topics from textual learning resources, linking related resources and generating interactive dynamic knowledge graphs. This algorithm accurately and efficiently accomplishes those tasks even for bigger books. We used Wikipedia Miner, TextRank, and Gensim within our algorithm. Our algorithmβ€˜s accuracy was evaluated against Gensim, largely improving its accuracy. Augmenting the system design with the implemented algorithm will produce many useful services for improving the learning process such as: identifying main topics of big textual learning resources automatically and connecting them to other well defined concepts from Wikipedia, enriching current learning resources with semantic information from external sources, providing student with browsable dynamic interactive knowledge graphs, and making use of learning groups to encourage students to share their learning experiences and feedback with other learners.Programa de Doctorado en IngenierΓ­a TelemΓ‘tica por la Universidad Carlos III de MadridPresidente: Luis SΓ‘nchez FernΓ‘ndez.- Secretario: Luis de la Fuente ValentΓ­n.- Vocal: Norberto FernΓ‘ndez GarcΓ­
    corecore