Search CORE

24 research outputs found

B-BabelNet: Business-Specific Lexical Database for Improving Semantic Analysis of Business Process Models

Author: Munif Abdul
Pamungkas Endang Wahyu
Sarno Riyanarto
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/03/2017
Field of study

Similarity calculation between business process models has an important role in managing repository of business process model. One of its uses is to facilitate the searching process of models in the repository. Business process similarity is closely related to semantic string similarity. Semantic string similarity is usually performed by utilizing a lexical database such as WordNet to find the semantic meaning of the word. The activity name of the business process uses terms that specifically related to the business field. However, most of the terms in business domain are not available in WordNet. This case would decrease the semantic analysis quality of business process model. Therefore, this study would try to improve semantic analysis of business process model. We present a new lexical database called B-BabelNet. B-BabelNet is a lexical database built by using the same method in BabelNet. We attempt to map the Wikipedia page to WordNet database but only focus on the word related to the domain of business. Also, to enrich the vocabulary in the business domain, we also use terms in the business-specific online dictionary (businessdictionary.com). We utilize this database to do word sense disambiguation process on business process model activity’s terms. The result from this study shows that the database can increase the accuracy of the word sense disambiguation process especially in particular terms related to the business and industrial domains

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

Knowledge-Based Method for Word Sense Disambiguation by Using Hindi WordNet

Author: N. Joshi
P. Sharma
Publication venue: D. G. Pylarinos
Publication date: 01/04/2019
Field of study

The purpose of word sense disambiguation (WSD) is to find the meaning of the word in any context with the help of a computer, to find the proper meaning of a lexeme in the available context in the problem area and the relationship between lexicons. This is done using natural language processing (NLP) techniques which involve queries from machine translation (MT), NLP specific documents or output text. MT automatically translates text from one natural language into another. Several application areas for WSD involve information retrieval (IR), lexicography, MT, text processing, speech processing etc. Using this knowledge-based technique, we are investigating Hindi WSD in this article. It involves incorporating word knowledge from external knowledge resources to remove the equivocalness of words. In this experiment, we tried to develop a WSD tool by considering a knowledge-based approach with WordNet of Hindi. The tool uses the knowledge-based LESK algorithm for WSD for Hindi. Our proposed system gives an accuracy of about 71.4%

Directory of Open Access Journals

A new graph based text segmentation using Wikipedia for automatic text summarization

Author: Ahmed Javed
Gila Abdul Rehman
Khan Mohammad Asif
Muhammed Yusof Zulkefli
Rehman M. Abdul
Waqas Ahmad
Publication venue: 'The Science and Information Organization'
Publication date: 01/01/2012
Field of study

Two models have been developed for simulating CO₂ emissions from wheat farms: (1) an artificial neural network (ANN) model; and (2) a multiple linear regression model (MLR). Data were collected from 40 wheat farms in the Canterbury region of New Zealand. Investigation of more than 140 various factors enabled the selection of eight factors to be employed as the independent variables for final the ANN model. The results showed the final ANN developed can forecast CO₂ emissions from wheat production areas under different conditions (proportion of wheat cultivated land on the farm, numbers of irrigation applications and numbers of cows), the condition of machinery (tractor power index (hp/ha) and age of fertilizer spreader) and N, P and insecticide inputs on the farms with an accuracy of ±11% (± 113 kg CO₂/ha). The total CO₂ emissions from farm inputs were estimated as 1032 kg CO₂/ha for wheat production. On average, fertilizer use of 52% and fuel use of around 20% have the highest CO₂ emissions for wheat cultivation. The results confirmed the ANN model forecast CO₂ emissions much better than MLR model

Archivio Ricerca Ca'Foscari

arXiv.org e-Print Archive

CiteSeerX

Lincoln University Research Archive

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

The International Islamic University Malaysia Repository

Universiti Teknikal Malaysia Melaka (UTeM) Repository

Semantic Annotation of Unstructured Documents Using Concepts Similarity

Author
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

Crossref

Automated Text Summarization Base on Lexicales Chain and graph Using of WordNet and Wikipedia Knowledge Base

Author: Mohammad Saniee Abadeh
Mohsen Pourvali
Publication venue
Publication date: 01/01/2012
Field of study

The technology of automatic document summarization is maturing and may provide a solution to the information overload problem. Nowadays, document summarization plays an important role in information retrieval. With a large volume of documents, presenting the user with a summary of each document greatly facilitates the task of finding the desired documents. Document summarization is a process of automatically creating a compressed version of a given document that provides useful information to users, and multi-document summarization is to produce a summary delivering the majority of information content from a set of documents about an explicit or implicit main topic. The lexical cohesion structure of the text can be exploited to determine the importance of a sentence/phrase. Lexical chains are useful tools to analyze the lexical cohesion structure in a text .In this paper we consider the effect of the use of lexical cohesion features in Summarization, And presenting a algorithm base on the knowledge base. Ours algorithm at first find the correct sense of any word, Then constructs the lexical chains, remove Lexical chains that less score than other, detects topics roughly from lexical chains, segments the text with respect to the topics and selects the most important sentences. The experimental results on an open benchmark datasets from DUC01 and DUC02 show that our proposed approach can improve the performance compared to sate-of-the-art summarization approaches

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Semantic Sort: A Supervised Approach to Personalized Semantic Relatedness

Author: El-Yaniv Ran
Yanay David
Publication venue
Publication date: 10/11/2013
Field of study

We propose and study a novel supervised approach to learning statistical semantic relatedness models from subjectively annotated training examples. The proposed semantic model consists of parameterized co-occurrence statistics associated with textual units of a large background knowledge corpus. We present an efficient algorithm for learning such semantic models from a training sample of relatedness preferences. Our method is corpus independent and can essentially rely on any sufficiently large (unstructured) collection of coherent texts. Moreover, the approach facilitates the fitting of semantic models for specific users or groups of users. We present the results of extensive range of experiments from small to large scale, indicating that the proposed method is effective and competitive with the state-of-the-art.Comment: 37 pages, 8 figures A short version of this paper was already published at ECML/PKDD 201

arXiv.org e-Print Archive

CiteSeerX