Search CORE

5,322 research outputs found

Proceedings of the International Workshop on Text Mining Research, Practice and Opportunities

Author
Publication venue
Publication date: 24/09/2005
Field of study

The University of Manchester - Institutional Repository

Design of a Controlled Language for Critical Infrastructures Protection

Author: CANTARELLA SIMONA
FERIGATO Carlo
OWUSU EVANS BOATENG
Publication venue: European Language Resources Association
Publication date: 28/03/2012
Field of study

We describe a project for the construction of controlled language for critical infrastructures protection (CIP). This project originates from the need to coordinate and categorize the communications on CIP at the European level. These communications can be physically represented by official documents, reports on incidents, informal communications and plain e-mail. We explore the application of traditional library science tools for the construction of controlled languages in order to achieve our goal. Our starting point is an analogous work done during the sixties in the field of nuclear science known as the Euratom Thesaurus.JRC.G.6-Security technology assessmen

JRC Publications Repository

Extraction of ontology and semantic web information from online business reports

Author: Simmons Lakisha L.
Publication venue: eGrove
Publication date: 01/01/2011
Field of study

CAINES, Content Analysis and INformation Extraction System, employs an information extraction (IE) methodology to extract unstructured text from the Web. It can create an ontology and a Semantic Web. This research is different from traditional IE systems in that CAINES examines the syntactic and semantic relationships within unstructured text of online business reports. Using CAINES provides more relevant results than manual searching or standard keyword searching. Over most extraction systems, CAINES extensively uses information extraction from natural language, Key Words in Context (KWIC), and semantic analysis. A total of 21 online business reports, averaging about 100 pages long, were used in this study. Based on financial expert opinions, extraction rules were created to extract information, an ontology, and a Semantic Web of data from financial reports. Using CAINES, one can extract information about global and domestic market conditions, market condition impacts, and information about the business outlook. A Semantic Web was created from Merrill Lynch reports, 107,533 rows of data, and displays information regarding mergers, acquisitions, and business segment news between 2007 and 2009. User testing of CAINES resulted in recall of 85.91%, precision of 87.16%, and an F-measure of 86.46%. Speed with CAINES was also greater than manually extracting information. Users agree that CAINES quickly and easily extracts unstructured information from financial reports on the EDGAR database

eGrove (Univ. of Mississippi)

Text mining with exploitation of user\u27s background knowledge : discovering novel association rules from text

Author: Chen Xin
Publication venue: Digital Commons @ NJIT
Publication date: 31/01/2006
Field of study

The goal of text mining is to find interesting and non-trivial patterns or knowledge from unstructured documents. Both objective and subjective measures have been proposed in the literature to evaluate the interestingness of discovered patterns. However, objective measures alone are insufficient because such measures do not consider knowledge and interests of the users. Subjective measures require explicit input of user expectations which is difficult or even impossible to obtain in text mining environments. This study proposes a user-oriented text-mining framework and applies it to the problem of discovering novel association rules from documents. The developed system, uMining, consists of two major components: a background knowledge developer and a novel association rules miner. The background knowledge developer learns a user\u27s background knowledge by extracting keywords from documents already known to the user (background documents) and developing a concept hierarchy to organize popular keywords. The novel association rule miner discovers association rules among noun phrases extracted from relevant documents (target documents) and compares the rules with the background knowledge to predict the rule novelty to the particular user (useroriented novelty). The user-oriented novelty measure is defined as the semantic distance between the antecedent and the consequent of a rule in the background knowledge. It consists of two components: occurrence distance and connection distance. The former considers the co-occurrences of two keywords in the background documents: the more the shorter the distance. The latter considers the common connections of with others in the concept hierarchy. It is defined as the length of the connecting the two keywords in the concept hierarchy: the longer the path, distance. The user-oriented novelty measure is evaluated from two perspectives: novelty prediction accuracy and usefulness indication power. The results show that the useroriented novelty measure outperforms the WordNet novelty measure and the compared objective measures in term of predicting novel rules and identifying useful rules

Digital Commons @ New Jersey Institute of Technology (NJIT)

Text Mining for Drug Discovery

Author: Piliouras Dimitrios
Publication venue
Publication date: 01/05/2014
Field of study

The University of Manchester - Institutional Repository

Ontology-based employer demand management

Author: Ashraf
Berners-Lee
Biesalski
Bizer
Chamber of Minerals and Energy WA
Chang
García-Sánchez
Green
Gruber
Guarino
Gómez-Pérez
Hepp
Kumaran
Maedche
Neches
O'Callaghan
Stanimirović
Storey
Studer
Terziev
Trichet
Wibisono
Wilcock
Wongthongtham
Publication venue: 'Wiley'
Publication date: 01/01/2015
Field of study

Skills shortages globally pose a real and urgent need for proper investigation and workforce development planning into the future. Analysing workforce development and employer demand needs through electronic job market allows much deeper and wider research into skill shortages. Current methods do not provide the level of depth required to address such important economic implications. In this paper, we present a system aiming to gather and analyse current employer demand information from online job advertisements. It identifies current employer demand needs analysed from electronic job market

Crossref

espace@Curtin

The role of Intellectual Capital Reporting (ICR) in organisational transformation: A discursive practice perspective

Author: Garcia-Lorenzo Lucia
Kourti Isidora
Yu Ai
Publication venue: 'Elsevier BV'
Publication date: 01/06/2017
Field of study

Intellectual Capital Reporting (ICR) has garnered increasing attention as a new accounting technology that can engender significant organisational changes. However, when ICR was first recognised as a management fashion, the intended change it heralded in stable environments was criticised for having limited impact on the state of practice. Conceiving ICR through a lens predicated on the notion of discursive practice, we argue that ICR can enable substantive change in emergent conditions. We empirically demonstrate this process by following the implementation of ICR in one organisation through interviews, documents and observations over 30 months. The qualitative analysis of the data corpus shows how situated change, subtle but no less significant, can take place in the name of intellectual capital as actors appropriate ICR into their everyday work practices while improvising variations to accommodate different logics of action. The paper opens up a new avenue to examine the specific roles of ICR in relation to the types of change enacted. It thus demonstrates when and how ICR may transcend a mere management fashion and the intended change it sets in motion through altering organisational actors’ ways of thinking and doing within the confines of their organisation

Goldsmiths Research Online

Southampton (e-Prints Soton)

LSE Research Online

Open Research Online (The Open University)

Characterization of near death experiences using text mining analyses: A preliminary study

Author: Antonopoulos Georgios
Cassol Helena
Charland-Verville Vanessa
Chronik Blaine Alexander
de Paula Demetrius Ribeiro
Laureys Steven
Martial Charlotte
Soddu Andrea
Publication venue: Scholarship@Western
Publication date: 01/01/2020
Field of study

The notion that death represents a passing to an afterlife, where we are reunited with loved ones and live eternally in a utopian paradise, is common in the reports of people who have encountered a “Near-Death Experience” (NDE). NDEs are thoroughly portrayed by the media but empirical studies are rather recent. The definition of the phenomenon as well as the identification of NDE experiencers is still a matter of debate. To date, NDEs’ identification and description in studies have mostly derived from answered items in questionnaires. However, questionnaires’ content could be restricting and subject to personal interpretation. We believe that in addition to their use, user-independent statistical text examination of freely expressed NDEs narratives is of prior importance to help capture the phenomenology of such a subjective and complex phenomenon. Towards that aim, we included 158 participants with a firsthand retrospective narrative of their self-reported NDE that we analyzed using an automated text-mining method. The output revealed the top words expressed by experiencers. In a second step, a hierarchical clustering analysis was conducted to visualize the relationships between these words. It revealed three main clusters of features: visual perceptions, emotions and spatial components. We believe the user-independent and data-driven text mining approach used in this study is promising by contributing to the building a rigorous description and definition of NDEs

Scholarship@Western

Directory of Open Access Journals

Open Repository and Bibliography - Liège