Search CORE

1,573 research outputs found

Evolving fuzzy grammar for crime texts categorization

Author: Martin Trevor
Mohd Sharef Nurfadhlina
Publication venue: 'Elsevier BV'
Publication date: 01/03/2015
Field of study

Text mining refers to the activity of identifying useful information from natural language text. This is one of the criteria practiced in automated text categorization. Machine learning (ML) based methods are the popular solution for this problem. However, the developed models typically provide low expressivity and lacking in human-understandable representation. In spite of being highly efficient, the ML based methods are established in train–test setting, and when the existing model is found insufficient, the whole processes need to be reinvented which implies train–test–retrain and is typically time consuming. Furthermore, retraining the model is not usually practical and feasible option whenever there is continuous change. This paper introduces the evolving fuzzy grammar (EFG) method for crime texts categorization. In this method, the learning model is built based on a set of selected text fragments which are then transformed into their underlying structure called fuzzy grammars. The fuzzy notion is used because the grammar matching, parsing and derivation involve uncertainty. Fuzzy union operator is also used to combine and transform individual text fragment grammars into more general representations of the learned text fragments. The set of learned fuzzy grammars is influenced by the evolution in the seen pattern; the learned model is slightly changed (incrementally) as adaptation, which does not require the conventional redevelopment. The performance of EFG in crime texts categorization is evaluated against expert-tagged real incidents summaries and compared against C4.5, support vector machines, naïve Bayes, boosting, and k-nearest neighbour methods. Results show that the EFG algorithm produces results that are close in performance with the other ML methods while being highly interpretable, easily integrated into a more comprehensive grammar system and with lower model retraining adaptability time

Crossref

Universiti Putra Malaysia Institutional Repository

Explore Bristol Research

Bibliometric Survey on Incremental Learning in Text Classification Algorithms for False Information Detection

Author: Barve Yashoda Narayanprasad, Mrs.
Mulay Preeti, Dr.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 23/11/2020
Field of study

The false information or misinformation over the web has severe effects on people, business and society as a whole. Therefore, detection of misinformation has become a topic of research among many researchers. Detecting misinformation of textual articles is directly connected to text classification problem. With the massive and dynamic generation of unstructured textual documents over the web, incremental learning in text classification has gained more popularity. This survey explores recent advancements in incremental learning in text classification and review the research publications of the area from Scopus, Web of Science, Google Scholar, and IEEE databases and perform quantitative analysis by using methods such as publication statistics, collaboration degree, research network analysis, and citation analysis. The contribution of this study in incremental learning in text classification provides researchers insights on the latest status of the research through literature survey, and helps the researchers to know the various applications and the techniques used recently in the field

DigitalCommons@University of Nebraska

Exploring Text Mining and Analytics for Applications in Public Security: An in-depth dive into a systematic literature review

Author: Carvalho Victor Diogho Heuer de
Costa Ana Paula Cabral Seixas
Publication venue: SciELO Preprints
Publication date: 19/01/2023
Field of study

Text mining and related analytics emerge as a technological approach to support human activities in extracting useful knowledge through texts in several formats. From a managerial point of view, it can help organizations in planning and decision-making processes, providing information that was not previously evident through textual materials produced internally or even externally. In this context, within the public/governmental scope, public security agencies are great beneficiaries of the tools associated with text mining, in several aspects, from applications in the criminal area to the collection of people's opinions and sentiments about the actions taken to promote their welfare. This article reports details of a systematic literature review focused on identifying the main areas of text mining application in public security, the most recurrent technological tools, and future research directions. The searches covered four major article bases (Scopus, Web of Science, IEEE Xplore, and ACM Digital Library), selecting 194 materials published between 2014 and the first half of 2021, among journals, conferences, and book chapters. There were several findings concerning the targets of the literature review, as presented in the results of this article

SciELO Preprints

L’INTELLECT INCARNÉ: Sur les interprétations computationnelles, évolutives et philosophiques de la connaissance

Author: KLAUS MAINZER
Publication venue: 'Croatian Philosophical Society (Hrvatsko filozofsko drustvo)'
Publication date: 01/01/2005
Field of study

Modern cognitive science cannot be understood without recent developments in computer science, artificial intelligence (AI), robotics, neuroscience, biology, linguistics, and psychology. Classic analytic philosophy as well as traditional AI assumed that all kinds of knowledge must eplicitly be represented by formal or programming languages. This assumption is in contradiction to recent insights into the biology of evolution and developmental psychology of the human organism. Most of our knowledge is implicit and unconscious. It is not formally represented, but embodied knowledge which is learnt by doing and understood by bodily interacting with ecological niches and social environments. That is true not only for low-level skills, but even for high-level domains of categorization, language, and abstract thinking. Embodied cognitive science, AI, and robotics try to build the embodied mind in an artificial evolution. From a philosophical point of view, it is amazing that the new ideas of embodied mind and robotics have deep roots in 20th-century philosophy.Die moderne Kognitionswissenschaft kann nicht verstanden werden ohne Einbeziehung der neuesten Errungenschaften aus der Computerwissenschaft, künstlichen Intelligenz (AI), Robotik, Neurowissenschaft, Biologie, Linguistik und Psychologie. Die klassische analytische Philosophie, wie auch die traditionelle AI, setzten voraus, dass alle Arten des Wissens explizit durch formale oder Programmsprachen dargestellt werden müssen. Diese Annahme steht im Widerspruch zu den rezenten Einsichten in die Evolutionsbiologie und Entwicklungspsychologie des menschlichen Organismus. Der größte Teil unseres Wissens ist implizit und unbewusst. Es ist kein formal repräsentiertes, sondern ein verkörpertes Wissen, das durch Handeln gelernt und durch körperliche Interaktion mit ökologischen Nischen und gesellschaftlichen Umgebungen verstanden wird. Dies gilt nicht nur für niedere Fertigkeiten, sondern auch für höher gestellte Domänen: Kategorisierung, Sprache und abstraktes Denken. Die verkörperte Erkenntniswissenschaft, AI und Robotik versuchen, den verkörperten Geist in einer artifiziellen Evolution zu bilden. Vom philosophischen Standpunkt gesehen ist es erstaunlich, wie tief die neuen Ideen des verkörperten Geistes und der Robotik in der Philosophie des 20. Jahrhunderts verankert sind.La science cognitive moderne ne peut être comprise sans les progrès récents en informatique, intelligence artificielle, robotique, neuroscience, biologie, linguistique et psychologie. La philosophie analytique classique et l’intelligence artificielle traditionnelle présumaient que toutes les sortes de savoir devaient être représentées explicitement par des langages formels ou programmatiques. Cette thèse est en contradiction avec les découvertes récentes en biologie de l’évolution et en psychologie évolutive de l’organisme humain. La majeure partie de notre savoir est implicite et inconsciente. Elle n’est pas représentée formellement, mais constitue un savoir incarné, qui s’acquiert par l’action et se comprend en interaction corporelle avec nos niches écologiques et nos environnements sociaux. Cela n’est pas seulement vrai pour nos aptitudes élémentaires, mais aussi pour nos facultés supérieures de catégorisation, de langage et de pensée abstraite. Science cognitive incarnée, l’intelligence artificielle, ainsi que la robotique, tentent de construire un intellect incarné en évolution artificielle. Du point de vue philosophique, il est admirable de voir à quel point les nouvelles idées d’intellect incarné et de robotique sont ancrées dans la philosophie du XXe siècle

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Proceedings of the International Workshop on Text Mining Research, Practice and Opportunities

Author
Publication venue
Publication date: 24/09/2005
Field of study

The University of Manchester - Institutional Repository

Text Classification: A Review, Empirical, and Experimental Evaluation

Author: Taha Aya
Taha Kamal
Yeun Chan
Yoo Paul D.
Publication venue
Publication date: 11/01/2024
Field of study

The explosive and widespread growth of data necessitates the use of text classification to extract crucial information from vast amounts of data. Consequently, there has been a surge of research in both classical and deep learning text classification methods. Despite the numerous methods proposed in the literature, there is still a pressing need for a comprehensive and up-to-date survey. Existing survey papers categorize algorithms for text classification into broad classes, which can lead to the misclassification of unrelated algorithms and incorrect assessments of their qualities and behaviors using the same metrics. To address these limitations, our paper introduces a novel methodological taxonomy that classifies algorithms hierarchically into fine-grained classes and specific techniques. The taxonomy includes methodology categories, methodology techniques, and methodology sub-techniques. Our study is the first survey to utilize this methodological taxonomy for classifying algorithms for text classification. Furthermore, our study also conducts empirical evaluation and experimental comparisons and rankings of different algorithms that employ the same specific sub-technique, different sub-techniques within the same technique, different techniques within the same category, and categorie

arXiv.org e-Print Archive

Development of Adjectival Use and Meaning Structures in Swedish Students' Written production

Author: Löhndorf Simone
Publication venue: 'Lund University Library'
Publication date: 17/11/2021
Field of study

This thesis is about the development of adjective use and meaning structures examined from a cognitive linguistic perspective. Adjectives modify nominal meanings and it is in context, in the interaction with the noun that the adjective meaning and configuration is determined. Nearly 13,000 adjective-noun combinations from texts written by Swedish students in grades 3, 5, 9, and 11/12 were analysed according to the LOC model (Ontologies and Construals in Lexical Semantics, Paradis, 2005) with regard to domains, noun ontology, adjective gradability, adjective position, and adjective function. Furthermore, the use of figurative language was studied. The results show a development from adjectives predominantly modifying concrete nouns to increasingly abstract meanings from a broad range of adjective and noun domains. The younger students use adjectives predominantly in the predicative position but there is a gradual shift towards attributive use, and attributive uses are the most common in the highest grade. Adjectives are primarily used in a descriptive function, but in the highest grade approximately one third of all adjectives are used in a classifying function. Scalar adjective construal is the most common in all grades, but the proportion of scalar uses decreases in favour of an increase in non-gradable uses. Figurative language is rare in all grades, but there is an increase in metaphorical language over the school years

Lund University Publications

Information Technology and Lawyers. Advanced Technology in the Legal Domain, from Challenges to Daily Routine

Author
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2006
Field of study

VU Research Portal

Design of a Controlled Language for Critical Infrastructures Protection

Author: CANTARELLA SIMONA
FERIGATO Carlo
OWUSU EVANS BOATENG
Publication venue: European Language Resources Association
Publication date: 28/03/2012
Field of study

We describe a project for the construction of controlled language for critical infrastructures protection (CIP). This project originates from the need to coordinate and categorize the communications on CIP at the European level. These communications can be physically represented by official documents, reports on incidents, informal communications and plain e-mail. We explore the application of traditional library science tools for the construction of controlled languages in order to achieve our goal. Our starting point is an analogous work done during the sixties in the field of nuclear science known as the Euratom Thesaurus.JRC.G.6-Security technology assessmen

JRC Publications Repository

Eger Journal of English Studies (Vol. 15.)

Author: Antal Éva
Czeglédi Csaba
Publication venue: Eszterházy Károly Főiskola Líceum Kiadó
Publication date
Field of study

EKE Repository of Publications