1,573 research outputs found
Evolving fuzzy grammar for crime texts categorization
Text mining refers to the activity of identifying useful information from natural language text. This is one of the criteria practiced in automated text categorization. Machine learning (ML) based methods are the popular solution for this problem. However, the developed models typically provide low expressivity and lacking in human-understandable representation. In spite of being highly efficient, the ML based methods are established in train–test setting, and when the existing model is found insufficient, the whole processes need to be reinvented which implies train–test–retrain and is typically time consuming. Furthermore, retraining the model is not usually practical and feasible option whenever there is continuous change. This paper introduces the evolving fuzzy grammar (EFG) method for crime texts categorization. In this method, the learning model is built based on a set of selected text fragments which are then transformed into their underlying structure called fuzzy grammars. The fuzzy notion is used because the grammar matching, parsing and derivation involve uncertainty. Fuzzy union operator is also used to combine and transform individual text fragment grammars into more general representations of the learned text fragments. The set of learned fuzzy grammars is influenced by the evolution in the seen pattern; the learned model is slightly changed (incrementally) as adaptation, which does not require the conventional redevelopment. The performance of EFG in crime texts categorization is evaluated against expert-tagged real incidents summaries and compared against C4.5, support vector machines, naïve Bayes, boosting, and k-nearest neighbour methods. Results show that the EFG algorithm produces results that are close in performance with the other ML methods while being highly interpretable, easily integrated into a more comprehensive grammar system and with lower model retraining adaptability time
Bibliometric Survey on Incremental Learning in Text Classification Algorithms for False Information Detection
The false information or misinformation over the web has severe effects on people, business and society as a whole. Therefore, detection of misinformation has become a topic of research among many researchers. Detecting misinformation of textual articles is directly connected to text classification problem. With the massive and dynamic generation of unstructured textual documents over the web, incremental learning in text classification has gained more popularity. This survey explores recent advancements in incremental learning in text classification and review the research publications of the area from Scopus, Web of Science, Google Scholar, and IEEE databases and perform quantitative analysis by using methods such as publication statistics, collaboration degree, research network analysis, and citation analysis. The contribution of this study in incremental learning in text classification provides researchers insights on the latest status of the research through literature survey, and helps the researchers to know the various applications and the techniques used recently in the field
Exploring Text Mining and Analytics for Applications in Public Security: An in-depth dive into a systematic literature review
Text mining and related analytics emerge as a technological approach to support human activities in extracting useful knowledge through texts in several formats. From a managerial point of view, it can help organizations in planning and decision-making processes, providing information that was not previously evident through textual materials produced internally or even externally. In this context, within the public/governmental scope, public security agencies are great beneficiaries of the tools associated with text mining, in several aspects, from applications in the criminal area to the collection of people's opinions and sentiments about the actions taken to promote their welfare. This article reports details of a systematic literature review focused on identifying the main areas of text mining application in public security, the most recurrent technological tools, and future research directions. The searches covered four major article bases (Scopus, Web of Science, IEEE Xplore, and ACM Digital Library), selecting 194 materials published between 2014 and the first half of 2021, among journals, conferences, and book chapters. There were several findings concerning the targets of the literature review, as presented in the results of this article
L’INTELLECT INCARNÉ: Sur les interprétations computationnelles, évolutives et philosophiques de la connaissance
Modern cognitive science cannot be understood without recent developments in computer science, artificial intelligence (AI), robotics, neuroscience, biology, linguistics, and psychology. Classic analytic philosophy as well as traditional AI assumed that all kinds of knowledge must eplicitly be represented by formal or programming languages. This assumption is in contradiction to recent insights into the biology of evolution and developmental psychology of the human organism. Most of our knowledge is implicit and unconscious. It is not formally represented, but embodied knowledge which is learnt by doing and understood by bodily interacting with ecological niches and social environments. That is true not only for low-level skills, but even for high-level domains of categorization, language, and abstract thinking. Embodied cognitive science, AI, and robotics try to build the embodied mind in an artificial evolution. From a philosophical point of view, it is amazing that the new ideas of embodied mind and robotics have deep roots in 20th-century philosophy.Die moderne Kognitionswissenschaft kann nicht verstanden werden ohne Einbeziehung der neuesten Errungenschaften aus der Computerwissenschaft, künstlichen Intelligenz (AI), Robotik, Neurowissenschaft, Biologie, Linguistik und Psychologie. Die klassische analytische Philosophie, wie auch die traditionelle AI, setzten voraus, dass alle Arten des Wissens explizit durch formale oder Programmsprachen dargestellt werden müssen. Diese Annahme steht im Widerspruch zu den rezenten Einsichten in die Evolutionsbiologie und Entwicklungspsychologie des menschlichen Organismus. Der größte Teil unseres Wissens ist implizit und unbewusst. Es ist kein formal repräsentiertes, sondern ein verkörpertes Wissen, das durch Handeln gelernt und durch körperliche Interaktion mit ökologischen Nischen und gesellschaftlichen Umgebungen verstanden wird. Dies gilt nicht nur für niedere Fertigkeiten, sondern auch für höher gestellte Domänen: Kategorisierung, Sprache und abstraktes Denken. Die verkörperte Erkenntniswissenschaft, AI und Robotik versuchen, den verkörperten Geist in einer artifiziellen Evolution zu bilden. Vom philosophischen Standpunkt gesehen ist es erstaunlich, wie tief die neuen Ideen des verkörperten Geistes und der Robotik in der Philosophie des 20. Jahrhunderts verankert sind.La science cognitive moderne ne peut être comprise sans les progrès récents en informatique, intelligence artificielle, robotique, neuroscience, biologie, linguistique et psychologie. La philosophie analytique classique et l’intelligence artificielle traditionnelle présumaient que toutes les sortes de savoir devaient être représentées explicitement par des langages formels ou programmatiques. Cette thèse est en contradiction avec les découvertes récentes en biologie de l’évolution et en psychologie évolutive de l’organisme humain. La majeure partie de notre savoir est implicite et inconsciente. Elle n’est pas représentée formellement, mais constitue un savoir incarné, qui s’acquiert par l’action et se comprend en interaction corporelle avec nos niches écologiques et nos environnements sociaux. Cela n’est pas seulement vrai pour nos aptitudes élémentaires, mais aussi pour nos facultés supérieures de catégorisation, de langage et de pensée abstraite. Science cognitive incarnée, l’intelligence artificielle, ainsi que la robotique, tentent de construire un intellect incarné en évolution artificielle. Du point de vue philosophique, il est admirable de voir à quel point les nouvelles idées d’intellect incarné et de robotique sont ancrées dans la philosophie du XXe siècle
Text Classification: A Review, Empirical, and Experimental Evaluation
The explosive and widespread growth of data necessitates the use of text
classification to extract crucial information from vast amounts of data.
Consequently, there has been a surge of research in both classical and deep
learning text classification methods. Despite the numerous methods proposed in
the literature, there is still a pressing need for a comprehensive and
up-to-date survey. Existing survey papers categorize algorithms for text
classification into broad classes, which can lead to the misclassification of
unrelated algorithms and incorrect assessments of their qualities and behaviors
using the same metrics. To address these limitations, our paper introduces a
novel methodological taxonomy that classifies algorithms hierarchically into
fine-grained classes and specific techniques. The taxonomy includes methodology
categories, methodology techniques, and methodology sub-techniques. Our study
is the first survey to utilize this methodological taxonomy for classifying
algorithms for text classification. Furthermore, our study also conducts
empirical evaluation and experimental comparisons and rankings of different
algorithms that employ the same specific sub-technique, different
sub-techniques within the same technique, different techniques within the same
category, and categorie
Development of Adjectival Use and Meaning Structures in Swedish Students' Written production
This thesis is about the development of adjective use and meaning structures examined from a cognitive linguistic perspective. Adjectives modify nominal meanings and it is in context, in the interaction with the noun that the adjective meaning and configuration is determined. Nearly 13,000 adjective-noun combinations from texts written by Swedish students in grades 3, 5, 9, and 11/12 were analysed according to the LOC model (Ontologies and Construals in Lexical Semantics, Paradis, 2005) with regard to domains, noun ontology, adjective gradability, adjective position, and adjective function. Furthermore, the use of figurative language was studied. The results show a development from adjectives predominantly modifying concrete nouns to increasingly abstract meanings from a broad range of adjective and noun domains. The younger students use adjectives predominantly in the predicative position but there is a gradual shift towards attributive use, and attributive uses are the most common in the highest grade. Adjectives are primarily used in a descriptive function, but in the highest grade approximately one third of all adjectives are used in a classifying function. Scalar adjective construal is the most common in all grades, but the proportion of scalar uses decreases in favour of an increase in non-gradable uses. Figurative language is rare in all grades, but there is an increase in metaphorical language over the school years
Design of a Controlled Language for Critical Infrastructures Protection
We describe a project for the construction of controlled language for critical infrastructures protection (CIP). This project originates
from the need to coordinate and categorize the communications on CIP at the European level. These communications can be physically
represented by official documents, reports on incidents, informal communications and plain e-mail. We explore the application of
traditional library science tools for the construction of controlled languages in order to achieve our goal. Our starting point is an
analogous work done during the sixties in the field of nuclear science known as the Euratom Thesaurus.JRC.G.6-Security technology assessmen
- …