201,333 research outputs found

    Data integration with data warehousing and data mining in database environments

    Full text link
    The topics of data warehousing and data mining encompasses architectures, algorithms and tools for bringing together selected data from multiple databases or other information sources into a single repository called a data warehouse which is suitable for direct querying or analysis. The querying and analysis can be implemented with any of the data mining tools being developed. Data mining is the non-trivial extraction of implicit, previously unknown and potentially useful information from information sources; In this thesis, we will define a specific data warehousing architecture, its components and give an explanation of the responsibilities of the components in the data warehousing system are defined. The new data warehousing system and it components will also provide suitable topics for exploratory research into their implementation. We will also explain how data mining techniques will be used to extract data from multiple information sources to place data into the central data warehouse of the system and how data mining tools will be used to query and analysis the data warehouse system

    Artificial intelligence and data mining: algorithms and applications

    Get PDF
    Artificial intelligence and data mining techniques have been used in many domains to solve classification, segmentation, association, diagnosis, and prediction problems. The overall aim of this special issue is to open a discussion among researchers actively working on algorithms and applications. The issue covers a wide variety of problems for computational intelligence, machine learning, time series analysis, remote sensing image mining, and pattern recognition. After a rigorous peer review process, 20 papers have been selected from 38 submissions. The accepted papers in this issue addressed the following topics: (i) advanced artificial intelligence and data mining techniques; (ii) computational intelligence in dynamic and uncertain environments; (iii) machine learning on massive datasets; (iv) time series data analysis; (v) Spatial data mining: algorithms and applications

    Cluster Analysis in Online Learning Communities: A Text Mining Approach

    Get PDF
    This paper presents a theory-informed blueprint for mining unstructured text data using mixed- and multi-methods to improve understanding of collaboration in asynchronous online discussions (AOD). Grounded in a community of inquiry theoretical framework to systematically combine established research techniques, we investigated how AOD topics and individual reflections on those topics affect formation of clusters or groups in a community. The data for the investigation came from 54 participants and 470 messages. Data analysis combined the analytical efficiency and scalability of topic modeling, social network analysis, and cluster analysis with qualitative content analysis. The cluster analysis found three clusters and that members of the intermediate cluster (i.e., middle of three clusters) played a pivotal role in this community by expressing uncertainty statements, which facilitated a collective sense-making process to resolve misunderstandings. Furthermore, we found that participants’ selected discussion topics and how they discussed those topics influenced cluster formations. Theoretical, practical, and methodological implications are discussed in depth

    Agents in Bioinformatics

    No full text
    The scope of the Technical Forum Group (TFG) on Agents in Bioinformatics (BIOAGENTS) was to inspire collaboration between the agent and bioinformatics communities with the aim of creating an opportunity to propose a different (agent-based) approach to the development of computational frameworks both for data analysis in bioinformatics and for system modelling in computational biology. During the day, the participants examined the future of research on agents in bioinformatics primarily through 12 invited talks selected to cover the most relevant topics. From the discussions, it became clear that there are many perspectives to the field, ranging from bio-conceptual languages for agent-based simulation, to the definition of bio-ontology-based declarative languages for use by information agents, and to the use of Grid agents, each of which requires further exploration. The interactions between participants encouraged the development of applications that describe a way of creating agent-based simulation models of biological systems, starting from an hypothesis and inferring new knowledge (or relations) by mining and analysing the huge amount of public biological data. In this report we summarise and reflect on the presentations and discussions

    Selected papers from the 14th Annual Bio-Ontologies Special Interest Group Meeting

    Get PDF
    Over the 14 years, the Bio-Ontologies SIG at ISMB has provided a forum for discussion of the latest and most innovative research in the bio-ontologies development, its applications to biomedicine and more generally the organisation, presentation and dissemination of knowledge in biomedicine and the life sciences. The seven papers selected for this supplement span a wide range of topics including: web-based querying over multiple ontologies, integration of data from wikis, innovative methods of annotating and mining electronic health records, advances in annotating web documents and biomedical literature, quality control of ontology alignments, and the ontology support for predictive models about toxicity and open access to the toxicity data

    Semantic Systems. The Power of AI and Knowledge Graphs

    Get PDF
    This open access book constitutes the refereed proceedings of the 15th International Conference on Semantic Systems, SEMANTiCS 2019, held in Karlsruhe, Germany, in September 2019. The 20 full papers and 8 short papers presented in this volume were carefully reviewed and selected from 88 submissions. They cover topics such as: web semantics and linked (open) data; machine learning and deep learning techniques; semantic information management and knowledge integration; terminology, thesaurus and ontology management; data mining and knowledge discovery; semantics in blockchain and distributed ledger technologies

    Exploring Topics and Genres in Storytime Books: A Text Mining Approach

    Get PDF
    Objective – While storytime programs for preschool children are offered in nearly all public libraries in the United States, little is known about the books librarians use in these programs. This study employed text analysis to explore topics and genres of books recommended for public library storytime programs. Methods – In the study, the researchers randomly selected 429 children books recommended for preschool storytime programs. Two corpuses of text were extracted from the titles, abstracts, and subject terms from bibliographic data. Multiple text mining methods were employed to investigate the content of the selected books, including term frequency, bi-gram analysis, topic modeling, and sentiment analysis. Results – The findings revealed popular topics in storytime books, including animals/creatures, color, alphabet, nature, movements, families, friends, and others. The analysis of bibliographic data described various genres and formats of storytime books, such as juvenile fiction, rhymes, board books, pictorial work, poetry, folklore, and nonfiction. Sentiment analysis results reveal that storytime books included a variety of words representing various dimensions of sentiment. Conclusion – The findings suggested that books recommended for storytime programs are centered around topics of interest to children that also support school readiness. In addition to selecting fictionalized stories that will support children in developing the academic concepts and socio-emotional skills necessary for later success, librarians should also be mindful of integrating informational texts into storytime programs

    Machine learning paradigms for modeling spatial and temporal information in multimedia data mining

    Get PDF
    Multimedia data mining and knowledge discovery is a fast emerging interdisciplinary applied research area. There is tremendous potential for effective use of multimedia data mining (MDM) through intelligent analysis. Diverse application areas are increasingly relying on multimedia under-standing systems. Advances in multimedia understanding are related directly to advances in signal processing, computer vision, machine learning, pattern recognition, multimedia databases, and smart sensors. The main mission of this special issue is to identify state-of-the-art machine learning paradigms that are particularly powerful and effective for modeling and combining temporal and spatial media cues such as audio, visual, and face information and for accomplishing tasks of multimedia data mining and knowledge discovery. These models should be able to bridge the gap between low-level audiovisual features which require signal processing and high-level semantics. A number of papers have been submitted to the special issue in the areas of imaging, artificial intelligence; and pattern recognition and five contributions have been selected covering state-of-the-art algorithms and advanced related topics. The first contribution by D. Xiang et al. “Evaluation of data quality and drought monitoring capability of FY-3A MERSI data” describes some basic parameters and major technical indicators of the FY-3A, and evaluates data quality and drought monitoring capability of the Medium-Resolution Imager (MERSI) onboard the FY-3A. The second contribution by A. Belatreche et al. “Computing with biologically inspired neural oscillators: application to color image segmentation” investigates the computing capabilities and potential applications of neural oscillators, a biologically inspired neural model, to gray scale and color image segmentation, an important task in image understanding and object recognition. The major contribution of this paper is the ability to use neural oscillators as a learning scheme for solving real world engineering problems. The third paper by A. Dargazany et al. entitled “Multibandwidth Kernel-based object tracking” explores new methods for object tracking using the mean shift (MS). A bandwidth-handling MS technique is deployed in which the tracker reach the global mode of the density function not requiring a specific staring point. It has been proven via experiments that the Gradual Multibandwidth Mean Shift tracking algorithm can converge faster than the conventional kernel-based object tracking (known as the mean shift). The fourth contribution by S. Alzu’bi et al. entitled “3D medical volume segmentation using hybrid multi-resolution statistical approaches” studies new 3D volume segmentation using multiresolution statistical approaches based on discrete wavelet transform and hidden Markov models. This system commonly reduced the percentage error achieved using the traditional 2D segmentation techniques by several percent. Furthermore, a contribution by G. Cabanes et al. entitled “Unsupervised topographic learning for spatiotemporal data mining” proposes a new unsupervised algorithm, suitable for the analysis of noisy spatiotemporal Radio Frequency Identification (RFID) data. The new unsupervised algorithm depicted in this article is an efficient data mining tool for behavioral studies based on RFID technology. It has the ability to discover and compare stable patterns in a RFID signal, and is appropriate for continuous learning. Finally, we would like to thank all those who helped to make this special issue possible, especially the authors and the reviewers of the articles. Our thanks go to the Hindawi staff and personnel, the journal Manager in bringing about the issue and giving us the opportunity to edit this special issue

    The 20th anniversary of EMBnet: 20 years of bioinformatics for the Life Sciences community

    Get PDF
    The EMBnet Conference 2008, focusing on 'Leading Applications and Technologies in Bioinformatics', was organized by the European Molecular Biology network (EMBnet) to celebrate its 20th anniversary. Since its foundation in 1988, EMBnet has been working to promote collaborative development of bioinformatics services and tools to serve the European community of molecular biology laboratories. This conference was the first meeting organized by the network that was open to the international scientific community outside EMBnet. The conference covered a broad range of research topics in bioinformatics with a main focus on new achievements and trends in emerging technologies supporting genomics, transcriptomics and proteomics analyses such as high-throughput sequencing and data managing, text and data-mining, ontologies and Grid technologies. Papers selected for publication, in this supplement to BMC Bioinformatics, cover a broad range of the topics treated, providing also an overview of the main bioinformatics research fields that the EMBnet community is involved in

    Innovations in design and construction of granular pavements and railways

    Get PDF
    This paper describes some of the work related with the International Technical Committee TC3 – Geotechnics of Pavements of ISSMGE. For brevity, some topics are selected to be described in some detail, while others are acknowledged for reference purposes. These topics cover: (1) Data Mining tools in transporttation geotechnics showing the capabilities to predict real-value from several attributes and also the possibility to develop a formal updating framework to reduce uncertainty and increase reliability of deformability modulus updating in pavement and rail track structures; (2) Design aspects related with the mechanist approach in the framework of soil mechanics; (3) construction and quality control aspects covering compactor technologies and advanced tools for continuous compaction control and bearing capacity surveys during and after construction. These contributions aim to promote the use of some powerful available tools in the design and construction processes with impacts in the reduction of maintenance costs.International Society for Soil Mechanics & Geotechnical EngineeringThe British Geotechnical Associatio
    • …
    corecore