75 research outputs found

    Stability in N-Layer recurrent neural networks

    Get PDF
    Starting with the theory developed by Hopfield, Cohen-Grossberg and Kosko, the study of associative memories is extended to N - layer re-current neural networks. The stability of different multilayer networks is demonstrated under specified bounding hypotheses. The analysis involves theorems for the additive as well as the multiplicative models for continuous and discrete N - layer networks. These demonstrations are based on contin-uous and discrete Liapunov theory. The thesis develops autoassociative and heteroassociative memories. It points out the link between all recurrent net-works of this type. The discrete case is analyzed using the threshold signal function as the activation function. A general approach for studying the sta-bility and convergence of the multilayer recurrent networks is developed

    Statistical Mechanics Approach to Inverse Problems on Networks

    Get PDF
    Statistical Mechanics has gained a central role in modern Inference and Computer Science. Many optimization and inference problems can be cast in a Statistical Mechanics framework, and various concepts and methods developed in this area of Physics can be very helpful not only in the theoretical analysis, but also constitute valuable tools for solving single instance cases of hard inference and computational tasks. In this work, I address various inverse problems on networks, from models of epidemic spreading to learning in neural networks, and apply a variety of methods which have been developed in the context of Disordered Systems, namely Replica and Cavity methods from the theoretical side, and their algorithmic incarnation, Belief Propagation, to solve hard inverse problems which can be formulated in a Bayesian framework

    Applications of Artificial Neural Networks (ANNs) in exploring materials property-property correlations

    Get PDF
    The copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the authorThe discoveries of materials property-property correlations usually require prior knowledge or serendipity, the process of which can be time-consuming, costly, and labour-intensive. On the other hand, artificial neural networks (ANNs) are intelligent and scalable modelling techniques that have been used extensively to predict properties from materials’ composition or processing parameters, but are seldom used in exploring materials property-property correlations. The work presented in this thesis has employed ANNs combinatorial searches to explore the correlations of different materials properties, through which, ‘known’ correlations are verified, and ‘unknown’ correlations are revealed. An evaluation criterion is proposed and demonstrated to be useful in identifying nontrivial correlations. The work has also extended the application of ANNs in the fields of data corrections, property predictions and identifications of variables’ contributions. A systematic ANN protocol has been developed and tested against the known correlating equations of elastic properties and the experimental data, and is found to be reliable and effective to correct suspect data in a complicated situation where no prior knowledge exists. Moreover, the hardness increments of pure metals due to HPT are accurately predicted from shear modulus, melting temperature and Burgers vector. The first two variables are identified to have the largest impacts on hardening. Finally, a combined ANN-SR (symbolic regression) method is proposed to yield parsimonious correlating equations by ruling out redundant variables through the partial derivatives method and the connection weight approach, which are based on the analysis of the ANNs weight vectors. By applying this method, two simple equations that are at least as accurate as other models in providing a rapid estimation of the enthalpies of vaporization for compounds are obtained.School of Engineering and Materials Science of Queen Mary, University of London and China Scholarship Council (CSC), for providing Queen Mary - China Scholarship Council Joint PhD Scholarsh

    Implementation of ANN in Software Effort Estimation: Boundary Value Effort Forecast: A novel Artificial Neural Networks model to improve the accuracy of Effort Estimation in Software Development Projects

    Get PDF
    Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementSoftware Development consistently accommodates a variety of unstable scenarios. Good planning always stands behind well-defined requirements. Hence, the consistency of the effort estimation plays a special role in the traditional Business-Consumer relationship. While the proposed models may provide high accuracy in predicting specific data sets, it’s still difficult for IT specialists/organizations to find the best method for evaluating certain functionalities. The challenge of the project; initiated programming language, project infrastructure, and/or staff experimentation are just a few of the reasons that lead to inequality in these terms. Conceptually, the planned work going to explicate the main correlations. It will contain historical background - as to how was the industrial lifecycle before pre-processing progress/what was the necessity for them to exist, as well as modern usage area of BPM and Project Management – like how managers and owners’ moves are intending to keep the consumer’s satisfaction in higher level while increasing the revenue. Taking the most failure causes of projects into consideration, the research will capture some components of Software Project Management to clarify developed approaches and their advantages and/or disadvantages. The study may also lead somehow to the Business Process Management to see the alignments of required tasks in a rigorous way. The research is generally intending to define the key features of the Project Effort Estimation as usage of the datasets, evaluating the architectures, etc. The investigation also aims to find effective causes of poor effort estimation and analyze how those improvable points may be developed to ensure a highly accurate Artificial Neural Networks model

    Advances of Machine Learning in Materials Science: Ideas and Techniques

    Full text link
    In this big data era, the use of large dataset in conjunction with machine learning (ML) has been increasingly popular in both industry and academia. In recent times, the field of materials science is also undergoing a big data revolution, with large database and repositories appearing everywhere. Traditionally, materials science is a trial-and-error field, in both the computational and experimental departments. With the advent of machine learning-based techniques, there has been a paradigm shift: materials can now be screened quickly using ML models and even generated based on materials with similar properties; ML has also quietly infiltrated many sub-disciplinary under materials science. However, ML remains relatively new to the field and is expanding its wing quickly. There are a plethora of readily-available big data architectures and abundance of ML models and software; The call to integrate all these elements in a comprehensive research procedure is becoming an important direction of material science research. In this review, we attempt to provide an introduction and reference of ML to materials scientists, covering as much as possible the commonly used methods and applications, and discussing the future possibilities.Comment: 80 pages; 22 figures. To be published in Frontiers of Physics, 18, xxxxx, (2023

    Computational aspects of cellular intelligence and their role in artificial intelligence.

    Get PDF
    The work presented in this thesis is concerned with an exploration of the computational aspects of the primitive intelligence associated with single-celled organisms. The main aim is to explore this Cellular Intelligence and its role within Artificial Intelligence. The findings of an extensive literature search into the biological characteristics, properties and mechanisms associated with Cellular Intelligence, its underlying machinery - Cell Signalling Networks and the existing computational methods used to capture it are reported. The results of this search are then used to fashion the development of a versatile new connectionist representation, termed the Artificial Reaction Network (ARN). The ARN belongs to the branch of Artificial Life known as Artificial Chemistry and has properties in common with both Artificial Intelligence and Systems Biology techniques, including: Artificial Neural Networks, Artificial Biochemical Networks, Gene Regulatory Networks, Random Boolean Networks, Petri Nets, and S-Systems. The thesis outlines the following original work: The ARN is used to model the chemotaxis pathway of Escherichia coli and is shown to capture emergent characteristics associated with this organism and Cellular Intelligence more generally. The computational properties of the ARN and its applications in robotic control are explored by combining functional motifs found in biochemical network to create temporal changing waveforms which control the gaits of limbed robots. This system is then extended into a complete control system by combining pattern recognition with limb control in a single ARN. The results show that the ARN can offer increased flexibility over existing methods. Multiple distributed cell-like ARN based agents termed Cytobots are created. These are first used to simulate aggregating cells based on the slime mould Dictyostelium discoideum. The Cytobots are shown to capture emergent behaviour arising from multiple stigmergic interactions. Applications of Cytobots within swarm robotics are investigated by applying them to benchmark search problems and to the task of cleaning up a simulated oil spill. The results are compared to those of established optimization algorithms using similar cell inspired strategies, and to other robotic agent strategies. Consideration is given to the advantages and disadvantages of the technique and suggestions are made for future work in the area. The report concludes that the Artificial Reaction Network is a versatile and powerful technique which has application in both simulation of chemical systems, and in robotic control, where it can offer a higher degree of flexibility and computational efficiency than benchmark alternatives. Furthermore, it provides a tool which may possibly throw further light on the origins and limitations of the primitive intelligence associated with cells

    An intelligent classification system for land use and land cover mapping using spaceborne remote sensing and GIS

    Get PDF
    The objectives of this study were to experiment with and extend current methods of Synthetic Aperture Rader (SAR) image classification, and to design and implement a prototype intelligent remote sensing image processing and classification system for land use and land cover mapping in wet season conditions in Bangladesh, which incorporates SAR images and other geodata. To meet these objectives, the problem of classifying the spaceborne SAR images, and integrating Geographic Information System (GIS) data and ground truth data was studied first. In this phase of the study, an extension to traditional techniques was made by applying a Self-Organizing feature Map (SOM) to include GIS data with the remote sensing data during image segmentation. The experimental results were compared with those of traditional statistical classifiers, such as Maximum Likelihood, Mahalanobis Distance, and Minimum Distance classifiers. The performances of the classifiers were evaluated in terms of the classification accuracy with respect to the collected real-time ground truth data. The SOM neural network provided the highest overall accuracy when a GIS layer of land type classification (with respect to the period of inundation by regular flooding) was used in the network. Using this method, the overall accuracy was around 15% higher than the previously mentioned traditional classifiers. It also achieved higher accuracies for more classes in comparison to the other classifiers. However, it was also observed that different classifiers produced better accuracy for different classes. Therefore, the investigation was extended to consider Multiple Classifier Combination (MCC) techniques, which is a recently emerging research area in pattern recognition. The study has tested some of these techniques to improve the classification accuracy by harnessing the goodness of the constituent classifiers. A Rule-based Contention Resolution method of combination was developed, which exhibited an improvement in the overall accuracy of about 2% in comparison to its best constituent (SOM) classifier. The next phase of the study involved the design of an architecture for an intelligent image processing and classification system (named ISRIPaC) that could integrate the extended methodologies mentioned above. Finally, the architecture was implemented in a prototype and its viability was evaluated using a set of real data. The originality of the ISRIPaC architecture lies in the realisation of the concept of a complete system that can intelligently cover all the steps of image processing classification and utilise standardised metadata in addition to a knowledge base in determining the appropriate methods and course of action for the given task. The implemented prototype of the ISRIPaC architecture is a federated system that integrates the CLIPS expert system shell, the IDRISI Kilimanjaro image processing and GIS software, and the domain experts' knowledge via a control agent written in Visual C++. It starts with data assessment and pre-processing and ends up with image classification and accuracy assessment. The system is designed to run automatically, where the user merely provides the initial information regarding the intended task and the source of available data. The system itself acquires necessary information about the data from metadata files in order to make decisions and perform tasks. The test and evaluation of the prototype demonstrates the viability of the proposed architecture and the possibility of extending the system to perform other image processing tasks and to use different sources of data. The system design presented in this study thus suggests some directions for the development of the next generation of remote sensing image processing and classification systems
    corecore