187 research outputs found

    Learning to Select Cuts for Efficient Mixed-Integer Programming

    Get PDF
    Cutting plane methods play a significant role in modern solvers for tackling mixed-integer programming (MIP) problems. Proper selection of cuts would remove infeasible solutions in the early stage, thus largely reducing the computational burden without hurting the solution accuracy. However, the major cut selection approaches heavily rely on heuristics, which strongly depend on the specific problem at hand and thus limit their generalization capability. In this paper, we propose a data-driven and generalizable cut selection approach, named Cut Ranking, in the settings of multiple instance learning. To measure the quality of the candidate cuts, a scoring function, which takes the instance-specific cut features as inputs, is trained and applied in cut ranking and selection. In order to evaluate our method, we conduct extensive experiments on both synthetic datasets and real-world datasets. Compared with commonly used heuristics for cut selection, the learning-based policy has shown to be more effective, and is capable of generalizing over multiple problems with different properties. Cut Ranking has been deployed in an industrial solver for large-scale MIPs. In the online A/B testing of the product planning problems with more than 10710^7 variables and constraints daily, Cut Ranking has achieved the average speedup ratio of 12.42% over the production solver without any accuracy loss of solution.Comment: Paper accepted at Pattern Recognition journa

    Inventory-Constrained Structural Design

    Get PDF

    A Memetic Algorithm for whole test suite generation

    Get PDF
    The generation of unit-level test cases for structural code coverage is a task well-suited to Genetic Algorithms. Method call sequences must be created that construct objects, put them into the right state and then execute uncovered code. However, the generation of primitive values, such as integers and doubles, characters that appear in strings, and arrays of primitive values, are not so straightforward. Often, small local changes are required to drive the value toward the one needed to execute some target structure. However, global searches like Genetic Algorithms tend to make larger changes that are not concentrated on any particular aspect of a test case. In this paper, we extend the Genetic Algorithm behind the EvoSuiTE test generation tool into a Memetic Algorithm, by equipping it with several local search operators. These operators are designed to efficiently optimize primitive values and other aspects of a test suite that allow the search for test cases to function more effectively. We evaluate our operators using a rigorous experimental methodology on over 12,000 Java classes, comprising open source classes of various different kinds, including numerical applications and text processors. Our study shows that increases in branch coverage of up to 53% are possible for an individual class in practice

    Ant Colony Optimisation – A Proposed Solution Framework for the Capacitated Facility Location Problem

    Get PDF
    This thesis is a critical investigation into the development, application and evaluation of ant colony optimisation metaheuristics, with a view to solving a class of capacitated facility location problems. The study is comprised of three phases. The first sets the scene and motivation for research, which includes; key concepts of ant colony optimisation, a review of published academic materials and a research philosophy which provides a justification for a deductive empirical mode of study. This phase reveals that published results for existing facility location metaheuristics are often ambiguous or incomplete and there is no clear evidence of a dominant method. This clearly represents a gap in the current knowledge base and provides a rationale for a study that will contribute to existing knowledge, by determining if ant colony optimisation is a suitable solution technique for solving capacitated facility location problems. The second phase is concerned with the research, development and application of a variety of ant colony optimisation algorithms. Solution methods presented include combinations of approximate and exact techniques. The study identifies a previously untried ant hybrid scheme, which incorporates an exact method within it, as the most promising of techniques that were tested. Also a novel local search initialisation which relies on memory is presented. These hybridisations successfully solve all of the capacitated facility location test problems available in the OR-Library. The third phase of this study conducts an extensive series of run-time analyses, to determine the prowess of the derived ant colony optimisation algorithms against a contemporary cross-entropy technique. This type of analysis for measuring metaheuristic performance for the capacitated facility location problem is not evident within published materials. Analyses of empirical run-time distributions reveal that ant colony optimisation is superior to its contemporary opponent. All three phases of this thesis provide their own individual contributions to existing knowledge bases: the production of a series of run-time distributions will be a valuable resource for future researchers; results demonstrate that hybridisation of metaheuristics with exact solution methods is an area not to be ignored; the hybrid methods employed in this study ten years ago would have been impractical or infeasible; ant colony optimisation is shown to be a very flexible metaheuristic that can easily be adapted to solving mixed integer problems using hybridisation techniques

    Dynamically Testing Graphical User Interfaces

    Get PDF
    Software test generation for GUIs is a hard problem. The goal of this thesis is to investigate different methods for dynamically generating tests for GUIs. We introduce the concept of an event-pair graph, which is used to represent and measure test suites, and show how it can be used to generate tests and measure GUI coverage. Before we can begin generating tests, we first want to determine which is better: a small test suite with a few long tests or a large test suite with many short tests. Therefore, we designed and conducted a study to determine which is more effective. We found that moderate to long tests perform better than short tests. We then move on to discuss seven test generation algorithms. Two are based on random selection, two are based on greedy selection, one is based on Q-Learning, and the last two are based on ant colony optimization. We conducted a study in order to compare the performance of each algorithm. We measured code coverage, GUI coverage, time to run, and faults found. The results show that the greedy algorithms performed the best. Finally, we conducted a study in order to determine if any of the GUI coverage metrics can be used to predict code coverage, and we conducted a study to determine if any of the coverage metrics can be used to predict the faults found. The results show that event pairs are good at predicting code coverage, and that predicting faults is difficult

    Critical Scenario Identification for Testing of Autonomous Driving Systems

    Get PDF
    Background: Autonomous systems have received considerable attention from academia and are adopted by various industrial domains, such as automotive, avionics, etc. As many of them are considered safety-critical, testing is indispensable to verify their reliability and safety. However, there is no common standard for testing autonomous systems efficiently and effectively. Thus new approaches for testing such systems must be developed.Aim: The objective of this thesis is two-fold. First, we want to present an overview of software testing of autonomous systems, i.e., relevant concepts, challenges, and techniques available in academic research and industry practice. Second, we aim to establish a new approach for testing autonomous driving systems and demonstrate its effectiveness by using real autonomous driving systems from industry.Research Methodology: We conducted the research in three steps using the design science paradigm. First, we explored the existing literature and industry practices to understand the state of the art for testing of autonomous systems. Second, we focused on a particular sub-domain - autonomous driving - and proposed a systematic approach for critical test scenario identification. Lastly, we validated our approach and employed it for testing real autonomous driving systems by collaborating with Volvo Cars.Results: We present the results as four papers in this thesis. First, we conceptualized a definition of autonomous systems and classified challenges and approaches, techniques, and practices for testing autonomous systems in general. Second, we designed a systematic approach for critical test scenario identification. We employed the approach for testing two real autonomous driving systems from the industry and have effectively identified critical test scenarios. Lastly, we established a model for predicting the distribution of vehicle-pedestrian interactions for realistic test scenario generation for autonomous driving systems. Conclusion: Critical scenario identification is a favorable approach to generate test scenarios and facilitate the testing of autonomous driving systems in an efficient way. Future improvement of the approach includes (1) evaluating the effectiveness of the generated critical scenarios for testing; (2) extending the sub-components in this approach; (3) combining different testing approaches, and (4) exploring the application of the approach to test different autonomous systems

    Ant colony optimisation : a proposed solution framework for the capacitated facility location problem

    Get PDF
    This thesis is a critical investigation into the development, application and evaluation of ant colony optimisation metaheuristics, with a view to solving a class of capacitated facility location problems. The study is comprised of three phases. The first sets the scene and motivation for research, which includes; key concepts of ant colony optimisation, a review of published academic materials and a research philosophy which provides a justification for a deductive empirical mode of study. This phase reveals that published results for existing facility location metaheuristics are often ambiguous or incomplete and there is no clear evidence of a dominant method. This clearly represents a gap in the current knowledge base and provides a rationale for a study that will contribute to existing knowledge, by determining if ant colony optimisation is a suitable solution technique for solving capacitated facility location problems. The second phase is concerned with the research, development and application of a variety of ant colony optimisation algorithms. Solution methods presented include combinations of approximate and exact techniques. The study identifies a previously untried ant hybrid scheme, which incorporates an exact method within it, as the most promising of techniques that were tested. Also a novel local search initialisation which relies on memory is presented. These hybridisations successfully solve all of the capacitated facility location test problems available in the OR-Library. The third phase of this study conducts an extensive series of run-time analyses, to determine the prowess of the derived ant colony optimisation algorithms against a contemporary cross-entropy technique. This type of analysis for measuring metaheuristic performance for the capacitated facility location problem is not evident within published materials. Analyses of empirical run-time distributions reveal that ant colony optimisation is superior to its contemporary opponent. All three phases of this thesis provide their own individual contributions to existing knowledge bases: the production of a series of run-time distributions will be a valuable resource for future researchers; results demonstrate that hybridisation of metaheuristics with exact solution methods is an area not to be ignored; the hybrid methods employed in this study ten years ago would have been impractical or infeasible; ant colony optimisation is shown to be a very flexible metaheuristic that can easily be adapted to solving mixed integer problems using hybridisation techniques.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Ant colony optimisation : a proposed solution framework for the capacitated facility location problem

    Get PDF
    This thesis is a critical investigation into the development, application and evaluation of ant colony optimisation metaheuristics, with a view to solving a class of capacitated facility location problems. The study is comprised of three phases. The first sets the scene and motivation for research, which includes; key concepts of ant colony optimisation, a review of published academic materials and a research philosophy which provides a justification for a deductive empirical mode of study. This phase reveals that published results for existing facility location metaheuristics are often ambiguous or incomplete and there is no clear evidence of a dominant method. This clearly represents a gap in the current knowledge base and provides a rationale for a study that will contribute to existing knowledge, by determining if ant colony optimisation is a suitable solution technique for solving capacitated facility location problems. The second phase is concerned with the research, development and application of a variety of ant colony optimisation algorithms. Solution methods presented include combinations of approximate and exact techniques. The study identifies a previously untried ant hybrid scheme, which incorporates an exact method within it, as the most promising of techniques that were tested. Also a novel local search initialisation which relies on memory is presented. These hybridisations successfully solve all of the capacitated facility location test problems available in the OR-Library. The third phase of this study conducts an extensive series of run-time analyses, to determine the prowess of the derived ant colony optimisation algorithms against a contemporary cross-entropy technique. This type of analysis for measuring metaheuristic performance for the capacitated facility location problem is not evident within published materials. Analyses of empirical run-time distributions reveal that ant colony optimisation is superior to its contemporary opponent. All three phases of this thesis provide their own individual contributions to existing knowledge bases: the production of a series of run-time distributions will be a valuable resource for future researchers; results demonstrate that hybridisation of metaheuristics with exact solution methods is an area not to be ignored; the hybrid methods employed in this study ten years ago would have been impractical or infeasible; ant colony optimisation is shown to be a very flexible metaheuristic that can easily be adapted to solving mixed integer problems using hybridisation techniques.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Reinforcement learning in large state action spaces

    Get PDF
    Reinforcement learning (RL) is a promising framework for training intelligent agents which learn to optimize long term utility by directly interacting with the environment. Creating RL methods which scale to large state-action spaces is a critical problem towards ensuring real world deployment of RL systems. However, several challenges limit the applicability of RL to large scale settings. These include difficulties with exploration, low sample efficiency, computational intractability, task constraints like decentralization and lack of guarantees about important properties like performance, generalization and robustness in potentially unseen scenarios. This thesis is motivated towards bridging the aforementioned gap. We propose several principled algorithms and frameworks for studying and addressing the above challenges RL. The proposed methods cover a wide range of RL settings (single and multi-agent systems (MAS) with all the variations in the latter, prediction and control, model-based and model-free methods, value-based and policy-based methods). In this work we propose the first results on several different problems: e.g. tensorization of the Bellman equation which allows exponential sample efficiency gains (Chapter 4), provable suboptimality arising from structural constraints in MAS(Chapter 3), combinatorial generalization results in cooperative MAS(Chapter 5), generalization results on observation shifts(Chapter 7), learning deterministic policies in a probabilistic RL framework(Chapter 6). Our algorithms exhibit provably enhanced performance and sample efficiency along with better scalability. Additionally, we also shed light on generalization aspects of the agents under different frameworks. These properties have been been driven by the use of several advanced tools (e.g. statistical machine learning, state abstraction, variational inference, tensor theory). In summary, the contributions in this thesis significantly advance progress towards making RL agents ready for large scale, real world applications

    Building Information Filtering Networks with Topological Constraints: Algorithms and Applications

    Get PDF
    We propose a new methodology for learning the structure of sparse networks from data; in doing so we adopt a dual perspective where we consider networks both as weighted graphs and as simplicial complexes. The proposed learning methodology belongs to the family of preferential attachment algorithms, where a network is extended by iteratively adding new vertices. In the conventional preferential attachment algorithm a new vertex is added to the network by adding a single edge to another existing vertex; in our approach a new vertex is added to a set of vertices by adding one or more new simplices to the simplicial complex. We propose the use of a score function to quantify the strength of the association between the new vertex and the attachment points. The methodology performs a greedy optimisation of the total score by selecting, at each step, the new vertex and the attachment points that maximise the gain in the score. Sparsity is enforced by restricting the space of the feasible configurations through the imposition of topological constraints on the candidate networks; the constraint is fulfilled by allowing only topological operations that are invariant with respect to the required property. For instance, if the topological constraint requires the constructed network to be be planar, then only planarity-invariant operations are allowed; if the constraint is that the network must be a clique forest, then only simplicial vertices can be added. At each step of the algorithm, the vertex to be added and the attachment points are those that provide the maximum increase in score while maintaining the topological constraints. As a concrete but general realisation we propose the clique forest as a possible topological structure for the representation of sparse networks, and we allow to specify further constraints such as the allowed range of clique sizes and the saturation of the attachment points. In this thesis we originally introduce the Maximally Filtered Clique Forest (MFCF) algorithm: the MFCF builds a clique forest by repeated application of a suitably invariant operation that we call Clique Expansion operator and adds vertices according to a strategy that greedily maximises the gain in a local score function. The gains produced by the Clique Expansion operator can be validated in a number of ways, including statistical testing, cross-validation or value thresholding. The algorithm does not prescribe a specific form for the gain function, but allows the use of any number of gain functions as long as they are consistent with the Clique Expansion operator. We describe several examples of gain functions suited to different problems. As a specific practical realisation we study the extraction of planar networks with the Triangulated Maximally Filtered Graph (TMFG). The TMFG, in its simplest form, is a specialised version of the MFCF, but it can be made more powerful by allowing the use of specialised planarity invariant operators that are not based on the Clique Expansion operator. We provide applications to two well known applied problems: the Maximum Weight Planar Subgraph Problem (MWPSP) and the Covariance Selection problem. With regards to the Covariance Selection problem we compare our results to the state of the art solution (the Graphical Lasso) and we highlight the benefits of our methodology. Finally, we study the geometry of clique trees as simplicial complexes and note how the statistics based on cliques and separators provides information equivalent to the one that can be achieved by means of homological methods, such as the analysis of Betti numbers, however with our approach being computationally more efficient and intuitively simpler. Finally, we use the geometric tools developed to provide a possible methodology for inferring the size of a dataset generated by a factor model. As an example we show that our tools provide a solution for inferring the size of a dataset generated by a factor model
    corecore