6,269 research outputs found

    Remote sensing applications to resource problems in South Dakota

    Get PDF
    Cooperative projects between RSI and numerous South Dakota agencies have provided a means of incorporating remote sensing techniques into operational programs. Eight projects discussed in detail are: (1) detection of high moisture zones near interstate 90; (2) thermal infrared census of Canada geese in South Dakota; (3) dutch elm disease detection in urban environment; (4) a feasibility study for monitoring effective precipitation in South Dakota using TIROS-N; (5) open and abandoned dump sites in Spink county; (6) the influence of soil reflectance on LANDSAT signatures of crops; (7) A model implementation program for Lake Herman watershed; and (8) the Six-Mile Creek investigation follow-on

    Learning Sparse Classifiers: Continuous and Mixed Integer Optimization Perspectives

    Full text link
    We consider a discrete optimization formulation for learning sparse classifiers, where the outcome depends upon a linear combination of a small subset of features. Recent work has shown that mixed integer programming (MIP) can be used to solve (to optimality) 0\ell_0-regularized regression problems at scales much larger than what was conventionally considered possible. Despite their usefulness, MIP-based global optimization approaches are significantly slower compared to the relatively mature algorithms for 1\ell_1-regularization and heuristics for nonconvex regularized problems. We aim to bridge this gap in computation times by developing new MIP-based algorithms for 0\ell_0-regularized classification. We propose two classes of scalable algorithms: an exact algorithm that can handle p50,000p\approx 50,000 features in a few minutes, and approximate algorithms that can address instances with p106p\approx 10^6 in times comparable to the fast 1\ell_1-based algorithms. Our exact algorithm is based on the novel idea of \textsl{integrality generation}, which solves the original problem (with pp binary variables) via a sequence of mixed integer programs that involve a small number of binary variables. Our approximate algorithms are based on coordinate descent and local combinatorial search. In addition, we present new estimation error bounds for a class of 0\ell_0-regularized estimators. Experiments on real and synthetic data demonstrate that our approach leads to models with considerably improved statistical performance (especially, variable selection) when compared to competing methods.Comment: To appear in JML

    An automated exact solution framework towards solving the logistic regression best subset selection problem

    Get PDF
    An automated logistic regression solution framework (ALRSF) is proposed to solve a mixed integer programming (MIP) formulation of the well known logistic regression best subset selection problem. The solution framework firstly determines the optimal number of independent variables that should be included in the model using an automated cardinality parameter selection procedure. The cardinality parameter dictates the size of the subset of variables and can be problem-specific. A novel regression parameter fixing heuristic that utilises a Benders decomposition algorithm is applied to prune the solution search space such that the optimal regression parameter values are found faster. An optimality gap is subsequently calculated to quantify the quality of the final regression model by considering the distance between the best possible log-likelihood value and a log-likelihood value that is calculated using the current parameter values. Attempts are then made to reduce the optimality gap by adjusting regression parameter values. The ALRSF serves as a holistic variable selection framework that enables the user to consider larger datasets when solving the best subset selection logistic regression problem by significantly reducing the memory requirements associated with its mixed integer programming formulation. Furthermore, the automated framework requires minimal user intervention during model training and hyperparameter tuning. Improvements in quality of the final model (when considering both the optimality gap and computing resources required to achieve a result) are observed when the ALRSF is applied to well-known real-world UCI machine learning datasets

    Optimal mathematical programming and variable neighborhood search for k-modes categorical data clustering

    Get PDF
    The conventional k-modes algorithm and its variants have been extensively used for categorical data clustering. However, these algorithms have some drawbacks, e.g., they can be trapped into local optima and sensitive to initial clusters/modes. Our numerical experiments even showed that the k-modes algorithm could not identify the optimal clustering results for some special datasets regardless the selection of the initial centers. In this paper, we developed an integer linear programming (ILP) approach for the k-modes clustering, which is independent to the initial solution and can obtain directly the optimal results for small-sized datasets. We also developed a heuristic algorithm that implements iterative partial optimization in the ILP approach based on a framework of variable neighborhood search, known as IPO-ILP-VNS, to search for near-optimal results of medium and large sized datasets with controlled computing time. Experiments on 38 datasets, including 27 synthesized small datasets and 11 known benchmark datasets from the UCI site were carried out to test the proposed ILP approach and the IPO-ILP-VNS algorithm. The experimental results outperformed the conventional and other existing enhanced k-modes algorithms in literature, updated 9 of the UCI benchmark datasets with new and improved results

    Feasibility and Coverage of Implementing Intermittent Preventive Treatment of Malaria in Pregnant women Contacting Private or Public Clinics in Tanzania: Experience-based Viewpoints of Health Managers in Mkuranga and Mufindi districts.

    Get PDF
    Evidence on healthcare managers' experience on operational feasibility of malaria intermittent preventive treatment for malaria during pregnancy (IPTp) using sulphadoxine-pyrimethamine (SP) in Africa is systematically inadequate. This paper elucidates the perspectives of District Council Health Management Team (CHMT)s regarding the feasibility of IPTp with SP strategy, including its acceptability and ability of district health care systems to cope with the contemporary and potential challenges. The study was conducted in Mkuranga and Mufindi districts. Data were collected between November 2005 and December 2007, involving focus group discussion (FGD) with Mufindi CHMT and in-depth interviews were conducted with few CHMT members in Mkuranga where it was difficult to summon all members for FGD. Participants in both districts acknowledged the IPTp strategy, considering the seriousness of malaria in pregnancy problem; government allocation of funds to support healthcare staff training programmes in focused antenatal care (fANC) issues, procuring essential drugs distributed to districts, staff remuneration, distribution of fANC guidelines, and administrative activities performed by CHMTs. The identified weaknesses include late arrival of funds from central level weakening CHMT's performance in health supervision, organising outreach clinics, distributing essential supplies, and delivery of IPTp services. Participants anticipated the public losing confidence in SP for IPTp after government announced artemither-lumefantrine (ALu) as the new first-line drug for uncomplicated malaria replacing SP. Role of private healthcare staff in IPTp services was acknowledged cautiously because CHMTs rarely supplied private clinics with SP for free delivery in fear that clients would be required to pay for the SP contrary to government policy. In Mufindi, the District Council showed a strong political support by supplementing ANC clinics with bottled water; in Mkuranga such support was not experienced. A combination of health facility understaffing, water scarcity and staff non-adherence to directly observed therapy instructions forced healthcare staff to allow clients to take SP at home. Need for investigating in improving adherence to IPTp administration was emphasised. High acceptability of the IPTp strategy at district level is meaningless unless necessary support is assured in terms of number, skills and motivation of caregivers and availability of essential supplies

    An interactive graphics program to retrieve, display, compare, manipulate, curve fit, difference and cross plot wind tunnel data

    Get PDF
    The Aerodynamic Data Analysis and Integration System (ADAIS), developed as a highly interactive computer graphics program capable of manipulating large quantities of data such that addressable elements of a data base can be called up for graphic display, compared, curve fit, stored, retrieved, differenced, etc., was described. The general nature of the system is evidenced by the fact that limited usage has already occurred with data bases consisting of thermodynamic, basic loads, and flight dynamics data. Productivity using ADAIS of five times that for conventional manual methods of wind tunnel data analysis is routinely achieved. In wind tunnel data analysis, data from one or more runs of a particular test may be called up and displayed along with data from one or more runs of a different test. Curves may be faired through the data points by any of four methods, including cubic spline and least squares polynomial fit up to seventh order
    corecore