6,269 research outputs found
Remote sensing applications to resource problems in South Dakota
Cooperative projects between RSI and numerous South Dakota agencies have provided a means of incorporating remote sensing techniques into operational programs. Eight projects discussed in detail are: (1) detection of high moisture zones near interstate 90; (2) thermal infrared census of Canada geese in South Dakota; (3) dutch elm disease detection in urban environment; (4) a feasibility study for monitoring effective precipitation in South Dakota using TIROS-N; (5) open and abandoned dump sites in Spink county; (6) the influence of soil reflectance on LANDSAT signatures of crops; (7) A model implementation program for Lake Herman watershed; and (8) the Six-Mile Creek investigation follow-on
Learning Sparse Classifiers: Continuous and Mixed Integer Optimization Perspectives
We consider a discrete optimization formulation for learning sparse
classifiers, where the outcome depends upon a linear combination of a small
subset of features. Recent work has shown that mixed integer programming (MIP)
can be used to solve (to optimality) -regularized regression problems
at scales much larger than what was conventionally considered possible. Despite
their usefulness, MIP-based global optimization approaches are significantly
slower compared to the relatively mature algorithms for -regularization
and heuristics for nonconvex regularized problems. We aim to bridge this gap in
computation times by developing new MIP-based algorithms for
-regularized classification. We propose two classes of scalable
algorithms: an exact algorithm that can handle features in a
few minutes, and approximate algorithms that can address instances with
in times comparable to the fast -based algorithms. Our
exact algorithm is based on the novel idea of \textsl{integrality generation},
which solves the original problem (with binary variables) via a sequence of
mixed integer programs that involve a small number of binary variables. Our
approximate algorithms are based on coordinate descent and local combinatorial
search. In addition, we present new estimation error bounds for a class of
-regularized estimators. Experiments on real and synthetic data
demonstrate that our approach leads to models with considerably improved
statistical performance (especially, variable selection) when compared to
competing methods.Comment: To appear in JML
An automated exact solution framework towards solving the logistic regression best subset selection problem
An automated logistic regression solution framework (ALRSF) is proposed to solve a mixed integer programming (MIP) formulation of the well known logistic regression best subset selection problem. The solution framework firstly determines the optimal number of independent variables that should be included in the model using an automated cardinality parameter selection procedure. The cardinality parameter dictates the size of the subset of variables and can be problem-specific. A novel regression parameter fixing heuristic that utilises a Benders decomposition algorithm is applied to prune the solution search space such that the optimal regression parameter values are found faster. An optimality gap is subsequently calculated to quantify the quality of the final regression model by considering the distance between the best possible log-likelihood value and a log-likelihood value that is calculated using the current parameter values. Attempts are then made to reduce the optimality gap by adjusting regression parameter values. The ALRSF serves as a holistic variable selection framework that enables the user to consider larger datasets when solving the best subset selection logistic regression problem by significantly reducing the memory requirements associated with its mixed integer programming formulation. Furthermore, the automated framework requires minimal user intervention during model training and hyperparameter tuning. Improvements in quality of the final model (when considering both the optimality gap and computing resources required to achieve a result) are observed when the ALRSF is applied to well-known real-world UCI machine learning datasets
Optimal mathematical programming and variable neighborhood search for k-modes categorical data clustering
The conventional k-modes algorithm and its variants have been extensively used for categorical data clustering. However, these algorithms have some drawbacks, e.g., they can be trapped into local optima and sensitive to initial clusters/modes. Our numerical experiments even showed that the k-modes algorithm could not identify the optimal clustering results for some special datasets regardless the selection of the initial centers. In this paper, we developed an integer linear programming (ILP) approach for the k-modes clustering, which is independent to the initial solution and can obtain directly the optimal results for small-sized datasets. We also developed a heuristic algorithm that implements iterative partial optimization in the ILP approach based on a framework of variable neighborhood search, known as IPO-ILP-VNS, to search for near-optimal results of medium and large sized datasets with controlled computing time. Experiments on 38 datasets, including 27 synthesized small datasets and 11 known benchmark datasets from the UCI site were carried out to test the proposed ILP approach and the IPO-ILP-VNS algorithm. The experimental results outperformed the conventional and other existing enhanced k-modes algorithms in literature, updated 9 of the UCI benchmark datasets with new and improved results
Recommended from our members
Evaluation of current algorithms for segmentation of scar tissue from late Gadolinium enhancement cardiovascular magnetic resonance of the left atrium: an open-access grand challenge
Background: Late Gadolinium enhancement (LGE) cardiovascular magnetic resonance (CMR) imaging can be used to visualise regions of fibrosis and scarring in the left atrium (LA) myocardium. This can be important for treatment stratification of patients with atrial fibrillation (AF) and for assessment of treatment after radio frequency catheter ablation (RFCA). In this paper we present a standardised evaluation benchmarking framework for algorithms segmenting fibrosis and scar from LGE CMR images. The algorithms reported are the response to an open challenge that was put to the medical imaging community through an ISBI (IEEE International Symposium on Biomedical Imaging) workshop. Methods: The image database consisted of 60 multicenter, multivendor LGE CMR image datasets from patients with AF, with 30 images taken before and 30 after RFCA for the treatment of AF. A reference standard for scar and fibrosis was established by merging manual segmentations from three observers. Furthermore, scar was also quantified using 2, 3 and 4 standard deviations (SD) and full-width-at-half-maximum (FWHM) methods. Seven institutions responded to the challenge: Imperial College (IC), Mevis Fraunhofer (MV), Sunnybrook Health Sciences (SY), Harvard/Boston University (HB), Yale School of Medicine (YL), King’s College London (KCL) and Utah CARMA (UTA, UTB). There were 8 different algorithms evaluated in this study. Results: Some algorithms were able to perform significantly better than SD and FWHM methods in both pre- and post-ablation imaging. Segmentation in pre-ablation images was challenging and good correlation with the reference standard was found in post-ablation images. Overlap scores (out of 100) with the reference standard were as follows: Pre: IC = 37, MV = 22, SY = 17, YL = 48, KCL = 30, UTA = 42, UTB = 45; Post: IC = 76, MV = 85, SY = 73, HB = 76, YL = 84, KCL = 78, UTA = 78, UTB = 72. Conclusions: The study concludes that currently no algorithm is deemed clearly better than others. There is scope for further algorithmic developments in LA fibrosis and scar quantification from LGE CMR images. Benchmarking of future scar segmentation algorithms is thus important. The proposed benchmarking framework is made available as open-source and new participants can evaluate their algorithms via a web-based interface
Feasibility and Coverage of Implementing Intermittent Preventive Treatment of Malaria in Pregnant women Contacting Private or Public Clinics in Tanzania: Experience-based Viewpoints of Health Managers in Mkuranga and Mufindi districts.
Evidence on healthcare managers' experience on operational feasibility of malaria intermittent preventive treatment for malaria during pregnancy (IPTp) using sulphadoxine-pyrimethamine (SP) in Africa is systematically inadequate. This paper elucidates the perspectives of District Council Health Management Team (CHMT)s regarding the feasibility of IPTp with SP strategy, including its acceptability and ability of district health care systems to cope with the contemporary and potential challenges. The study was conducted in Mkuranga and Mufindi districts. Data were collected between November 2005 and December 2007, involving focus group discussion (FGD) with Mufindi CHMT and in-depth interviews were conducted with few CHMT members in Mkuranga where it was difficult to summon all members for FGD. Participants in both districts acknowledged the IPTp strategy, considering the seriousness of malaria in pregnancy problem; government allocation of funds to support healthcare staff training programmes in focused antenatal care (fANC) issues, procuring essential drugs distributed to districts, staff remuneration, distribution of fANC guidelines, and administrative activities performed by CHMTs. The identified weaknesses include late arrival of funds from central level weakening CHMT's performance in health supervision, organising outreach clinics, distributing essential supplies, and delivery of IPTp services. Participants anticipated the public losing confidence in SP for IPTp after government announced artemither-lumefantrine (ALu) as the new first-line drug for uncomplicated malaria replacing SP. Role of private healthcare staff in IPTp services was acknowledged cautiously because CHMTs rarely supplied private clinics with SP for free delivery in fear that clients would be required to pay for the SP contrary to government policy. In Mufindi, the District Council showed a strong political support by supplementing ANC clinics with bottled water; in Mkuranga such support was not experienced. A combination of health facility understaffing, water scarcity and staff non-adherence to directly observed therapy instructions forced healthcare staff to allow clients to take SP at home. Need for investigating in improving adherence to IPTp administration was emphasised. High acceptability of the IPTp strategy at district level is meaningless unless necessary support is assured in terms of number, skills and motivation of caregivers and availability of essential supplies
An interactive graphics program to retrieve, display, compare, manipulate, curve fit, difference and cross plot wind tunnel data
The Aerodynamic Data Analysis and Integration System (ADAIS), developed as a highly interactive computer graphics program capable of manipulating large quantities of data such that addressable elements of a data base can be called up for graphic display, compared, curve fit, stored, retrieved, differenced, etc., was described. The general nature of the system is evidenced by the fact that limited usage has already occurred with data bases consisting of thermodynamic, basic loads, and flight dynamics data. Productivity using ADAIS of five times that for conventional manual methods of wind tunnel data analysis is routinely achieved. In wind tunnel data analysis, data from one or more runs of a particular test may be called up and displayed along with data from one or more runs of a different test. Curves may be faired through the data points by any of four methods, including cubic spline and least squares polynomial fit up to seventh order
- …