Search CORE

3,027 research outputs found

Nonparametric Predictive Inference for System Reliability

Author: ABOALKHAIR AHMAD,MOHAMMAD,ABDALMONEM
Publication venue
Publication date: 01/01/2012
Field of study

This thesis provides a new method for statistical inference on system reliability on the basis of limited information resulting from component testing. This method is called Nonparametric Predictive Inference (NPI). We present NPI for system reliability, in particular NPI for k-out-of-m systems, and for systems that consist of multiple ki-out-of-mi subsystems in series configuration. The algorithm for optimal redundancy allocation, with additional components added to subsystems one at a time is presented. We also illustrate redundancy allocation for the same system in case the costs of additional components differ per subsystem. Then NPI is presented for system reliability in a similar setting, but with all subsystems consisting of the same single type of component. As a further step in the development of NPI for system reliability, where more general system structures can be considered, nonparametric predictive inference for reliability of voting systems with multiple component types is presented. We start with a single voting system with multiple component types, then we extend to a series configuration of voting subsystems with multiple component types. Throughout this thesis we assume information from tests of nt components of type t

Durham e-Theses

CiteSeerX

Classification of clinical outcomes using high-throughput and clinical informatics.

Author: Cambon Alexander Carswell
Publication venue: ThinkIR: The University of Louisville\u27s Institutional Repository
Publication date: 01/12/2014
Field of study

It is widely recognized that many cancer therapies are effective only for a subset of patients. However clinical studies are most often powered to detect an overall treatment effect. To address this issue, classification methods are increasingly being used to predict a subset of patients which respond differently to treatment. This study begins with a brief history of classification methods with an emphasis on applications involving melanoma. Nonparametric methods suitable for predicting subsets of patients responding differently to treatment are then reviewed. Each method has different ways of incorporating continuous, categorical, clinical and high-throughput covariates. For nonparametric and parametric methods, distance measures specific to the method are used to make classification decisions. Approaches are outlined which employ these distances to measure treatment interactions and predict patients more sensitive to treatment. Simulations are also carried out to examine empirical power of some of these classification methods in an adaptive signature design. Results were compared with logistic regression models. It was found that parametric and nonparametric methods performed reasonably well. Relative performance of the methods depends on the simulation scenario. Finally a method was developed to evaluate power and sample size needed for an adaptive signature design in order to predict the subset of patients sensitive to treatment. It is hoped that this study will stimulate more development of nonparametric and parametric methods to predict subsets of patients responding differently to treatment

University of Louisville

Philosophy and the practice of Bayesian statistics

Author: Abbott
Ashby
Atkinson
Barkow
Bartlett
Bayarri
Bayarri
Bayarri
Berger
Berk
Berk
Bernardo
Binmore
Bousquet
Box
Box
Box
Braithwaite
Brown
Cesa-Bianchi
Claeskens
Cox
Cox
Cox
Cox
Csiszár
Dawid
Donovan
Doob
Earman
Eggertsson
Fitelson
Foster
Fraser
Freedman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Gelman
Ghitza
Ghosh
Giere
Gigerenzer
Gigerenzer
Glymour
Good
Good
Gray
Greenland
Greenland
Grünwald
Grünwald
Gustafson
Guttorp
Haack
Hacking
Halpern
Hastie
Hempel
Hill
Hjort
Holland
Howson
Hunter
Jaynes
Kass
Kass
Kass
Kelly
Kelly
Kitcher
Kleijn
Kolakowski
Kuhn
Kuhn
Lakatos
Laudan
Laudan
Li
Lijoi
Lindsay
Manski
Manski
Mayo
Mayo
Mayo
Mayo
McAllister
McCarty
Merrill
Metropolis
Morris
Müller
Newman
Norton
Paninski
Popper
Quine
Raftery
Ripley
Rivers
Robins
Rubin
Rubin
Russell
Salmon
Savage
Schervish
Seidenfeld
Seidenfeld
Shalizi
Snijders
Spanos
Stove
Stove
Tilly
Tilly
Toulmin
Tukey
Uffink
Uffink
Vansteelandt
Vidyasagar
Vuong
Wahba
Wasserman
Weinberg
White
Wooldridge
Ziman
Publication venue: 'Wiley'
Publication date: 01/01/2010
Field of study

A substantial school in the philosophy of science identifies Bayesian inference with inductive inference and even rationality as such, and seems to be strengthened by the rise and practical success of Bayesian statistics. We argue that the most successful forms of Bayesian statistics do not actually support that particular philosophy but rather accord much better with sophisticated forms of hypothetico-deductivism. We examine the actual role played by prior distributions in Bayesian models, and the crucial aspects of model checking and model revision, which fall outside the scope of Bayesian confirmation theory. We draw on the literature on the consistency of Bayesian updating and also on our experience of applied work in social science. Clarity about these matters should benefit not just philosophy of science, but also statistical practice. At best, the inductivist view has encouraged researchers to fit and compare models without checking them; at worst, theorists have actively discouraged practitioners from performing model checking because it does not fit into their framework.Comment: 36 pages, 5 figures. v2: Fixed typo in caption of figure 1. v3: Further typo fixes. v4: Revised in response to referee

arXiv.org e-Print Archive

CiteSeerX

Crossref

Quantitative Analysis of Judicial Processes: Some Practical and Theoretical Applications

Author: Ulmer S. Sidney
Publication venue: Duke University School of Law
Publication date: 01/01/1963
Field of study

bepress Legal Repository

Duke Law Scholarship Repository

Contributions to reasoning on imprecise data

Author: Fink Paul
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 15/06/2018
Field of study

This thesis contains four contributions which advocate cautious statistical modelling and inference. They achieve it by taking sets of models into account, either directly or indirectly by looking at compatible data situations. Special care is taken to avoid assumptions which are technically convenient, but reduce the uncertainty involved in an unjustified manner. This thesis provides methods for cautious statistical modelling and inference, which are able to exhaust the potential of precise and vague data, motivated by different fields of application, ranging from political science to official statistics. At first, the inherently imprecise Nonparametric Predictive Inference model is involved in the cautious selection of splitting variables in the construction of imprecise classification trees, which are able to describe a structure and allow for a reasonably high predictive power. Dependent on the interpretation of vagueness, different strategies for vague data are then discussed in terms of finite random closed sets: On the one hand, the data to be analysed are regarded as set-valued answers of an item in a questionnaire, where each possible answer corresponding to a subset of the sample space is interpreted as a separate entity. By this the finite random set is reduced to an (ordinary) random variable on a transformed sample space. The context of application is the analysis of voting intentions, where it is shown that the presented approach is able to characterise the undecided in a more detailed way, which common approaches are not able to. Altough the presented analysis, regarded as a first step, is carried out on set-valued data, which are suitably self-constructed with respect to the scientific research question, it still clearly demonstrates that the full potential of this quite general framework is not exhausted. It is capable of dealing with more complex applications. On the other hand, the vague data are produced by set-valued single imputation (imprecise imputation) where the finite random sets are interpreted as being the result of some (unspecified) coarsening. The approach is presented within the context of statistical matching, which is used to gain joint knowledge on features that were not jointly collected in the initial data production. This is especially relevant in data production, e.g. in official statistics, as it allows to fuse the information of already accessible data sets into a new one, without the requirement of actual data collection in the field. Finally, in order to share data, they need to be suitably anonymised. For the specific class of anonymisation techniques of microaggregation, its ability to infer on generalised linear regression models is evaluated. Therefore, the microaggregated data are regarded as a set of compatible, unobserved underlying data situations. Two strategies to follow are proposed. At first, a maximax-like optimisation strategy is pursued, in which the underlying unobserved data are incorporated into the regression model as nuisance parameters, providing a concise yet over-optimistic estimation of the regression coefficients. Secondly, an approach in terms of partial identification, which is inherently more cautious than the previous one, is applied to estimate the set of all regression coefficients that are obtained by performing the estimation on each compatible data situation. Vague data are deemed favourable to precise data as they additionally encompass the uncertainty of the individual observation, and therefore they have a higher informational value. However, to the present day, there are few (credible) statistical models that are able to deal with vague or set-valued data. For this reason, the collection of such data is neglected in data production, disallowing such models to exhaust their full potential. This in turn prevents a throughout evaluation, negatively affecting the (further) development of such models. This situation is a variant of the chicken or egg dilemma. The ambition of this thesis is to break this cycle by providing actual methods for dealing with vague data in relevant situations in practice, to stimulate the required data production.Diese Schrift setzt sich in vier Beiträgen für eine vorsichtige statistische Modellierung und Inferenz ein. Dieses wird erreicht, indem man Mengen von Modellen betrachtet, entweder direkt oder indirekt über die Interpretation der Daten als Menge zugrunde liegender Datensituationen. Besonderer Wert wird dabei darauf gelegt, Annahmen zu vermeiden, die zwar technisch bequem sind, aber die zugrunde liegende Unsicherheit der Daten in ungerechtfertigter Weise reduzieren. In dieser Schrift werden verschiedene Methoden der vorsichtigen Modellierung und Inferenz vorgeschlagen, die das Potential von präzisen und unscharfen Daten ausschöpfen können, angeregt von unterschiedlichen Anwendungsbereichen, die von Politikwissenschaften bis zur amtlichen Statistik reichen. Zuerst wird das Modell der Nonparametrischen Prädiktiven Inferenz, welches per se unscharf ist, in der vorsichtigen Auswahl von Split-Variablen bei der Erstellung von Klassifikationsbäumen verwendet, die auf Methoden der Imprecise Probabilities fußen. Diese Bäume zeichnen sich dadurch aus, dass sie sowohl eine Struktur beschreiben, als auch eine annehmbar hohe Prädiktionsgüte aufweisen. In Abhängigkeit von der Interpretation der Unschärfe, werden dann verschiedene Strategien für den Umgang mit unscharfen Daten im Rahmen von finiten Random Sets erörtert. Einerseits werden die zu analysierenden Daten als mengenwertige Antwort auf eine Frage in einer Fragebogen aufgefasst. Hierbei wird jede mögliche (multiple) Antwort, die eine Teilmenge des Stichprobenraumes darstellt, als eigenständige Entität betrachtet. Somit werden die finiten Random Sets auf (gewöhnliche) Zufallsvariablen reduziert, die nun in einen transformierten Raum abbilden. Im Rahmen einer Analyse von Wahlabsichten hat der vorgeschlagene Ansatz gezeigt, dass die Unentschlossenen mit ihm genauer charakterisiert werden können, als es mit den gängigen Methoden möglich ist. Obwohl die vorgestellte Analyse, betrachtet als ein erster Schritt, auf mengenwertige Daten angewendet wird, die vor dem Hintergrund der wissenschaftlichen Forschungsfrage in geeigneter Weise selbst konstruiert worden sind, zeigt diese dennoch klar, dass die Möglichkeiten dieses generellen Ansatzes nicht ausgeschöpft sind, so dass er auch in komplexeren Situationen angewendet werden kann. Andererseits werden unscharfe Daten durch eine mengenwertige Einfachimputation (imprecise imputation) erzeugt. Hier werden die finiten Random Sets als Ergebnis einer (unspezifizierten) Vergröberung interpretiert. Der Ansatz wird im Rahmen des Statistischen Matchings vorgeschlagen, das verwendet wird, um gemeinsame Informationen über ursprünglich nicht zusammen erhobene Merkmale zur erhalten. Dieses ist insbesondere relevant bei der Datenproduktion, beispielsweise in der amtlichen Statistik, weil es erlaubt, die verschiedenartigen Informationen aus unterschiedlichen bereits vorhandenen Datensätzen zu einen neuen Datensatz zu verschmelzen, ohne dass dafür tatsächlich Daten neu erhoben werden müssen. Zudem müssen die Daten für den Datenaustausch in geeigneter Weise anonymisiert sein. Für die spezielle Klasse der Anonymisierungstechnik der Mikroaggregation wird ihre Eignung im Hinblick auf die Verwendbarkeit in generalisierten linearen Regressionsmodellen geprüft. Hierfür werden die mikroaggregierten Daten als eine Menge von möglichen, unbeobachtbaren zu Grunde liegenden Datensituationen aufgefasst. Es werden zwei Herangehensweisen präsentiert: Als Erstes wird eine maximax-ähnliche Optimisierungsstrategie verfolgt, dabei werden die zu Grunde liegenden unbeobachtbaren Daten als Nuisance Parameter in das Regressionsmodell aufgenommen, was eine enge, aber auch über-optimistische Schätzung der Regressionskoeffizienten liefert. Zweitens wird ein Ansatz im Sinne der partiellen Identifikation angewendet, der per se schon vorsichtiger ist (als der vorherige), indem er nur die Menge aller möglichen Regressionskoeffizienten schätzt, die erhalten werden können, wenn die Schätzung auf jeder zu Grunde liegenden Datensituation durchgeführt wird. Unscharfe Daten haben gegenüber präzisen Daten den Vorteil, dass sie zusätzlich die Unsicherheit der einzelnen Beobachtungseinheit umfassen. Damit besitzen sie einen höheren Informationsgehalt. Allerdings gibt es zur Zeit nur wenige glaubwürdige statistische Modelle, die mit unscharfen Daten umgehen können. Von daher wird die Erhebung solcher Daten bei der Datenproduktion vernachlässigt, was dazu führt, dass entsprechende statistische Modelle ihr volles Potential nicht ausschöpfen können. Dies verhindert eine vollumfängliche Bewertung, wodurch wiederum die (Weiter-)Entwicklung jener Modelle gehemmt wird. Dies ist eine Variante des Henne-Ei-Problems. Diese Schrift will durch Vorschlag konkreter Methoden hinsichtlich des Umgangs mit unscharfen Daten in relevanten Anwendungssituationen Lösungswege aus der beschriebenen Situation aufzeigen und damit die entsprechende Datenproduktion anregen

The safety case and the lessons learned for the reliability and maintainability case

Author: Bedford T.J.
Revie Matthew
Walls L.A.
Publication venue
Publication date: 01/01/2005
Field of study

This paper examine the safety case and the lessons learned for the reliability and maintainability case

University of Strathclyde Institutional Repository

A survey on utilization of data mining approaches for dermatological (skin) diseases prediction

Author: Adibi N
Ahmadzadeh MR
Barati E
Mohammadi A
Saraee MH
Publication venue: Cyber Journals
Publication date: 01/03/2011
Field of study

Due to recent technology advances, large volumes of medical data is obtained. These data contain valuable information. Therefore data mining techniques can be used to extract useful patterns. This paper is intended to introduce data mining and its various techniques and a survey of the available literature on medical data mining. We emphasize mainly on the application of data mining on skin diseases. A categorization has been provided based on the different data mining techniques. The utility of the various data mining methodologies is highlighted. Generally association mining is suitable for extracting rules. It has been used especially in cancer diagnosis. Classification is a robust method in medical mining. In this paper, we have summarized the different uses of classification in dermatology. It is one of the most important methods for diagnosis of erythemato-squamous diseases. There are different methods like Neural Networks, Genetic Algorithms and fuzzy classifiaction in this topic. Clustering is a useful method in medical images mining. The purpose of clustering techniques is to find a structure for the given data by finding similarities between data according to data characteristics. Clustering has some applications in dermatology. Besides introducing different mining methods, we have investigated some challenges which exist in mining skin data

University of Salford Institutional Repository

The motivation to express prejudice

Author: Cox William T.L.
Devine Patricia G.
Forscher Patrick S.
Graetz Nicholas
Publication venue: ScholarWorks@UARK
Publication date: 01/01/2015
Field of study

Contemporary prejudice research focuses primarily on people who are motivated to respond without prejudice and the ways in which unintentional bias can cause these people to act inconsistent with this motivation. However, some real-world phenomena (e.g., hate speech, hate crimes) and experimental findings (e.g., Plant & Devine, 2001; 2009) suggest that some expressions of prejudice are intentional. These phenomena and findings are difficult to explain solely from the motivations to respond without prejudice. We argue that some people are motivated to express prejudice, and we develop the motivation to express prejudice (MP) scale to measure this motivation. In seven studies involving more than 6,000 participants, we demonstrate that, across scale versions targeted at Black people and gay men, the MP scale has good reliability and convergent, discriminant, and predictive validity. In normative climates that prohibit prejudice, the internal and external motivations to express prejudice are functionally non-independent, but they become more independent when normative climates permit more prejudice toward a target group. People high in the motivation to express prejudice are relatively likely to resist pressure to support programs promoting intergroup contact and vote for political candidates who support oppressive policies. The motivation to express prejudice predicted these outcomes even when controlling for attitudes and the motivations to respond without prejudice. This work encourages contemporary prejudice researchers to broaden the range of samples, target groups, and phenomena that they study, and more generally to consider the intentional aspects of negative intergroup behavior

ScholarWorks@UARK

UARK (University of Arkansas )

PubMed Central