312 research outputs found

    Dealing with Missing Data and Uncertainty in the Context of Data Mining

    Get PDF
    Missing data is an issue in many real-world datasets yet robust methods for dealing with missing data appropriately still need development. In this paper we conduct an investigation of how some methods for handling missing data perform when the uncertainty increases. Using benchmark datasets from the UCI Machine Learning repository we generate datasets for our experimentation with increasing amounts of data Missing Completely At Random (MCAR) both at the attribute level and at the record level. We then apply four classification algorithms: C4.5, Random Forest, Naïve Bayes and Support Vector Machines (SVMs). We measure the performance of each classifiers on the basis of complete case analysis, simple imputation and then we study the performance of the algorithms that can handle missing data. We find that complete case analysis has a detrimental effect because it renders many datasets infeasible when missing data increases, particularly for high dimensional data. We find that increasing missing data does have a negative effect on the performance of all the algorithms tested but the different algorithms tested either using preprocessing in the form of simple imputation or handling the missing data do not show a significant difference in performance

    Guidelines on the diagnosis, clinical assessments, treatment and management for CLN2 disease patients.

    Get PDF
    BACKGROUND: CLN2 disease (Neuronal Ceroid Lipofuscinosis Type 2) is an ultra-rare, neurodegenerative lysosomal storage disease, caused by an enzyme deficiency of tripeptidyl peptidase 1 (TPP1). Lack of disease awareness and the non-specificity of presenting symptoms often leads to delayed diagnosis. These guidelines provide robust evidence-based, expert-agreed recommendations on the risks/benefits of disease-modifying treatments and the medical interventions used to manage this condition. METHODS: An expert mapping tool process was developed ranking multidisciplinary professionals, with knowledge of CLN2 disease, diagnostic or management experience of CLN2 disease, or family support professionals. Individuals were sequentially approached to identify two chairs, ensuring that the process was transparent and unbiased. A systematic literature review of published evidence using Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidance was independently and simultaneously conducted to develop key statements based upon the strength of the publications. Clinical care statements formed the basis of an international modified Delphi consensus determination process using the virtual meeting (Within3) online platform which requested experts to agree or disagree with any changes. Statements reaching the consensus mark became the guiding statements within this manuscript, which were subsequently assessed against the Appraisal of Guidelines for Research and Evaluation (AGREEII) criteria. RESULTS: Twenty-one international experts from 7 different specialities, including a patient advocate, were identified. Fifty-three guideline statements were developed covering 13 domains: General Description and Statements, Diagnostics, Clinical Recommendations and Management, Assessments, Interventions and Treatment, Additional Care Considerations, Social Care Considerations, Pain Management, Epilepsy / Seizures, Nutritional Care Interventions, Respiratory Health, Sleep and Rest, and End of Life Care. Consensus was reached after a single round of voting, with one exception which was revised, and agreed by 100% of the SC and achieved 80% consensus in the second voting round. The overall AGREE II assessment score obtained for the development of the guidelines was 5.7 (where 1 represents the lowest quality, and 7 represents the highest quality). CONCLUSION: This program provides robust evidence- and consensus-driven guidelines that can be used by all healthcare professionals involved in the management of patients with CLN2 disease and other neurodegenerative disorders. This addresses the clinical need to complement other information available

    Genome-wide association study: Exploring the genetic basis for responsiveness to ketogenic dietary therapies for drug-resistant epilepsy

    Get PDF
    OBJECTIVE: With the exception of specific metabolic disorders, predictors of response to ketogenic dietary therapies (KDTs) are unknown. We aimed to determine whether common variation across the genome influences the response to KDT for epilepsy. METHODS: We genotyped individuals who were negative for glucose transporter type 1 deficiency syndrome or other metabolic disorders, who received KDT for epilepsy. Genotyping was performed with the Infinium HumanOmniExpressExome Beadchip. Hospital records were used to obtain demographic and clinical data. KDT response (≥50% seizure reduction) at 3-month follow-up was used to dissect out nonresponders and responders. We then performed a genome-wide association study (GWAS) in nonresponders vs responders, using a linear mixed model and correcting for population stratification. Variants with minor allele frequency <0.05 and those that did not pass quality control filtering were excluded. RESULTS: After quality control filtering, the GWAS of 112 nonresponders vs 123 responders revealed an association locus at 6p25.1, 61 kb upstream of CDYL (rs12204701, P = 3.83 × 10-8 , odds ratio [A] = 13.5, 95% confidence interval [CI] 4.07-44.8). Although analysis of regional linkage disequilibrium around rs12204701 did not strengthen the likelihood of CDYL being the candidate gene, additional bioinformatic analyses suggest it is the most likely candidate. SIGNIFICANCE: CDYL deficiency has been shown to disrupt neuronal migration and to influence susceptibility to epilepsy in mice. Further exploration with a larger replication cohort is warranted to clarify whether CDYL is the causal gene underlying the association signal

    Loss of neuronal network resilience precedes seizures and determines the ictogenic nature of interictal synaptic perturbations

    Get PDF
    The mechanisms of seizure emergence, and the role of brief interictal epileptiform discharges (IEDs) in seizure generation are two of the most important unresolved issues in modern epilepsy research. Our study shows that the transition to seizure is not a sudden phenomenon,but a slow process characterized by the progressive loss of neuronal network resilience. From a dynamical perspective, the slow transition is governed by the principles of critical slowing, a robust natural phenomenon observable in systems characterized by transitions between dynamical regimes. In epilepsy, this process is modulated by the synchronous synaptic input from IEDs. IEDs are external perturbations that produce phasic changes in the slow transition process and exert opposing effects on the dynamics of a seizure-generating network, causing either anti-seizure or pro-seizure effects. We show that the multifaceted nature of IEDs is defined by the dynamical state of the network at the moment of the discharge occurrence

    Higuchi Dimension of Digital Images

    Get PDF
    There exist several methods for calculating the fractal dimension of objects represented as 2D digital images. For example, Box counting, Minkowski dilation or Fourier analysis can be employed. However, there appear to be some limitations. It is not possible to calculate only the fractal dimension of an irregular region of interest in an image or to perform the calculations in a particular direction along a line on an arbitrary angle through the image. The calculations must be made for the whole image. In this paper, a new method to overcome these limitations is proposed. 2D images are appropriately prepared in order to apply 1D signal analyses, originally developed to investigate nonlinear time series. The Higuchi dimension of these 1D signals is calculated using Higuchi's algorithm, and it is shown that both regions of interests and directional dependencies can be evaluated independently of the whole picture. A thorough validation of the proposed technique and a comparison of the new method to the Fourier dimension, a common two dimensional method for digital images, are given. The main result is that Higuchi's algorithm allows a direction dependent as well as direction independent analysis. Actual values for the fractal dimensions are reliable and an effective treatment of regions of interests is possible. Moreover, the proposed method is not restricted to Higuchi's algorithm, as any 1D method of analysis, can be applied

    The Pioneer Anomaly

    Get PDF
    Radio-metric Doppler tracking data received from the Pioneer 10 and 11 spacecraft from heliocentric distances of 20-70 AU has consistently indicated the presence of a small, anomalous, blue-shifted frequency drift uniformly changing with a rate of ~6 x 10^{-9} Hz/s. Ultimately, the drift was interpreted as a constant sunward deceleration of each particular spacecraft at the level of a_P = (8.74 +/- 1.33) x 10^{-10} m/s^2. This apparent violation of the Newton's gravitational inverse-square law has become known as the Pioneer anomaly; the nature of this anomaly remains unexplained. In this review, we summarize the current knowledge of the physical properties of the anomaly and the conditions that led to its detection and characterization. We review various mechanisms proposed to explain the anomaly and discuss the current state of efforts to determine its nature. A comprehensive new investigation of the anomalous behavior of the two Pioneers has begun recently. The new efforts rely on the much-extended set of radio-metric Doppler data for both spacecraft in conjunction with the newly available complete record of their telemetry files and a large archive of original project documentation. As the new study is yet to report its findings, this review provides the necessary background for the new results to appear in the near future. In particular, we provide a significant amount of information on the design, operations and behavior of the two Pioneers during their entire missions, including descriptions of various data formats and techniques used for their navigation and radio-science data analysis. As most of this information was recovered relatively recently, it was not used in the previous studies of the Pioneer anomaly, but it is critical for the new investigation.Comment: 165 pages, 40 figures, 16 tables; accepted for publication in Living Reviews in Relativit

    Tipping points in the dynamics of speciation.

    Get PDF
    Speciation can be gradual or sudden and involve few or many genetic changes. Inferring the processes generating such patterns is difficult, and may require consideration of emergent and non-linear properties of speciation, such as when small changes at tipping points have large effects on differentiation. Tipping points involve positive feedback and indirect selection stemming from associations between genomic regions, bi-stability due to effects of initial conditions and evolutionary history, and dependence on modularity of system components. These features are associated with sudden 'regime shifts' in other cellular, ecological, and societal systems. Thus, tools used to understand other complex systems could be fruitfully applied in speciation research
    • …
    corecore