265 research outputs found
System log pre-processing to improve failure prediction
Log preprocessing, a process applied on the raw log be-fore applying a predictive method, is of paramount impor-tance to failure prediction and diagnosis. While existing fil-tering methods have demonstrated good compression rate, they fail to preserve important failure patterns that are cru-cial for failure analysis. To address the problem, in this paper we present a log preprocessing method. It consists of three integrated steps: (1) event categorization to uni-formly classify system events and identify fatal events; (2) event filtering to remove temporal and spatial redundant records, while also preserving necessary failure patterns for failure analysis; (3) causality-related filtering to com-bine correlated events for filtering through apriori associ-ation rule mining. We demonstrate the effectiveness of our preprocessing method by using real failure logs collected from the Cray XT4 at ORNL and the Blue Gene/L system at SDSC. Experiments show that our method can preserve more failure patterns for failure analysis, thereby improv-ing failure prediction by up to 174%
Recommended from our members
Data Standards for the Genomes to Life Program
Existing GTL Projects already have produced volumes of dataand, over the course of the next five years, will produce an estimatedhundreds, or possibly thousands, of terabytes of data from hundreds ofexperiments conducted at dozens of laboratories in National Labs anduniversities across the nation. These data will be the basis forpublications by individual researchers, research groups, andmulti-institutional collaborations, and the basis for future DOEdecisions on funding further research in bioremediation. The short-termand long-term value of the data to project participants, to the DOE, andto the nation depends, however, on being able to access the data and onhow, or whether, the data are archived. The ability to access data is thestarting point for data analysis and interpretation, data integration,data mining, and development of data-driven models. Limited orinefficient data access means that less data are analyzed in acost-effective and timely manner. Data production in the GTL Program willlikely outstrip, or may have already outstripped, the ability to analyzethe data. Being able to access data depends on two key factors: datastandards and implementation of the data standards. For the purpose ofthis proposal, a data standard is defined as a standard, documented wayin which data and information about the data are describe. The attributesof the experiment in which the data were collected need to be known andthe measurements corresponding to the data collected need to bedescribed. In general terms, a data standard could be a form (electronicor paper) that is completed by a researcher or a document that prescribeshow a protocol or experiment should be described in writing.Datastandards are critical to data access because they provide a frameworkfor organizing and managing data. Researchers spend significant amountsof time managing data and information about experiments using labnotebooks, computer files, Excel spreadsheets, etc. In addition, dataoutput format varies for different equipment and usually need to beformatted differently for the variety of computer programs used todisplay and analyze the data. If, however, data for a given type ofexperiment were converted from vendor format to a format defined by adata standard, then researchers and software developers could save time.In addition, if data and information describing how they were obtainedwere available in a consistent format throughout the GTL Program,comparison and integration of results would be facilitated and a datarepository could be built to encourage project-wide data mining.Datastandards also are essential for archiving data sets. If data are storedtogether with the experiment metadata (i.e., information about the data)in an 'information/data package', then the data retain their value due tothe accessibility of information about measurement and analysisprocedures.DOE's commitment to developing data standards for the GTLProgram is needed to ensure that the most value is obtained from DOE'sexpenditures on experimental work and to provide a data repository thatcan be used as the basis for on-going model development. By developingdata standards for experiments conducted as part of the GTL Program, DOEhas the opportunity to facilitate data sharing not only within the DOEcommunity, but also with research institutes through theworld
Dynamics of the Drosophila Circadian Clock: Theoretical Anti-Jitter Network and Controlled Chaos
Background: Electronic clocks exhibit undesirable jitter or time variations in periodic signals. The circadian clocks of humans, some animals, and plants consist of oscillating molecular networks with peak-to-peak time of approximately 24 hours. Clockwork orange (CWO) is a transcriptional repressor of Drosophila direct target genes. Methodology/Principal Findings: Theory and data from a model of the Drosophila circadian clock support the idea that CWO controls anti-jitter negative circuits that stabilize peak-to-peak time in light-dark cycles (LD). The orbit is confined to chaotic attractors in both LD and dark cycles and is almost periodic in LD; furthermore, CWO diminishes the Euclidean dimension of the chaotic attractor in LD. Light resets the clock each day by restricting each molecular peak to the proximity of a prescribed time. Conclusions/Significance: The theoretical results suggest that chaos plays a central role in the dynamics of the Drosophila circadian clock and that a single molecule, CWO, may sense jitter and repress it by its negative loops
Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling
The U.S. Department of Energy recently announced the first five grants for the Genomes to Life (GTL) Program. The goal of this program is to "achieve the most far-reaching of all biological goals: a fundamental, comprehensive, and systematic understanding of life." While more information about the program can be found at the GTL website (www.doegenomestolife.org), this paper provides an overview of one of the five GTL projects funded, "Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling." This project is a combined experimental and computational effort emphasizing developing, prototyping, and applying new computational tools and methods to ellucidate the biochemical mechanisms of the carbon sequestration of Synechococcus Sp., an abundant marine cyanobacteria known to play an important role in the global carbon cycle. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. The project includes five subprojects: an experimental investigation, three computational biology efforts, and a fifth which deals with addressing computational infrastructure challenges of relevance to this project and the Genomes to Life program as a whole. Our experimental effort is designed to provide biology and data to drive the computational efforts and includes significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Our computational efforts include coupling molecular simulation methods with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes and developing a set of novel capabilities for inference of regulatory pathways in microbial genomes across multiple sources of information through the integration of computational and experimental technologies. These capabilities will be applied to Synechococcus regulatory pathways to characterize their interaction map and identify component proteins in these pathways. We will also investigate methods for combining experimental and computational results with visualization and natural language tools to accelerate discovery of regulatory pathways. Furthermore, given that the ultimate goal of this effort is to develop a systems-level of understanding of how the Synechococcus genome affects carbon fixation at the global scale, we will develop and apply a set of tools for capturing the carbon fixation behavior of complex of Synechococcus at different levels of resolution. Finally, because the explosion of data being produced by high-throughput experiments requires data analysis and models which are more computationally complex, more heterogeneous, and require coupling to ever increasing amounts of experimentally obtained data in varying formats, we have also established a companion computational infrastructure to support this effort as well as the Genomes to Life program as a whole.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/63164/1/153623102321112746.pd
Advice on assistance and protection from the Scientific Advisory Board of the Organisation for the Prohibition of Chemical Weapons : Part 2. On preventing and treating health effects from acute, prolonged, and repeated nerve agent exposure, and the identification of medical countermeasures able to reduce or eliminate the longer term health effects of nerve agents
The Scientific Advisory Board (SAB) of the Organisation for the Prohibition of Chemical Weapons (OPCW) has provided advice in relation to the Chemical Weapons Convention on assistance and protection. We present the SAB’s response to a request from the OPCW Director-General in 2014 for information on the best practices for preventing and treating the health effects from acute, prolonged, and repeated organophosphorus nerve agent (NA) exposure. The report summarises pre- and post-exposure treatments, and developments in decontaminants and adsorbing materials, that at the time of the advice, were available for NAs. The updated information provided could assist medics and emergency responders unfamiliar with treatment and decontamination options related to exposure to NAs. The SAB recommended that developments in research on medical countermeasures and decontaminants for NAs should be monitored by the OPCW, and used in assistance and protection training courses and workshops organised through its capacity building programmes.Peer reviewe
Advice from the Scientific Advisory Board of the Organisation for the Prohibition of Chemical Weapons on riot control agents in connection to the Chemical Weapons Convention
Compounds that cause powerful sensory irritation to humans were reviewed by the Scientific Advisory Board (SAB) of the Organisation for the Prohibition of Chemical Weapons (OPCW) in response to requests in 2014 and 2017 by the OPCW Director-General to advise which riot control agents (RCAs) might be subject to declaration under the Chemical Weapons Convention (the Convention). The chemical and toxicological properties of 60 chemicals identified from a survey by the OPCW of RCAs that had been researched or were available for purchase, and additional chemicals recognised by the SAB as having potential RCA applications, were considered. Only 17 of the 60 chemicals met the definition of a RCA under the Convention. These findings were provided to the States Parties of the Convention to inform the implementation of obligations pertaining to RCAs under this international chemical disarmament and non-proliferation treaty.Peer reviewe
The Long Term Response of Birds to Climate Change: New Results from a Cold Stage Avifauna in Northern England
The early MIS 3 (55–40 Kyr BP associated with Middle Palaeolithic archaeology) bird remains from Pin Hole, Creswell Crags, Derbyshire, England are analysed in the context of the new dating of the site’s stratigraphy. The analysis is restricted to the material from the early MIS 3 level of the cave because the upper fauna is now known to include Holocene material as well as that from the Late Glacial. The results of the analysis confirm the presence of the taxa, possibly unexpected for a Late Pleistocene glacial deposit including records such as Alpine swift, demoiselle crane and long-legged buzzard with southern and/or eastern distributions today. These taxa are accompanied by more expected ones such as willow ptarmigan /red grouse and rock ptarmigan living today in northern and montane areas. Finally, there are temperate taxa normally requiring trees for nesting such as wood pigeon and grey heron. Therefore, the result of the analysis is that the avifauna of early MIS 3 in England included taxa whose ranges today do not overlap making it a non-analogue community similar to the many steppe-tundra mammalian faunas of the time. The inclusion of more temperate and woodland taxa is discussed in the light that parts of northern Europe may have acted as cryptic northern refugia for some such taxa during the last glacial. These records showing former ranges of taxa are considered in the light of modern phylogeographic studies as these often assume former ranges without considering the fossil record of those taxa. In addition to the anomalous combination of taxa during MIS 3 living in Derbyshire, the individuals of a number of the taxa are different in size and shape to members of the species today probably due to the high carrying capacity of the steppe-tundra
Variability of Female Responses to Conspecific vs. Heterospecific Male Mating Calls in Polygynous Deer: An Open Door to Hybridization?
Males of all polygynous deer species (Cervinae) give conspicuous calls during the reproductive season. The extreme interspecific diversity that characterizes these vocalizations suggests that they play a strong role in species discrimination. However, interbreeding between several species of Cervinae indicates permeable interspecific reproductive barriers. This study examines the contribution of vocal behavior to female species discrimination and mating preferences in two closely related polygynous deer species known to hybridize in the wild after introductions. Specifically, we investigate the reaction of estrous female red deer (Cervus elaphus) to playbacks of red deer vs. sika deer (Cervus nippon) male mating calls, with the prediction that females will prefer conspecific calls. While on average female red deer preferred male red deer roars, two out of twenty females spent more time in close proximity to the speaker broadcasting male sika deer moans. We suggest that this absence of strict vocal preference for species-specific mating calls may contribute to the permeability of pre-zygotic reproductive barriers observed between these species. Our results also highlight the importance of examining inter-individual variation when studying the role of female preferences in species discrimination and intraspecific mate selection
Complexity Variability Assessment of Nonlinear Time-Varying Cardiovascular Control
The application of complex systems theory to physiology and medicine has provided meaningful information about the nonlinear aspects underlying the dynamics of a wide range of biological processes and their disease-related aberrations. However, no studies have investigated whether meaningful information can be extracted by quantifying second-order moments of time-varying cardiovascular complexity. To this extent, we introduce a novel mathematical framework termed complexity variability, in which the variance of instantaneous Lyapunov spectra estimated over time serves as a reference quantifier. We apply the proposed methodology to four exemplary studies involving disorders which stem from cardiology, neurology and psychiatry: Congestive Heart Failure (CHF), Major Depression Disorder (MDD), Parkinson?s Disease (PD), and Post-Traumatic Stress Disorder (PTSD) patients with insomnia under a yoga training regime. We show that complexity assessments derived from simple time-averaging are not able to discern pathology-related changes in autonomic control, and we demonstrate that between-group differences in measures of complexity variability are consistent across pathologies. Pathological states such as CHF, MDD, and PD are associated with an increased complexity variability when compared to healthy controls, whereas wellbeing derived from yoga in PTSD is associated with lower time-variance of complexity
- …