305 research outputs found

    Learning from networked examples

    Get PDF
    Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may share some common objects, and hence share the features of these shared objects. We show that the classic approach of ignoring this problem potentially can have a harmful effect on the accuracy of statistics, and then consider alternatives. One of these is to only use independent examples, discarding other information. However, this is clearly suboptimal. We analyze sample error bounds in this networked setting, providing significantly improved results. An important component of our approach is formed by efficient sample weighting schemes, which leads to novel concentration inequalities

    The archive solution for distributed workflow management agents of the CMS experiment at LHC

    Full text link
    The CMS experiment at the CERN LHC developed the Workflow Management Archive system to persistently store unstructured framework job report documents produced by distributed workflow management agents. In this paper we present its architecture, implementation, deployment, and integration with the CMS and CERN computing infrastructures, such as central HDFS and Hadoop Spark cluster. The system leverages modern technologies such as a document oriented database and the Hadoop eco-system to provide the necessary flexibility to reliably process, store, and aggregate O\mathcal{O}(1M) documents on a daily basis. We describe the data transformation, the short and long term storage layers, the query language, along with the aggregation pipeline developed to visualize various performance metrics to assist CMS data operators in assessing the performance of the CMS computing system.Comment: This is a pre-print of an article published in Computing and Software for Big Science. The final authenticated version is available online at: https://doi.org/10.1007/s41781-018-0005-

    The CMS DBS Query Language

    Get PDF
    The CMS experiment has implemented a flexible and powerful system enabling users to find data within the CMS physics data catalog. The Dataset Bookkeeping Service (DBS) comprises a database and the services used to store and access metadata related to CMS physics data. To this, we have added a generalized query system in addition to the existing web and programmatic interfaces to the DBS. This query system is based on a query language that hides the complexity of the underlying database structure by discovering the join conditions between database tables. This provides a way of querying the system that is simple and straightforward for CMS data managers and physicists to use without requiring knowledge of the database tables or keys. The DBS Query Language uses the ANTLR tool to build the input query parser and tokenizer, followed by a query builder that uses a graph representation of the DBS schema to construct the SQL query sent to underlying database. We will describe the design of the query system, provide details of the language components and overview of how this component fits into the overall data discovery system architecture

    Which horticultural activities are more effective for children’s recovery from stress and mental fatigue? A quasi-experimental study

    Get PDF
    IntroductionStudies have established the benefits of horticultural therapy and activities for human health and well-being. Nonetheless, limited research has been conducted on the potential restorative advantages and distinctions between different types of horticultural activities in terms of stress reduction.MethodsThis study employed a quantitative research method to investigate the stress recovery benefits of five horticultural activities (flower arrangement, sowing and transplanting seeds, kokedama crafting, pressed flower card making, and decorative bottle painting with dried flowers) and one reference activity (short composition writing) for children. The experiment was conducted in a children’s activity center’s multi-purpose classroom with 48 elementary students aged 9–12 years. The subjects first took a stress test to induce stress and then engaged in horticultural activities for 20 min. Physiological stress was assessed using electrocardiograms and electroencephalograms as feedback indicators. Psychological and emotional changes were determined using the Positive and Negative Affect Schedule for Children and Self-Assessment Manikin scales.ResultsThe results demonstrated that horticultural activities greatly reduced physiological fatigue, and their recovery benefits were significantly greater than those of the reference activity. The recovery effects from different horticultural activities were similar across physiological indicators, although flower arrangement and sowing and transplanting seeds exhibited relatively robust recovery benefits. The heart rate and α-EEG-based generalized estimating equation revealed that horticultural activities offered significantly better relative recovery at each time phase of operation than the reference activity, with girls showing a 3.68% higher relative recovery value than boys. Flower arrangement and kokedama crafting offered better physiological recovery for students with prior horticultural experience, and these two activities received the highest scores in terms of positive effects and the “pleasure” dimension. Students believed that participating in horticultural activities resulted in a noteworthy increase in personal confidence and a greater sense of achievement.ConclusionThe study suggests that horticultural activities that involve real and vibrant plants or natural materials and are more attractive have more stress-relieving benefits. We conclude that horticultural activities are beneficial leisure activities that aid in stress relief for children and that it is important to consider the attributes of activities when developing horticultural programs for elementary students

    Antibiotics and antibiotic resistance genes in global lakes:A review and meta-analysis

    Get PDF
    Lakes are an important source of freshwater, containing nearly 90% of the liquid surface fresh water worldwide. Long retention times in lakes mean pollutants from discharges slowly circulate around the lakes and may lead to high ecological risk for ecosystem and human health. In recent decades, antibiotics and antibiotic resistance genes (ARGs) have been regarded as emerging pollutants. The occurrence and distribution of antibiotics and ARGs in global freshwater lakes are summarized to show the pollution level of antibiotics and ARGs and to identify some of the potential risks to ecosystem and human health. Fifty-seven antibiotics were reported at least once in the studied lakes. Our meta-analysis shows that sulfamethoxazole, sulfamerazine, sulfameter, tetracycline, oxytetracycline, erythromycin, and roxithromycin were found at high concentrations in both lake water and lake sediment. There is no significant difference in the concentration of sulfonamides in lake water from China and that from other countries worldwide; however, there was a significant difference in quinolones. Erythromycin had the lowest predicted hazardous concentration for 5% of the species (HC5) and the highest ecological risk in lakes. There was no significant difference in the concentration of sulfonamide resistance genes (sul1 and sul2) in lake water and river water. There is surprisingly limited research on the role of aquatic biota in propagation of ARGs in freshwater lakes. As an environment that is susceptible to cumulative build-up of pollutants, lakes provide an important environment to study the fate of antibiotics and transport of ARGs with a broad range of niches including bacterial community, aquatic plants and animals

    Classical Simulation of Relativistic Quantum Mechanics in Periodic Optical Structures

    Full text link
    Spatial and/or temporal propagation of light waves in periodic optical structures offers a rather unique possibility to realize in a purely classical setting the optical analogues of a wide variety of quantum phenomena rooted in relativistic wave equations. In this work a brief overview of a few optical analogues of relativistic quantum phenomena, based on either spatial light transport in engineered photonic lattices or on temporal pulse propagation in Bragg grating structures, is presented. Examples include spatial and temporal photonic analogues of the Zitterbewegung of a relativistic electron, Klein tunneling, vacuum decay and pair-production, the Dirac oscillator, the relativistic Kronig-Penney model, and optical realizations of non-Hermitian extensions of relativistic wave equations.Comment: review article (invited), 14 pages, 7 figures, 105 reference

    Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

    Full text link
    Core-collapse supernova (CCSN) is one of the most energetic astrophysical events in the Universe. The early and prompt detection of neutrinos before (pre-SN) and during the SN burst is a unique opportunity to realize the multi-messenger observation of the CCSN events. In this work, we describe the monitoring concept and present the sensitivity of the system to the pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), which is a 20 kton liquid scintillator detector under construction in South China. The real-time monitoring system is designed with both the prompt monitors on the electronic board and online monitors at the data acquisition stage, in order to ensure both the alert speed and alert coverage of progenitor stars. By assuming a false alert rate of 1 per year, this monitoring system can be sensitive to the pre-SN neutrinos up to the distance of about 1.6 (0.9) kpc and SN neutrinos up to about 370 (360) kpc for a progenitor mass of 30MM_{\odot} for the case of normal (inverted) mass ordering. The pointing ability of the CCSN is evaluated by using the accumulated event anisotropy of the inverse beta decay interactions from pre-SN or SN neutrinos, which, along with the early alert, can play important roles for the followup multi-messenger observations of the next Galactic or nearby extragalactic CCSN.Comment: 24 pages, 9 figure

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe
    corecore