763 research outputs found

    Erschließen von Freitextfeldern mittels Text Mining und die Qualität der gewonnenen Informationen

    Get PDF
    Vermehrt fallen innerhalb von Firmen neben den einfach auszuwertenden strukturierten Daten, auch unstrukturierte Daten in Form von Freitexten an. In dieser Ausarbeitung werden Techniken zur Strukturierung von Freitexten sowie verwandte Arbeiten und Vor- und Nachteile der Nutzung von Freitexten vorgestellt. Der Fokus liegt auf der Repräsentation der Daten als Vektoren und der Filterung von Stoppwörtern. Außerdem wird ein Prototyp zum Clustern von Freitextfeldern vorgestellt und auf einen Datensatz der NHTSA angewendet. Durch die Anwendung des Prototyps auf den NHTSA-Datensatz wird geklärt, inwiefern dieser Informationen in den Freitextfelder enthält, die nicht in den strukturierten Daten enthalten sind. Und ob das Clustering zu vollständigeren Informationen, das heißt zur erhöhter Datenqualität führt. Die Beantwortung geschieht durch Datenanalysen auf den vom Prototyp erweiterten Datensatz. Eine zusätzliche Anwendung und Auswertung des Prototyps, findet auf einen Datensatz aus der Industrie statt

    Injuries prevalence in elite male artistic gymnasts

    Get PDF
    A proposta deste estudo foi investigar a prevalência de lesões na ginástica artística masculina de alto rendimento. Vinte ginastas brasileiros da categoria adulta, com 23,1 ± 6,5 anos, 13,9 ± 5,0 anos de prá- tica no esporte e 36,5 ± 4,7 horas de treino semanais, participaram do estudo. Os atletas responderam o questionário de lesões referidas, formulado a partir de estudos da literatura, para obter informações sobre as características e circunstâncias das lesões. Foram considerados dados sobre o aparelho ou evento do treinamento em que ocorreu a lesão, o local anatômico lesionado, o tecido biológico afetado e o retorno às atividades após a lesão. Os dados foram avaliados por meio de estatística descritiva, a partir das frequências absolutas e relativas. A sobrecarga de treinamento e os aparelhos solo, cavalo com alças e salto sobre a mesa foram os eventos com maior frequência de lesões. Em relação ao local anatômico, tornozelo, mãos/dedos e ombro foram as regiões mais citadas. Os tecidos ligamentares e ósseo, bem como a cápsula articular foram os tecidos biológicos mais afetados. Em relação ao retorno às atividades, 56% relataram melhora, 33% reportaram retorno ao mesmo nível prévio à lesão e 10% relataram piora do desempenho após retorno às atividades. As lesões na ginástica artística masculina estão associadas às demandas mecânicas do esporte. A análise dos fatores de risco contribui no entendimento dos mecanismos das lesões na ginástica, bem como pode auxiliar em estratégias efetivas de prevenção.The purpose of this study was to investigate the injuries prevalence in men elite artistic gymnasts. Twenty Brazilian senior gymnasts, aged 23.1 ± 6.5 years, 13.9 ± 5.0 years of practice and 36.5 ± 4.7 hours per week training, participated in this study. The athletes answered a morbidity questionnaire, formulated according to studies from the literature, for information on the injuries’ characteristics and circumstances. Information about the injury circumstances (gymnastic apparatus, overload training and physical exercises), the anatomic site injured, the affect biological tissue and the return to training after injury treatment were evaluated. Data were analyzed by descriptive statistics, absolute and relative frequencies. The training overload, and floor, pommel horse and vault were the events that presented higher injuries frequency. In relation to anatomic site, ankle, hands/fingers and shoulder were the most cited regions. The ligament, bone and articular capsule were the most affected biological tissues. In relation to gymnasts’ return to their sports activities, 56% of them reported a better condition at return, 33% reported to have returned at the same fitness level and 10% indicated that they were in a worse condition when they returned to the sports activities. The men’s artistic gymnastics injuries are related to the mechanical demands of this sport. The analysis of risk factors helps in understanding the injuries mechanisms in gymnastics, and provides relevant information that can assist in effective prevention strategies

    Complex matching of RDF datatype properties

    Get PDF
    Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of "first name" and "last name" to "full name" on one end, to complex matchings, such as parsing and pairing symbol/digit strings to SSN numbers, at the other end of the spectrum. In this paper, we propose a two-phase instance-based technique for complex datatype property matching. Phase 1 computes the Estimate Mutual Information matrix of the property values to (1) find simple, 1:1 matches, and (2) compute a list of possible complex matches. Phase 2 applies Genetic Programming to the much reduced search space of candidate matches to find complex matches. We conclude with experimental results that illustrate how the technique works. Furthermore, we show that the proposed technique greatly improves results over those obtained if the Estimate Mutual Information matrix or the Genetic Programming techniques were to be used independently. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-40285-2_18

    International criteria for electrocardiographic interpretation in athletes: Consensus statement.

    Get PDF
    Sudden cardiac death (SCD) is the leading cause of mortality in athletes during sport. A variety of mostly hereditary, structural or electrical cardiac disorders are associated with SCD in young athletes, the majority of which can be identified or suggested by abnormalities on a resting 12-lead electrocardiogram (ECG). Whether used for diagnostic or screening purposes, physicians responsible for the cardiovascular care of athletes should be knowledgeable and competent in ECG interpretation in athletes. However, in most countries a shortage of physician expertise limits wider application of the ECG in the care of the athlete. A critical need exists for physician education in modern ECG interpretation that distinguishes normal physiological adaptations in athletes from distinctly abnormal findings suggestive of underlying pathology. Since the original 2010 European Society of Cardiology recommendations for ECG interpretation in athletes, ECG standards have evolved quickly, advanced by a growing body of scientific data and investigations that both examine proposed criteria sets and establish new evidence to guide refinements. On 26-27 February 2015, an international group of experts in sports cardiology, inherited cardiac disease, and sports medicine convened in Seattle, Washington (USA), to update contemporary standards for ECG interpretation in athletes. The objective of the meeting was to define and revise ECG interpretation standards based on new and emerging research and to develop a clear guide to the proper evaluation of ECG abnormalities in athletes. This statement represents an international consensus for ECG interpretation in athletes and provides expert opinion-based recommendations linking specific ECG abnormalities and the secondary evaluation for conditions associated with SCD

    Measurement of the cosmic ray spectrum above 4×10184{\times}10^{18} eV using inclined events detected with the Pierre Auger Observatory

    Full text link
    A measurement of the cosmic-ray spectrum for energies exceeding 4×10184{\times}10^{18} eV is presented, which is based on the analysis of showers with zenith angles greater than 6060^{\circ} detected with the Pierre Auger Observatory between 1 January 2004 and 31 December 2013. The measured spectrum confirms a flux suppression at the highest energies. Above 5.3×10185.3{\times}10^{18} eV, the "ankle", the flux can be described by a power law EγE^{-\gamma} with index γ=2.70±0.02(stat)±0.1(sys)\gamma=2.70 \pm 0.02 \,\text{(stat)} \pm 0.1\,\text{(sys)} followed by a smooth suppression region. For the energy (EsE_\text{s}) at which the spectral flux has fallen to one-half of its extrapolated value in the absence of suppression, we find Es=(5.12±0.25(stat)1.2+1.0(sys))×1019E_\text{s}=(5.12\pm0.25\,\text{(stat)}^{+1.0}_{-1.2}\,\text{(sys)}){\times}10^{19} eV.Comment: Replaced with published version. Added journal reference and DO

    Energy Estimation of Cosmic Rays with the Engineering Radio Array of the Pierre Auger Observatory

    Full text link
    The Auger Engineering Radio Array (AERA) is part of the Pierre Auger Observatory and is used to detect the radio emission of cosmic-ray air showers. These observations are compared to the data of the surface detector stations of the Observatory, which provide well-calibrated information on the cosmic-ray energies and arrival directions. The response of the radio stations in the 30 to 80 MHz regime has been thoroughly calibrated to enable the reconstruction of the incoming electric field. For the latter, the energy deposit per area is determined from the radio pulses at each observer position and is interpolated using a two-dimensional function that takes into account signal asymmetries due to interference between the geomagnetic and charge-excess emission components. The spatial integral over the signal distribution gives a direct measurement of the energy transferred from the primary cosmic ray into radio emission in the AERA frequency range. We measure 15.8 MeV of radiation energy for a 1 EeV air shower arriving perpendicularly to the geomagnetic field. This radiation energy -- corrected for geometrical effects -- is used as a cosmic-ray energy estimator. Performing an absolute energy calibration against the surface-detector information, we observe that this radio-energy estimator scales quadratically with the cosmic-ray energy as expected for coherent emission. We find an energy resolution of the radio reconstruction of 22% for the data set and 17% for a high-quality subset containing only events with at least five radio stations with signal.Comment: Replaced with published version. Added journal reference and DO

    Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis.

    Get PDF
    Multiple sclerosis is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability. Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals, and systematic attempts to identify linkage in multiplex families have confirmed that variation within the major histocompatibility complex (MHC) exerts the greatest individual effect on risk. Modestly powered genome-wide association studies (GWAS) have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects have a key role in disease susceptibility. Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require the analysis of sample sizes that are beyond the numbers currently available to individual research groups. In a collaborative GWAS involving 9,772 cases of European descent collected by 23 research groups working in 15 different countries, we have replicated almost all of the previously suggested associations and identified at least a further 29 novel susceptibility loci. Within the MHC we have refined the identity of the HLA-DRB1 risk alleles and confirmed that variation in the HLA-A gene underlies the independent protective effect attributable to the class I region. Immunologically relevant genes are significantly overrepresented among those mapping close to the identified loci and particularly implicate T-helper-cell differentiation in the pathogenesis of multiple sclerosis

    Measurement of the Radiation Energy in the Radio Signal of Extensive Air Showers as a Universal Estimator of Cosmic-Ray Energy

    Full text link
    We measure the energy emitted by extensive air showers in the form of radio emission in the frequency range from 30 to 80 MHz. Exploiting the accurate energy scale of the Pierre Auger Observatory, we obtain a radiation energy of 15.8 \pm 0.7 (stat) \pm 6.7 (sys) MeV for cosmic rays with an energy of 1 EeV arriving perpendicularly to a geomagnetic field of 0.24 G, scaling quadratically with the cosmic-ray energy. A comparison with predictions from state-of-the-art first-principle calculations shows agreement with our measurement. The radiation energy provides direct access to the calorimetric energy in the electromagnetic cascade of extensive air showers. Comparison with our result thus allows the direct calibration of any cosmic-ray radio detector against the well-established energy scale of the Pierre Auger Observatory.Comment: Replaced with published version. Added journal reference and DOI. Supplemental material in the ancillary file
    corecore