4,631 research outputs found

    Analyzing How BERT Performs Entity Matching

    Get PDF
    State-of-the-art Entity Matching (EM) approaches rely on transformer architectures, such as BERT, for generating highly contextualized embeddings of terms. The embeddings are then used to predict whether pairs of entity descriptions refer to the same real-world entity. BERT-based EM models demonstrated to be effective, but act as black-boxes for the users, who have limited insight into the motivations behind their decisions. In this paper, we perform a multi-facet analysis of the components of pre-trained and fine-tuned BERT architectures applied to an EM task. The main findings resulting from our extensive experimental evaluation are (1) the fine-tuning process applied to the EM task mainly modifies the last layers of the BERT components, but in a different way on tokens belonging to descriptions of matching / non-matching entities; (2) the special structure of the EM datasets, where records are pairs of entity descriptions is recognized by BERT; (3) the pair-wise semantic similarity of tokens is not a key knowledge exploited by BERT-based EM models

    Random forests highlight the combined effect of environmental heavy metals exposure and genetic damages for cardiovascular diseases

    Get PDF
    Heavy metals are a dangerous source of pollution due to their toxicity, permanence in the environment and chemical nature. It is well known that long-term exposure to heavy metals is related to several chronic degenerative diseases (cardiovascular diseases, neoplasms, neurodegenerative syndromes, etc.). In this work, we propose a machine learning framework to evaluate the severity of cardiovascular diseases (CVD) from Human scalp hair analysis (HSHA) tests and genetic analysis and identify a small group of these clinical features mostly associated with the CVD risk. Using a private dataset provided by the DD Clinic foundation in Caserta, Italy, we cross-validated the classification performance of a Random Forests model with 90 subjects affected by CVD. The proposed model reached an AUC of 0.78 ± 0.01 on a three class classification problem. The robustness of the predictions was assessed by comparison with different cross-validation schemes and two state-ofthe-art classifiers, such as Artificial Neural Network and General Linear Model. Thus, is the first work that studies, through a machine learning approach, the tight link between CVD severity, heavy metal concentrations and SNPs. Then, the selected features appear highly correlated with the CVD phenotype, and they could represent targets for future CVD therapies

    Hormonal therapy with megestrol in inoperable hepatocellular carcinoma characterized by variant oestrogen receptors

    Get PDF
    Variant liver oestrogen receptor transcripts in hepatocellular carcinoma are associated with aggressive clinical course and unresponsiveness to tamoxifen. To evaluate the impact on survival and on tumour growth of megestrol (progestin drug acting at post-receptorial level) we enrolled 45 patients with HCC characterized by variant liver oestrogen receptors in a prospective, randomized study with megestrol vs. placebo. Presence of variant oestrogen receptors was determined by RT/PCR. 24 patients were randomized to no treatment and 21 to therapy with megestrol 160 mg day−1. Results were analysed by Kaplan-Meier and Cox methods. Survival of hepatocellular carcinoma characterized by variant oestrogen receptors was extremely poor (median survival 7 months); megestrol significantly improved survival (18 months) (P = 0.0090). Tumour growth at one year was significantly slowed down in megestrol-treated patients (P = 0.0212). Bilirubin levels, presence of portal thrombosis, HBV aetiology and treatment were identified at univariate analysis as factors significantly associated with survival; at multivariate analysis, only megestrol therapy (P = 0.0003), presence of HBV infection (P = 0.0009) and presence of portal vein thrombosis (P = 0.0051) were factors independently related with survival. (1) Megestrol slows down the aggressive tumour growth of patients with hepatocellular carcinoma characterized by variant estrogen receptors and (2) is also able to favourably influence the course of disease, more than doubling median survival. © 2001 Cancer Research Campaignhttp://www.bjcancer.co

    Evaluation of microbial contamination of air in two haematology departments equipped with ventilation systems with different filtration devices

    Get PDF
    Background. Nosocomial infections (NI) are above all due to health-care workers practices, but also the contamination of the environment could lead to their rise in health-care facilities. Introduction. In the last years, the incidence of NI has increased due to a substantial rise in the number of immuno-compromised patients. These patients are often gathered in hospital areas declared at ?high risk? of infection such as Hematology and Bone Marrow Transplant ward. In this study, we evaluated microbial contamination of the air in two divisions with high risk patients, focusing on the validity of the air system with correla- tion to the presence or not of the HEPA absolute filters. Methods. An environmental surveillance study has been carried out in two Divisions of Haematology, in two different Hospitals. Investigations have been performed by sampling air and by analyzing bacterial and fungal growth on microbiology plates after an incubation period. Results. Unit A, without HEPA filters in the ventilation systems, showed a gradual increase in the bacterial load 20 and 60 days after cleaning of the ventilation system. Mycetes and Aspergilli were not present in basal conditions, at 20 or 60 days after decontamination. Unit B, equipped with HEPA filters placed at the inlet vents, showed extremely low values of the bacterial load either in basal conditions or upon inspection 60 days after cleaning. No mycetes were present. Discussion. From the results obtained, it was evident that fol- lowing the cleaning operation, the quality of the air is excellent in both types of equipment, since no mycetes were present and the bacterial load was inf. 20 CFU/mc in all the sites tested. However, although in subsequent controls mycetes were absent in both types of equipment, a great difference in the suspended bacterial load was found: Unit B was close to sterility whereas in Unit A a progressive increase was observed

    Sparse Exploratory Factor Analysis

    Get PDF
    Sparse principal component analysis is a very active research area in the last decade. It produces component loadings with many zero entries which facilitates their interpretation and helps avoid redundant variables. The classic factor analysis is another popular dimension reduction technique which shares similar interpretation problems and could greatly benefit from sparse solutions. Unfortunately, there are very few works considering sparse versions of the classic factor analysis. Our goal is to contribute further in this direction. We revisit the most popular procedures for exploratory factor analysis, maximum likelihood and least squares. Sparse factor loadings are obtained for them by, first, adopting a special reparameterization and, second, by introducing additional [Formula: see text]-norm penalties into the standard factor analysis problems. As a result, we propose sparse versions of the major factor analysis procedures. We illustrate the developed algorithms on well-known psychometric problems. Our sparse solutions are critically compared to ones obtained by other existing methods

    Piecewise smooth systems near a co-dimension 2 discontinuity manifold: can one say what should happen?

    Full text link
    We consider a piecewise smooth system in the neighborhood of a co-dimension 2 discontinuity manifold Σ\Sigma. Within the class of Filippov solutions, if Σ\Sigma is attractive, one should expect solution trajectories to slide on Σ\Sigma. It is well known, however, that the classical Filippov convexification methodology is ambiguous on Σ\Sigma. The situation is further complicated by the possibility that, regardless of how sliding on Σ\Sigma is taking place, during sliding motion a trajectory encounters so-called generic first order exit points, where Σ\Sigma ceases to be attractive. In this work, we attempt to understand what behavior one should expect of a solution trajectory near Σ\Sigma when Σ\Sigma is attractive, what to expect when Σ\Sigma ceases to be attractive (at least, at generic exit points), and finally we also contrast and compare the behavior of some regularizations proposed in the literature. Through analysis and experiments we will confirm some known facts, and provide some important insight: (i) when Σ\Sigma is attractive, a solution trajectory indeed does remain near Σ\Sigma, viz. sliding on Σ\Sigma is an appropriate idealization (of course, in general, one cannot predict which sliding vector field should be selected); (ii) when Σ\Sigma loses attractivity (at first order exit conditions), a typical solution trajectory leaves a neighborhood of Σ\Sigma; (iii) there is no obvious way to regularize the system so that the regularized trajectory will remain near Σ\Sigma as long as Σ\Sigma is attractive, and so that it will be leaving (a neighborhood of) Σ\Sigma when Σ\Sigma looses attractivity. We reach the above conclusions by considering exclusively the given piecewise smooth system, without superimposing any assumption on what kind of dynamics near Σ\Sigma (or sliding motion on Σ\Sigma) should have been taking place.Comment: 19 figure

    Optimal stent design for high bleeding risk patients: Evidence from a network meta-analysis

    Get PDF
    Objective. To determine the best stent design for high bleeding risk (HBR) patients. Background. Polymer-free (PF) drug eluting stent (DES) devices have a proven benefit over bare-metal stent (BMS) devices in previous trials. It is unknown, however, whether polymer-based (PB)-DES devices are as safe as PF-DES devices. Methods. A network meta-analysis including all randomized controlled trials (RCTs) that compared different stent technology in HBR patients with a 1-month course of dual-antiplatelet therapy (DAPT) was performed. The main efficacy outcome was major adverse cardiac event (MACE) rate, defined as the composite of all-cause mortality, myocardial infarction (MI), and target-lesion revascularization (TLR). Secondary efficacy events included all-cause and cardiac mortality, MI, stroke, TLR, and target-vessel revascularization (TVR). Safety outcomes included all bleeding, major bleeding, and stent thrombosis (ST). Results. A total of 4 RCTs with 6456 patients were included. PF-DES and PB-DES yielded a reduced rate of MACE, MI, TLR, and TVR events compared with BMS (all P<.05). ST events were reduced in PB-DES compared with BMS (P=.01). No differences were found in all-cause death, cardiac death, or stroke events in PF-DES and PB-DES compared with BMS. Furthermore, no differences were found between PF-DES and PB-DES regarding any of the outcomes. Conclusion. DES devices were associated with lower MACE and TVR rates compared with BMS, whereas there were no statistical differences in other efficacy endpoints. Also, PB-DES were associated with fewer ST events compared with BMS. There were no statistical differences between PB-DES and PF-DES with regard to any of the endpoints. t 2021 HMP Comm Personal Use Onl

    Global sexual fertility in the opportunistic pathogen aspergillus fumigatus and identification of new supermater strains

    Get PDF
    © 2020 by the authors. Licensee MDPI, Basel, Switzerland. A sexual cycle in Aspergillus fumigatus was first described in 2009 with isolates from Dublin, Ireland. However, the extent to which worldwide isolates can undergo sexual reproduction has remained unclear. In this study a global collection of 131 isolates was established with a near 1:1 ratio of mating types. All isolates were crossed to MAT1-1 or MAT1-2 Irish strains, and a subset of isolates from different continents were crossed together. Ninety seven percent of isolates were found to produce cleistothecia with at least one mating partner, showing that sexual fertility is not limited to the Irish population but is a characteristic of global A. fumigatus. However, large variation was seen in numbers of cleistothecia produced per cross, suggesting differences in the possibility for genetic exchange between strains in nature. The majority of crosses produced ascospores with >50% germination rates, but with wide variation evident. A high temperature heat shock was required to induce ascospore germination. Finally, a new set of highly fertile MAT1-1 and MAT1-2 supermater strains were identified and pyrimidine auxotrophs generated for community use. Results provide insights into the potential for the A. fumigatus sexual cycle to generate genetic variation and allow gene flow of medically important traits
    • …
    corecore