54 research outputs found

    Measuring Similarity in Large-Scale Folksonomies

    Get PDF
    Social (or folksonomic) tagging has become a very popular way to describe content within Web 2.0 websites. Unlike\ud taxonomies, which overimpose a hierarchical categorisation of content, folksonomies enable end-users to freely create and choose the categories (in this case, tags) that best\ud describe some content. However, as tags are informally de-\ud fined, continually changing, and ungoverned, social tagging\ud has often been criticised for lowering, rather than increasing, the efficiency of searching, due to the number of synonyms, homonyms, polysemy, as well as the heterogeneity of\ud users and the noise they introduce. To address this issue, a\ud variety of approaches have been proposed that recommend\ud users what tags to use, both when labelling and when looking for resources. As we illustrate in this paper, real world\ud folksonomies are characterized by power law distributions\ud of tags, over which commonly used similarity metrics, including the Jaccard coefficient and the cosine similarity, fail\ud to compute. We thus propose a novel metric, specifically\ud developed to capture similarity in large-scale folksonomies,\ud that is based on a mutual reinforcement principle: that is,\ud two tags are deemed similar if they have been associated to\ud similar resources, and vice-versa two resources are deemed\ud similar if they have been labelled by similar tags. We offer an efficient realisation of this similarity metric, and assess its quality experimentally, by comparing it against cosine similarity, on three large-scale datasets, namely Bibsonomy, MovieLens and CiteULike

    Effective Retrieval of Resources in Folksonomies Using a New Tag Similarity Measure

    Full text link
    Social (or folksonomic) tagging has become a very popular way to describe content within Web 2.0 websites. However, as tags are informally defined, continually changing, and ungoverned, it has often been criticised for lowering, rather than increasing, the efficiency of searching. To address this issue, a variety of approaches have been proposed that recommend users what tags to use, both when labeling and when looking for resources. These techniques work well in dense folksonomies, but they fail to do so when tag usage exhibits a power law distribution, as it often happens in real-life folksonomies. To tackle this issue, we propose an approach that induces the creation of a dense folksonomy, in a fully automatic and transparent way: when users label resources, an innovative tag similarity metric is deployed, so to enrich the chosen tag set with related tags already present in the folksonomy. The proposed metric, which represents the core of our approach, is based on the mutual reinforcement principle. Our experimental evaluation proves that the accuracy and coverage of searches guaranteed by our metric are higher than those achieved by applying classical metrics.Comment: 6 pages, 2 figures, CIKM 2011: 20th ACM Conference on Information and Knowledge Managemen

    PCDH19 mutations in female patients from Southern Italy

    Get PDF
    AbstractPurposeMutations in PCDH19, encoding protocadherin 19 on chromosome X, cause familial epilepsy and mental retardation limited to females or Dravet-like syndrome. We wished to explore the causative role of PCDH19 gene (Xq22) in female patients with epilepsy, from Southern Italy.MethodsDirect sequencing of PCDH19 gene was conducted in 31 unrelated female patients with early onset (<1 year of age) epilepsy and a wide spectrum of phenotypes including febrile seizures, focal and generalized forms, with either sporadic or familial distribution.ResultsWe identified two de novo heterozygous novel mutations of PCDH19 gene (p.Arg550Pro, Ile508ProfsX59) in two of 31 unrelated female patients. We also identified a novel silent mutation p.Ser856=.ConclusionsThe present findings confirm that PCDH19 is a major causative gene for infantile onset familial or sporadic epilepsy in female patients with or without mental retardation

    A functional gene expression analysis in epithelial sinonasal cancer: Biology and clinical relevance behind three histological subtypes

    Get PDF
    Epithelial sinonasal cancers (SNCs) are rare diseases with overlapping morphological features and a dismal prognosis. We aimed to investigate the expression differences among the histological subtypes for discerning their molecular characteristics. We selected 47 SNCs: (i) 21 nonkeratinizing squamous cell carcinomas (NKSCCs), (ii) 13 sinonasal neuroendocrine cancers (SNECs), and (iii) 13 sinonasal undifferentiated cancers (SNUCs). Gene expression profiling was performed by DASL (cDNA-mediated annealing, selection, extension, and ligation) microarray analysis with internal validation by quantitative RT-PCR (RT-qPCR). Relevant molecular patterns were uncovered by sparse partial-least squares discriminant analysis (sPLS-DA), microenvironment cell type (xCell), CIBERSORT, and gene set enrichment (GSEA) analyses. The first two sPLS-DA components stratified samples by histological subtypes. xCell highlighted increased expression of immune components (CD8 + effector memory cells, in SNUC) and \u201cother cells\u201d: keratinocytes and neurons in NKSCC and SNEC, respectively. Pathway enrichment was observed in NKSCC (six gene sets, proliferation related), SNEC (one gene set, pancreatic \u3b2-cells), and SNUC (twenty gene sets, some of them immune-system related). Major neuroendocrine involvement was observed in all the SNEC samples. Our high-throughput analysis revealed a good diagnostic ability to differentiate NKSCC, SNEC, and SNUC, but indicated that the neuroendocrine pathway, typical and pathognomonic of SNEC is also present at lower expression levels in the other two histological subtypes. The different and specific profiles may be exploited for elucidating their biology and could help to identify prognostic and therapeutic opportunities

    HER2 status in recurrent/metastatic androgen receptor overexpressing salivary gland carcinoma patients

    Get PDF
    BackgroundOverexpression of human epidermal growth factor receptor type 2 (HER2) occurs in almost 25-30% of androgen receptor (AR)-positive salivary gland carcinomas (SGCs), notably salivary duct carcinoma (SDC) and adenocarcinoma not otherwise specified (NOS). In the last years, several studies have reported the clinical benefit of HER2 directed therapies in this setting. This work aims at describing the natural history of AR-positive recurrent/metastatic (R/M) SGC patients, based on HER2 amplification status.MethodsConsecutive R/M AR-positive SGC patients accessing our Institution from 2010 to 2021 were analyzed. Descriptive statistics and survival analyses were performed to present the clinical characteristics of the selected patients and the outcomes, based on HER2 status. A specific focus was dedicated to patients developing metastases to the central nervous system (CNS).ResultsSeventy-four R/M AR-positive SGC patients (72 men) were analyzed. Median follow-up was 36.18 months (95% CI 30.19-42.66). HER2 status was available in 62 cases (84%) and in 42% the protein was overexpressed (HER2+). Compared with patients with HER2- SGCs, in patients with HER2+ disease, HR for disease recurrence was 2.97 (95% CI 1.44-6.1, p=0.003), and HR for death from R/M disease was 3.22 (95% CI 1.39-7.49, p=0.007). Moreover, the HER2+ group showed a non-significant trend towards a higher prevalence of CNS metastases (40% vs. 24%, p=0.263). Patients developing CNS metastases had shorter survival than those who did not; at bivariate analysis (covariates: CNS disease and HER2 status), HER2 status demonstrated its independent prognostic significance.DiscussionIn our patient population, HER2 amplification was a negative prognostic factor, and it was associated with a non-statistically significant higher risk of developing CNS metastasis. Further studies are needed to explore the potential clinical benefit of tackling the two biological pathways (AR and HER2) in patients affected by this rare and aggressive malignancy

    Circulating pre-treatment Epstein-Barr virus DNA as prognostic factor in locally-advanced nasopharyngeal cancer in a nonendemic area

    Get PDF
    The prognostic value of pre-treatment Epstein-Barr Virus (EBV) DNA viral load for non-endemic, locally-advanced, EBV-related nasopharyngeal cancer (NPC) patients is yet to be defined. All patients with EBV encoded RNA (EBER)-positive NPC treated at our Institution from 2005 to 2014 with chemotherapy (CT) concurrent with radiation (RT) +/- induction chemotherapy (ICT) were retrospectively reviewed. Pre-treatment baseline plasma EBV DNA (b-EBV DNA) viral load was detected and quantified by PCR. Median b-EBV DNA value was correlated to potential influencing factors by univariate analysis. Significant variables were then extrapolated and included in a multivariate linear regression model. The same variables, including b-EBV DNA, were correlated with Disease Free Survival (DFS) and Overall Survival (OS) by univariate and multivariate analysis. A total of 130 locally-advanced EBER positive NPC patients were evaluated. Overall, b-EBV DNA was detected in 103 patients (79.2%). Median viral load was 554 copies/mL (range 50-151075), and was positively correlated with T stage (p= 0.002), N3a-b vs N0-1-2 stage (p= 0.048), type of treatment (ICT followed by CTRT, p= 0.006) and locoregional and/or distant disease recurrence (p= 0.034). In the overall population, DFS and OS were significantly longer in patients with pre-treatment negative EBV DNA than in positive subjects at the multivariate analysis. Negative b-EBV DNA can be considered as prognostic biomarker of longer DFS and OS in NPC in non-endemic areas. This finding needs confirmation in larger prospective series, with standardized and inter-laboratory harmonized method of plasma EBV DNA quantificatio

    Exploring the middle way: The Istituto per la Ricostruzione Industriale (IRI) in between corporate capitalism and planned economy (1948–1973)

    No full text
    This paper narrates the efforts made in Italy after the end of the Second World War to develop an economic and social model in between corporate capitalism and planned economy, on the one hand, and shareholder versus stakeholder capitalism on the other. The result was the institutional infrastructure that supported the Istituto per la Ricostruzione industriale (IRI), a state-owned public holding in charge of managing the funds of the so-called Marshall Plan. The history of IRI illustrates the importance of a pragmatic approach to dealing with institutional constraints and opportunities when faced with the need to reconstruct destroyed economies in a context of very fragmented societies such as those of post-war Italy. The result was the resistance to an acritical adoption of the corporate American model and the definition of a more balanced form of capitalism. In an era of new recovery plans, there is a lot we can learn from IRI’s history

    There’s no such thing as the perfect map: quantifying bias in spatial crowd-sourcing datasets

    Get PDF
    Crowd-sourcing has become a popular form of computer mediated collaborative work and OpenStreetMap represents one of the most successful crowd-sourcing systems, where the goal of building and maintaining an accurate global map of the world is being accomplished by means of contributions made by over 1.2M citizens. However, within this apparently large crowd, a tiny group of highly active users is responsible for the mapping of almost all the content. One may thus wonder to what extent the information being mapped is biased towards the interests and agenda of this group of users. In this paper, we present a method to quantitatively measure content bias in crowd-sourced geographic information. We then apply the method to quantify content bias across a three-year period of OpenStreetMap mapping in 40 countries. We find almost no content bias in terms of what is being mapped, but significant geographic bias; furthermore, we find that bias in terms of meticulousness varies with culture
    corecore