1,129 research outputs found

    Evaluating Similarity Metrics for Latent Twitter Topics

    Get PDF
    Topic modelling approaches such as LDA, when applied on a tweet corpus, can often generate a topic model containing redundant topics. To evaluate the quality of a topic model in terms of redundancy, topic similarity metrics can be applied to estimate the similarity among topics in a topic model. There are various topic similarity metrics in the literature, e.g. the Jensen Shannon (JS) divergence-based metric. In this paper, we evaluate the performances of four distance/divergence-based topic similarity metrics and examine how they align with human judgements, including a newly proposed similarity metric that is based on computing word semantic similarity using word embeddings (WE). To obtain human judgements, we conduct a user study through crowdsourcing. Among various insights, our study shows that in general the cosine similarity (CS) and WE-based metrics perform better and appear to be complementary. However, we also find that the human assessors cannot easily distinguish between the distance/divergence-based and the semantic similarity-based metrics when identifying similar latent Twitter topics

    A customisable pipeline for continuously harvesting socially-minded Twitter users

    Full text link
    On social media platforms and Twitter in particular, specific classes of users such as influencers have been given satisfactory operational definitions in terms of network and content metrics. Others, for instance online activists, are not less important but their characterisation still requires experimenting. We make the hypothesis that such interesting users can be found within temporally and spatially localised contexts, i.e., small but topical fragments of the network containing interactions about social events or campaigns with a significant footprint on Twitter. To explore this hypothesis, we have designed a continuous user profile discovery pipeline that produces an ever-growing dataset of user profiles by harvesting and analysing contexts from the Twitter stream. The profiles dataset includes key network and content-based users metrics, enabling experimentation with user-defined score functions that characterise specific classes of online users. The paper describes the design and implementation of the pipeline and its empirical evaluation on a case study consisting of healthcare-related campaigns in the UK, showing how it supports the operational definitions of online activism, by comparing three experimental ranking functions. The code is publicly available.Comment: Procs. ICWE 2019, June 2019, Kore

    Phylogeographic analysis and genetic structure of an endemic sino-japanese disjunctive genus Diabelia (caprifoliaceae)

    Full text link
    The Sino-Japanese Floristic Region (SJFR) is a key area for plant phylogeographical research, due to its very high species diversity and disjunct distributions of a large number of species and genera. At present, the root cause and temporal origin of the discontinuous distribution of many plants in the Sino-Japanese flora are still unclear. Diabelia (Caprifoliaceae; Linnaeoideae) is a genus endemic to Asia, mostly in Japan, but two recent discoveries in China raised questions over the role of the East China Sea (ECS) in these species' disjunctions. Chloroplast DNA sequence data were generated from 402 population samples for two regions (rpl32-trnL, and trnH-psbA) and 11 nuclear microsatellite loci were screened for 549 individuals. Haplotype, population-level structure, combined analyses of ecological niche modeling, and reconstruction of ancestral state in phylogenies were also performed. During the Last Glacial Maximum (LGM) period after the Tertiary, Diabelia was potentially widely distributed in southeastern China, the continental shelf of the East China Sea and Japan (excluding Hokkaido). After LGM, all populations in China have disappeared except those in Zhejiang which may represent a Glacial refuge. Populations of Diabelia in Japan have not experienced significant bottleneck effects, and populations have maintained a relatively stable state. The observed discontinuous distribution of Diabelia species between China and Japan are interpreted as the result of relatively ancient divergence. The phylogenetic tree of chloroplast fragments shows the characteristics of multi-origin evolution (except for D. sanguinea). STRUCTURE analysis of nuclear Simple Sequence Repeat (nSSR) showed that the plants of the Diabelia were divided into five gene pools: D. serrata, D. spathulata, D. sanguinea, D. ionostachya (D. spathulata var. spathulata-Korea), and populations of D. ionostachya var. ionostachya in Yamagata prefecture, northern Japan. Molecular evidence provides new insights of Diabelia into biogeography, a potential glacial refuge, and population-level genetic structure within species. In the process of species differentiation, ECS acts as a corridor for two-way migration of animals and plants between China and Japan during glacial maxima, providing the possibility of secondary contact for discontinuously distributed species between China and Japan, or as a filter (creating isolation) during glacial minima. The influence of the ECS in speciation and biogeography of Diabelia in the Tertiary remains unresolved in this study. Understanding origins, evolutionary histories, and speciation will provide a framework for the conservation and cultivation of Diabelia

    Synthesis and electromagnetic wave absorption property of amorphous carbon nanotube networks on a 3D graphene aerogel/BaFe12O19 nanocomposite

    Get PDF
    Homogeneous amorphous carbon nanotube (ACNT) networks have been synthesized using floating catalyst chemical vapor deposition method on a 3D graphene aerogel (GA)/ BaFe12O19 (BF) nanocomposite which was prepared by a self-propagating combustion process. The as-synthesized ACNT/GA/BF nanocomposite with 3D network structures could be directly used as a good absorber material for electromagnetic wave absorption. The experimental results indicated that the minimum reflection loss of ACNT/GA/BF composite with a thickness of 2 mm was -18.35 dB at 10.64 GHz in the frequency range of 2-18 GHz. The frequency bandwidth of the reflection loss below -10 dB was 3.32 GHz and below -5 dB was 6.24 GHz, respectively. The 3D graphene aerogel structures which composed of dense interlined tubes and amorphous structure bearing quantities of dihedral angles could consume the incident waves through multiple reflection and scattering inside the 3D web structures. The interlinked ACNTs have both the virtues of amorphous CNTs (multiple reflection inside the wall) and crystalline CNTs (high conductivity), consuming the electromagnetic wave as resistance heat. ACNT/GA/BF composite has a good electromagnetic wave absorption performance.Institute of Textiles and Clothing2016-2017 > Academic research: refereed > Publication in refereed journalbcr

    In situ evidence for the structure of the magnetic null in a 3D reconnection event in the Earth's magnetotail

    Get PDF
    Magnetic reconnection is one of the most important processes in astrophysical, space and laboratory plasmas. Identifying the structure around the point at which the magnetic field lines break and subsequently reform, known as the magnetic null point, is crucial to improving our understanding reconnection. But owing to the inherently three-dimensional nature of this process, magnetic nulls are only detectable through measurements obtained simultaneously from at least four points in space. Using data collected by the four spacecraft of the Cluster constellation as they traversed a diffusion region in the Earth's magnetotail on 15 September, 2001, we report here the first in situ evidence for the structure of an isolated magnetic null. The results indicate that it has a positive-spiral structure whose spatial extent is of the same order as the local ion inertial length scale, suggesting that the Hall effect could play an important role in 3D reconnection dynamics.Comment: 14 pages, 4 figure
    corecore