630 research outputs found

    Text stream to temporal network - A dynamic heartbeat graph to detect emerging events on twitter

    Full text link
    © 2018, Springer International Publishing AG, part of Springer Nature. Huge mounds of data are generated every second on the Internet. People around the globe publish and share information related to real-world events they experience every day. This provides a valuable opportunity to analyze the content of this information to detect real-world happenings, however, it is quite challenging task. In this work, we propose a novel graph-based approach named the Dynamic Heartbeat Graph (DHG) that not only detects the events at an early stage, but also suppresses them in the upcoming adjacent data stream in order to highlight new emerging events. This characteristic makes the proposed method interesting and efficient in finding emerging events and related topics. The experiment results on real-world datasets (i.e. FA Cup Final and Super Tuesday 2012) show a considerable improvement in most cases, while time complexity remains very attractive

    Enhanced Heartbeat Graph for emerging event detection on Twitter using time series networks

    Full text link
    © 2019 Elsevier Ltd With increasing popularity of social media, Twitter has become one of the leading platforms to report events in real-time. Detecting events from Twitter stream requires complex techniques. Event-related trending topics consist of a group of words which successfully detect and identify events. Event detection techniques must be scalable and robust, so that they can deal with the huge volume and noise associated with social media. Existing event detection methods mostly rely on burstiness, mainly the frequency of words and their co-occurrences. However, burstiness sometimes dominates other relevant details in the data which could be equally significant. Besides, the topological and temporal relationships in the data are often ignored. In this work, we propose a novel graph-based approach, called the Enhanced Heartbeat Graph (EHG), which detects events efficiently. EHG suppresses dominating topics in the subsequent data stream, after their first detection. Experimental results on three real-world datasets (i.e., Football Association Challenge Cup Final, Super Tuesday, and the US Election 2012) show superior performance of the proposed approach in comparison to the state-of-the-art techniques

    What’s Happening Around the World? A Survey and Framework on Event Detection Techniques on Twitter

    Full text link
    © 2019, Springer Nature B.V. In the last few years, Twitter has become a popular platform for sharing opinions, experiences, news, and views in real-time. Twitter presents an interesting opportunity for detecting events happening around the world. The content (tweets) published on Twitter are short and pose diverse challenges for detecting and interpreting event-related information. This article provides insights into ongoing research and helps in understanding recent research trends and techniques used for event detection using Twitter data. We classify techniques and methodologies according to event types, orientation of content, event detection tasks, their evaluation, and common practices. We highlight the limitations of existing techniques and accordingly propose solutions to address the shortcomings. We propose a framework called EDoT based on the research trends, common practices, and techniques used for detecting events on Twitter. EDoT can serve as a guideline for developing event detection methods, especially for researchers who are new in this area. We also describe and compare data collection techniques, the effectiveness and shortcomings of various Twitter and non-Twitter-based features, and discuss various evaluation measures and benchmarking methodologies. Finally, we discuss the trends, limitations, and future directions for detecting events on Twitter

    Rapid Synchronization for Ultra-Wideband Communication Systems

    Full text link
    Very high data rate packet systems, such as those based on ultra-wideband (UWB) signaling, face an increasingly important challenge – UWB radio uses sub-nanosecond pulses to transmit information, resulting is high resolution in time implying that the acquisition algorithm must employ sub-pulse duration steps, thereby leading to a large search space, which consequently leads to large mean acquisition time (MAT). The role of synchronization is essentially to determine the relative delay of the received signal with respect to a template signal in the receiver. This paper addresses coarse synchronization in UWB multipath environments taking into account the specific properties of UWB signals. Since we are interested in low signalto-noise ratio environments, the serial search technique is considered and the performance measure is the MAT. This shows how the design of the correlation parameters affects the time to achieve synchronization

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Full text link
    Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarchical dependency structure. We consider a common use case where some regions of the Pareto frontier are prioritized over others according to a specified partial ordering\textit{partial ordering} in the objectives. For instance, when designing antibodies, we would like to maximize the binding affinity to a target antigen only if it can be expressed in live cell culture -- modeling the experimental dependency in which affinity can only be measured for antibodies that can be expressed and thus produced in viable quantities. In general, we may want to confer a partial ordering to the properties such that each property is optimized conditioned on its parent properties satisfying some feasibility condition. To this end, we present PropertyDAG, a framework that operates on top of the traditional multi-objective BO to impose this desired ordering on the objectives, e.g. expression →\rightarrow affinity. We demonstrate its performance over multiple simulated active learning iterations on a penicillin production task, toy numerical problem, and a real-world antibody design task.Comment: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Worksho

    ULTRA-WIDEBAND (UWB) FOR MULTIMEDIA APPLICATIONS

    Full text link
    UWB communication refers to impulse radio technology, in which wireless data is transferred using time domain modulation of data and extremely narrow radio impulses (i.e. nanosecond duration) that occupy typically several GHz of bandwidth. In this paper, we simulate an indoor environment whereby the channel characteristics model of UWB is observed - Saleh- Valenzuela-4 channel model is adopted-, and tested for the feasibility of UWB system in transmitting real time multimedia as incorporating a wireless link, which UWB is the first candidate to transfer these types of data due to its features, i.e. very high data rate (up to 500Mbps), multipath immunity, LPI. Certain aspects were emphasized such as multiple user and channel effects. Designing a wireless link for a streaming video and audio with a wire-like quality was the main objective of this paper

    μ-CS: An extension of the TM4 platform to manage Affymetrix binary data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A main goal in understanding cell mechanisms is to explain the relationship among genes and related molecular processes through the combined use of technological platforms and bioinformatics analysis. High throughput platforms, such as microarrays, enable the investigation of the whole genome in a single experiment. There exist different kind of microarray platforms, that produce different types of binary data (images and raw data). Moreover, also considering a single vendor, different chips are available. The analysis of microarray data requires an initial preprocessing phase (i.e. normalization and summarization) of raw data that makes them suitable for use on existing platforms, such as the TIGR M4 Suite. Nevertheless, the annotations of data with additional information such as gene function, is needed to perform more powerful analysis. Raw data preprocessing and annotation is often performed in a manual and error prone way. Moreover, many available preprocessing tools do not support annotation. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of microarray data are needed.</p> <p>Results</p> <p>The paper presents <it>μ</it>-CS (Microarray Cel file Summarizer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix binary data. <it>μ</it>-CS is based on a client-server architecture. The <it>μ</it>-CS client is provided both as a plug-in of the TIGR M4 platform and as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data, avoiding the manual invocation of external tools (e.g. the Affymetrix Power Tools), the manual loading of preprocessing libraries, and the management of intermediate files. The <it>μ</it>-CS server automatically updates the references to the summarization and annotation libraries that are provided to the <it>μ</it>-CS client before the preprocessing. The <it>μ</it>-CS server is based on the web services technology and can be easily extended to support more microarray vendors (e.g. Illumina).</p> <p>Conclusions</p> <p>Thus <it>μ</it>-CS users can directly manage binary data without worrying about locating and invoking the proper preprocessing tools and chip-specific libraries. Moreover, users of the <it>μ</it>-CS plugin for TM4 can manage Affymetrix binary files without using external tools, such as APT (Affymetrix Power Tools) and related libraries. Consequently, <it>μ</it>-CS offers four main advantages: (i) it avoids to waste time for searching the correct libraries, (ii) it reduces possible errors in the preprocessing and further analysis phases, e.g. due to the incorrect choice of parameters or the use of old libraries, (iii) it implements the annotation of preprocessed data, and finally, (iv) it may enhance the quality of further analysis since it provides the most updated annotation libraries. The <it>μ</it>-CS client is freely available as a plugin of the TM4 platform as well as a standalone application at the project web site (<url>http://bioingegneria.unicz.it/M-CS</url>).</p

    Examining smoking-induced differential gene expression changes in buccal mucosa

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene expression changes resulting from conditions such as disease, environmental stimuli, and drug use, can be monitored in the blood. However, a less invasive method of sample collection is of interest because of the discomfort and specialized personnel necessary for blood sampling especially if multiple samples are being collected. Buccal mucosa cells are easily collected and may be an alternative sample material for biomarker testing. A limited number of studies, primarily in the smoker/oral cancer literature, address this tissue's efficacy as an RNA source for expression analysis. The current study was undertaken to determine if total RNA isolated from buccal mucosa could be used as an alternative tissue source to assay relative gene expression.</p> <p>Methods</p> <p>Total RNA was isolated from swabs, reverse transcribed and amplified. The amplified cDNA was used in RT-qPCR and microarray analyses to evaluate gene expression in buccal cells. Initially, RT-qPCR was used to assess relative transcript levels of four genes from whole blood and buccal cells collected from the same seven individuals, concurrently. Second, buccal cell RNA was used for microarray-based differential gene expression studies by comparing gene expression between a group of female smokers and nonsmokers.</p> <p>Results</p> <p>An amplification protocol allowed use of less buccal cell total RNA (50 ng) than had been reported previously with human microarrays. Total RNA isolated from buccal cells was degraded but was of sufficient quality to be used with RT-qPCR to detect expression of specific genes. We report here the finding of a small number of statistically significant differentially expressed genes between smokers and nonsmokers, using buccal cells as starting material. Gene Set Enrichment Analysis confirmed that these genes had a similar expression pattern to results from another study.</p> <p>Conclusions</p> <p>Our results suggest that despite a high degree of degradation, RNA from buccal cells from cheek mucosa could be used to detect differential gene expression between smokers and nonsmokers. However the RNA degradation, increase in sample variability and microarray failure rate show that buccal samples should be used with caution as source material in expression studies.</p

    Epidemiology of childhood Guillan-Barre syndrome in the north west of Iran

    Get PDF
    <p>Abstract</p> <p>Background and aims</p> <p>This study was carried out to investigate the incidence, annual time trend and some epidemiological and clinical features of Guillain-Barre syndrome in children in the north west of Iran.</p> <p>Materials and methods</p> <p>In this population-based cross sectional research, epidemiological and clinical features of 143 cases with Guillain-Barre syndrome between 2001 and 2006 were studied. The setting of the study was Tabriz Children Medical Centre, the major University-Hospital located in Tabriz city of the East Azarbaijan province covering whole region. Data collected included age, gender, chronological information, preceding events, functional grade of motor deficit.</p> <p>Results</p> <p>The mean age (standard deviation) of subjects was 5.4 (3.6) years. The male/female ratio was 1.3. The average annual incidence rate was 2.27 per 100 000 population of 15 years children (CI95%: 1.9–2.6). The majority of cases occurred in March, July and November and the highest proportion of the syndrome was observed in winter (29 percent, P > 0.10).</p> <p>Conclusion</p> <p>The results indicated that an unexpected high incidence of Guillain-Barre syndrome has occurred in 2003 in the region. We concluded that a monitoring and surveillance system for Guillain-Barre syndrome is essential to set up in this region.</p

    Psychiatric rating scales in Urdu: a systematic review

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Researchers setting out to conduct research employing questionnaires in non-English speaking populations need instruments that have been validated in the indigenous languages. In this study we have tried to review the literature on the status of cross-cultural and/or criterion validity of all the questionnaires measuring psychiatric symptoms available in Urdu language.</p> <p>Methods</p> <p>A search of Medline, Embase, PsycINFO and <url>http://www.pakmedinet.com</url> was conducted using the search terms; Urdu psychiatric rating scale, and Urdu and Psychiatry. References of retrieved articles were searched. Only studies describing either cross-cultural or criterion validation of a questionnaire in Urdu measuring psychiatric symptoms were included.</p> <p>Results</p> <p>Thirty two studies describing validation of 19 questionnaires were identified. Six of these questionnaires were developed indigenously in Urdu while thirteen had been translated from English. Of the six indigenous questionnaires five had had their criterion validity examined. Of the thirteen translated questionnaires only four had had both their cross-cultural and criterion validity assessed.</p> <p>Conclusion</p> <p>There is a paucity of validated questionnaires assessing psychiatric symptoms in Urdu. The BSI, SRQ and AKUADS are the questionnaires that have been most thoroughly evaluated in Urdu.</p
    • …
    corecore