381 research outputs found

    Extracting Entities of Interest from Comparative Product Reviews

    Full text link
    This paper presents a deep learning based approach to extract product comparison information out of user reviews on various e-commerce websites. Any comparative product review has three major entities of information: the names of the products being compared, the user opinion (predicate) and the feature or aspect under comparison. All these informing entities are dependent on each other and bound by the rules of the language, in the review. We observe that their inter-dependencies can be captured well using LSTMs. We evaluate our system on existing manually labeled datasets and observe out-performance over the existing Semantic Role Labeling (SRL) framework popular for this task.Comment: Source Code: https://github.com/jatinarora2702/Review-Information-Extractio

    Exploration and visualization of gene expression with neuroanatomy in the adult mouse brain

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Spatially mapped large scale gene expression databases enable quantitative comparison of data measurements across genes, anatomy, and phenotype. In most ongoing efforts to study gene expression in the mammalian brain, significant resources are applied to the mapping and visualization of data. This paper describes the implementation and utility of Brain Explorer, a 3D visualization tool for studying <it>in situ </it>hybridization-based (ISH) expression patterns in the Allen Brain Atlas, a genome-wide survey of 21,000 expression patterns in the C57BL6J adult mouse brain.</p> <p>Results</p> <p>Brain Explorer enables users to visualize gene expression data from the C57Bl/6J mouse brain in 3D at a resolution of 100 μm<sup>3</sup>, allowing co-display of several experiments as well as 179 reference neuro-anatomical structures. Brain Explorer also allows viewing of the original ISH images referenced from any point in a 3D data set. Anatomic and spatial homology searches can be performed from the application to find data sets with expression in specific structures and with similar expression patterns. This latter feature allows for anatomy independent queries and genome wide expression correlation studies.</p> <p>Conclusion</p> <p>These tools offer convenient access to detailed expression information in the adult mouse brain and the ability to perform data mining and visualization of gene expression and neuroanatomy in an integrated manner.</p

    TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

    Full text link
    Punctuation and Segmentation are key to readability in Automatic Speech Recognition (ASR), often evaluated using F1 scores that require high-quality human transcripts and do not reflect readability well. Human evaluation is expensive, time-consuming, and suffers from large inter-observer variability, especially in conversational speech devoid of strict grammatical structures. Large pre-trained models capture a notion of grammatical structure. We present TRScore, a novel readability measure using the GPT model to evaluate different segmentation and punctuation systems. We validate our approach with human experts. Additionally, our approach enables quantitative assessment of text post-processing techniques such as capitalization, inverse text normalization (ITN), and disfluency on overall readability, which traditional word error rate (WER) and slot error rate (SER) metrics fail to capture. TRScore is strongly correlated to traditional F1 and human readability scores, with Pearson's correlation coefficients of 0.67 and 0.98, respectively. It also eliminates the need for human transcriptions for model selection

    Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

    Full text link
    Segmentation for continuous Automatic Speech Recognition (ASR) has traditionally used silence timeouts or voice activity detectors (VADs), which are both limited to acoustic features. This segmentation is often overly aggressive, given that people naturally pause to think as they speak. Consequently, segmentation happens mid-sentence, hindering both punctuation and downstream tasks like machine translation for which high-quality segmentation is critical. Model-based segmentation methods that leverage acoustic features are powerful, but without an understanding of the language itself, these approaches are limited. We present a hybrid approach that leverages both acoustic and language information to improve segmentation. Furthermore, we show that including one word as a look-ahead boosts segmentation quality. On average, our models improve segmentation-F0.5 score by 9.8% over baseline. We show that this approach works for multiple languages. For the downstream task of machine translation, it improves the translation BLEU score by an average of 1.05 points

    Single cell fertilizer (SCF): Evidence to prove that bio-molecules are potent nutrient for plant growth

    Get PDF
    Fertilizers of various kinds are used for the cultivation of crop plants for hyper production of plant based food materials. The study used bio-molecules made in a bacterial cell. The experimental results showed tremendous effect on plant growth. These cellular molecules were made by treating the bacterial cells with lysozyme and protenase K. The wet/weight was increased in multiple folds compared to that of control sets. The fold of increase was 4.79 for rice, 2.77 for wheat, 1.89 for gram and 1.89 for pea when bacterial cellular molecules were used as fertilizer

    An anatomic gene expression atlas of the adult mouse brain

    Get PDF
    Studying gene expression provides a powerful means of understanding structure-function relationships in the nervous system. The availability of genome-scale in situ hybridization datasets enables new possibilities for understanding brain organization based on gene expression patterns. The Anatomic Gene Expression Atlas (AGEA) is a new relational atlas revealing the genetic architecture of the adult C57Bl/6J mouse brain based on spatial correlations across expression data for thousands of genes in the Allen Brain Atlas (ABA). The AGEA includes three discovery tools for examining neuroanatomical relationships and boundaries: (1) three-dimensional expression-based correlation maps, (2) a hierarchical transcriptome-based parcellation of the brain and (3) a facility to retrieve from the ABA specific genes showing enriched expression in local correlated domains. The utility of this atlas is illustrated by analysis of genetic organization in the thalamus, striatum and cerebral cortex. The AGEA is a publicly accessible online computational tool integrated with the ABA (http://mouse.brain-map.org/agea)

    Mechanistic Insight into the Reactivation of BCAII Enzyme from Denatured and Molten Globule States by Eukaryotic Ribosomes and Domain V rRNAs

    Get PDF
    In all life forms, decoding of messenger-RNA into polypeptide chain is accomplished by the ribosome. Several protein chaperones are known to bind at the exit of ribosomal tunnel to ensure proper folding of the nascent chain by inhibiting their premature folding in the densely crowded environment of the cell. However, accumulating evidence suggests that ribosome may play a chaperone role in protein folding events in vitro. Ribosome-mediated folding of denatured proteins by prokaryotic ribosomes has been studied extensively. The RNA-assisted chaperone activity of the prokaryotic ribosome has been attributed to the domain V, a span of 23S rRNA at the intersubunit side of the large subunit encompassing the Peptidyl Transferase Centre. Evidently, this functional property of ribosome is unrelated to the nascent chain protein folding at the exit of the ribosomal tunnel. Here, we seek to scrutinize whether this unique function is conserved in a primitive kinetoplastid group of eukaryotic species Leishmania donovani where the ribosome structure possesses distinct additional features and appears markedly different compared to other higher eukaryotic ribosomes. Bovine Carbonic Anhydrase II (BCAII) enzyme was considered as the model protein. Our results manifest that domain V of the large subunit rRNA of Leishmania ribosomes preserves chaperone activity suggesting that ribosome-mediated protein folding is, indeed, a conserved phenomenon. Further, we aimed to investigate the mechanism underpinning the ribosome-assisted protein reactivation process. Interestingly, the surface plasmon resonance binding analyses exhibit that rRNA guides productive folding by directly interacting with molten globule-like states of the protein. In contrast, native protein shows no notable affinity to the rRNA. Thus, our study not only confirms conserved, RNA-mediated chaperoning role of ribosome but also provides crucial insight into the mechanism of the process

    Predicting Group Success in Meetup

    No full text
    Success of groups in Meetup is of utmost importance for members who organize them. However, measures of group success in Meetup is quite vague till now. In this paper, we take a step to quantify the success of Meetup groups. Driven by a comprehensive study of our Meetup dataset, we handpick a set of key properties which can potentially regulate a group’s success. Finally, we develop a machine learning model leveraging on these features which can predict success of Meetup groups early with high accuracy

    On the Role of Micro-categories to Characterize Event Popularity in Meetup

    No full text
    Event-based social networking platforms such as Meetup have recently witnessed a huge growth. However, with the rise in the volume of groups and events, making individual events attractive has become increasingly challenging for its organizers. As a result, we find that events hosted by groups of same category at similar venues and similar times, also widely differ in their popularity. Data study reveals that the topics specified in textual descriptions of events may be key to their popularity. In this paper, we introduce a novel concept of topical micro-categories in the context of EBSNs for accurately characterizing events, such that events belonging to the same micro-category exhibit similar popularity profile. We develop a principled method to detect such micro-categories from the textual descriptions of individual events. Our experiments reveal the significance of the detected micro-categories in determining the popularity of associated Meetup events and groups. We also investigate the effectiveness of the micro-categories in a real-world application scenario by developing a recommendation model; this model recommends relevant micro-categories to a group for hosting its future events with enhanced popularity. Notably, our model achieves an average NDCG score of around 0.75 showing a straight 5% improvement over the best performing competing method

    On the Splitting Dynamics of Meetup Social Groups

    No full text
    Groups in online social networks witness continuous evolution by loss of existing members and gain of new members. In this paper, we present a study of group split in Meetup, where a major fraction of members leave the existing group together and join a newly formed group. We identify pivotal group members, called splitters, playing key roles in group split by influencing the existing members to leave the group. We provide an in-depth analysis of the empirical data to reveal key motivating factors leading to a group split and its subsequent impact. Finally, we develop a prediction model for early detection of splitters, as well as the group members likely to be influenced by the splitter to leave the group
    • …
    corecore