Search CORE

437 research outputs found

Extracting Entities of Interest from Comparative Product Reviews

Author: Agrawal Sumit
Arora Jatin
Goyal Pawan
Pathak Sayan
Publication venue
Publication date: 31/10/2023
Field of study

This paper presents a deep learning based approach to extract product comparison information out of user reviews on various e-commerce websites. Any comparative product review has three major entities of information: the names of the products being compared, the user opinion (predicate) and the feature or aspect under comparison. All these informing entities are dependent on each other and bound by the rules of the language, in the review. We observe that their inter-dependencies can be captured well using LSTMs. We evaluate our system on existing manually labeled datasets and observe out-performance over the existing Semantic Role Labeling (SRL) framework popular for this task.Comment: Source Code: https://github.com/jatinarora2702/Review-Information-Extractio

arXiv.org e-Print Archive

Exploration and visualization of gene expression with neuroanatomy in the adult mouse brain

Author: Hawrylycz Mike
Jones Allan
Kuan Leonard
Lau Christopher
Ng Lydia
Pathak Sayan
Thompson Carol
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Spatially mapped large scale gene expression databases enable quantitative comparison of data measurements across genes, anatomy, and phenotype. In most ongoing efforts to study gene expression in the mammalian brain, significant resources are applied to the mapping and visualization of data. This paper describes the implementation and utility of Brain Explorer, a 3D visualization tool for studying <it>in situ </it>hybridization-based (ISH) expression patterns in the Allen Brain Atlas, a genome-wide survey of 21,000 expression patterns in the C57BL6J adult mouse brain. Results Brain Explorer enables users to visualize gene expression data from the C57Bl/6J mouse brain in 3D at a resolution of 100 μm3, allowing co-display of several experiments as well as 179 reference neuro-anatomical structures. Brain Explorer also allows viewing of the original ISH images referenced from any point in a 3D data set. Anatomic and spatial homology searches can be performed from the application to find data sets with expression in specific structures and with similar expression patterns. This latter feature allows for anatomy independent queries and genome wide expression correlation studies. Conclusion These tools offer convenient access to detailed expression information in the adult mouse brain and the ability to perform data mining and visualization of gene expression and neuroanatomy in an integrated manner.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

Author: Basoglu Chris
Behre Piyush
Chang Shuangyu
Kesavamoorthy Harini
Pathak Sayan
Shah Amy
Tan Sharman
Zuo Fei
Publication venue
Publication date: 26/10/2022
Field of study

Punctuation and Segmentation are key to readability in Automatic Speech Recognition (ASR), often evaluated using F1 scores that require high-quality human transcripts and do not reflect readability well. Human evaluation is expensive, time-consuming, and suffers from large inter-observer variability, especially in conversational speech devoid of strict grammatical structures. Large pre-trained models capture a notion of grammatical structure. We present TRScore, a novel readability measure using the GPT model to evaluate different segmentation and punctuation systems. We validate our approach with human experts. Additionally, our approach enables quantitative assessment of text post-processing techniques such as capitalization, inverse text normalization (ITN), and disfluency on overall readability, which traditional word error rate (WER) and slot error rate (SER) metrics fail to capture. TRScore is strongly correlated to traditional F1 and human readability scores, with Pearson's correlation coefficients of 0.67 and 0.98, respectively. It also eliminates the need for human transcriptions for model selection

arXiv.org e-Print Archive

Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

Author: Basoglu Chris
Behre Piyush
Chang Shuangyu
Khalil Hosam
Liu Geoffrey
Parihar Naveen
Pathak Sayan
Shah Amy
Sharma Eva
Tan Sharman
Publication venue
Publication date: 27/10/2022
Field of study

Segmentation for continuous Automatic Speech Recognition (ASR) has traditionally used silence timeouts or voice activity detectors (VADs), which are both limited to acoustic features. This segmentation is often overly aggressive, given that people naturally pause to think as they speak. Consequently, segmentation happens mid-sentence, hindering both punctuation and downstream tasks like machine translation for which high-quality segmentation is critical. Model-based segmentation methods that leverage acoustic features are powerful, but without an understanding of the language itself, these approaches are limited. We present a hybrid approach that leverages both acoustic and language information to improve segmentation. Furthermore, we show that including one word as a look-ahead boosts segmentation quality. On average, our models improve segmentation-F0.5 score by 9.8% over baseline. We show that this approach works for multiple languages. For the downstream task of machine translation, it improves the translation BLEU score by an average of 1.05 points

arXiv.org e-Print Archive

Single cell fertilizer (SCF): Evidence to prove that bio-molecules are potent nutrient for plant growth

Author: Amit Kr. Roy
Deben Pathak
Dipankar Dey
Isita Chattopadhya
Rajib Ghosh
Sanjeev Upadhyay
Sayan Choudhury
Shibu Ghosh
Sohini Roy
Somenath Das
Sribir Sen
Publication venue
Publication date: 25/01/2009
Field of study

Fertilizers of various kinds are used for the cultivation of crop plants for hyper production of plant based food materials. The study used bio-molecules made in a bacterial cell. The experimental results showed tremendous effect on plant growth. These cellular molecules were made by treating the bacterial cells with lysozyme and protenase K. The wet/weight was increased in multiple folds compared to that of control sets. The fold of increase was 4.79 for rice, 2.77 for wheat, 1.89 for gram and 1.89 for pea when bacterial cellular molecules were used as fertilizer

Nature Precedings

An anatomic gene expression atlas of the adult mouse brain

Author: A Alvarez-Buylla
A MacKenzie-Graham
A MacKenzie-Graham
A Visel
Allan R Jones
Amy Bernard
AW Toga
AW Toga
Caroline C Overly
CE Gee
Chihchau Kuan
Chinh Dang
Chris Lau
David J Anderson
DC Van Essen
DN Abrous
Ed S Lein
ES Lein
G Marini
G Paxinos
Hemant Bokil
Hong-Wei Dong
HW Dong
Jason W Bohland
JB Kruskal
JB Kruskal
JL Price
John Hohmann
K Brodmann
LL Ng
Luis Puelles
LW Swanson
Lydia Ng
MA Zapala
MH DeGroot
Michael Hawrylycz
N Flames
P Voorn
PA Yushkevich
Partha P Mitra
RF Bonner
Sayan Pathak
SM Sherman
SM Sunkin
Susan M Sunkin
T Yamamori
VB Mountcastle
Y Ma
Y Nakamura
Publication venue: Nature Publishing Group
Publication date: 15/02/2009
Field of study

Studying gene expression provides a powerful means of understanding structure-function relationships in the nervous system. The availability of genome-scale in situ hybridization datasets enables new possibilities for understanding brain organization based on gene expression patterns. The Anatomic Gene Expression Atlas (AGEA) is a new relational atlas revealing the genetic architecture of the adult C57Bl/6J mouse brain based on spatial correlations across expression data for thousands of genes in the Allen Brain Atlas (ABA). The AGEA includes three discovery tools for examining neuroanatomical relationships and boundaries: (1) three-dimensional expression-based correlation maps, (2) a hierarchical transcriptome-based parcellation of the brain and (3) a facility to retrieve from the ABA specific genes showing enriched expression in local correlated domains. The utility of this atlas is illustrated by analysis of genetic organization in the thalamus, striatum and cerebral cortex. The AGEA is a publicly accessible online computational tool integrated with the ABA (http://mouse.brain-map.org/agea)

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Caltech Authors

Mechanistic Insight into the Reactivation of BCAII Enzyme from Denatured and Molten Globule States by Eukaryotic Ribosomes and Domain V rRNAs

Author: A Basu
A Ben-Shem
A Das
A Ghosh
A Jansens
A Katranidis
A Naeem
AA Komar
B Das
Biprashekhar Chakraborty
BK Pathak
C Voisset
C Voisset
CB Anfinsen
D Das
D Das
D Malakar
D Pal
D Samanta
D Samanta
FU Hartl
Hans-Joachim Wieden
HP Sorensen
Jayati Sengupta
JL Cleland
JR Warner
K Rajaraman
M Selmer
MA Algire
MS Svetlov
N Barbezier
N Bonander
NA Bushmarina
P Hammarstrom
R Santucci
R Singh
RH Argent
S Chattopadhyay
S Chattopadhyay
S Chattopadhyay
S Michaeli
Sayan Bhakta
SD Reis
SI Choi
SI Choi
SK Jha
SK Jha
TE Creighton
V Ramakrishnan
VN Uversky
VN Uversky
W Kudlicki
W Wong
Y Hashem
Y Pang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 21/04/2016
Field of study

In all life forms, decoding of messenger-RNA into polypeptide chain is accomplished by the ribosome. Several protein chaperones are known to bind at the exit of ribosomal tunnel to ensure proper folding of the nascent chain by inhibiting their premature folding in the densely crowded environment of the cell. However, accumulating evidence suggests that ribosome may play a chaperone role in protein folding events in vitro. Ribosome-mediated folding of denatured proteins by prokaryotic ribosomes has been studied extensively. The RNA-assisted chaperone activity of the prokaryotic ribosome has been attributed to the domain V, a span of 23S rRNA at the intersubunit side of the large subunit encompassing the Peptidyl Transferase Centre. Evidently, this functional property of ribosome is unrelated to the nascent chain protein folding at the exit of the ribosomal tunnel. Here, we seek to scrutinize whether this unique function is conserved in a primitive kinetoplastid group of eukaryotic species Leishmania donovani where the ribosome structure possesses distinct additional features and appears markedly different compared to other higher eukaryotic ribosomes. Bovine Carbonic Anhydrase II (BCAII) enzyme was considered as the model protein. Our results manifest that domain V of the large subunit rRNA of Leishmania ribosomes preserves chaperone activity suggesting that ribosome-mediated protein folding is, indeed, a conserved phenomenon. Further, we aimed to investigate the mechanism underpinning the ribosome-assisted protein reactivation process. Interestingly, the surface plasmon resonance binding analyses exhibit that rRNA guides productive folding by directly interacting with molten globule-like states of the protein. In contrast, native protein shows no notable affinity to the rRNA. Thus, our study not only confirms conserved, RNA-mediated chaperoning role of ribosome but also provides crucial insight into the mechanism of the process

Crossref

Directory of Open Access Journals

PubMed Central

EPrints@IICB Welcomes! - EPrints@IICB

FigShare

Predicting Group Success in Meetup

Author: Gundapuneni Midhun
Mitra Bivas
Pathak Sayan
Pramanik Soumajit
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 31/03/2016
Field of study

Success of groups in Meetup is of utmost importance for members who organize them. However, measures of group success in Meetup is quite vague till now. In this paper, we take a step to quantify the success of Meetup groups. Driven by a comprehensive study of our Meetup dataset, we handpick a set of key properties which can potentially regulate a group’s success. Finally, we develop a machine learning model leveraging on these features which can predict success of Meetup groups early with high accuracy

Association for the Advancement of Artificial Intelligence: AAAI Publications

On the Role of Micro-categories to Characterize Event Popularity in Meetup

Author: Bhowmick Ayan Kumar
Mitra Bivas
Pathak Sayan
Pramanik Soumajit
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 22/05/2021
Field of study

Event-based social networking platforms such as Meetup have recently witnessed a huge growth. However, with the rise in the volume of groups and events, making individual events attractive has become increasingly challenging for its organizers. As a result, we find that events hosted by groups of same category at similar venues and similar times, also widely differ in their popularity. Data study reveals that the topics specified in textual descriptions of events may be key to their popularity. In this paper, we introduce a novel concept of topical micro-categories in the context of EBSNs for accurately characterizing events, such that events belonging to the same micro-category exhibit similar popularity profile. We develop a principled method to detect such micro-categories from the textual descriptions of individual events. Our experiments reveal the significance of the detected micro-categories in determining the popularity of associated Meetup events and groups. We also investigate the effectiveness of the micro-categories in a real-world application scenario by developing a recommendation model; this model recommends relevant micro-categories to a group for hosting its future events with enhanced popularity. Notably, our model achieves an average NDCG score of around 0.75 showing a straight 5% improvement over the best performing competing method

Association for the Advancement of Artificial Intelligence: AAAI Publications

On the Splitting Dynamics of Meetup Social Groups

Author: Bhowmick Ayan Kumar
Mitra Bivas
Pathak Sayan
Pramanik Soumajit
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 26/05/2020
Field of study

Groups in online social networks witness continuous evolution by loss of existing members and gain of new members. In this paper, we present a study of group split in Meetup, where a major fraction of members leave the existing group together and join a newly formed group. We identify pivotal group members, called splitters, playing key roles in group split by influencing the existing members to leave the group. We provide an in-depth analysis of the empirical data to reveal key motivating factors leading to a group split and its subsequent impact. Finally, we develop a prediction model for early detection of splitters, as well as the group members likely to be influenced by the splitter to leave the group

Association for the Advancement of Artificial Intelligence: AAAI Publications