79 research outputs found
SENTIMENT ANALYSIS OF CHINESE MICROBLOG MESSAGE USING NEURAL NETWORK-BASED VECTOR REPRESENTATION FOR MEASURING REGIONAL PREJUDICE
Regional prejudice is prevalent in Chinese cities in which native residents and migrants lack a basic level of trust in the other group. Like Twitter, Sina Weibo is a social media platform where people actively engage in discussions on various social issues. Thus, it provides a good data source for measuring individuals’ regional prejudice on a large scale. We find that a resentful tone dominates in Weibo messages related to migrants. In this paper, we propose a novel approach, named DKV, for recognizing polarity and direction of sentiment for Weibo messages using distributed real-valued vector representation of keywords learned from neural networks. Such a representation can project rich context information (or embedding) into the vector space, and subsequently be used to infer similarity measures among words, sentences, and even documents. We provide a comprehensive performance evaluation to demonstrate that by exploiting the keyword embeddings, DKV paired with support vector machines can effectively recognize a Weibo message into the predefined sentiment and its direction. Results demonstrate that our method can achieve the best performances compared to other approaches
Molecular epidemiology and emergence of worldwide epidemic clones of Neisseria meningitidis in Taiwan
BACKGROUND: Meningococcal disease is infrequently found in Taiwan, a country with 23 million people. Between 1996 and 2002, 17 to 81 clinical cases of the disease were reported annually. Reported cases dramatically increased in 2001–2002. Our record shows that only serogroup B and W135 meningococci have been isolated from patients with meningococcal disease until 2000. However, serogroup A, C and Y meningococci were detected for the first time in 2001 and continued to cause disease through 2002. Most of serogroup Y meningococcus infections localized in Central Taiwan in 2001, indicating that a small-scale outbreak of meningococcal disease had occurred. The occurrence of a meningococcal disease outbreak and the emergence of new meningococcal strains are of public health concern. METHODS: Neisseria meningitidis isolates from patients with meningococcal disease from 1996 to 2002 were collected and characterized by serogrouping, pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST). The genetic relatedness and clonal relationship between the isolates were analyzed by using the PFGE patterns and the allelic profiles of the sequence types (STs). RESULTS: Serogroups A, B, C, W135, Y, and non-serogroupable Neisseria meningitidis were, respectively, responsible for 2%, 50%, 2%, 35%, 9%, and 2% of 158 culture-confirmed cases of meningococcal disease in 1996–2002. Among 100 N. meningitidis isolates available for PFGE and MLST analyses, 51 different PFGE patterns and 30 STs were identified with discriminatory indices of 0.95 and 0.87, respectively. Of the 30 STs, 21 were newly identified and of which 19 were found in serogroup B isolates. A total of 40 PFGE patterns were identified in 52 serogroup B isolates with the patterns distributed over several distinct clusters. In contrast, the isolates within each of the serogroups A, C, W135, and Y shared high levels of PFGE pattern similarity. Analysis of the allelic profile of the 30 STs suggested the serogroup B isolates be assigned into 5 clonally related groups/ clonal complexes and 7 unique clones. The ST-41/44 complex/Lineage 3, and the ST-3439 and ST-3200 groups represented 79% of the serogroup B meningococci. In contrast, isolates within serogroups A, serogroup W135 (and C), and serogroup Y, respectively, simply belonged to ST-7, ST-11, and ST-23 clones. CONCLUSION: Our data suggested that serogroup B isolates were derived from several distinct lineages, most of which could either be indigenous or were introduced into Taiwan a long time ago. The serogroup A, W135 (and C), and Y isolates, respectively, belonged to the ST-7, ST-11, and ST-23, and the represented clones that are currently the major circulating clones in the world and are introduced into Taiwan more recently. The emergence of serogroup A, C and Y strains contributed partly to the increase in cases of meningococcal disease in 2001–2002
Genome-Wide Gene Expression Analysis Implicates the Immune Response and Lymphangiogenesis in the Pathogenesis of Fetal Chylothorax
Fetal chylothorax (FC) is a rare condition characterized by lymphocyte-rich pleural effusion. Although its pathogenesis remains elusive, it may involve inflammation, since there are increased concentrations of proinflammatory mediators in pleural fluids. Only a few hereditary lymphedema-associated gene loci, e.g. VEGFR3, ITGA9 and PTPN11, were detected in human fetuses with this condition; these cases had a poorer prognosis, due to defective lymphangiogenesis. In the present study, genome-wide gene expression analysis was conducted, comparing pleural and ascitic fluids in three hydropic fetuses, one with and two without the ITGA9 mutation. One fetus (the index case), from a dizygotic pregnancy (the cotwin was unaffected), received antenatal OK-432 pleurodesis and survived beyond the neonatal stage, despite having the ITGA9 mutation. Genes and pathways involved in the immune response were universally up-regulated in fetal pleural fluids compared to those in ascitic fluids. Furthermore, genes involved in the lymphangiogenesis pathway were down-regulated in fetal pleural fluids (compared to ascitic fluid), but following OK-432 pleurodesis, they were up-regulated. Expression of ITGA9 was concordant with overall trends of lymphangiogenesis. In conclusion, we inferred that both the immune response and lymphangiogenesis were implicated in the pathogenesis of fetal chylothorax. Furthermore, genome-wide gene expression microarray analysis may facilitate personalized medicine by selecting the most appropriate treatment, according to the specific circumstances of the patient, for this rare, but heterogeneous disease
Urinary levels of organophosphate flame retardants metabolites in a young population from Southern Taiwan and potential health effects
BackgroundOrganophosphate flame retardants (OPFRs) are widely distributed in the environment and their metabolites are observed in urine, but little is known regarding OPFRs in a broad-spectrum young population from newborns to those aged 18 years.ObjectivesInvestigate urinary levels of OPFRs and OPFR metabolites in Taiwanese infants, young children, schoolchildren, and adolescents within the general population.MethodsDifferent age groups of subjects (n=136) were recruited from southern Taiwan to detect 10 OPFR metabolites in urine samples. Associations between urinary OPFRs and their corresponding metabolites and potential health status were also examined.ResultsThe mean level of urinary Σ10 OPFR in this broad-spectrum young population is 2.25 μg/L (standard deviation (SD) of 1.91 μg/L). Σ10 OPFR metabolites in urine are 3.25 ± 2.84, 3.06 ± 2.21, 1.75 ± 1.10, and 2.32 ± 2.29 μg/L in the age groups comprising of newborns, 1-5 year-olds, 6-10 year-olds, and 11-18 year-olds, respectively, and borderline significant differences were found in the different age groups (p=0.125). The OPFR metabolites of TCEP, BCEP, DPHP, TBEP, DBEP, and BDCPP predominate in urine and comprise more than 90% of the total. TBEP was highly correlated with DBEP in this population (r=0.845, p<0.001). The estimated daily intake (EDI) of Σ5OPFRs (TDCPP, TCEP, TBEP, TNBP, and TPHP) was 2,230, 461, 130, and 184 ng/kg bw/day for newborns, 1-5 yr children, 6-10 yr children, and 11-17 yr adolescents, respectively. The EDI of Σ5OPFRs for newborns was 4.83-17.2 times higher than the other age groups. Urinary OPFR metabolites are significantly correlated with birth length and chest circumference in newborns.ConclusionTo our knowledge, this is the first investigation of urinary OPFR metabolite levels in a broad-spectrum young population. There tended to be higher exposure rates in both newborns and pre-schoolers, though little is known about their exposure levels or factors leading to exposure in the young population. Further studies should clarify the exposure levels and factor relationships
Genome-Wide Association Study of Lung Adenocarcinoma in East Asia and Comparison With a European Population
Lung adenocarcinoma is the most common type of lung cancer. Known risk variants explain only a small fraction of lung adenocarcinoma heritability. Here, we conducted a two-stage genome-wide association study of lung adenocarcinoma of East Asian ancestry (21,658 cases and 150,676 controls; 54.5% never-smokers) and identified 12 novel susceptibility variants, bringing the total number to 28 at 25 independent loci. Transcriptome-wide association analyses together with colocalization studies using a Taiwanese lung expression quantitative trait loci dataset (n = 115) identified novel candidate genes, including FADS1 at 11q12 and ELF5 at 11p13. In a multi-ancestry meta-analysis of East Asian and European studies, four loci were identified at 2p11, 4q32, 16q23, and 18q12. At the same time, most of our findings in East Asian populations showed no evidence of association in European populations. In our studies drawn from East Asian populations, a polygenic risk score based on the 25 loci had a stronger association in never-smokers vs. individuals with a history of smoking (Pinteraction = 0.0058). These findings provide new insights into the etiology of lung adenocarcinoma in individuals from East Asian populations, which could be important in developing translational applications
Genome-wide association study of lung adenocarcinoma in East Asia and comparison with a European population
Lung adenocarcinoma is the most common type of lung cancer. Known risk variants explain only a small fraction of lung adenocarcinoma heritability. Here, we conducted a two-stage genome-wide association study of lung adenocarcinoma of East Asian ancestry (21,658 cases and 150,676 controls; 54.5% never-smokers) and identified 12 novel susceptibility variants, bringing the total number to 28 at 25 independent loci. Transcriptome-wide association analyses together with colocalization studies using a Taiwanese lung expression quantitative trait loci dataset (n = 115) identified novel candidate genes, including FADS1 at 11q12 and ELF5 at 11p13. In a multi-ancestry meta-analysis of East Asian and European studies, four loci were identified at 2p11, 4q32, 16q23, and 18q12. At the same time, most of our findings in East Asian populations showed no evidence of association in European populations. In our studies drawn from East Asian populations, a polygenic risk score based on the 25 loci had a stronger association in never-smokers vs. individuals with a history of smoking (P interaction = 0.0058). These findings provide new insights into the etiology of lung adenocarcinoma in individuals from East Asian populations, which could be important in developing translational applications
Genome-wide association study of lung adenocarcinoma in East Asia and comparison with a European population.
Lung adenocarcinoma is the most common type of lung cancer. Known risk variants explain only a small fraction of lung adenocarcinoma heritability. Here, we conducted a two-stage genome-wide association study of lung adenocarcinoma of East Asian ancestry (21,658 cases and 150,676 controls; 54.5% never-smokers) and identified 12 novel susceptibility variants, bringing the total number to 28 at 25 independent loci. Transcriptome-wide association analyses together with colocalization studies using a Taiwanese lung expression quantitative trait loci dataset (n = 115) identified novel candidate genes, including FADS1 at 11q12 and ELF5 at 11p13. In a multi-ancestry meta-analysis of East Asian and European studies, four loci were identified at 2p11, 4q32, 16q23, and 18q12. At the same time, most of our findings in East Asian populations showed no evidence of association in European populations. In our studies drawn from East Asian populations, a polygenic risk score based on the 25 loci had a stronger association in never-smokers vs. individuals with a history of smoking (Pinteraction = 0.0058). These findings provide new insights into the etiology of lung adenocarcinoma in individuals from East Asian populations, which could be important in developing translational applications
Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples
Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts
- …