421 research outputs found
Inferential considerations for low-count RNA-seq transcripts: a case study on the dominant prairie grass Andropogon gerardii
Citation: Raithel, S., Johnson, L., Galliart, M., Brown, S., Shelton, J., Herndon, N., & Bello, N. M. (2016). Inferential considerations for low-count RNA-seq transcripts: a case study on the dominant prairie grass Andropogon gerardii. Bmc Genomics, 17, 16. doi:10.1186/s12864-016-2442-7Background: Differential expression (DE) analysis of RNA-seq data still poses inferential challenges, such as handling of transcripts characterized by low expression levels. In this study, we use a plasmode-based approach to assess the relative performance of alternative inferential strategies on RNA-seq transcripts, with special emphasis on transcripts characterized by a small number of read counts, so-called low-count transcripts, as motivated by an ecological application in prairie grasses. Big bluestem (Andropogon gerardii) is a wide-ranging dominant prairie grass of ecological and agricultural importance to the US Midwest while edaphic subspecies sand bluestem (A. gerardii ssp. Hallii) grows exclusively on sand dunes. Relative to big bluestem, sand bluestem exhibits qualitative phenotypic divergence consistent with enhanced drought tolerance, plausibly associated with transcripts of low expression levels. Our dataset consists of RNA-seq read counts for 25,582 transcripts (60 % of which are classified as low-count) collected from leaf tissue of individual plants of big bluestem (n = 4) and sand bluestem (n = 4). Focused on low-count transcripts, we compare alternative ad-hoc data filtering techniques commonly used in RNA-seq pipelines and assess the inferential performance of recently developed statistical methods for DE analysis, namely DESeq2 and edgeR robust. These methods attempt to overcome the inherently noisy behavior of low-count transcripts by either shrinkage or differential weighting of observations, respectively. Results: Both DE methods seemed to properly control family-wise type 1 error on low-count transcripts, whereas edgeR robust showed greater power and DESeq2 showed greater precision and accuracy. However, specification of the degree of freedom parameter under edgeR robust had a non-trivial impact on inference and should be handled carefully. When properly specified, both DE methods showed overall promising inferential performance on low-count transcripts, suggesting that ad-hoc data filtering steps at arbitrary expression thresholds may be unnecessary. A note of caution is in order regarding the approximate nature of DE tests under both methods. Conclusions: Practical recommendations for DE inference are provided when low-count RNA-seq transcripts are of interest, as is the case in the comparison of subspecies of bluestem grasses. Insights from this study may also be relevant to other applications focused on transcripts of low expression levels
TweetLID : a benchmark for tweet language identification
Language identification, as the task of determining the language a given text is written in, has progressed substantially in recent decades. However, three main issues remain still unresolved: (1) distinction of similar languages, (2) detection of multilingualism in a single document, and (3) identifying the language of short texts. In this paper, we describe our work on the development of a benchmark to encourage further research in these three directions, set forth an evaluation framework suitable for the task, and make a dataset of annotated tweets publicly available for research purposes. We also describe the shared task we organized to validate and assess the evaluation framework and dataset with systems submitted by seven different participants, and analyze the performance of these systems. The evaluation of the results submitted by the participants of the shared task helped us shed some light on the shortcomings of state-of-the-art language identification systems, and gives insight into the extent to which the brevity, multilingualism, and language similarity found in texts exacerbate the performance of language identifiers. Our dataset with nearly 35,000 tweets and the evaluation framework provide researchers and practitioners with suitable resources to further study the aforementioned issues on language identification within a common setting that enables to compare results with one another
All clinically-relevant blood components transmit prion disease following a single blood transfusion: a sheep model of vCJD
Variant CJD (vCJD) is an incurable, infectious human disease, likely arising from the consumption of BSE-contaminated meat products. Whilst the epidemic appears to be waning, there is much concern that vCJD infection may be perpetuated in humans by the transfusion of contaminated blood products. Since 2004, several cases of transfusion-associated vCJD transmission have been reported and linked to blood collected from pre-clinically affected donors. Using an animal model in which the disease manifested resembles that of humans affected with vCJD, we examined which blood components used in human medicine are likely to pose the greatest risk of transmitting vCJD via transfusion. We collected two full units of blood from BSE-infected donor animals during the pre-clinical phase of infection. Using methods employed by transfusion services we prepared red cell concentrates, plasma and platelets units (including leucoreduced equivalents). Following transfusion, we showed that all components contain sufficient levels of infectivity to cause disease following only a single transfusion and also that leucoreduction did not prevent disease transmission. These data suggest that all blood components are vectors for prion disease transmission, and highlight the importance of multiple control measures to minimise the risk of human to human transmission of vCJD by blood transfusion
Gauge invariant definition of the jet quenching parameter
In the framework of Soft-Collinear Effective Theory, the jet quenching
parameter, , has been evaluated by adding the effect of Glauber gluon
interactions to the propagation of a highly-energetic collinear parton in a
medium. The result, which holds in covariant gauges, has been expressed in
terms of the expectation value of two Wilson lines stretching along the
direction of the four-momentum of the parton. In this paper, we show how that
expression can be generalized to an arbitrary gauge by the addition of
transverse Wilson lines. The transverse Wilson lines are explicitly computed by
resumming interactions of the parton with Glauber gluons that appear only in
non-covariant gauges. As an application of our result, we discuss the
contribution to coming from transverse momenta of order in a
medium that is a weakly-coupled quark-gluon plasma.Comment: 31 pages, 7 figures; journal versio
Prevalence of MRI lesions in men responding to a GP-led invitation for a prostate health check: a prospective cohort study
OBJECTIVE: In men with a raised prostate-specific antigen (PSA), MRI increases the detection of clinically significant cancer and reduces overdiagnosis, with fewer biopsies. MRI as a screening tool has not been assessed independently of PSA in a formal screening study. We report a systematic community-based assessment of the prevalence of prostate MRI lesions in an age-selected population.
METHODS AND ANALYSIS: Men aged 50â75 were identified from participating general practice (GP) practices and randomly selected for invitation to a screening MRI and PSA. Men with a positive MRI or a raised PSA density (â„0.12âng/mL2) were recommended for standard National Health Service (NHS) prostate cancer assessment. RESULTS: Eight GP practices sent invitations to 2096 men. 457 men (22%) responded and 303 completed both screening tests. Older white men were most likely to respond to the invitation, with black men having 20% of the acceptance rate of white men. One in six men (48/303 men, 16%) had a positive screening MRI, and an additional 1 in 20 men (16/303, 5%) had a raised PSA density alone. After NHS assessment, 29 men (9.6%) were diagnosed with clinically significant cancer and 3 men (1%) with clinically insignificant cancer. Two in three men with a positive MRI, and more than half of men with clinically significant disease had a PSA <3âng/mL. CONCLUSIONS: Prostate MRI may have value in screening independently of PSA. These data will allow modelling of the use of MRI as a primary screening tool to inform larger prostate cancer screening studies. TRIAL REGISTRATION NUMBER:
NCT04063566
The zoo of models of deliberate ignorance
This chapter looks at deliberate ignorance from a modeling perspective. Standard economic models cannot produce deliberate ignorance in a meaningful way; if there were no cost for acquisition and processing, data could be looked at privately and processed perfectly. Here the focus is on cases where the standard assumptions are violated in some way. Cases are considered from an individualâs perspective, without game-theoretic (strategic) aspects. Different classes of ânot wanting to knowâ something are identified: aside from the boring case of the cost of information acquisition being too high, an individual may prefer to not know some information (e.g., when knowledge would reduce the enjoyment of other experiences) or may want to not use some information (e.g., relating to a lack of self-control). In addition, strategic cases of deliberate ignorance are reviewed, where obtaining information would also signal to others that information acquisition has occurred, and thus it may be better to remain ignorant. Finally, the possibility of deliberate ignorance emerging in population-level models is discussed, where there seems to be a relative dearth of models of the phenomenon at present. Throughout, the authors make use of examples to summarize different classes of models, ideas for how deliberate ignorance can make sense, and gaps in the literature for future modeling
Recommended from our members
Multi-ancestry study of blood lipid levels identifies four loci interacting with physical activity.
Many genetic loci affect circulating lipid levels, but it remains unknown whether lifestyle factors, such as physical activity, modify these genetic effects. To identify lipid loci interacting with physical activity, we performed genome-wide analyses of circulating HDL cholesterol, LDL cholesterol, and triglyceride levels in up to 120,979 individuals of European, African, Asian, Hispanic, and Brazilian ancestry, with follow-up of suggestive associations in an additional 131,012 individuals. We find four loci, in/near CLASP1, LHX1, SNTA1, and CNTNAP2, that are associated with circulating lipid levels through interaction with physical activity; higher levels of physical activity enhance the HDL cholesterol-increasing effects of the CLASP1, LHX1, and SNTA1 loci and attenuate the LDL cholesterol-increasing effect of the CNTNAP2 locus. The CLASP1, LHX1, and SNTA1 regions harbor genes linked to muscle function and lipid metabolism. Our results elucidate the role of physical activity interactions in the genetic contribution to blood lipid levels
The PrPC Cl fragment derived from the ovine A(136)R(154)R(171) PRNP allele is highly abundant in sheep brain and inhibits fibrillisation of full-length PrPC protein in vitro
AbstractExpression of the cellular prion protein (PrPC) is crucial for the development of prion diseases. Resistance to prion diseases can result from reduced availability of the prion protein or from amino acid changes in the prion protein sequence. We propose here that increased production of a natural PrP α-cleavage fragment, C1, is also associated with resistance to disease. We show, in brain tissue, that ARR homozygous sheep, associated with resistance to disease, produced PrPC comprised of 25% more C1 fragment than PrPC from the disease-susceptible ARQ homozygous and highly susceptible VRQ homozygous animals. Only the C1 fragment derived from the ARR allele inhibits in-vitro fibrillisation of other allelic PrPC variants. We propose that the increased α-cleavage of ovine ARR PrPC contributes to a dominant negative effect of this polymorphism on disease susceptibility. Furthermore, the significant reduction in PrPC ÎČ-cleavage product C2 in sheep of the ARR/ARR genotype compared to ARQ/ARQ and VRQ/VRQ genotypes, may add to the complexity of genetic determinants of prion disease susceptibility
Interaction of mumps virus V protein variants with STAT1-STAT2 heterodimer: experimental and theoretical studies
<p>Abstract</p> <p>Background</p> <p>Mumps virus V protein has the ability to inhibit the interferon-mediated antiviral response by inducing degradation of STAT proteins. Two virus variants purified from Urabe AM9 mumps virus vaccine differ in their replication and transcription efficiency in cells primed with interferon. Virus susceptibility to IFN was associated with insertion of a non-coded glycine at position 156 in the V protein (VGly) of one virus variant, whereas resistance to IFN was associated with preservation of wild-type phenotype in the V protein (VWT) of the other variant.</p> <p>Results</p> <p>VWT and VGly variants of mumps virus were cloned and sequenced from Urabe AM9 vaccine strain. VGly differs from VWT protein because it possesses an amino acid change Gln<sub>103</sub>Pro (Pro<sup>103</sup>) and the Gly<sup>156 </sup>insertion. The effect of V protein variants on components of the interferon-stimulated gene factor 3 (ISGF3), STAT1 and STAT2 proteins were experimentally tested in cervical carcinoma cell lines. Expression of VWT protein decreased STAT1 phosphorylation, whereas VGly had no inhibitory effect on either STAT1 or STAT2 phosphorylation. For theoretical analysis of the interaction between V proteins and STAT proteins, 3D structural models of VWT and VGly were predicted by comparing with simian virus 5 (SV5) V protein structure in complex with STAT1-STAT2 heterodimer. <it>In silico </it>analysis showed that VWT-STAT1-STAT2 complex occurs through the V protein Trp-motif (W<sup>174</sup>, W<sup>178</sup>, W<sup>189</sup>) and Glu<sup>95 </sup>residue close to the Arg<sup>409 </sup>and Lys<sup>415 </sup>of the nuclear localization signal (NLS) of STAT2, leaving exposed STAT1 Lys residues (K<sup>85</sup>, K<sup>87</sup>, K<sup>296</sup>, K<sup>413</sup>, K<sup>525</sup>, K<sup>679</sup>, K<sup>685</sup>), which are susceptible to proteasome degradation. In contrast, the interaction between VGly and STAT1-STAT2 heterodimer occurs in a region far from the NLS of STAT2 without blocking of Lys residues in both STAT1 and STAT2.</p> <p>Conclusions</p> <p>Our results suggest that VWT protein of Urabe AM9 strain of mumps virus may be more efficient than VGly to inactivate both the IFN signaling pathway and antiviral response due to differences in their finest molecular interaction with STAT proteins.</p
Genome-wide Association Study of Platelet Count Identifies Ancestry-Specific Loci in Hispanic/Latino Americans
Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 Ă 10â28) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits
- âŠ