76 research outputs found
Automated and Interpretable Patient ECG Profiles for Disease Detection, Tracking, and Discovery
The electrocardiogram or ECG has been in use for over 100 years and remains
the most widely performed diagnostic test to characterize cardiac structure and
electrical activity. We hypothesized that parallel advances in computing power,
innovations in machine learning algorithms, and availability of large-scale
digitized ECG data would enable extending the utility of the ECG beyond its
current limitations, while at the same time preserving interpretability, which
is fundamental to medical decision-making. We identified 36,186 ECGs from the
UCSF database that were 1) in normal sinus rhythm and 2) would enable training
of specific models for estimation of cardiac structure or function or detection
of disease. We derived a novel model for ECG segmentation using convolutional
neural networks (CNN) and Hidden Markov Models (HMM) and evaluated its output
by comparing electrical interval estimates to 141,864 measurements from the
clinical workflow. We built a 725-element patient-level ECG profile using
downsampled segmentation data and trained machine learning models to estimate
left ventricular mass, left atrial volume, mitral annulus e' and to detect and
track four diseases: pulmonary arterial hypertension (PAH), hypertrophic
cardiomyopathy (HCM), cardiac amyloid (CA), and mitral valve prolapse (MVP).
CNN-HMM derived ECG segmentation agreed with clinical estimates, with median
absolute deviations (MAD) as a fraction of observed value of 0.6% for heart
rate and 4% for QT interval. Patient-level ECG profiles enabled quantitative
estimates of left ventricular and mitral annulus e' velocity with good
discrimination in binary classification models of left ventricular hypertrophy
and diastolic function. Models for disease detection ranged from AUROC of 0.94
to 0.77 for MVP. Top-ranked variables for all models included known ECG
characteristics along with novel predictors of these traits/diseases.Comment: 13 pages, 6 figures, 1 Table + Supplemen
Recognition of Polyadenylate RNA by the Poly(A)-Binding Protein
AbstractThe cocrystal structure of human poly(A)-binding protein (PABP) has been determined at 2.6 Å resolution. PABP recognizes the 3′ mRNA poly(A) tail and plays critical roles in eukaryotic translation initiation and mRNA stabilization/degradation. The minimal PABP used in this study consists of the N-terminal two RRM-type RNA-binding domains connected by a short linker (RRM1/2). These two RRMs form a continuous RNA-binding trough, lined by an antiparallel β sheet backed by four α helices. The polyadenylate RNA adopts an extended conformation running the length of the molecular trough. Adenine recognition is primarily mediated by contacts with conserved residues found in the RNP motifs of the two RRMs. The convex dorsum of RRM1/2 displays a phylogenetically conserved hydrophobic/acidic portion, which may interact with translation initiation factors and regulatory proteins
Recommended from our members
Adipocyte JAK2 Regulates Hepatic Insulin Sensitivity Independently of Body Composition, Liver Lipid Content, and Hepatic Insulin Signaling.
Disruption of hepatocyte growth hormone (GH) signaling through disruption of Jak2 (JAK2L) leads to fatty liver. Previously, we demonstrated that development of fatty liver depends on adipocyte GH signaling. We sought to determine the individual roles of hepatocyte and adipocyte Jak2 on whole-body and tissue insulin sensitivity and liver metabolism. On chow, JAK2L mice had hepatic steatosis and severe whole-body and hepatic insulin resistance. However, concomitant deletion of Jak2 in hepatocytes and adipocytes (JAK2LA) completely normalized insulin sensitivity while reducing liver lipid content. On high-fat diet, JAK2L mice had hepatic steatosis and insulin resistance despite protection from diet-induced obesity. JAK2LA mice had higher liver lipid content and no protection from obesity but retained exquisite hepatic insulin sensitivity. AKT activity was selectively attenuated in JAK2L adipose tissue, whereas hepatic insulin signaling remained intact despite profound hepatic insulin resistance. Therefore, JAK2 in adipose tissue is epistatic to liver with regard to insulin sensitivity and responsiveness, despite fatty liver and obesity. However, hepatocyte autonomous JAK2 signaling regulates liver lipid deposition under conditions of excess dietary fat. This work demonstrates how various tissues integrate JAK2 signals to regulate insulin/glucose and lipid metabolism
Pattern Specification and Immune Response Transcriptional Signatures of Pericardial and Subcutaneous Adipose Tissue
Cardiovascular disease (CVD) remains the leading cause of morbidity and mortality in the United States. Recent studies suggest that pericardial adipose tissue (PCAT) secretes inflammatory factors that contribute to the development of CVD. To better characterize the role of PCAT in the pathogenesis of disease, we performed a large-scale unbiased analysis of the transcriptional differences between PCAT and subcutaneous adipose tissue, analysing 53 microarrays across 19 individuals. As it was unknown whether PCAT-secreted factors are produced by adipocytes or cells in the supporting stromal fraction, we also sought to identify differentially expressed genes in isolated pericardial adipocytes vs. isolated subcutaneous adipocytes. Using microarray analysis, we found that: 1) pericardial adipose tissue and isolated pericardial adipocytes both overexpress atherosclerosis-promoting chemokines and 2) pericardial and subcutaneous fat depots, as well as isolated pericardial adipocytes and subcutaneous adipocytes, express specific patterns of homeobox genes. In contrast, a core set of lipid processing genes showed no significant overlap with differentially expressed transcripts. These depot-specific homeobox signatures and transcriptional profiles strongly suggest different functional roles for the pericardial and subcutaneous adipose depots. Further characterization of these inter-depot differences should be a research priority
Single-Nucleotide Polymorphisms in LPA Explain Most of the Ancestry-Specific Variation in Lp(a) Levels in African Americans
Lipoprotein(a) (Lp(a)) is an important causal cardiovascular risk factor, with serum Lp(a) levels predicting atherosclerotic heart disease and genetic determinants of Lp(a) levels showing association with myocardial infarction. Lp(a) levels vary widely between populations, with African-derived populations having nearly 2-fold higher Lp(a) levels than European Americans. We investigated the genetic basis of this difference in 4464 African Americans from the Jackson Heart Study (JHS) using a panel of up to 1447 ancestry informative markers, allowing us to accurately estimate the African ancestry proportion of each individual at each position in the genome. In an unbiased genome-wide admixture scan for frequency-differentiated genetic determinants of Lp(a) level, we found a convincing peak (LOD = 13.6) at 6q25.3, which spans the LPA locus. Dense fine-mapping of the LPA locus identified a number of strongly associated, common biallelic SNPs, a subset of which can account for up to 7% of the variation in Lp(a) level, as well as >70% of the African-European population differences in Lp(a) level. We replicated the association of the most strongly associated SNP, rs9457951 (p = 6×10−22, 27% change in Lp(a) per allele, ∼5% of Lp(a) variance explained in JHS), in 1,726 African Americans from the Dallas Heart Study and found an even stronger association after adjustment for the kringle(IV) repeat copy number. Despite the strong association with Lp(a) levels, we find no association of any LPA SNP with incident coronary heart disease in 3,225 African Americans from the Atherosclerosis Risk in Communities Study
Recommended from our members
Prioritizing causal disease genes using unbiased genomic features
Background: Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. Results: To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. Conclusion: Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0534-8) contains supplementary material, which is available to authorized users
An internal promoter underlies the difference in disease severity between N- and C-terminal truncation mutations of Titin in zebrafish
Truncating mutations in the giant sarcomeric protein Titin result in dilated cardiomyopathy and skeletal myopathy. The most severely affected dilated cardiomyopathy patients harbor Titin truncations in the C-terminal two-thirds of the protein, suggesting that mutation position might influence disease mechanism. Using CRISPR/Cas9 technology, we generated six zebrafish lines with Titin truncations in the N-terminal and C-terminal regions. Although all exons were constitutive, C-terminal mutations caused severe myopathy whereas N-terminal mutations demonstrated mild phenotypes. Surprisingly, neither mutation type acted as a dominant negative. Instead, we found a conserved internal promoter at the precise position where divergence in disease severity occurs, with the resulting protein product partially rescuing N-terminal truncations. In addition to its clinical implications, our work may shed light on a long-standing mystery regarding the architecture of the sarcomere
Genetic Differences between the Determinants of Lipid Profile Phenotypes in African and European Americans: The Jackson Heart Study
Genome-wide association analysis in populations of European descent has recently found more than a hundred genetic variants affecting risk for common disease. An open question, however, is how relevant the variants discovered in Europeans are to other populations. To address this problem for cardiovascular phenotypes, we studied a cohort of 4,464 African Americans from the Jackson Heart Study (JHS), in whom we genotyped both a panel of 12 recently discovered genetic variants known to predict lipid profile levels in Europeans and a panel of up to 1,447 ancestry informative markers allowing us to determine the African ancestry proportion of each individual at each position in the genome. Focusing on lipid profiles—HDL-cholesterol (HDL-C), LDL-cholesterol (LDL-C), and triglycerides (TG)—we identified the lipoprotein lipase (LPL) locus as harboring variants that account for interethnic variation in HDL-C and TG. In particular, we identified a novel common variant within LPL that is strongly associated with TG (p = 2.7×10−6) and explains nearly 1% of the variability in this phenotype, the most of any variant in African Americans to date. Strikingly, the extensively studied “gain-of-function” S447X mutation at LPL, which has been hypothesized to be the major determinant of the LPL-TG genetic association and is in trials for human gene therapy, has a significantly diminished strength of biological effect when it is found on a background of African rather than European ancestry. These results suggest that there are other, yet undiscovered variants at the locus that are truly causal (and are in linkage disequilibrium with S447X) or that work synergistically with S447X to modulate TG levels. Finally, we find systematically lower effect sizes for the 12 risk variants discovered in European populations on the African local ancestry background in JHS, highlighting the need for caution in the use of genetic variants for risk assessment across different populations
Recommended from our members
Interpreting Cancer Genomes Using Systematic Host Perturbations by Tumour Virus Proteins
Genotypic differences greatly influence susceptibility and resistance to disease. Understanding genotype-phenotype relationships requires that phenotypes be viewed as manifestations of network properties, rather than simply as the result of individual genomic variations. Genome sequencing efforts have identified numerous germline mutations associated with cancer predisposition and large numbers of somatic genomic alterations. However, it remains challenging to distinguish between background, or “passenger” and causal, or “driver” cancer mutations in these datasets. Human viruses intrinsically depend on their host cell during the course of infection and can elicit pathological phenotypes similar to those arising from mutations. To test the hypothesis that genomic variations and tumour viruses may cause cancer via related mechanisms, we systematically examined host interactome and transcriptome network perturbations caused by DNA tumour virus proteins. The resulting integrated viral perturbation data reflects rewiring of the host cell networks, and highlights pathways that go awry in cancer, such as Notch signalling and apoptosis. We show that systematic analyses of host targets of viral proteins can identify cancer genes with a success rate on par with their identification through functional genomics and large-scale cataloguing of tumour mutations. Together, these complementary approaches result in increased specificity for cancer gene identification. Combining systems-level studies of pathogen-encoded gene products with genomic approaches will facilitate prioritization of cancer-causing driver genes so as to advance understanding of the genetic basis of human cancer
- …