42,178 research outputs found
Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?
The organization and mining of malaria genomic and post-genomic data is
highly motivated by the necessity to predict and characterize new biological
targets and new drugs. Biological targets are sought in a biological space
designed from the genomic data from Plasmodium falciparum, but using also the
millions of genomic data from other species. Drug candidates are sought in a
chemical space containing the millions of small molecules stored in public and
private chemolibraries. Data management should therefore be as reliable and
versatile as possible. In this context, we examined five aspects of the
organization and mining of malaria genomic and post-genomic data: 1) the
comparison of protein sequences including compositionally atypical malaria
sequences, 2) the high throughput reconstruction of molecular phylogenies, 3)
the representation of biological processes particularly metabolic pathways, 4)
the versatile methods to integrate genomic data, biological representations and
functional profiling obtained from X-omic experiments after drug treatments and
5) the determination and prediction of protein structures and their molecular
docking with drug candidate structures. Progresses toward a grid-enabled
chemogenomic knowledge space are discussed.Comment: 43 pages, 4 figures, to appear in Malaria Journa
Vesivirus 2117 capsids more closely resemble sapovirus and lagovirus particles than other known vesivirus structures
Vesivirus 2117 is an adventitious agent that in 2009, was identified as a contaminant of CHO cells propagated in bioreactors at a pharmaceutical manufacturing plant belonging to Genzyme. The consequent interruption in supply of Fabrazyme and Cerezyme (drugs used to treat Fabry and Gaucher disease respectively), caused significant economic losses. Vesivirus 2117 is a member of the Caliciviridae; a family of small icosahedral viruses encoding a positive sense RNA genome. We have used cryo-electron microscopy and three dimensional image reconstruction to calculate a structure of vesivirus 2117 virus like particles as well as feline calicivirus and a chimeric sapovirus. We present a structural comparison of several members of the Caliciviridae, showing that the distal P domain of vesivirus 2117 is morphologically distinct from that seen in other known vesivirus structures. Furthermore, at intermediate resolutions we found a high level of structural similarity between vesivirus 2117 and Caliciviridae from other genera, such as sapovirus and rabbit haemorrhagic disease virus. Phylogenetic analysis confirms vesivirus 2117 as a vesivirus closely related to canine vesiviruses. We postulate that morphological differences in virion structure seen between vesivirus clades may reflect differences in receptor usage
Computational Identification of Four Spliceosomal snRNAs from the Deep-Branching Eukaryote Giardia intestinalis
Funding: Marsden Fund New Zealand Allan Wilson Centre The funders had no role in study design, data collection and analysis, decision to publish, or
preparation of the manuscript.RNAs processing other RNAs is very general in eukaryotes, but is not clear to what extent it is ancestral to eukaryotes. Here
we focus on pre-mRNA splicing, one of the most important RNA-processing mechanisms in eukaryotes. In most eukaryotes
splicing is predominantly catalysed by the major spliceosome complex, which consists of five uridine-rich small nuclear
RNAs (U-snRNAs) and over 200 proteins in humans. Three major spliceosomal introns have been found experimentally in
Giardia; one Giardia U-snRNA (U5) and a number of spliceosomal proteins have also been identified. However, because of
the low sequence similarity between the Giardia ncRNAs and those of other eukaryotes, the other U-snRNAs of Giardia had
not been found. Using two computational methods, candidates for Giardia U1, U2, U4 and U6 snRNAs were identified in this
study and shown by RT-PCR to be expressed. We found that identifying a U2 candidate helped identify U6 and U4 based on
interactions between them. Secondary structural modelling of the Giardia U-snRNA candidates revealed typical features of
eukaryotic U-snRNAs. We demonstrate a successful approach to combine computational and experimental methods to
identify expected ncRNAs in a highly divergent protist genome. Our findings reinforce the conclusion that spliceosomal
small-nuclear RNAs existed in the last common ancestor of eukaryotes
The Carcinoembryonic Antigen Gene Family
The molecular cloning of carcinoembryonic antigen (CEA) and several cross-reacting antigens reveals a basic domain structure for the whole family, which shows structural similarities to the immunoglobulin superfamily. The CEA family consists of approximately 10 genes which are localized in two clusters on chromosome 19. So far, mRNA species for five of these genes have been identified which show tissue variability in their transcriptional activity. Expression of some of these genes in heterologous systems has been achieved, allowing the localization of some epitopes. The characterization of a CEA gene family in the rat and a comparison with its human counterpart has been utilized in the development of an evolutionary model
Path Similarity Analysis: a Method for Quantifying Macromolecular Pathways
Diverse classes of proteins function through large-scale conformational
changes; sophisticated enhanced sampling methods have been proposed to generate
these macromolecular transition paths. As such paths are curves in a
high-dimensional space, they have been difficult to compare quantitatively, a
prerequisite to, for instance, assess the quality of different sampling
algorithms. The Path Similarity Analysis (PSA) approach alleviates these
difficulties by utilizing the full information in 3N-dimensional trajectories
in configuration space. PSA employs the Hausdorff or Fr\'echet path
metrics---adopted from computational geometry---enabling us to quantify path
(dis)similarity, while the new concept of a Hausdorff-pair map permits the
extraction of atomic-scale determinants responsible for path differences.
Combined with clustering techniques, PSA facilitates the comparison of many
paths, including collections of transition ensembles. We use the closed-to-open
transition of the enzyme adenylate kinase (AdK)---a commonly used testbed for
the assessment enhanced sampling algorithms---to examine multiple microsecond
equilibrium molecular dynamics (MD) transitions of AdK in its substrate-free
form alongside transition ensembles from the MD-based dynamic importance
sampling (DIMS-MD) and targeted MD (TMD) methods, and a geometrical targeting
algorithm (FRODA). A Hausdorff pairs analysis of these ensembles revealed, for
instance, that differences in DIMS-MD and FRODA paths were mediated by a set of
conserved salt bridges whose charge-charge interactions are fully modeled in
DIMS-MD but not in FRODA. We also demonstrate how existing trajectory analysis
methods relying on pre-defined collective variables, such as native contacts or
geometric quantities, can be used synergistically with PSA, as well as the
application of PSA to more complex systems such as membrane transporter
proteins.Comment: 9 figures, 3 tables in the main manuscript; supplementary information
includes 7 texts (S1 Text - S7 Text) and 11 figures (S1 Fig - S11 Fig) (also
available from journal site
An automatic method for assessing structural importance of amino acid positions
Background: A great deal is known about the qualitative aspects of the sequence-structure relationship, for example that buried residues are usually more conserved between structurally similar homologues, but no attempts have been made to quantitate the relationship between evolutionary conservation at a sequence position and change to global tertiary structure. In this paper we demonstrate that the Spearman correlation between sequence and structural change is suitable for this purpose.
Results:
Buried residues, bends, cysteines, prolines and leucines were significantly more likely to occupy positions highly correlated with structural change than expected by chance. Some buried residues were found to be less informative than expected, particularly residues involved in active sites and the binding of small molecules.
Conclusion:
The correlation-based method generates predictions of structural importance for superfamily positions which agree well with previous results of manual analyses, and may be of use in automated residue annotation piplines. A PERL script which implements the method is provided
- …