4,001 research outputs found
Recommended from our members
Networks Underpinning Symbiosis Revealed Through Cross-Species eQTL Mapping.
Organisms engage in extensive cross-species molecular dialog, yet the underlying molecular actors are known for only a few interactions. Many techniques have been designed to uncover genes involved in signaling between organisms. Typically, these focus on only one of the partners. We developed an expression quantitative trait locus (eQTL) mapping-based approach to identify cause-and-effect relationships between genes from two partners engaged in an interspecific interaction. We demonstrated the approach by assaying expression of 98 isogenic plants (Medicago truncatula), each inoculated with a genetically distinct line of the diploid parasitic nematode Meloidogyne hapla With this design, systematic differences in gene expression across host plants could be mapped to genetic polymorphisms of their infecting parasites. The effects of parasite genotypes on plant gene expression were often substantial, with up to 90-fold (P = 3.2 Γ 10-52) changes in expression levels caused by individual parasite loci. Mapped loci included a number of pleiotropic sites, including one 87-kb parasite locus that modulated expression of >60 host genes. The 213 host genes identified were substantially enriched for transcription factors. We distilled higher-order connections between polymorphisms and genes from both species via network inference. To replicate our results and test whether effects were conserved across a broader host range, we performed a confirmatory experiment using M. hapla-infected tomato. This revealed that homologous genes were similarly affected. Finally, to validate the broader utility of cross-species eQTL mapping, we applied the strategy to data from a Salmonella infection study, successfully identifying polymorphisms in the human genome affecting bacterial expression
Species-level functional profiling of metagenomes and metatranscriptomes.
Functional profiles of microbial communities are typically generated using comprehensive metagenomic or metatranscriptomic sequence read searches, which are time-consuming, prone to spurious mapping, and often limited to community-level quantification. We developed HUMAnN2, a tiered search strategy that enables fast, accurate, and species-resolved functional profiling of host-associated and environmental communities. HUMAnN2 identifies a community's known species, aligns reads to their pangenomes, performs translated search on unclassified reads, and finally quantifies gene families and pathways. Relative to pure translated search, HUMAnN2 is faster and produces more accurate gene family profiles. We applied HUMAnN2 to study clinal variation in marine metabolism, ecological contribution patterns among human microbiome pathways, variation in species' genomic versus transcriptional contributions, and strain profiling. Further, we introduce 'contributional diversity' to explain patterns of ecological assembly across different microbial community types
Composite structural motifs of binding sites for delineating biological functions of proteins
Most biological processes are described as a series of interactions between
proteins and other molecules, and interactions are in turn described in terms
of atomic structures. To annotate protein functions as sets of interaction
states at atomic resolution, and thereby to better understand the relation
between protein interactions and biological functions, we conducted exhaustive
all-against-all atomic structure comparisons of all known binding sites for
ligands including small molecules, proteins and nucleic acids, and identified
recurring elementary motifs. By integrating the elementary motifs associated
with each subunit, we defined composite motifs which represent
context-dependent combinations of elementary motifs. It is demonstrated that
function similarity can be better inferred from composite motif similarity
compared to the similarity of protein sequences or of individual binding sites.
By integrating the composite motifs associated with each protein function, we
define meta-composite motifs each of which is regarded as a time-independent
diagrammatic representation of a biological process. It is shown that
meta-composite motifs provide richer annotations of biological processes than
sequence clusters. The present results serve as a basis for bridging atomic
structures to higher-order biological phenomena by classification and
integration of binding site structures.Comment: 34 pages, 7 figure
Fine-Scale Haplotype Structure Reveals Strong Signatures of Positive Selection in a Recombining Bacterial Pathogen
Identifying genetic variation in bacteria that has been shaped by ecological differences remains an important challenge. For recombining bacteria, the sign and strength of linkage provide a unique lens into ongoing selection. We show that derived allelesPeer reviewe
μ μ 체 λΉκ΅λΆμμ ν΅ν ν¬μ λ₯ κ°μΌμ± λ°μ΄λ¬μ€μ μ§νμ λν ν΅μ°°
νμλ
Όλ¬Έ(μμ¬)--μμΈλνκ΅ λνμ :λμ
μλͺ
κ³Όνλν λμλͺ
곡νλΆ,2019. 8. κΉν¬λ°.κ°μΌμ± λ°μ΄λ¬μ€λ μΈκ°μ λΉλ‘―ν λ§μ μ’
μ λλ¬Όμ κ°μΌμμΌ λμ΄ν¬ μ μλ κ²°κ³Όλ₯Ό μ΄λνκΈ°λ ν©λλ€. μλ§μ μ¬λμ μ£½μμ μ΄λ₯΄κ² νλ κ²μ λ¬Όλ‘ , 맀 ν΄λ§λ€ λκ·λͺ¨ κ°μΆ κ°μΌμ¬λ‘λ‘ μΈνμ¬ μΆμ°μ
μ 컀λ€λ κ²½μ μ νΌν΄λ₯Ό λΌμΉκ³ μμ΅λλ€. κ·Έλ κΈ° λλ¬Έμ κ°μΌμ± λ°μ΄λ¬μ€μ λν μΆ©λΆν μ°κ΅¬κ° νμν©λλ€. λ°μ΄λ¬μ€λ λ€λ₯Έ λ―Έμλ¬Όμ΄λ μλͺ
체μ λΉνμ¬ μ μ μ λ³νμ΄ λ³΄λ€ λΉ λ₯΄κ³ 무μμλ‘ μ΄λ£¨μ΄μ§λ νΉμ§μ΄ μμ΅λλ€. λλΆλΆμ λ°μ΄λ¬μ€λ μμ£Όμ μ’
μ λ°λΌ κ°μΌ μ¬λΆκ° λ¬λΌμ§μ§λ§, λ΄ν΄λ μ€νμ΄λμ μλ―Έλ
Έμ° μμ΄ νλμ λ³νμΌλ‘λ μλ‘μ΄ μ’
μ μμ£Όλ₯Ό κ°μΌμν€κ±°λ κ·Έ λ
μ±μ΄ λ¬λΌμ§κΈ°λ νκΈ° λλ¬Έμ κ·Έλ€μ μ μ 체 μ°¨μμμμ νΉμ§μ λ°κ²¬νκ³ λΆμνλ κ²μ μμ
μ λ° κ³Όνμ μ£Όμν κ°μΉλ₯Ό μ 곡ν©λλ€. μ΄λ¬ν μ μ 체 νΉμ§ μ€μμ λ¨μΌ μ μ μ λ³μ΄μ²΄(Single Nucleotide and Amino acid variant)λ λ§μ μ°κ΅¬μμ μ°κ΅¬ λμμΌλ‘ μ¬μ©λκ³ μμ΅λλ€. μ€μ μ μΌλ‘ λ°μ΄λ¬μ€ μ°κ΅¬μμ λ°μ΄λ¬μ€μ μ’
μ λμ νκ±°λ λ°±μ κ°λ° λ± λ€μν λΆμΌμ μ¬μ©λκ³ μμ΅λλ€.
μ±ν° 2μ§μΉ΄λ°μ΄λ¬μ€λ μΌλ°μ μΈ μ±μΈμ΄ κ°μΌλμμ μμλ μ§μΉ΄μ΄, λν΅ λ° κ΄μ ν΅ λ±μ μ¦μμ μ λ°νμ§λ§ μμ°λΆκ° κ°μΌλμμ μμλ νμμ μλμ¦μ μΌμΌν€λ κ²κ³Ό μ°κ΄μ΄ μλ€κ³ μλ €μ Έ μμ΅λλ€. μ§λ 10λ
κ° μ μΈκ³μ νλ°μ μΌλ‘ νΌμ Έ λκ°μΌλ©° λ§μ νμλ€μ΄ μ§μΉ΄λ°μ΄λ¬μ€μ λΆμ λ©μ»€λμ¦μ λν μ°κ΅¬λ₯Ό μννμ΅λλ€. κ·Έλ¬λ μΉλ£μ μλ°©μ μν μμ½ν λ° λ°±μ κ°λ°μ μμ§κΉμ§ μ§ν μ€μ΄λ©° λ³΄λ€ λ§μ μ μ 체 μμ€μμμ μ°κ΅¬κ° νμν©λλ€.
μ΄ μ°κ΅¬μμ 곡κ°λ°μ΄ν°λ² μ΄μ€λ‘λΆν° μ΄μ© κ°λ₯ν μ§μΉ΄λ°μ΄λ¬μ€μ NGS μ μ 체 λ°μ΄ν°λ₯Ό μμ§νκ³ λΆμμ ν΅νμ¬ μ§λ¦¬μ , μκΈ°μ κ΄μ μ κ³ λ €ν μ§μ νΉμ΄μ μ μ 체 λ³μ΄(Single Nucleotide and Amino Acid variants)λ₯Ό μ μ μ λ§μ»€λ‘μ¨ μ μνμμ΅λλ€. μ§νμ μ°κ΄λΆμκ³Ό μμ¨νμ΅ k-means ν΄λ¬μ€ν°λ§ μκ³ λ¦¬μ¦μ μ΄μ©νμ¬ 4κ°μ λνκ·Έλ£Ήμ μ μ νμμ΅λλ€. λν 4κ·Έλ£Ήμ μ΄μ μ λ§μΆμ΄ ν΅κ³μ μΌλ‘ μ μλ―Έν μ μ 체 λ³μ΄λ€μ μ°Ύμλ΄κ³ dN/dS μ§ν λΆμμΌλ‘ μ§νμ μΌλ‘ κ°μνλ λ¨λ°±μ§ μνΈν μμμ νμΈνμ΅λλ€. μ΄ν κ·Έλ£Ή κΈ°λ₯μ± λ¨λ°±μ§ μμκ³Ό B-cell, T-cell νΉμ΄μ νμκ²°μ κΈ° ν보λ₯Ό μμΈ‘νμ¬ μ°ΎμλΈ μ μ 체 λ³μ΄λ€μ΄ λ¨λ°±μ§ λ° νμκ²°μ κΈ° νμ±μ κ²°μ μ μΈ μν μ νμΈνμ¬ κ·Έλ£Ήλ³ μ£Όμ μ μ μ λ§μ»€λ‘μ¨ μ μνμμ΅λλ€.
μ±ν° 3μΈν루μμμ μλ‘μ΄ νμ
μΌλ‘ λΆλ₯λ μΈν루μμ D λ°μ΄λ¬μ€λ μλ₯Ό λΉλ‘―ν λ°μΆλλ¬Όμ κ°μΌμν€λ νΈν‘κΈ°μ± λ°μ΄λ¬μ€μ
λλ€. κ°μΌ μ¦μμ κ²½λ―Ένμ§λ§ λ€λ₯Έ μΉλͺ
μ μΈ νΈν‘κΈ°μ± λ°μ΄λ¬μ€ κ°μΌμ μ λ°νκ³ μΈκ°μκ²λ κ°μΌλ μ μλ μ μ¬μ±μ΄ μκΈ° λλ¬Έμ μ μ 체 μ°¨μμμμ μ°κ΅¬λ₯Ό μννμμ΅λλ€. μΈν루μμ D λ°μ΄λ¬μ€μ λͺ¨λ μ μ μ λ¨νΈ NGSλ°μ΄ν°λ₯Ό μ΄μ©ν μ μ 체 νΉμ± λ° μ§νμ μκ΄κ΄κ³ λΆμμΌλ‘ νλμ μ μ μ λ¨νΈμ ν΅ν λΆμμ κ²°κ³Όμμ μ°¨μ΄μ μ λ°νλμ΅λλ€. κ·Έ κ²°κ³Όλ₯Ό ν λλ‘ μ μ ν λν κ·Έλ£Ήμ μ΄μ μΌλ‘, ν΅κ³μ μΌλ‘ μ μλ―Έν νΉμ΄μ μ μ 체 λ³μ΄λ₯Ό μ°Ύμλμ΅λλ€. μ΄ν dN/dS μ§ν λΆμκ³Ό λ¨λ°±μ§ μ½λ©μμ, B-cell νΉμ΄μ νμκ²°μ κΈ° μμΈ‘ λΆμ κ²°κ³Όμ λΉκ΅νμ¬ κ·Έλ£Ή νΉμ΄μ μ μ μ λ§μ»€λ‘μ¨ μ μνμμ΅λλ€.
μ΄ μ°κ΅¬λ₯Ό ν΅νμ¬ κ°μΌμ± λ°μ΄λ¬μ€μ κ·Έλ£Ήλ³ νΉμ΄μ μ μ μ λ§μ»€λ₯Ό μ μνκ³ μ΄ λ§μ»€κ° μλ‘μ΄ λ°μ΄λ¬μ€ μ’
μ λμ κ³Ό λ³λ
μ± μ§νμ λν ν΅μ°°, κ·Έλ¦¬κ³ λ°±μ κ°λ°μ λμμ μ€ μ μμ κ²μ
λλ€.Infectious viruses infect many species of animal, including human, and cause irreversible consequence. They bring fetal death to human and cause massive economic losses to livestock industry due to the large-scale infection. Therefore, we need more research on infectious viruses. Viruses have faster and random genetic variable features than other organisms. Most viruses are susceptible to infection depending on the host species. However, since a single nucleotide and amino acid sequence variation leads infection to a new species or alter its toxicity, genomic level of virus research provides major commercial and scientific value. Therefore, many researchers focus on the single genetic variation for identification of a new virus species or vaccine study.
Chapter 1Zika virus (ZIKV) is known to be associated with a serious brain disease, fetal microcephaly in pregnant women, and has been explosively spread throughout the world over the last decade. Virologists of most countries attempted investigations of ZIKV molecular mechanisms to prevent the worldwide proliferation. However, only few genetic variants in several regions were anticipated as targets of vaccines and medicines. Here, I analyzed all of available ZIKV complete genomes from the Virus Pathogen Resource (ViPR) database to identify novel genetic markers by considering geographical and temporal perspectives. By principal component and phylogenetic analysis, ZIKV strains formed four clusters according to collected continent. Focusing on the major groups in African, Asian, Central America and Caribbean, I found single nucleotide variants (SNVs) supported by statistical significance. From the dN/dS analysis, I identified the protein coding regions that were evolutionary accelerated in each group. Out of the intercontinental SNVs, non-synonymous and synonymous variants on functional protein domains and predicted B-cell and T-cell epitopes were suggested as regional markers. I believe these local genetic markers can improve medical strategies for ZIKV prevention, diagnosis, and treatment.
Chapter 2Influenza D virus (IDV), a new type of influenza, is a respiratory virus that infects ruminants, including cattle. Because the infection symptoms of IDV are mild, but, causes fatal infection of other respiratory viruses and have potential for infection in human, I conducted researches at the genomic level. Using the results of phylogeny and principal coordinate analysis (PCoA), we compared concatenated all of coding sequence dataset and each of genes coding sequence dataset. I confirmed that concatenated dataset results were more appropriately clustered into four groups with isolated region, and I selected the main three groups. Focusing on the main three groups, I found statistically significant genetic markers in comparison with dN/dS analysis, searching protein coding region, and B-cell epitope prediction analysis.
Through this study, I suggest local-specific genetic markers of infectious virus, and these markers will give a deep insight for further studies.ABSTRACT IV
CONTENTS VII
LIST OF TABLES VIII
LIST OF FIGURES IX
CHAPTER 1. LITERATURE REVIEW 1
CHAPTER 2. IDENTIFICATION OF LOCAL-SPECIFIC GENETIC MARKERS OF ZIKA VIRUS ACROSS THE ENTIRE GLOBE 7
2.1 ABSTRACT 8
2.2 INTRODUCTION 9
2.3 MATERIALS AND METHODS 12
2.4 RESULTS 18
2.5 DISCUSSION 26
CHAPTER 3. LOCAL GENETIC MARKERS CLUSTERED BY CODING SEQUENCES OF INFLUENZA D VIRUS 56
3.1 ABSTRACT 57
3.2 INTRODUCTION 59
3.3 MATERIALS AND METHODS 61
3.4 RESULTS 66
3.5 DISCUSSION 72
REFERENCES 93
μμ½(κ΅λ¬Έμ΄λ‘) 100Maste
A bioinformatics toolkit: in silico tools and online resources for investigating genetic variation
With the advent of large-scale next-generation sequencing initiatives, there is an increasing importance to interpret and understand the potential phenotypic influence of identified genetic variation and its significance in the human genome. Bioinformatics analyses can provide useful information to assist with variant interpretation. This review provides an overview of tools/resources currently available, and how they can help predict the impact of genetic variation at the deoxyribonucleic acid, ribonucleic acid, and protein level
Understanding The Intra And Inter-Cellular Interaction Complexities And Flexibilities Using Systems And Sequence Analysis Approach
The present thesis work has been undertaken to gain an understanding of intra-cellular
or inter-cellular interactions between bio-molecular entities utilizing either a systems
analysis based perspective or different sequence analysis approaches. During this study
different principles likely to be prevalent among intra-cellular and inter-cellular
interactions have been studied with the help of computational approaches. Broadly, the
complexities in intra-cellular interactions have been studied by determining the effect
of perturbations such as over-expression or down-regulation of a key regulator on the
intra-cellular interaction network architecture or its components. In particular, network
analysis of regulatory network proteins in association with the intra-cellular proteinprotein
interaction network, led to a key observation that topologically important
effector proteins in the regulatory network could be important signaling proteins.
Identification of such important effector proteins essential for the regulatory network
integrity of a key regulator may be performed by network analysis. It is likely that
alterations in these important effector proteins may lead to disruptions in cellular
physiology and as such in this manner probable disease associated entities can be
determined. Alternately, the flexibility among protein-protein interactions has been
studied by analyzing homologous sequence families of interacting proteins with the
help of information theory based measures like mutual information and Bhattacharyya
co-efficient. Since interacting proteins may co-evolve, co-variation may allow the
preservation of a functional interaction between co-evolving proteins and interdependent
residue pair alterations may occur as a result of evolutionary pressure.
Analysis of molecular co-evolution in inter-cellular protein interaction complexes
determined that co-evolutionary pairings may be present among interface and noninterface
residue pairs and such positions are likely to be crucial for a functional
interaction between these sets of proteins. Therefore, utilising information contained in
biological sequences, co-evolutionary pairings involving structurally or functionally
crucial residue positions in disease associated inter-cellular protein-protein interaction
complexes were predicted. Thus, different computational approaches have been utilised
to study a particular hypothesis in a disease scenario in order to delineate certain
themes prevalent in intra-cellular or inter-cellular interactions among bio-molecular
entities while predicting disease associated entities or studying interaction patterns
among them
- β¦