19 research outputs found
The accuracy of several multiple sequence alignment programs for proteins
BACKGROUND: There have been many algorithms and software programs implemented for the inference of multiple sequence alignments of protein and DNA sequences. The "true" alignment is usually unknown due to the incomplete knowledge of the evolutionary history of the sequences, making it difficult to gauge the relative accuracy of the programs. RESULTS: We tested nine of the most often used protein alignment programs and compared their results using sequences generated with the simulation software Simprot which creates known alignments under realistic and controlled evolutionary scenarios. We have simulated more than 30000 alignment sets using various evolutionary histories in order to define strengths and weaknesses of each program tested. We found that alignment accuracy is extremely dependent on the number of insertions and deletions in the sequences, and that indel size has a weaker effect. We also considered benchmark alignments from the latest version of BAliBASE and the results relative to BAliBASE- and Simprot-generated data sets were consistent in most cases. CONCLUSION: Our results indicate that employing Simprot's simulated sequences allows the creation of a more flexible and broader range of alignment classes than the usual methods for alignment accuracy assessment. Simprot also allows for a quick and efficient analysis of a wider range of possible evolutionary histories that might not be present in currently available alignment sets. Among the nine programs tested, the iterative approach available in Mafft (L-INS-i) and ProbCons were consistently the most accurate, with Mafft being the faster of the two
Low-voltage characteristic voltage based fault distance estimation method of distribution network
The traditional medium-voltage distribution network fault location method uses mainly the voltage and current measurements of the medium-voltage side, which results in problems such as high installation costs at the measuring points and complicated postoperation and maintenance work. Therefore, a fault location idea based on the distributed measurement of low-voltage side voltage is proposed in this paper. First, the characteristic voltage is adaptively selected according to the fault type. Second, the suspected fault section is determined by comparing the characteristic voltage amplitude of each measuring point. Third, the fault section is located using the section unit characteristic voltage drop defined for each suspected fault section. Finally, fault distance estimation is achieved based on the voltage difference matrix and characteristic voltage analysis. This method achieves accurate fault distance identification based on the distribution difference of the characteristic voltage of the low-voltage side under the fault state. This work provides a new economical and practical idea for determining the fault locations of distribution networks. The effectiveness of this method is evaluated by considering a 10Â kV distribution network in Guangdong Province built in PSCAD/EMTDC
Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects
Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (OR=1.11, P=5.7×10−15), which persisted after excluding loci implicated in previous studies (OR=1.07, P=1.7 ×10−6). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 ×10−11) and neurobehavioral phenotypes in mouse (OR = 1.18, P= 7.3 ×10−5). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by non-allelic homologous recombination
Schizophrenia-associated somatic copy-number variants from 12,834 cases reveal recurrent NRXN1 and ABCB11 disruptions
While germline copy-number variants (CNVs) contribute to schizophrenia (SCZ) risk, the contribution of somatic CNVs (sCNVs)—present in some but not all cells—remains unknown. We identified sCNVs using blood-derived genotype arrays from 12,834 SCZ cases and 11,648 controls, filtering sCNVs at loci recurrently mutated in clonal blood disorders. Likely early-developmental sCNVs were more common in cases (0.91%) than controls (0.51%, p = 2.68e−4), with recurrent somatic deletions of exons 1–5 of the NRXN1 gene in five SCZ cases. Hi-C maps revealed ectopic, allele-specific loops forming between a potential cryptic promoter and non-coding cis-regulatory elements upon 5′ deletions in NRXN1. We also observed recurrent intragenic deletions of ABCB11, encoding a transporter implicated in anti-psychotic response, in five treatment-resistant SCZ cases and showed that ABCB11 is specifically enriched in neurons forming mesocortical and mesolimbic dopaminergic projections. Our results indicate potential roles of sCNVs in SCZ risk
Copy number variation in fetal alcohol spectrum disorder
Fetal alcohol spectrum disorder (FASD) is characterized by a combination of neurological, developmental, and congenital defects that may occur as a consequence of prenatal alcohol exposure. Earlier reports showed that large chromosomal anomalies may link to FASD. Here, we examined the prevalence and types of copy number variations (CNVs) in FASD cases previously diagnosed by a multidisciplinary FASD team in sites across Canada. We genotyped 95 children with FASD and 87 age-matched, typically developing controls on the Illumina Human Omni2.5 SNP array platform. We compared their CNVs to those of 10,851 population controls, in order to identify rare CNVs (The accepted manuscript in pdf format is listed with the files at the bottom of this page. The presentation of the authors' names and (or) special characters in the title of the manuscript may differ slightly between what is listed on this page and what is listed in the pdf file of the accepted manuscript; that in the pdf file of the accepted manuscript is what was submitted by the author
Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects
Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (odds ratio (OR) = 1.11, P = 5.7 x 10(-15)), which persisted after excluding loci implicated in previous studies (OR = 1.07, P = 1.7 x 10(-6)). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 x 10(-11)) and neurobehavioral phenotypes in mouse (OR = 1.18, P = 7.3 x 10(-5)). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by nonallelic homologous recombination
Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects
Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (odds ratio (OR) = 1.11, P = 5.7 x 10(-15)), which persisted after excluding loci implicated in previous studies (OR = 1.07, P = 1.7 x 10(-6)). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 x 10(-11)) and neurobehavioral phenotypes in mouse (OR = 1.18, P = 7.3 x 10(-5)). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by nonallelic homologous recombination