308 research outputs found

    Linguistic feature analysis for protein interaction extraction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The rapid growth of the amount of publicly available reports on biomedical experimental results has recently caused a boost of text mining approaches for protein interaction extraction. Most approaches rely implicitly or explicitly on linguistic, i.e., lexical and syntactic, data extracted from text. However, only few attempts have been made to evaluate the contribution of the different feature types. In this work, we contribute to this evaluation by studying the relative importance of deep syntactic features, i.e., grammatical relations, shallow syntactic features (part-of-speech information) and lexical features. For this purpose, we use a recently proposed approach that uses support vector machines with structured kernels.</p> <p>Results</p> <p>Our results reveal that the contribution of the different feature types varies for the different data sets on which the experiments were conducted. The smaller the training corpus compared to the test data, the more important the role of grammatical relations becomes. Moreover, deep syntactic information based classifiers prove to be more robust on heterogeneous texts where no or only limited common vocabulary is shared.</p> <p>Conclusion</p> <p>Our findings suggest that grammatical relations play an important role in the interaction extraction task. Moreover, the net advantage of adding lexical and shallow syntactic features is small related to the number of added features. This implies that efficient classifiers can be built by using only a small fraction of the features that are typically being used in recent approaches.</p

    Breeding histories and selection criteria for oilseed rape in Europe and China identified by genome wide pedigree dissection

    Get PDF
    Selection breeding has played a key role in the improvement of seed yield and quality in oilseed rape (Brassica napus L.). We genotyped Tapidor (European), Ningyou7 (Chinese) and their progenitors with the Brassica 60 K Illumina Infinium SNP array and mapped a total of 29,347 SNP markers onto the reference genome of Darmor-bzh. Identity by descent (IBD) refers to a haplotype segment of a chromosome inherited from a shared common ancestor. IBDs identified on the C subgenome were larger than those on the A subgenome within both the Tapidor and Ningyou7 pedigrees. IBD number and length were greater in the Ningyou7 pedigree than in the Tapidor pedigree. Seventy nine QTLs for flowering time, seed quality and root morphology traits were identified in the IBDs of Tapidor and Ningyou7. Many more candidate genes had been selected within the Ningyou7 pedigree than within the Tapidor pedigree. These results highlight differences in the transfer of favorable gene clusters controlling key traits during selection breeding in Europe and China

    The Persistency of the India-Pakistan Conflict: Chances and Obstacles of the Bilateral Composite Dialogue

    Full text link
    This article investigates the underlying causes for the persistency of the India–Pakistan conflict and, on this basis, the chances and obstacles of the bilateral composite dialogue initiated in 2004. In particular, it wants to provide a theoretically grounded account of the factors that facilitated and constrained the bilateral composite dialogue process. Drawing on the regional security complex theory, this article examines the rivalry between the two South Asian nuclear powers on four levels of analysis: the domestic, the regional, the interregional and the global level. The analysis shows that there have been some substantial changes on all four levels in the recent decade or so and that these changes have provided more beneficial conditions for a peace process. These changes include, inter alia, India’s new regional policy, the consequences of the 9/11 terrorist attacks for the region and India’s growing power capacities. However, major obstacles to the India–Pakistan dialogue and a permanent conflict resolution continue to persist: the dominant role of the military in Pakistan, conflicting national identities and the still partially contested nature of statehood in India and Pakistan, which is in the case of Pakistan linked to the growing power of Islamic fundamentalists

    Efficient quantitative assessment of facial paralysis using iris segmentation and active contour-based key points detection with hybrid classifier

    Full text link
    BACKGROUND: Facial palsy or paralysis (FP) is a symptom that loses voluntary muscles movement in one side of the human face, which could be very devastating in the part of the patients. Traditional methods are solely dependent to clinician’s judgment and therefore time consuming and subjective in nature. Hence, a quantitative assessment system becomes apparently invaluable for physicians to begin the rehabilitation process; and to produce a reliable and robust method is challenging and still underway. METHODS: We introduce a novel approach for a quantitative assessment of facial paralysis that tackles classification problem for FP type and degree of severity. Specifically, a novel method of quantitative assessment is presented: an algorithm that extracts the human iris and detects facial landmarks; and a hybrid approach combining the rule-based and machine learning algorithm to analyze and prognosticate facial paralysis using the captured images. A method combining the optimized Daugman’s algorithm and Localized Active Contour (LAC) model is proposed to efficiently extract the iris and facial landmark or key points. To improve the performance of LAC, appropriate parameters of initial evolving curve for facial features’ segmentation are automatically selected. The symmetry score is measured by the ratio between features extracted from the two sides of the face. Hybrid classifiers (i.e. rule-based with regularized logistic regression) were employed for discriminating healthy and unhealthy subjects, FP type classification, and for facial paralysis grading based on House-Brackmann (H-B) scale. RESULTS: Quantitative analysis was performed to evaluate the performance of the proposed approach. Experiments show that the proposed method demonstrates its efficiency. CONCLUSIONS: Facial movement feature extraction on facial images based on iris segmentation and LAC-based key point detection along with a hybrid classifier provides a more efficient way of addressing classification problem on facial palsy type and degree of severity. Combining iris segmentation and key point-based method has several merits that are essential for our real application. Aside from the facial key points, iris segmentation provides significant contribution as it describes the changes of the iris exposure while performing some facial expressions. It reveals the significant difference between the healthy side and the severe palsy side when raising eyebrows with both eyes directed upward, and can model the typical changes in the iris region

    Reconstruction of major maternal and paternal lineages of the Cape Muslim population

    Get PDF
    The earliest Cape Muslims were brought to the Cape (Cape Town - South Africa) from Africa and Asia from 1652 to 1834. They were part of an involuntary migration of slaves, political prisoners and convicts, and they contributed to the ethnic diversity of the present Cape Muslim population of South Africa. The history of the Cape Muslims has been well documented and researched however no in-depth genetic studies have been undertaken. The aim of the present study was to determine the respective African, Asian and European contributions to the mtDNA (maternal) and Y-chromosomal (paternal) gene pool of the Cape Muslim population, by analyzing DNA samples of 100 unrelated Muslim males born in the Cape Metropolitan area. A panel of six mtDNA and eight Y-chromosome SNP markers were screened using polymerase chain reaction-restriction fragment length polymorphisms (PCR-RFLP). Overall admixture estimates for the maternal line indicated Asian (0.4168) and African mtDNA (0.4005) as the main contributors. The admixture estimates for the paternal line, however, showed a predominance of the Asian contribution (0.7852). The findings are in accordance with historical data on the origins of the early Cape Muslims.Web of Scienc

    Standardization of the NEO-PI-3 in the Greek general population

    Get PDF
    BACKGROUND: The revised NEO Personality Inventory (NEO-PI-3) includes 240 items corresponding to the Big Five personality traits (Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness to Experience) and subordinate dimensions (facets). It is suitable for use with adolescents and adults (12 years or older). The aim of the current study was to validate the Greek translation of the NEO-PI-3 in the general Greek population. MATERIAL AND METHODS: The study sample included 734 subjects from the general Greek population of whom 59.4% were females and 40.6% males aged 40.80 +/- 11.48. The NEO-PI-3 was translated into Greek and back-translated into English, and the accuracy of the translation was confirmed and established. The statistical analysis included descriptive statistics, confirmatory factorial analysis (CFA), the calculation of Cronbach's alpha, and the calculation of Pearson product-moment correlations. Sociodemographics groups were compared by ANOVA. RESULTS: Most facets had Cronbach's alpha above 0.60. Confirmatory factor analysis showed acceptable loading of the facets on their own hypothesized factors and very good estimations of Cronbach's alphas for the hypothesized factors, so it was partially supportive of the five-factor structure of the NEO-PI-3.The factors extracted with Procrustes rotation analysis can be considered reasonably homologous to the factors of the American normative sample. Correlations between dimensions were as expected and similar to those reported in the literature. DISCUSSION: The literature suggests that overall, the psychometric properties of NEO-PI-3 scales have been found to generalize across ages, cultures, and methods of measurement. In accord with this, the results of the current study confirm the reliability of the Greek translation and adaptation of the NEO-PI-3. The inventory has comparable psychometric properties in its Greek version in comparison to the original and other national translations, and it is suitable for clinical as well as research use

    Physical properties of naked DNA influence nucleosome positioning and correlate with transcription start and termination sites in yeast

    Get PDF
    Abstract Background In eukaryotic organisms, DNA is packaged into chromatin structure, where most of DNA is wrapped into nucleosomes. DNA compaction and nucleosome positioning have clear functional implications, since they modulate the accessibility of genomic regions to regulatory proteins. Despite the intensive research effort focused in this area, the rules defining nucleosome positioning and the location of DNA regulatory regions still remain elusive. Results Naked (histone-free) and nucleosomal DNA from yeast were digested by microccocal nuclease (MNase) and sequenced genome-wide. MNase cutting preferences were determined for both naked and nucleosomal DNAs. Integration of their sequencing profiles with DNA conformational descriptors derived from atomistic molecular dynamic simulations enabled us to extract the physical properties of DNA on a genomic scale and to correlate them with chromatin structure and gene regulation. The local structure of DNA around regulatory regions was found to be unusually flexible and to display a unique pattern of nucleosome positioning. Ab initio physical descriptors derived from molecular dynamics were used to develop a computational method that accurately predicts nucleosome enriched and depleted regions. Conclusions Our experimental and computational analyses jointly demonstrate a clear correlation between sequence-dependent physical properties of naked DNA and regulatory signals in the chromatin structure. These results demonstrate that nucleosome positioning around TSS (Transcription Start Site) and TTS (Transcription Termination Site) (at least in yeast) is strongly dependent on DNA physical properties, which can define a basal regulatory mechanism of gene expression

    Using a limited mapping strategy to identify major QTLs for resistance to grapevine powdery mildew (Erysiphe necator) and their use in marker-assisted breeding

    Get PDF
    A limited genetic mapping strategy based on simple sequence repeat (SSR) marker data was used with five grape populations segregating for powdery mildew (Erysiphe necator) resistance in an effort to develop genetic markers from multiple sources and enable the pyramiding of resistance loci. Three populations derived their resistance from Muscadinia rotundifolia ‘Magnolia’. The first population (06708) had 97 progeny and was screened with 137 SSR markers from seven chromosomes (4, 7, 9, 12, 13, 15, and 18) that have been reported to be associated with powdery or downy mildew resistance. A genetic map was constructed using the pseudo-testcross strategy and QTL analysis was carried out. Only markers from chromosome 13 and 18 were mapped in the second (04327) and third (06712) populations, which had 47 and 80 progeny, respectively. Significant QTLs for powdery mildew resistance with overlapping genomic regions were identified for different tissue types (leaf, stem, rachis, and berry) on chromosome 18, which distinguishes the resistance in ‘Magnolia’ from that present in other accessions of M. rotundifolia and controlled by the Run1 gene on chromosome 12. The ‘Magnolia’ resistance locus was termed as Run2.1. Powdery mildew resistance was also mapped in a fourth population (08391), which had 255 progeny and resistance from M. rotundifolia ‘Trayshed’. A locus accounting for 50% of the phenotypic variation mapped to chromosome 18 and was named Run2.2. This locus overlapped the region found in the ‘Magnolia’-based populations, but the allele sizes of the flanking markers were different. ‘Trayshed’ and ‘Magnolia’ shared at least one allele for 68% of the tested markers, but alleles of the other 32% of the markers were not shared indicating that the two M. rotundifolia selections were very different. The last population, 08306 with 42 progeny, derived its resistance from a selection Vitis romanetii C166-043. Genetic mapping discovered a major powdery mildew resistance locus termed Ren4 on chromosome 18, which explained 70% of the phenotypic variation in the same region of chromosome 18 found in the two M. rotundifolia resistant accessions. The mapping results indicate that powdery mildew resistance genes from different backgrounds reside on chromosome 18, and that genetic markers can be used as a powerful tool to pyramid these loci and other powdery mildew resistance loci into a single line
    corecore