148 research outputs found
Steel corrosion in reinforced alkali-activated materials
The development of alkali-activated materials (AAMs) as an alternative to Portland cement (PC) has seen significant progress in the past decades. However, there still remains significant uncertainty regarding their long term performance when used in steel-reinforced structures. The durability of AAMs in such applications depends strongly on the corrosion behaviour of the embedded steel reinforcement, and the experimental data in the literature are limited and in some cases inconsistent. This letter elucidates the role of the chemistry of AAMs on the mechanisms governing passivation and chloride-induced corrosion of the steel reinforcement, to bring a better understanding of the durability of AAM structures exposed to chloride. The corrosion of the steel reinforcement in AAMs differs significantly from observations in PC; the onset of pitting (or the chloride ‘threshold’ value) depends strongly on the alkalinity, and the redox environment, of these binders. Classifications or standards used to assess the severity of steel corrosion in PC appear not to be directly applicable to AAMs due to important differences in pore solution chemistry and phase assemblage
Identification of disease-causing genes using microarray data mining and gene ontology
Background: One of the best and most accurate methods for identifying disease-causing genes is monitoring gene expression values in different samples using microarray technology. One of the shortcomings of microarray data is that they provide a small quantity of samples with respect to the number of genes. This problem reduces the classification accuracy of the methods, so gene selection is essential to improve the predictive accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVMRFE) has become one of the leading methods, but its performance can be reduced because of the small sample size, noisy data and the fact that the method does not remove redundant genes.
Methods: We propose a novel framework for gene selection which uses the advantageous features of conventional methods and addresses their weaknesses. In fact, we have combined the Fisher method and SVMRFE to utilize the advantages of a filtering method as well as an embedded method. Furthermore, we have added a redundancy reduction stage to address the weakness of the Fisher method and SVMRFE. In addition to gene expression values, the proposed method uses Gene Ontology which is a reliable source of information on genes. The use of Gene Ontology can compensate, in part, for the limitations of microarrays, such as having a small number of samples and erroneous measurement results.
Results: The proposed method has been applied to colon, Diffuse Large B-Cell Lymphoma (DLBCL) and prostate cancer datasets. The empirical results show that our method has improved classification performance in terms of accuracy, sensitivity and specificity. In addition, the study of the molecular function of selected genes strengthened the hypothesis that these genes are involved in the process of cancer growth.
Conclusions: The proposed method addresses the weakness of conventional methods by adding a redundancy reduction stage and utilizing Gene Ontology information. It predicts marker genes for colon, DLBCL and prostate cancer with a high accuracy. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help in the search for a cure for cancers
Machine learning on normalized protein sequences
<p>Abstract</p> <p>Background</p> <p>Machine learning techniques have been widely applied to biological sequences, e.g. to predict drug resistance in HIV-1 from sequences of drug target proteins and protein functional classes. As deletions and insertions are frequent in biological sequences, a major limitation of current methods is the inability to handle varying sequence lengths.</p> <p>Findings</p> <p>We propose to normalize sequences to uniform length. To this end, we tested one linear and four different non-linear interpolation methods for the normalization of sequence lengths of 19 classification datasets. Classification tasks included prediction of HIV-1 drug resistance from drug target sequences and sequence-based prediction of protein function. We applied random forests to the classification of sequences into "positive" and "negative" samples. Statistical tests showed that the linear interpolation outperforms the non-linear interpolation methods in most of the analyzed datasets, while in a few cases non-linear methods had a small but significant advantage. Compared to other published methods, our prediction scheme leads to an improvement in prediction accuracy by up to 14%.</p> <p>Conclusions</p> <p>We found that machine learning on sequences normalized by simple linear interpolation gave better or at least competitive results compared to state-of-the-art procedures, and thus, is a promising alternative to existing methods, especially for protein sequences of variable length.</p
GÊNEROS DISCURSIVOS E ENSINO: UMA PROPOSTA DE APLICAÇÃO EM SALA DE AULA
Os gêneros discursivos são formas de agir e interagir discursivamente e são inerentes à comunicação humana. Neste artigo, nos propomos, a partir de um percurso teórico, discutir sobre o conceito de gênero discursivo com base nas reflexões de Bakhtin (2000) e Marcuschi (2003, 2005), considerando sua aplicabilidade no ensino como condição para assegurar à construção de conhecimentos fundamentais para as práticas sociais de linguagem. Para isso, refletimos sobre o gênero discursivo como atividade sociocomunicativa de interação social, produzido para as necessidades de comunicação, constituído de componentes sociais, históricos, culturais e cognitivos. Além disso, analisamos a sequência didática na perspectiva de Dolz e Schneuwly (2004) como possibilidade de auxiliar o ensino através dos gêneros. Entendemos ser essencial, por essa razão, que as aulas de língua portuguesa centrem-se, nos diferentes níveis de ensino, nas dinâmicas sociais de interação por meio dos gêneros discursivos.
 
Prediction of Protein Domain with mRMR Feature Selection and Analysis
The domains are the structural and functional units of proteins. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop effective methods for predicting the protein domains according to the sequences information alone, so as to facilitate the structure prediction of proteins and speed up their functional annotation. However, although many efforts have been made in this regard, prediction of protein domains from the sequence information still remains a challenging and elusive problem. Here, a new method was developed by combing the techniques of RF (random forest), mRMR (maximum relevance minimum redundancy), and IFS (incremental feature selection), as well as by incorporating the features of physicochemical and biochemical properties, sequence conservation, residual disorder, secondary structure, and solvent accessibility. The overall success rate achieved by the new method on an independent dataset was around 73%, which was about 28–40% higher than those by the existing method on the same benchmark dataset. Furthermore, it was revealed by an in-depth analysis that the features of evolution, codon diversity, electrostatic charge, and disorder played more important roles than the others in predicting protein domains, quite consistent with experimental observations. It is anticipated that the new method may become a high-throughput tool in annotating protein domains, or may, at the very least, play a complementary role to the existing domain prediction methods, and that the findings about the key features with high impacts to the domain prediction might provide useful insights or clues for further experimental investigations in this area. Finally, it has not escaped our notice that the current approach can also be utilized to study protein signal peptides, B-cell epitopes, HIV protease cleavage sites, among many other important topics in protein science and biomedicine
Global patterns in endemicity and vulnerability of soil fungi
Fungi are highly diverse organisms, which provide multiple ecosystem services. However, compared with charismatic animals and plants, the distribution patterns and conservation needs of fungi have been little explored. Here, we examined endemicity patterns, global change vulnerability and conservation priority areas for functional groups of soil fungi based on six global surveys using a high-resolution, long-read metabarcoding approach. We found that the endemicity of all fungi and most functional groups peaks in tropical habitats, including Amazonia, Yucatan, West-Central Africa, Sri Lanka, and New Caledonia, with a negligible island effect compared with plants and animals. We also found that fungi are predominantly vulnerable to drought, heat and land-cover change, particularly in dry tropical regions with high human population density. Fungal conservation areas of highest priority include herbaceous wetlands, tropical forests, and woodlands. We stress that more attention should be focused on the conservation of fungi, especially root symbiotic arbuscular mycorrhizal and ectomycorrhizal fungi in tropical regions as well as unicellular early-diverging groups and macrofungi in general. Given the low overlap between the endemicity of fungi and macroorganisms, but high conservation needs in both groups, detailed analyses on distribution and conservation requirements are warranted for other microorganisms and soil organisms
Modeling of Ti-W Solidification Microstructures Under Additive Manufacturing Conditions
Additive manufacturing (AM) processes have many benefits for the fabrication of alloy parts, including the potential for greater microstructural control and targeted properties than traditional metallurgy processes. To accelerate utilization of this process to produce such parts, an effective computational modeling approach to identify the relationships between material and process parameters, microstructure, and part properties is essential. Development of such a model requires accounting for the many factors in play during this process, including laser absorption, material addition and melting, fluid flow, various modes of heat transport, and solidification. In this paper, we start with a more modest goal, to create a multiscale model for a specific AM process, Laser Engineered Net Shaping (LENS™), which couples a continuum-level description of a simplified beam melting problem (coupling heat absorption, heat transport, and fluid flow) with a Lattice Boltzmann-cellular automata (LB-CA) microscale model of combined fluid flow, solute transport, and solidification. We apply this model to a binary Ti-5.5 wt pct W alloy and compare calculated quantities, such as dendrite arm spacing, with experimental results reported in a companion paper
- …