457 research outputs found
Recommended from our members
Acquiring verb classes through bottom-up semantic verb clustering
In this paper, we present the first analysis of bottom-up manual semantic clustering of verbs in three languages, English, Polish and Croatian. Verb classes including syntactic and semantic information have been shown to support many NLP tasks by allowing abstraction from individual words and thereby alleviating data sparseness. The availability of such classifications is however still non-existent or limited in most languages. While a range of automatic verb classification approaches have been proposed, high-quality resources and gold standards are needed for evaluation and to improve the performance of NLP systems. We investigate whether semantic verb classes in three different languages can be reliably obtained from native speakers without linguistics training. The analysis of inter-annotator agreement shows an encouraging degree of overlap in the classifications produced for each language individually, as well as across all three languages. Comparative examination of the resultant classifications provides interesting insights into cross-linguistic semantic commonalities and patterns of ambiguity
Immobilization of Yeast on Polymeric Supports
Biocatalysts (enzymes and whole cells) play a crucial role in industrial processes allowing for efficient production of many important compounds, but their use has been limited because of the considerably unstable nature of enzymes. Immobilization often
protects enzymes from environmental stresses such as pH, temperature, salts, solvents, inhibitors and poisons. Immobilization of cells containing specific enzymes has further advantages such as elimination of long and expensive procedures for enzymes separation
and purification and it is vital to expand their application by enabling easy separation and purification of products from reaction mixtures and efficient recovery of catalyst. This review focuses on organic polymers (natural and synthetic) used as matrices for immobilization
of microorganisms, mainly baker’s yeasts and potential application of immobilized cells in the chemical, pharmaceutical, biomedical and food industries
Semantic data set construction from human clustering and spatial arrangement
Abstract
Research into representation learning models of lexical semantics usually utilizes some form of intrinsic evaluation to ensure that the learned representations reflect human semantic judgments. Lexical semantic similarity estimation is a widely used evaluation method, but efforts have typically focused on pairwise judgments of words in isolation, or are limited to specific contexts and lexical stimuli. There are limitations with these approaches that either do not provide any context for judgments, and thereby ignore ambiguity, or provide very specific sentential contexts that cannot then be used to generate a larger lexical resource. Furthermore, similarity between more than two items is not considered. We provide a full description and analysis of our recently proposed methodology for large-scale data set construction that produces a semantic classification of a large sample of verbs in the first phase, as well as multi-way similarity judgments made within the resultant semantic classes in the second phase. The methodology uses a spatial multi-arrangement approach proposed in the field of cognitive neuroscience for capturing multi-way similarity judgments of visual stimuli. We have adapted this method to handle polysemous linguistic stimuli and much larger samples than previous work. We specifically target verbs, but the method can equally be applied to other parts of speech. We perform cluster analysis on the data from the first phase and demonstrate how this might be useful in the construction of a comprehensive verb resource. We also analyze the semantic information captured by the second phase and discuss the potential of the spatially induced similarity judgments to better reflect human notions of word similarity. We demonstrate how the resultant data set can be used for fine-grained analyses and evaluation of representation learning models on the intrinsic tasks of semantic clustering and semantic similarity. In particular, we find that stronger static word embedding methods still outperform lexical representations emerging from more recent pre-training methods, both on word-level similarity and clustering. Moreover, thanks to the data set’s vast coverage, we are able to compare the benefits of specializing vector representations for a particular type of external knowledge by evaluating FrameNet- and VerbNet-retrofitted models on specific semantic domains such as “Heat” or “Motion.”</jats:p
A terahertz vibrational molecular clock with systematic uncertainty at the level
Neutral quantum absorbers in optical lattices have emerged as a leading
platform for achieving clocks with exquisite spectroscopic resolution. However,
the studies of these clocks and their systematic shifts have so far been
limited to atoms. Here, we extend this architecture to an ensemble of diatomic
molecules and experimentally realize an accurate lattice clock based on pure
molecular vibration. We evaluate the leading systematics, including the
characterization of nonlinear trap-induced light shifts, achieving a total
systematic uncertainty of . The absolute frequency of the
vibrational splitting is measured to be 31 825 183 207 592.8(5.1) Hz, enabling
the dissociation energy of our molecule to be determined with record accuracy.
Our results represent an important milestone in molecular spectroscopy and
THz-frequency standards, and may be generalized to other neutral molecular
species with applications for fundamental physics, including tests of molecular
quantum electrodynamics and the search for new interactions.Comment: 17 pages, 8 figure
Folding mechanisms steer the amyloid fibril formation propensity of highly homologous proteins
Significant advances in the understanding of the molecular determinants of fibrillogenesis can be expected from comparative studies of the aggregation propensities of proteins with highly homologous structures but different folding pathways. Here, we fully characterize, by means of stopped-flow, T-jump, CD and DSC experiments, the unfolding mechanisms of three highly homologous proteins, zinc binding Ros87 and Ml153-149 and zinc-lacking Ml452-151. The results indicate that the three proteins significantly differ in terms of stability and (un)folding mechanisms. Particularly, Ros87 and Ml153-149 appear to be much more stable to guanidine denaturation and are characterized by folding mechanisms including the presence of an intermediate. On the other hand, metal lacking Ml452-151 folds according to a classic two-state model. Successively, we have monitored the capabilities of Ros87, Ml452-151 and Ml153-149 to form amyloid fibrils under native conditions. Particularly, we show, by CD, fluorescence, DLS, TEM and SEM experiments, that after 168 hours, amyloid formation of Ros87 has started, while Ml153-149 has formed only amorphous aggregates and Ml452-151 is still monomeric in solution. This study shows how metal binding can influence protein folding pathways and thereby control conformational accessibility to aggregation-prone states, which in turn changes aggregation kinetics, shedding light on the role of metal ions in the development of protein deposition diseases
- …