2,835 research outputs found
Chinese unknown word identification as known word tagging
This paper presents a tagging approach to Chinese unknown word identification based on lexicalized hidden Markov models (LHMMs). In this work, Chinese unknown word identification is represented as a tagging task on a sequence of known words by introducing word-formation patterns and part-of-speech. Based on the lexicalized HMMs, a statistical tagger is further developed to assign each known word an appropriate tag that indicates its pattern in forming a word and the part-of-speech of the formed word. The experimental results on the Peking University corpus indicate that the use of lexicalization technique and the introduction of part-of-speech are helpful to unknown word identification. The experiment on the SIGHAN-PK open test data also shows that our system can achieve state-of-art performance.published_or_final_versio
Recommended from our members
Study on thermal conductivity of gas phase in nano-porous aerogel
This paper was presented at the 4th Micro and Nano Flows Conference (MNF2014), which was held at University College, London, UK. The conference was organised by Brunel University and supported by the Italian Union of Thermofluiddynamics, IPEM, the Process Intensification Network, the Institution of Mechanical Engineers, the Heat Transfer Society, HEXAG - the Heat Exchange Action Group, and the Energy Institute, ASME Press, LCN London Centre for Nanotechnology, UCL University College London, UCL Engineering, the International NanoScience Community, www.nanopaprika.eu.Nano-porous aerogel has an ultra low thermal conductivity and is usually used as the super
insulator. To evaluate the insulation performance of the aerogel, we focus on studying the thermal
conductivity of gas phase in the aerogel. We present a modified model to take into account the effect of nonuniform
pore-size distribution on the gaseous thermal conductivity, and the present model predicts more
agreement results with available data than the existing models. The gaseous thermal conductivity of the
aerogel at high temperature gradient condition is also numerically studied. We also study the effect of the
thermal transpiration flow on the gaseous thermal conductivity, and the results shows that the thermal
transpiration flow effect leads to a reduction of the gaseous thermal conductivity
SLC4A1 (solute carrier family 4, anion exchanger, member 1 (erythrocyte membrane protein band 3, Diego blood group))
Review on SLC4A1 (solute carrier family 4, anion exchanger, member 1 (erythrocyte membrane protein band 3, Diego blood group)), with data on DNA, on the protein encoded, and where the gene is implicated
Chinese text chunking using lexicalized HMMS
This paper presents a lexicalized HMM-based approach to Chinese text chunking. To tackle the problem of unknown words, we formalize Chinese text chunking as a tagging task on a sequence of known words. To do this, we employ the uniformly lexicalized HMMs and develop a lattice-based tagger to assign each known word a proper hybrid tag, which involves four types of information: word boundary, POS, chunk boundary and chunk type. In comparison with most previous approaches, our approach is able to integrate different features such as part-of-speech information, chunk-internal cues and contextual information for text chunking under the framework of HMMs. As a result, the performance of the system can be improved without losing its efficiency in training and tagging. Our preliminary experiments on the PolyU Shallow Treebank show that the use of lexicalization technique can substantially improve the performance of a HMM-based chunking system. © 2005 IEEE.published_or_final_versio
Crystal Structures of the structure-selective nuclease Mus81-Eme1 bound to flap DNA substrates
The Mus81-Eme1 complex is a structure-selective endonuclease with a critical role in the resolution of recombination intermediates during DNA repair after interstrand cross-links, replication fork collapse, or double-strand breaks. To explain the molecular basis of 3 ' flap substrate recognition and cleavage mechanism by Mus81-Eme1, we determined crystal structures of human Mus81-Eme1 bound to various flap DNA substrates. Mus81-Eme1 undergoes gross substrate-induced conformational changes that reveal two key features: (i) a hydrophobic wedge of Mus81 that separates pre- and post-nick duplex DNA and (ii) a 5 ' end binding pocket that hosts the 5 ' nicked end of post-nick DNA. These features are crucial for comprehensive protein-DNA interaction, sharp bending of the 3 ' flap DNA substrate, and incision strand placement at the active site. While Mus81-Eme1 unexpectedly shares several common features with members of the 5 ' flap nuclease family, the combined structural, biochemical, and biophysical analyses explain why Mus81-Eme1 preferentially cleaves 3 ' flap DNA substrates with 5 ' nicked ends.X11119Ysciescopu
Knocking down 10-formyltetrahydrofolate dehydrogenase increased oxidative stress and impeded zebrafish embryogenesis by obstructing morphogenetic movement
[[incitationindex]]SC
- …