11,275 research outputs found
CSCD-IME: Correcting Spelling Errors Generated by Pinyin IME
Chinese Spelling Correction (CSC) is a task to detect and correct spelling
mistakes in texts. In fact, most of Chinese input is based on pinyin input
method, so the study of spelling errors in this process is more practical and
valuable. However, there is still no research dedicated to this essential
scenario. In this paper, we first present a Chinese Spelling Correction Dataset
for errors generated by pinyin IME (CSCD-IME), including 40,000 annotated
sentences from real posts of official media on Sina Weibo. Furthermore, we
propose a novel method to automatically construct large-scale and high-quality
pseudo data by simulating the input through pinyin IME. A series of analyses
and experiments on CSCD-IME show that spelling errors produced by pinyin IME
hold a particular distribution at pinyin level and semantic level and are
challenging enough. Meanwhile, our proposed pseudo-data construction method can
better fit this error distribution and improve the performance of CSC systems.
Finally, we provide a useful guide to using pseudo data, including the data
scale, the data source, and the training strategy
3-Hydroxy-2-[(4-hydroxy-3,5-dimethoxyphenyl)(2-hydroxy-4,4-dimethyl-6-oxocyclohex-1-en-1-yl)methyl]-5,5-dimethylcyclohex-2-en-1-one
In the title compound, C25H32O7, the 3-hydroxy-5,5-dimethylcyclohex-2-enone rings adopt slightly distorted envelope conformations with the two planes at the base of the envelope forming dihedral angles of 57.6 (4) and 53.9 (9)° with the benzene ring. There is an intramolecular hydroxy–ketone O—H⋯O interaction between the two substituted cyclohexane rings as well as a short intramolecular phenol–methoxy O—H⋯O interaction
Recommended from our members
Quadruple bonding between iron and boron in the BFe(CO)3- complex.
While main group elements have four valence orbitals accessible for bonding, quadruple bonding to main group elements is extremely rare. Here we report that main group element boron is able to form quadruple bonding interactions with iron in the BFe(CO)3- anion complex, which has been revealed by quantum chemical investigation and identified by mass-selected infrared photodissociation spectroscopy in the gas phase. The complex is characterized to have a B-Fe(CO)3- structure of C3v symmetry and features a B-Fe bond distance that is much shorter than that expected for a triple bond. Various chemical bonding analyses indicate that the complex involves unprecedented B≣Fe quadruple bonding interactions. Besides the common one electron-sharing σ bond and two Fe→B dative π bonds, there is an additional weak B→Fe dative σ bonding interaction. This finding of the new quadruple bonding indicates that there might exist a wide range of boron-metal complexes that contain such high multiplicity of chemical bonds
Discrete Element Simulation of the Relationship between Composition, ITZ Property, and Tensile Behavior of Eco-Friendly UHPC Matrix
To combat global warming, the development of eco-friendly ultra-high performance concrete (UHPC) has become one of the current research hotspots. Understanding the relationship between composition and performance of eco-friendly UHPC from a meso-mechanical point will be of great significance in proposing a more scientific and effective mix design theory. In this paper, the 3D discrete element model (DEM) of an eco-friendly UHPC matrix was constructed. The mechanism of the effect of the interface transition zone (ITZ) properties on the tensile behavior of an eco-friendly UHPC matrix was studied. The relationship between composition, ITZ property, and tensile behavior of eco-friendly UHPC matrix was analyzed. The results show that ITZ strength influences the tensile strength and cracking behavior of eco-friendly UHPC matrix. The effect of ITZ on the tensile properties of eco-friendly UHPC matrix is more significant than that of normal concrete. The tensile strength of UHPC will be increased by 48% when the ITZ property is changed from normal condition to perfect. Improving the reactivity of the binder system of UHPC will improve the performance of ITZ. The cement content in UHPC was reduced from 80% to 35%, and the σITZ/σPaste was reduced from 0.7 to 0.32. Both nanomaterials and chemical activators can promote the hydration reaction of the binder material, which in turn leads to better ITZ strength and tensile properties for an eco-friendly UHPC matrix
Computational Simulations to Predict Creatine Kinase-Associated Factors: Protein-Protein Interaction Studies of Brain and Muscle Types of Creatine Kinases
Creatine kinase (CK; EC 2.7.3.2) is related to several skin diseases such as psoriasis and dermatomyositis. CK is important in skin energy homeostasis because it catalyzes the reversible transfer of a phosphoryl group from MgATP to creatine. In this study, we predicted CK binding proteins via the use of bioinformatic tools such as protein-protein interaction (PPI) mappings and suggest the putative hub proteins for CK interactions. We obtained 123 proteins for brain type CK and 85 proteins for muscle type CK in the interaction networks. Among them, several hub proteins such as NFKB1, FHL2, MYOC, and ASB9 were predicted. Determination of the binding factors of CK can further promote our understanding of the roles of CK in physiological conditions
- …