21,970 research outputs found

    2kenize: Tying Subword Sequences for Chinese Script Conversion

    Full text link
    Simplified Chinese to Traditional Chinese character conversion is a common preprocessing step in Chinese NLP. Despite this, current approaches have poor performance because they do not take into account that a simplified Chinese character can correspond to multiple traditional characters. Here, we propose a model that can disambiguate between mappings and convert between the two scripts. The model is based on subword segmentation, two language models, as well as a method for mapping between subword sequences. We further construct benchmark datasets for topic classification and script conversion. Our proposed method outperforms previous Chinese Character conversion approaches by 6 points in accuracy. These results are further confirmed in a downstream application, where 2kenize is used to convert pretraining dataset for topic classification. An error analysis reveals that our method's particular strengths are in dealing with code-mixing and named entities.Comment: Accepted to ACL 202

    Long-lived non-thermal states realized by atom losses in one-dimensional quasi-condensates

    Get PDF
    We investigate the cooling produced by a loss process non selective in energy on a one-dimensional (1D) Bose gas with repulsive contact interactions in the quasi-condensate regime. By performing nonlinear classical field calculations for a homogeneous system, we show that the gas reaches a non-thermal state where different modes have acquired different temperatures. After losses have been turned off, this state is robust with respect to the nonlinear dynamics, described by the Gross-Pitaevskii equation. We argue that the integrability of the Gross-Pitaevskii equation is linked to the existence of such long-lived non-thermal states, and illustrate this by showing that such states are not supported within a non-integrable model of two coupled 1D gases of different masses. We go beyond a classical field analysis, taking into account the quantum noise introduced by the discreteness of losses, and show that the non-thermal state is still produced and its non-thermal character is even enhanced. Finally, we extend the discussion to gases trapped in a harmonic potential and present experimental observations of a long-lived non-thermal state within a trapped 1D quasi-condensate following an atom loss process

    Spatial structure and composition of polysaccharide-protein complexes from Small Angle Neutron Scattering

    Get PDF
    We use Small Angle Neutron Scattering (SANS), with an original analysis method, to obtain both the characteristic sizes and the inner composition of lysozyme-pectin complexes depending on the charge density. Lysozyme is a globular protein and pectin a natural anionic semiflexible polysaccharide with a degree of methylation (DM) 0, 43 and 74. For our experimental conditions (buffer ionic strength I = 2.5 10-2 mol/L and pH between 3 and 7), the electrostatic charge of lysozyme is always positive (from 8 to 17 depending on pH). The pectin charge per elementary chain segment is negative and can be varied from almost zero to one through the change of DM and pH. The weight molar ratio of lysozyme on pectin monomers is kept constant. The ratio of negative charge content per volume to positive charge content per volume, -/+, is varied between 10 and 0.007. On a local scale, for all charged pectins, a correlation peak appears at 0.2 {\AA}-1 due to proteins clustering inside the complexes. On a large scale, the complexes appear as formed of spherical globules with a well defined radius of 10 to 50 nm, containing a few thousands proteins. The volume fraction Phi of organic matter within the globules derived from SANS absolute cross-sections is around 0.1. The protein stacking, which occurs inside the globules, is enhanced when pectin is more charged, due to pH or DM. The linear charge density of the pectin determines the size of the globules for pectin chains of comparable molecular weights whether it is controlled by the pH or the DM. The radius of the globules varies between 10 nm and 50 nm. In conclusion the structure is driven by electrostatic interactions and not by hydrophobic interactions. The molecular weight also has a large influence on the structure of the complexes since long chains tend to form larger globules. This maybe one reason why DM and pH are not completely equivalent in our system since DM 0 has a short mass, but this may not be the only one. For very low pectin charge (-/+ = 0.07), globules do not appear and the scattering signals a gel-like structure. We did not observe any beads-on-a-string structure

    Foaming properties of protein/pectin electrostatic complexes and foam structure at the nanoscale

    Get PDF
    The foaming properties, foaming capacity and foam stability, of soluble complexes of pectin and a globular protein, napin, have been investigated with a "Foamscan" apparatus. Complementary, we also used SANS with a recent method consisting in an analogy between the SANS by foams and the neutron reflectivity of films to measure in situ film thickness of foams. The effect of ionic strength, of protein concentration and of charge density of the pectin has been analysed. Whereas the foam stability is improved for samples containing soluble complexes, no effect has been noticed on the foam film thickness, which is almost around 315 {\AA} whatever the samples. These results let us specify the role of each specie in the mixture: free proteins contribute to the foaming capacity, provided the initial free protein content in the bulk is sufficient to allow the foam formation, and soluble complexes slow down the drainage by their presence in the Plateau borders, which finally results in the stabilisation of foams

    Losing sight of atmospheric sounds in televised nature documentary

    Get PDF
    The production of soundtracks for televised nature documentaries involves complexities in balancing the audience's sonic perceptions and emotions with audio content and scientific rigour. In addition, soundtracks need to be congruent with audience expectations and commercial imperatives. Popular televised nature documentaries often appear to be narrative melodramas with environmental soundscapes submerged by narration and music. This paper examines the correlations between perceptual agency, educational practices and production constraints with regards to sound production in nature documentaries. The purpose is a clarification surrounding the causative factors and results of the curious neglect for the sound of our natural world within an industry dedicated to the sensory portrayal of nature

    Population Synthesis of Radio and Gamma-ray Pulsars in the Fermi Era

    Full text link
    We present results of our pulsar population synthesis of normal pulsars from the Galactic disk using our previously developed computer code. On the same footing, we use slot gap and outer gap models for gamma-ray emission from normal pulsars to obtain statistics of radio-loud and radio-quiet gamma-ray pulsars. From recently improved understanding of HII and star forming regions in the Galaxy, we develop a new surface density model of the birth location of neutron stars. We explore models of neutron star evolution with magnetic field-decay, and with different initial period and magnetic field distributions. We present preliminary results including simulated population statistics that are compared with recent detections by Fermi of normal, isolated pulsars.Comment: 4 pages, 3 figures, to appear in the proceedings of the Pulsar 2010 Conference, Italy, 10 - 15 October 201

    An Analytical Model of a Thermally Excited Microcantilever Vibrating Laterally in a Viscous Fluid

    Get PDF
    To achieve higher quality factors (Q) for microcantilevers used in liquid-phase sensing applications, recent studies have explored the use of the lateral (in-plane) flexural mode. In particular, we have recently shown that this mode may be excited electrothermally using integrated heating resistors near the micro cantilever support, and that the resulting increase in Q helps to make low-ppb limits of detection a possibility in liquids. However, because the use of electrothermally excited, liquid-phase, microcantilever-based sensors in lateral flexure is relatively new, theoretical models are lacking. Therefore, we present here a new analytical model for predicting the vibratory response of these devices. The model is also used to successfully confirm the validity of our previously derived Q formula, which was based on a single-degree-of-freedom (SDOF) model and a harmonic tip force. Comparisons with experimental data show that the present model and, thus, the analytical formula provide excellent Q estimates for sufficiently thin beams vibrating laterally in water and reasonable upper-bound estimates for thicker beams
    corecore