58,940 research outputs found

    Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

    Full text link
    The Chinese language has evolved a lot during the long-term development. Therefore, native speakers now have trouble in reading sentences written in ancient Chinese. In this paper, we propose to build an end-to-end neural model to automatically translate between ancient and contemporary Chinese. However, the existing ancient-contemporary Chinese parallel corpora are not aligned at the sentence level and sentence-aligned corpora are limited, which makes it difficult to train the model. To build the sentence level parallel training data for the model, we propose an unsupervised algorithm that constructs sentence-aligned ancient-contemporary pairs by using the fact that the aligned sentence pair shares many of the tokens. Based on the aligned corpus, we propose an end-to-end neural model with copying mechanism and local attention to translate between ancient and contemporary Chinese. Experiments show that the proposed unsupervised algorithm achieves 99.4% F1 score for sentence alignment, and the translation model achieves 26.95 BLEU from ancient to contemporary, and 36.34 BLEU from contemporary to ancient.Comment: Acceptted by NLPCC 201

    Modeling Spacing Distribution of Queuing Vehicles in Front of a Signalized Junction Using Random-Matrix Theory

    Full text link
    Modeling of headway/spacing between two consecutive vehicles has many applications in traffic flow theory and transport practice. Most known approaches only study the vehicles running on freeways. In this paper, we propose a model to explain the spacing distribution of queuing vehicles in front of a signalized junction based on random-matrix theory. We show that the recently measured spacing distribution data well fit the spacing distribution of a Gaussian symplectic ensemble (GSE). These results are also compared with the spacing distribution observed for car parking problem. Why vehicle-stationary-queuing and vehicle-parking have different spacing distributions (GSE vs GUE) seems to lie in the difference of driving patterns
    • …
    corecore