57 research outputs found
Punctuation effects in English and Esperanto texts
A statistical physics study of punctuation effects on sentence lengths is
presented for written texts: {\it Alice in wonderland} and {\it Through a
looking glass}. The translation of the first text into esperanto is also
considered as a test for the role of punctuation in defining a style, and for
contrasting natural and artificial, but written, languages. Several log-log
plots of the sentence length-rank relationship are presented for the major
punctuation marks. Different power laws are observed with characteristic
exponents. The exponent can take a value much less than unity ( 0.50 or
0.30) depending on how a sentence is defined. The texts are also mapped into
time series based on the word frequencies. The quantitative differences between
the original and translated texts are very minutes, at the exponent level. It
is argued that sentences seem to be more reliable than word distributions in
discussing an author style.Comment: 13 pages, 7 figures (3x2+1), 60 reference
Equilibrium (Zipf) and Dynamic (Grasseberg-Procaccia) method based analyses of human texts. A comparison of natural (english) and artificial (esperanto) languages
A comparison of two english texts from Lewis Carroll, one (Alice in
wonderland), also translated into esperanto, the other (Through a looking
glass) are discussed in order to observe whether natural and artificial
languages significantly differ from each other. One dimensional time series
like signals are constructed using only word frequencies (FTS) or word lengths
(LTS). The data is studied through (i) a Zipf method for sorting out
correlations in the FTS and (ii) a Grassberger-Procaccia (GP) technique based
method for finding correlations in LTS. Features are compared : different power
laws are observed with characteristic exponents for the ranking properties, and
the {\it phase space attractor dimensionality}. The Zipf exponent can take
values much less than unity ( 0.50 or 0.30) depending on how a sentence is
defined. This non-universality is conjectured to be a measure of the author
. Moreover the attractor dimension is a simple function of the so
called phase space dimension , i.e., , with . Such an exponent should also conjecture to be a measure of the author
. However, even though there are quantitative differences between
the original english text and its esperanto translation, the qualitative
differences are very minutes, indicating in this case a translation relatively
well respecting, along our analysis lines, the content of the author writing.Comment: 22 pages, 87 references, 5 tables, 8 figure
Verb Morphology of Modern Greek: a Descriptive Analysis.
Ph.D.LinguisticsUniversity of Michiganhttp://deepblue.lib.umich.edu/bitstream/2027.42/156749/1/6101755.pd
Michael Kenstowicz and Charles Kisseberth, Generative phonology: description and theory. New York: Academic Press, 1979. Pp. xiv+459.
Two papers on language translation by computers
http://deepblue.lib.umich.edu/bitstream/2027.42/6059/5/bac5848.0001.001.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/6059/4/bac5848.0001.001.tx
- …