2,976 research outputs found

    Inferring inflection classes with description length

    Get PDF
    International audienceWe discuss the notion of an inflection class system, a traditional ingredient of the description of inflection systems of nontrivial complexity. We distinguish systems of microclasses, which partition a set of lexemes in classes with identical behavior, and systems of macroclasses, which group lexemes that are similar enough in a few larger classes. On the basis of the intuition that macroclasses should contribute to a concise description of the system, we propose one algorithmic method for inferring macroclasses from raw inflectional paradigms, based on minimisation of the description length of the system under a given strategy of identifying morphological alternations in paradigms. We then exhibit classifications produced by our implementation on French and European Portuguese conjugation data and argue that they constitute an appropriate systematisation of traditional classifications. To arrive at such a convincing systematisation, it was crucial for us to use a local approach to inflection class similarity (based on pairwise comparisons of paradigm cells) rather than a global approach (based on the simultaneous comparison of all cells). We conclude that it is indeed possible to infer inflectional macroclasses objectively

    A generalised alignment template formalism and its application to the inference of shallow-transfer machine translation rules from scarce bilingual corpora

    Get PDF
    Statistical and rule-based methods are complementary approaches to machine translation (MT) that have different strengths and weaknesses. This complementarity has, over the last few years, resulted in the consolidation of a growing interest in hybrid systems that combine both data-driven and linguistic approaches. In this paper, we address the situation in which the amount of bilingual resources that is available for a particular language pair is not sufficiently large to train a competitive statistical MT system, but the cost and slow development cycles of rule-based MT systems cannot be afforded either. In this context, we formalise a new method that uses scarce parallel corpora to automatically infer a set of shallow-transfer rules to be integrated into a rule-based MT system, thus avoiding the need for human experts to handcraft these rules. Our work is based on the alignment template approach to phrase-based statistical MT, but the definition of the alignment template is extended to encompass different generalisation levels. It is also greatly inspired by the work of Sánchez-Martínez and Forcada (2009) in which alignment templates were also considered for shallow-transfer rule inference. However, our approach overcomes many relevant limitations of that work, principally those related to the inability to find the correct generalisation level for the alignment templates, and to select the subset of alignment templates that ensures an adequate segmentation of the input sentences by the rules eventually obtained. Unlike previous approaches in literature, our formalism does not require linguistic knowledge about the languages involved in the translation. Moreover, it is the first time that conflicts between rules are resolved by choosing the most appropriate ones according to a global minimisation function rather than proceeding in a pairwise greedy fashion. Experiments conducted using five different language pairs with the free/open-source rule-based MT platform Apertium show that translation quality significantly improves when compared to the method proposed by Sánchez-Martínez and Forcada (2009), and is close to that obtained using handcrafted rules. For some language pairs, our approach is even able to outperform them. Moreover, the resulting number of rules is considerably smaller, which eases human revision and maintenance.Research funded by Universitat d’Alacant through project GRE11-20, by the Spanish Ministry of Economy and Competitiveness through projects TIN2009-14009-C02-01 and TIN2012-32615, by Generalitat Valenciana through grant ACIF/2010/174, and by the European Union Seventh Framework Programme FP7/2007-2013 under grant agreement PIAP-GA-2012-324414 (Abu-MaTran)

    Time as It Could Be Measured in Artificial Living Systems

    Get PDF
    Being able to measure time, whether directly or indirectly, is a significant advantage for an organism. It permits it to predict regular events, and prepare for them on time. Thus, clocks are ubiquitous in biology. In the present paper, we consider the most minimal abstract pure clocks and investigate their characteristics with respect to their ability to measure time. Amongst other, we find fundamentally diametral clock characteristics, such as oscillatory behaviour for local time measurement or decay-based clocks measuring time periods in scales global to the problem. We include also cascades of independent clocks (“clock bags”) and composite clocks with controlled dependency; the latter show various regimes of markedly different dynamics.Final Published versio

    Ozone exchange within and above an irrigated Californian orchard

    Get PDF
    In this study, the canopy effects on the vertical ozone exchange within and above Californian orchard are investigated. We examined the comprehensive dataset obtained from the Canopy Horizontal Array Turbulence Study (CHATS). CHATS typifies a rural central Californian site, with O3 mixing ratios of less than 60 ppb and moderate NOx mixing ratios. The CHATS campaign covered a complete irrigation cycle, with our analysis including periods before and after irrigation. Lower O3 mixing ratios were found following irrigation, together with increased wind speeds, decreased air temperatures and increased specific humidity. Friction velocity, sensible heat and gas fluxes above the canopy were estimated using variations on the flux-gradient method, including a method which accounts for the roughness sublayer (RSL). These methods were compared to fluxes derived from observed eddy diffusivities of heat and friction velocity. We found that the use of the RSL parameterization, which accounts for the canopy-induced turbulent mixing above the canopy, resulted in a stronger momentum, heat, and ozone exchange fluxes above this orchard, compared to the method which omits the RSL. This was quantified by the increased friction velocity, heat flux and ozone deposition flux of up to 12, 29, and 35% at 2.5 m above the canopy, respectively. Within the canopy, vertical fluxes, as derived from local gradients and eddy diffusivity of heat, were compared to fluxes calculated using the Lagrangian inverse theory. Both methods showed a presence of vertical flux divergence of friction velocity, heat and ozone, suggesting that turbulent mixing was inefficient in homogenizing the effects driven by local sources and sinks on vertical exchange of those quantities. This weak mixing within the canopy was also corroborated in the eddy diffusivities of friction velocity and heat, which were calculated directly from the observations. Finally, the influence of water stress on the O3 budget was examined by comparing the results prior and after the irrigation. Although the analysis is limited to the local conditions, our in situ measurements indicated differences in the O3 mixing ratio prior and after irrigation during CHATS. We attribute these O3 mixing ratio changes to enhanced biological emission of volatile organic compounds (VOCs), driven by water stress

    String Inflation After Planck 2013

    Full text link
    We briefly summarize the impact of the recent Planck measurements for string inflationary models, and outline what might be expected to be learned in the near future from the expected improvement in sensitivity to the primordial tensor-to-scalar ratio. We comment on whether these models provide sufficient added value to compensate for their complexity, and ask how they fare in the face of the new constraints on non-gaussianity and dark radiation. We argue that as a group the predictions made before Planck agree well with what has been seen, and draw conclusions from this about what is likely to mean as sensitivity to primordial gravitational waves improves.Comment: LaTeX, 21 pages plus references; slight modification of the discussion of inflection point inflation, references added and typos correcte

    The effects of concordance-based electronic glosses on L2 vocabulary learning

    Get PDF
    The present study investigates the effects of two different vocabulary learning conditions in digital reading environments equipped with electronic textual glossing. The first condition presents the concordance lines of a target lexical item, thereby making learners infer its meaning by reading the referenced sentences. The second condition additionally offers the definition of a target lexical item after learners consult the concordance lines, thus enabling learners to confirm their meaning inference. A total of 138 English as a Foreign Language students completed a meaning-recall vocabulary pre-test, and three different reading tasks, which were followed by meaning-recall vocabulary post-tests in a repeated measures design with a control condition. Overall, the findings showed that the second condition resulted in higher vocabulary gains than both the first condition andthe control condition. Yet, a closer look at the interactions of (a) the participants’ clicking behaviors, (b) the difficulty of selected concordance lines, (c) the surrounding contexts around target lexical items, and (d) the participants’ prior knowledge of the target lexical items showed that each target lexical item may require different treatments for it to be recalled most efficiently and effectively. Through this investigation, the present study suggests that glossary information, such as concordance lines, may involve more complex and unexpected learner interactions

    Gene surfing

    Get PDF
    Spatially resolved genetic data is increasingly used to reconstruct the migrational history of species. To assist such inference, we study, by means of simulations and analytical methods, the dynamics of neutral gene frequencies in a population undergoing a continual range expansion in one dimension. During such a colonization period, lineages can fix at the wave front by means of a ``surfing'' mechanism [Edmonds C.A., Lillie A.S. & Cavalli-Sforza L.L. (2004) Proc Natl Acad Sci USA 101: 975-979]. We quantify this phenomenon in terms of (i) the spatial distribution of lineages that reach fixation and, closely related, (ii) the continual loss of genetic diversity (heterozygosity) at the wave front, characterizing the approach to fixation. Our simulations show that an effective population size can be assigned to the wave that controls the (observable) gradient in heterozygosity left behind the colonization process. This effective population size is markedly higher in pushed waves than in pulled waves, and increases only sub-linearly with deme size. To explain these and other findings, we develop a versatile analytical approach, based on the physics of reaction-diffusion systems, that yields simple predictions for any deterministic population dynamics
    corecore