Statistična analiza izgovora črke l v Slovenskem oblikoslovnem leksikonu Sloleks

Abstract

The ambiguous pronunciation of the letter l before consonant graphemes (e.g., polža, alge, volilca) poses a problem for grapheme-to-phoneme conversion for Slovenian. Despite having been addressed in multiple Slovenian language resources, the problem is still unresolved. Because of a lack of empirical and machine-readable data on the pronunciation of native speakers, a dataset of approximately six thousand lexemes (and their inflected forms) was manually annotated from the Sloleks Morphological Lexicon of Slovene according to their pronunciation (/l/, /u̯/, or both) and a statistical analysis was performed to reveal the most problematic points. The findings will be useful in developing a model to predict the pronunciation of the letter l before a consonant grapheme.Dvoumnost izgovora črke l v položaju pred soglasniškim grafemom (polža, alge, volilca) predstavlja problem v grafemsko-fonemski pretvorbi za slovenščino in kljub večkratni obravnavi v slovenskih jezikovnih virih še vedno ni razrešena. Zaradi pomanjkanja empiričnih strojno berljivih podatkov o izgovoru rojenih govorcev in govork smo ročno označili množico približno 6.000 leksemov (in njihovih pregibnih oblik) iz Slovenskega oblikoslovnega leksikona Sloleks glede na izgovor (/l/, /u̯/ ali variantno) ter opravili statistično analizo, ki razkrije najbolj problematične točke. Izsledki bodo v pomoč pri razvoju modela, ki napoveduje izgovor črke l pred soglasniškim grafemom

Similar works

Full text

thumbnail-image

ZRC SAZU Publishing (Znanstvenoraziskovalni center - Slovenske akademije znanosti in umetnosti)

redirect
Last time updated on 04/07/2025

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.

Licence: https://creativecommons.org/licenses/by-sa/4.0