26 research outputs found

    Conceptual design of a database to store recordings from articulographic studies of Polish speech

    No full text
    W artykule opisano struktur臋 i funkcjonalno艣膰 bazy danych artykulograficznych do przechowywania danych z bada艅 przeprowadzanych z wykorzystaniem artykulografu elektromagnetycznego, kamery akustycznej i 3 kamer wideo. Baza danych umo偶liwia selektywne pobieranie r贸偶nych typ贸w danych, w szczeg贸lno艣ci dotycz膮cych m贸wcy, sesji nagraniowej, nagra艅 oraz eksperyment贸w. Opisano struktur臋 i budow臋 bazy danych. Przedstawiono r贸wnie偶 potencjalne przysz艂e zastosowania do przeprowadzania analiz statystycznych oraz w eksperymentach dotycz膮cych inwersji mowy z wykorzystaniem modeli sieci Bayesa.The article describes the structure and functionality of the articulographic database for storing data from articulographic research using an electromagnetic articulograph, an acoustic camera and 3 video cameras. The database enables selective extraction of various types of data for scientific research and interoperates with programs that carry out experiments. Structure and construction of the database is described. Potential future application in statistical analysis and experiments on speech inversion using dynamic Bayesian networks (DBN) was also presented

    Fusing the electromagnetic articulograph, high-speed video cameras and a 16-channel microphone array for speech analysis

    No full text
    Electromagnetic articulography (EMA) is one of the instrumental phonetic research methods used for recording and assessing articulatory movements. Usually, articulographic data are analysed together with standard audio recordings. This paper, however, demonstrates how coupling the articulograph with devices providing other types of information may be used in more advanced speech research. A novel measurement system is presented that consists of the AG 500 electromagnetic articulograph, a 16-channel microphone array with a dedicated audio recorder and a video module consisting of 3 high-speed cameras. It is argued that synchronization of all these devices allows for comparative analyses of results obtained with the three components. To complement the description of the system, the article presents innovative data analysis techniques developed by the authors as well as preliminary results of the system鈥檚 accuracy

    Application Hidden Markov Models to Automatic Detection of Speech Disorder

    No full text
    W artykule przedstawiono wyniki bada艅 dotycz膮cych automatycznej detekcji wad wymowy u dzieci. Jako materia艂 badawczy zosta艂y wykorzystane nagrania pochodz膮ce od dzieci z wadami wymowy. Zadanie polega艂o na rozpoznaniu nieprawid艂owo realizowanego fonemu w wybranych s艂owach testowych. Detekcja by艂a dokonywana za pomoc膮 metod rozpoznawania mowy, w kt贸rych jako cec sygna艂u mowy u偶yto dw贸ch najbardziej obiecuj膮cych rodzaj贸w cech: wsp贸艂czynnika MFCC praz wsp贸艂czynnik贸w HFCC. Jako klasyfikatora u偶yto metody niejawnych modeli Markowa (HMM), gdzie modelowanymi jednostkami fonetycznimi by艂y zar贸wno fonemy jak i ca艂e s艂owa. W badanych metodach dobrano ich parametry w celu zmaksymalizowania skuteczno艣ci rozpoznawania. W artykule zaprezentowano r贸wnie偶 analiz臋 por贸wnawcz膮 wynik贸w rozpoznawania otrzymanych z wykorzystaniem metody HMM oraz testowanej w poprzednich pracach metody nieliniowej transformacji czasowej (DTW).The results of research on automatic detection of the pathological phoneme pronunciation are presented in the paper. Speech samples came from speech impaired children and persons who imitated pathological phoneme pronunciation. The recognition task was to find wrongly realized phoneme in the selected test utterances. At the reature extraction stage the most effective features` types have been used: standard Mel-Frequency Cepstral Coefficients (MFCC) and recently proposed Human Factor Cepstral Coefficients (HFCC). As a classificator hidden Markov models, with modeled speech unit being a phoneme as well as a whole word, have been used. The parameters of the HMMs were adjusted in order to achieve the best recognition accuracy. Comparision of the HMM and DTW methods is also presented in the paper
    corecore