21 research outputs found

    献辞(中間敬文教授古希記念特集)

    Get PDF
    © Published under licence by IOP Publishing Ltd. This paper describes the analysis and modelling of word usage frequency time series. During one of previous studies, an assumption was put forward that all word usage frequencies have uniform dynamics approaching the shape of a Gaussian function. This assumption can be checked using the frequency dictionaries of the Google Books Ngram database. This database includes 5.2 million books published between 1500 and 2008. The corpus contains over 500 billion words in American English, British English, French, German, Spanish, Russian, Hebrew, and Chinese. We clustered time series of word usage frequencies using a Kohonen neural network. The similarity between input vectors was estimated using several algorithms. As a result of the neural network training procedure, more than ten different forms of time series were found. They describe the dynamics of word usage frequencies from birth to death of individual words. Different groups of word forms were found to have different dynamics of word usage frequency variations

    Modelling of nonlinear filtering Poisson time series

    Get PDF
    © Published under licence by IOP Publishing Ltd.In this article, algorithms of non-linear filtering of Poisson time series are tested using statistical modelling. The objective is to find a representation of a time series as a wavelet series with a small number of non-linear coefficients, which allows distinguishing statistically significant details. There are well-known efficient algorithms of non-linear wavelet filtering for the case when the values of a time series have a normal distribution. However, if the distribution is not normal, good results can be expected using the maximum likelihood estimations. The filtration is studied according to the criterion of maximum likelihood by the example of Poisson time series. For direct optimisation of the likelihood function, different stochastic (genetic algorithms, annealing method) and deterministic optimization algorithms are used. Testing of the algorithm using both simulated series and empirical data (series of rare words frequencies according to the Google Books Ngram data were used) showed that filtering based on the criterion of maximum likelihood has a great advantage over well-known algorithms for the case of Poisson series. Also, the most perspective methods of optimisation were selected for this problem

    Cluster analysis of word frequency dynamics

    Get PDF
    © Published under licence by IOP Publishing Ltd. This paper describes the analysis and modelling of word usage frequency time series. During one of previous studies, an assumption was put forward that all word usage frequencies have uniform dynamics approaching the shape of a Gaussian function. This assumption can be checked using the frequency dictionaries of the Google Books Ngram database. This database includes 5.2 million books published between 1500 and 2008. The corpus contains over 500 billion words in American English, British English, French, German, Spanish, Russian, Hebrew, and Chinese. We clustered time series of word usage frequencies using a Kohonen neural network. The similarity between input vectors was estimated using several algorithms. As a result of the neural network training procedure, more than ten different forms of time series were found. They describe the dynamics of word usage frequencies from birth to death of individual words. Different groups of word forms were found to have different dynamics of word usage frequency variations

    Methods of verification of compliance with an asymptotically power law

    Get PDF
    © Published under licence by IOP Publishing Ltd. This paper describes methods of the power law verification using different empirical data. We do not analyse the value of deviations from the model but try to found out whether these deviations are regular or random. The suggested approach is based on the idea of finding local power approximation of the considered series for each range of ranks, after which one or another trend criterion is applied to the obtained series of local exponents. Application of the runs test is also discussed. The suggested methods were tested using 10 sets of empirical data, which are available for free. It was shown that compliance with the power law is satisfactory only in one case

    Non-hematopoietic erythropoietin-derived peptides for atheroprotection and treatment of cardiovascular diseases

    Get PDF
    Research in recent decades has shown that erythropoietin has a high cytoprotective activity, which is mainly associated with exposure to the mitochondrial link and has been confirmed in various experimental models. There is also a short-chain derivative, the 11-amino acid pyroglutamate helix B surface peptide (PHBSP), which selectively binds to the erythropoietin heterodymic receptor and reproduces its cytoprotective propertie

    Modelling of nonlinear filtering Poisson time series

    No full text
    © Published under licence by IOP Publishing Ltd.In this article, algorithms of non-linear filtering of Poisson time series are tested using statistical modelling. The objective is to find a representation of a time series as a wavelet series with a small number of non-linear coefficients, which allows distinguishing statistically significant details. There are well-known efficient algorithms of non-linear wavelet filtering for the case when the values of a time series have a normal distribution. However, if the distribution is not normal, good results can be expected using the maximum likelihood estimations. The filtration is studied according to the criterion of maximum likelihood by the example of Poisson time series. For direct optimisation of the likelihood function, different stochastic (genetic algorithms, annealing method) and deterministic optimization algorithms are used. Testing of the algorithm using both simulated series and empirical data (series of rare words frequencies according to the Google Books Ngram data were used) showed that filtering based on the criterion of maximum likelihood has a great advantage over well-known algorithms for the case of Poisson series. Also, the most perspective methods of optimisation were selected for this problem

    Modelling of nonlinear filtering Poisson time series

    Get PDF
    © Published under licence by IOP Publishing Ltd.In this article, algorithms of non-linear filtering of Poisson time series are tested using statistical modelling. The objective is to find a representation of a time series as a wavelet series with a small number of non-linear coefficients, which allows distinguishing statistically significant details. There are well-known efficient algorithms of non-linear wavelet filtering for the case when the values of a time series have a normal distribution. However, if the distribution is not normal, good results can be expected using the maximum likelihood estimations. The filtration is studied according to the criterion of maximum likelihood by the example of Poisson time series. For direct optimisation of the likelihood function, different stochastic (genetic algorithms, annealing method) and deterministic optimization algorithms are used. Testing of the algorithm using both simulated series and empirical data (series of rare words frequencies according to the Google Books Ngram data were used) showed that filtering based on the criterion of maximum likelihood has a great advantage over well-known algorithms for the case of Poisson series. Also, the most perspective methods of optimisation were selected for this problem

    Modelling of nonlinear filtering Poisson time series

    No full text
    © Published under licence by IOP Publishing Ltd.In this article, algorithms of non-linear filtering of Poisson time series are tested using statistical modelling. The objective is to find a representation of a time series as a wavelet series with a small number of non-linear coefficients, which allows distinguishing statistically significant details. There are well-known efficient algorithms of non-linear wavelet filtering for the case when the values of a time series have a normal distribution. However, if the distribution is not normal, good results can be expected using the maximum likelihood estimations. The filtration is studied according to the criterion of maximum likelihood by the example of Poisson time series. For direct optimisation of the likelihood function, different stochastic (genetic algorithms, annealing method) and deterministic optimization algorithms are used. Testing of the algorithm using both simulated series and empirical data (series of rare words frequencies according to the Google Books Ngram data were used) showed that filtering based on the criterion of maximum likelihood has a great advantage over well-known algorithms for the case of Poisson series. Also, the most perspective methods of optimisation were selected for this problem

    Methods of verification of compliance with an asymptotically power law

    No full text
    © Published under licence by IOP Publishing Ltd. This paper describes methods of the power law verification using different empirical data. We do not analyse the value of deviations from the model but try to found out whether these deviations are regular or random. The suggested approach is based on the idea of finding local power approximation of the considered series for each range of ranks, after which one or another trend criterion is applied to the obtained series of local exponents. Application of the runs test is also discussed. The suggested methods were tested using 10 sets of empirical data, which are available for free. It was shown that compliance with the power law is satisfactory only in one case

    Cluster analysis of word frequency dynamics

    Get PDF
    © Published under licence by IOP Publishing Ltd. This paper describes the analysis and modelling of word usage frequency time series. During one of previous studies, an assumption was put forward that all word usage frequencies have uniform dynamics approaching the shape of a Gaussian function. This assumption can be checked using the frequency dictionaries of the Google Books Ngram database. This database includes 5.2 million books published between 1500 and 2008. The corpus contains over 500 billion words in American English, British English, French, German, Spanish, Russian, Hebrew, and Chinese. We clustered time series of word usage frequencies using a Kohonen neural network. The similarity between input vectors was estimated using several algorithms. As a result of the neural network training procedure, more than ten different forms of time series were found. They describe the dynamics of word usage frequencies from birth to death of individual words. Different groups of word forms were found to have different dynamics of word usage frequency variations
    corecore