Search CORE

3,687 research outputs found

On the role of autocorrelations in texts

Author: Lande D. V.
Snarskii A. A.
Publication venue
Publication date: 01/10/2007
Field of study

The task of finding a criterion allowing to distinguish a text from an arbitrary set of words is rather relevant in itself, for instance, in the aspect of development of means for internet-content indexing or separating signals and noise in communication channels. The Zipf law is currently considered to be the most reliable criterion of this kind [3]. At any rate, conventional stochastic word sets do not meet this law. The present paper deals with one of possible criteria based on the determination of the degree of data compression.Comment: 5 pages, 4 figures, 5 reference

arXiv.org e-Print Archive

Diagram of measurement series elements deviation from local linear approximations

Author: Lande D. V.
Snarskii A. A.
Publication venue
Publication date: 19/03/2009
Field of study

Method for detection and visualization of trends, periodicities, local peculiarities in measurement series (dL-method) based on DFA technology (Detrended fluctuation analysis) is proposed. The essence of the method lies in reflecting the values of absolute deviation of measurement accumulation series points from the respective values of linear approximation. It is shown that dL-method in some cases allows better determination of local peculiarities than wavelet-analysis. Easy-to-realize approach that is proposed can be used in the analysis of time series in such fields as economics and sociology.Comment: 4 pages, 6 figure

arXiv.org e-Print Archive

Self-similarity properties in a queuing network model

Author: Snarskii D. V. Lande A. A.
Publication venue
Publication date: 05/10/2009
Field of study

In this paper a model of subscriber telephone network based on the concept of cellular automata is elaborated. Some fractal properties inherent in the model are revealed that vary depending on parameters assigning its operation rules. The main advantage of the model in question is its compatibility with algorithmic methods - a finite set of formal rules, assigned on a finite set of elements (cells), allows precise realization in the form of algorithms.Comment: 5 pages, 5 figure

arXiv.org e-Print Archive

Compactified Horizontal Visibility Graph for the Language Network

Author: Lande D. V.
Snarskii A. A.
Publication venue
Publication date: 19/02/2013
Field of study

A compactified horizontal visibility graph for the language network is proposed. It was found that the networks constructed in such way are scale free, and have a property that among the nodes with largest degrees there are words that determine not only a text structure communication, but also its informational structure.Comment: 9 pages, 3 figures, 2 appendix table

arXiv.org e-Print Archive

Power law in website ratings

Author: Lande D. V.
Snarskii A. A.
Publication venue
Publication date: 17/06/2010
Field of study

In the practical work of websites popularization, analysis of their efficiency and downloading it is of key importance to take into account web-ratings data. The main indicators of website traffic include the number of unique hosts from which the analyzed website was addressed and the number of granted web pages (hits) per unit time (for example, day, month or year). Of certain interest is the ratio between the number of hits (S) and hosts (H). In practice there is even used such a concept as "average number of viewed pages" (S/H), which on default supposes a linear dependence of S on H. What actually happens is that linear dependence is observed only as a partial case of power dependence, and not always. Another new power law has been discovered on the Internet, in particular, on the WWW.Comment: 4 pages, 5 figure

arXiv.org e-Print Archive

Ranking of nodes of networks taking into account the power function of its weight of connections

Author: Lande D. V.
Soboliev A. M.
Publication venue
Publication date: 14/11/2017
Field of study

To rank nodes in quasi-hierarchical networks of social nature, it is necessary to carry out a detailed analysis of the network and evaluate the results obtained according to all the given criteria and identify the most influential nodes. Existing ranking algorithms in the overwhelming majority estimate such networks in general, which does not allow to clearly determine the influence of nodes among themselves. In the course of the study, an analysis of the results of known algorithms for ranking the nodes of HITS, PageRank and compares the obtained data with the expert evaluation of the network. For the effective analysis of quasi-hierarchical networks, the basic algorithm of HITS is modified, which allows to evaluate and rank nodes according to the given criteria (the number of input and output links among themselves), which corresponds to the results of expert evaluation. It is shown that the received method in some cases provides results that correspond to the real social relation, and the indexes of the authorship of the nodes - pre-assigned social roles.Comment: 11 pages, 2 fi

arXiv.org e-Print Archive

Decomposing an information stream into the principal components

Author: Hraivoronska A. M.
Lande D. V.
Publication venue
Publication date: 16/07/2018
Field of study

We propose an approach to decomposing a thematic information stream into principal components. Each principal component is related to a narrow topic extracted from the information stream. The essence of the approach arises from analogy with the Fourier transform. We examine methods for analyzing the principal components and propose using multifractal analysis for identifying similar topics. The decomposition technique is applied to the information stream dedicated to Brexit. We provide a comparison between the principal components obtained by applying the decomposition to Brexit stream and the related topics extracted by Google Trends.Comment: 14 pages; 4 figures; 4 table

arXiv.org e-Print Archive

On the fractal nature of mutual relevance sequences in the Internet news message flows

Author: Braichevsky S.
Lande D.
Snarskii A.
Publication venue
Publication date: 01/10/2007
Field of study

In the task of information retrieval the term relevance is taken to mean formal conformity of a document given by the retrieval system to user's information query. As a rule, the documents found by the retrieval system should be submitted to the user in a certain order. Therefore, a retrieval perceived as a selection of documents formally solving the user's query, should be supplemented with a certain procedure of processing a relevant set. It would be natural to introduce a quantitative measure of document conformity to query, i.e. the relevance measure. Since no single rule exists for the determination of the relevance measure, we shall consider two of them which are the simplest in our opinion. The proposed approach does not suppose any restrictions and can be applied to other relevance measures.Comment: 6 pages, 56 figure

arXiv.org e-Print Archive

Detection Implicit Links and G-betweenness

Author: Lande D. V.
Snarskii A. A.
Zhenirovskyy M. I.
Publication venue
Publication date: 24/08/2010
Field of study

A concept of implicit links for Complex Networks has been defined and a new value - cohesion factor, which allows to evaluate numerically the presence of such connection between any two nodes, has been introduced. We introduce a generalization of such characteristics as the betweenness, which allows to rank network nodes in more details. The effectiveness of the proposed concepts is shown by the numerical examples.Comment: 11 pages, 3 figure

arXiv.org e-Print Archive

"Conjectural" links in complex networks

Author: Lande D. V.
Snarskii A. A.
Zorinets D. I.
Publication venue: 'Elsevier BV'
Publication date: 16/03/2016
Field of study

This paper introduces the concept of Conjectural Link for Complex Networks, in particular, social networks. Conjectural Link we understand as an implicit link, not available in the network, but supposed to be present, based on the characteristics of its topology. It is possible, for example, when in the formal description of the network some connections are skipped due to errors, deliberately hidden or withdrawn (e.g. in the case of partial destruction of the network). Introduced a parameter that allows ranking the Conjectural Link. The more this parameter - the more likely that this connection should be present in the network. This paper presents a method of recovery of partially destroyed Complex Networks using Conjectural Links finding. Presented two methods of finding the node pairs that are not linked directly to one another, but have a great possibility of Conjectural Link communication among themselves: a method based on the determination of the resistance between two nodes, and method based on the computation of the lengths of routes between two nodes. Several examples of real networks are reviewed and performed a comparison to know network links prediction methods, not intended to find the missing links in already formed networks.Comment: 12 pages, 3 figure

arXiv.org e-Print Archive