29 research outputs found
Intelligent search in Big Data
Β© Published under licence by IOP Publishing Ltd. An approach to data integration, aimed on the ontology-based intelligent search in Big Data, is considered in the case when information objects are represented in the form of relational databases (RDB), structurally marked by their schemes. The source of information for constructing an ontology and, later on, the organization of the search are texts in natural language, treated as semi-structured data. For the RDBs, these are comments on the names of tables and their attributes. Formal definition of RDBs integration model in terms of ontologies is given. Within framework of the model universal RDB representation ontology, oil production subject domain ontology and linguistic thesaurus of subject domain language are built. Technique of automatic SQL queries generation for subject domain specialists is proposed. On the base of it, information system for TATNEFT oil-producing company RDBs was implemented. Exploitation of the system showed good relevance with majority of queries
On Semantic Search Algorithm Optimization
Β© 2019, Springer Nature Switzerland AG. In the article we consider, on the example of development of a relational database (RDB) information system for Tatneft oil and gas company, an approach to organization of effective search in large arrays of heterogeneous data, satisfying the following essential requirements. On the one hand, the data is integrated at the semantic level, i.e. the system supports the presentation of data, describing its semantic properties within an unified subject domain ontology. Accordingly, end userβs request are formulated exclusively in the subject domain terminology. On the other hand, the system generates unregulated SQL-queries, i.e. the full text of possible SQL-queries, not just values of particular parameters, predefined by the system developers. Considered approach includes both the possibilities of increasing the reactivity of the universal SQL queries generation scheme as well as more specific optimization possibilities, arising from the particular system usage context
On Semantic Search Algorithm Optimization
Β© 2019, Springer Nature Switzerland AG. In the article we consider, on the example of development of a relational database (RDB) information system for Tatneft oil and gas company, an approach to organization of effective search in large arrays of heterogeneous data, satisfying the following essential requirements. On the one hand, the data is integrated at the semantic level, i.e. the system supports the presentation of data, describing its semantic properties within an unified subject domain ontology. Accordingly, end userβs request are formulated exclusively in the subject domain terminology. On the other hand, the system generates unregulated SQL-queries, i.e. the full text of possible SQL-queries, not just values of particular parameters, predefined by the system developers. Considered approach includes both the possibilities of increasing the reactivity of the universal SQL queries generation scheme as well as more specific optimization possibilities, arising from the particular system usage context
On ontology based data integration: Problems and solutions
Β© Published under licence by IOP Publishing Ltd. In the article essential problems of integrating heterogeneous data, arising in development of corporate databases intellectual access systems, are considered. In addition to the common structural problems, caused by variety of data organization, special attention is paid to the less obvious linguistic problems, caused by differences in data notation. A unified approach to overcoming such problems by sequential application of explicit definition of semantics, is described. This approach was tested in development of an intelligent search system for the TATNEFT oil-producing corporation; the system implementation showed high relevance of search results together with an adequate reactivity
Building subject domain ontology for a corporate web application
Copyright Β© 2020 for this paper by its authors. The technology of automated construction of the subject domain ontology, based on information extracted from the comments of the TATNEFT oil company relational databases, is considered. The technology is based on building a converter (compiler) translating the logical data model of Epicenter Petrotechnical Open Software Corporation (POSC), presented in the form of ER diagrams and a set of the EXPRESS object-oriented language descriptions, into the OWL ontology description language, recommended by the W3C consortium. The basic syntactic and semantic aspects of the transformation are described
Intelligent search in Big Data
Β© Published under licence by IOP Publishing Ltd. An approach to data integration, aimed on the ontology-based intelligent search in Big Data, is considered in the case when information objects are represented in the form of relational databases (RDB), structurally marked by their schemes. The source of information for constructing an ontology and, later on, the organization of the search are texts in natural language, treated as semi-structured data. For the RDBs, these are comments on the names of tables and their attributes. Formal definition of RDBs integration model in terms of ontologies is given. Within framework of the model universal RDB representation ontology, oil production subject domain ontology and linguistic thesaurus of subject domain language are built. Technique of automatic SQL queries generation for subject domain specialists is proposed. On the base of it, information system for TATNEFT oil-producing company RDBs was implemented. Exploitation of the system showed good relevance with majority of queries
Revisiting the βWest-Balticβ Type Hydronymy in Central Russia
Π ΡΠΊΠΎΠΏΠΈΡΡ ΠΏΠΎΡΡΡΠΏΠΈΠ»Π° Π² ΡΠ΅Π΄Π°ΠΊΡΠΈΡ 06.10.2020.Received on 6 October 2020.Π ΡΡΠ°ΡΡΠ΅ ΡΠ°ΡΡΠΌΠΎΡΡΠ΅Π½Π° ΡΡΠ±ΡΡΡΠ°ΡΠ½Π°Ρ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΈΡ ΠΠΎΠΎΡΡΡ ΠΈ ΠΠΎΠ΄Π½Π΅ΠΏΡΠΎΠ²ΡΡ (Ρ ΡΠΈΠ½Π°Π»ΡΠΌΠΈ -Π²Π°, -Π΄Π° ΠΈ Π΄Ρ.), ΠΎΠ±ΡΡΠ½ΠΎ ΠΎΡΠ½ΠΎΡΠΈΠΌΠ°Ρ ΠΊ Π·Π°ΠΏΠ°Π΄Π½ΠΎΠ±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠΌΡ ΡΠΎΠΏΠΎΠ½ΠΈΠΌΠΈΡΠ΅ΡΠΊΠΎΠΌΡ ΡΠ»ΠΎΡ ΠΈ ΡΠ²ΡΠ·ΡΠ²Π°Π΅ΠΌΠ°Ρ Ρ ΡΠ·ΡΠΊΠΎΠΌ Π½ΠΎΡΠΈΡΠ΅Π»Π΅ΠΉ ΠΌΠΎΡΠΈΠ½ΡΠΊΠΎΠΉ Π°ΡΡ
Π΅ΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΎΠΉ ΠΊΡΠ»ΡΡΡΡΡ ΠΈ Π±Π»ΠΈΠ·ΠΊΠΈΡ
ΠΊ Π½Π΅ΠΉ Π³ΡΡΠΏΠΏ ΠΏΠ°ΠΌΡΡΠ½ΠΈΠΊΠΎΠ². ΠΠ²ΡΠΎΡΠΎΠΌ ΠΏΡΠΎΠ²Π΅Π΄Π΅Π½ Π°Π½Π°Π»ΠΈΠ· Π΅Π΅ ΠΏΡΠΎΡΡΡΠ°Π½ΡΡΠ²Π΅Π½Π½ΠΎΠ³ΠΎ ΡΠ°ΡΠΏΡΠ΅Π΄Π΅Π»Π΅Π½ΠΈΡ Π½Π° ΠΠΎΡΡΠΎΡΠ½ΠΎ-ΠΠ²ΡΠΎΠΏΠ΅ΠΉΡΠΊΠΎΠΉ ΡΠ°Π²Π½ΠΈΠ½Π΅. ΠΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠ΅ ΠΏΠΎΠΊΠ°Π·Π°Π»ΠΎ, ΡΡΠΎ: 1) Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΈΡ Π½Π° -Π²Π° ΡΠΎΠΎΡΠ½ΠΎΡΠΈΡΡΡ Ρ Π°ΡΠ΅Π°Π»ΠΎΠΌ ΡΡΠ±ΡΡΡΠ°ΡΠ½ΠΎΠΉ Π±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠΉ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΈΠΈ Π² ΡΠ΅Π»ΠΎΠΌ ΠΈ ΡΠ΅Π³ΠΈΠΎΠ½Π°ΠΌΠΈ ΡΠ°ΡΠΏΡΠΎΡΡΡΠ°Π½Π΅Π½ΠΈΡ ΠΏΠ°ΠΌΡΡΠ½ΠΈΠΊΠΎΠ² Π΄Π½Π΅ΠΏΡΠΎ-Π΄Π²ΠΈΠ½ΡΠΊΠΎΠΉ, ΡΡ
Π½ΠΎΠ²ΡΠΊΠΎΠΉ, ΠΏΠΎΠ·Π΄Π½Π΅Π΄ΡΡΠΊΠΎΠ²ΡΠΊΠΎΠΉ ΠΊΡΠ»ΡΡΡΡ ΡΠ°Π½Π½Π΅Π³ΠΎ ΠΆΠ΅Π»Π΅Π·Π½ΠΎΠ³ΠΎ Π²Π΅ΠΊΠ°; 2) Π°ΡΠ΅Π°Π»Ρ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΎΠ² Ρ ΠΌΠ΅Π½ΠΎΠΉ ΠΆ/Π· ΠΈ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΎΠ², ΡΠ°Π·Π»ΠΈΡΠ°ΡΡΠΈΡ
ΡΡ ΡΠΎΠ»ΡΠΊΠΎ ΡΡΠΈΠΌΠΈ Π·Π²ΡΠΊΠ°ΠΌΠΈ, ΡΠΎΠΎΡΠ½ΠΎΡΡΡΡΡ Ρ Π°ΡΠ΅Π°Π»Π°ΠΌΠΈ Π΄ΡΠ΅Π²Π½ΠΎΡΡΠ΅ΠΉ ΠΊΡΠΈΠ²ΠΈΡΠ΅ΠΉ ΠΈ ΡΠ°Π΄ΠΈΠΌΠΈΡΠ΅ΠΉ, ΠΎΠ±Π»Π°ΡΡΡΠΌΠΈ Π½Π΅ΡΠ°Π·Π»ΠΈΡΠ΅Π½ΠΈΡ ΡΠ²ΠΈΡΡΡΡΠΈΡ
ΠΈ ΡΠΈΠΏΡΡΠΈΡ
ΠΈ ΡΠ΅ΠΏΠ΅Π»ΡΠ²ΠΎΠ³ΠΎ ΠΏΡΠΎΠΈΠ·Π½ΠΎΡΠ΅Π½ΠΈΡ ΡΠ²ΠΈΡΡΡΡΠΈΡ
Π² ΡΡΡΡΠΊΠΈΡ
Π΄ΠΈΠ°Π»Π΅ΠΊΡΠ°Ρ
; 3) Π±Π°Π»ΡΠΈΠΉΡΠΊΠ°Ρ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΈΡ Ρ ΡΠΈΠ½Π°Π»ΡΠ½ΡΠΌ ΠΊΠΎΠΌΠΏΠΎΠ½Π΅Π½ΡΠΎΠΌ -Π΄Π° ΠΎΡΡΡΡΡΡΠ²ΡΠ΅Ρ Π½Π° ΡΠ΅ΡΡΠΈΡΠΎΡΠΈΠΈ ΠΌΠΎΡΠΈΠ½ΡΠΊΠΎΠΉ ΠΊΡΠ»ΡΡΡΡΡ ΠΈ ΡΠ²ΡΠ·Π°Π½Π½ΡΡ
Ρ Π½Π΅ΠΉ Π³ΡΡΠΏΠΏ ΠΏΠ°ΠΌΡΡΠ½ΠΈΠΊΠΎΠ²; 4) ΠΈΠ· Π΄ΠΎΡΡΠΎΠ²Π΅ΡΠ½ΡΡ
Π½Π°Π·Π²Π°Π½ΠΈΠΉ Ρ ΠΊΠΎΡΠ½Π΅ΠΌ ape-/upe- Π² ΠΌΠΎΡΠΈΠ½ΡΠΊΠΎΠΌ Π°ΡΠ΅Π°Π»Π΅ ΠΎΡΠΌΠ΅ΡΠ΅Π½Ρ Π»ΠΈΡΡ ΡΠΎΠ΄Π΅ΡΠΆΠ°ΡΠΈΠ΅ Π²ΠΎΡΡΠΎΡΠ½ΠΎΠ±Π°Π»ΡΠΈΠΉΡΠΊΠΈΠΉ Π²Π°ΡΠΈΠ°Π½Ρ; 5) Π²ΠΎΡΡΠΎΡΠ½ΠΎΠ±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠ΅ ΠΎΠ±ΡΡΡΠ½Π΅Π½ΠΈΠ΅ Π½Π°Π·Π²Π°Π½ΠΈΠΉ Ρ ΠΊΠΎΡΠ½Π΅ΠΌ stab- Π½Π΅ ΡΡΡΡΠΏΠ°Π΅Ρ Π·Π°ΠΏΠ°Π΄Π½ΠΎ-Π±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠΌΡ; 6) ΠΎΡΡΡΡΡΡΠ²ΡΡΡ Π΄ΠΎΡΡΠ°ΡΠΎΡΠ½ΡΠ΅ ΠΎΡΠ½ΠΎΠ²Π°Π½ΠΈΡ Π΄Π»Ρ Π²ΠΎΠ·Π²Π΅Π΄Π΅Π½ΠΈΡ Π½Π°Π·Π²Π°Π½ΠΈΠΉ Π½Π΅ΠΊΠΎΡΠΎΡΡΡ
ΡΠ΅ΠΊ ΠΊ ΠΏΡΡΡΡΠΊΠΈΠΌ ΡΠ»ΠΎΠ²Π°ΠΌ pannean ΠΈ suge: Π±Γ³Π»ΡΡΠ°Ρ ΡΠ°ΡΡΡ ΡΡΠΈΡ
Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΎΠ² Π½ΠΎΡΠΈΡ ΠΏΠΎΠ·Π΄Π½ΠΈΠΉ Ρ
Π°ΡΠ°ΠΊΡΠ΅Ρ, Π° Π΄Π»Ρ ΠΌΠ΅Π½ΡΡΠ΅ΠΉ Π±ΠΎΠ»Π΅Π΅ Π²Π΅ΡΠΎΡΡΠ½Ρ ΠΈΠ½ΡΠ΅ ΠΎΠ±ΡΡΡΠ½Π΅Π½ΠΈΡ; 7) Π³ΠΈΠΏΠΎΡΠ΅Π·Π° ΠΎ ΠΏΡΠΈΠ½Π°Π΄Π»Π΅ΠΆΠ½ΠΎΡΡΠΈ Π½Π΅ΠΊΠΎΡΠΎΡΡΡ
Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΎΠ² ΠΊ ΡΠΈΡΠ»Ρ ΡΡΠ±ΡΡΡΠ°ΡΠ½ΡΡ
(ΠΠ΅ΡΠ½Π°, ΠΠΏΠΎΡΠΈΠ½ΠΊΠ°, ΠΠΎΠ½Ρ, Π‘Π΅ΠΆΠΈΠΊΠΎΠ²ΠΊΠ° ΠΈ Π΄Ρ.) Π½Π΅ ΠΏΠΎΠ΄ΡΠ²Π΅ΡΠΆΠ΄Π°Π΅ΡΡΡ. ΠΡΡ
ΠΎΠ΄Ρ ΠΈΠ· Π²ΡΡΠ΅ΠΏΠ΅ΡΠ΅ΡΠΈΡΠ»Π΅Π½Π½ΡΡ
Π½Π°Π±Π»ΡΠ΄Π΅Π½ΠΈΠΉ ΠΎΡΠΏΠΎΡΠ΅Π½Π° Π³ΠΈΠΏΠΎΡΠ΅Π·Π° ΠΎ Π½Π°Π»ΠΈΡΠΈΠΈ Π·Π°ΠΏΠ°Π΄Π½ΠΎΠ±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠ³ΠΎ ΡΠ»ΠΎΡ Π³ΠΈΠ΄ΡΠΎΠ½ΠΈΠΌΠΈΠΈ Π² ΠΠΎΠΎΡΡΠ΅ ΠΈ Π²ΡΡΠ΅ΠΊΠ°ΡΡΠ΅Π΅ ΠΈΠ· Π½Π΅Π΅ ΠΏΡΠ΅Π΄ΠΏΠΎΠ»ΠΎΠΆΠ΅Π½ΠΈΠ΅ ΠΎ Π·Π°ΠΏΠ°Π΄Π½ΠΎΠ±Π°Π»ΡΠΈΠΉΡΠΊΠΎΠΉ ΠΈΠ΄Π΅Π½ΡΠΈΡΠ½ΠΎΡΡΠΈ Π½ΠΎΡΠΈΡΠ΅Π»Π΅ΠΉ ΠΌΠΎΡΠΈΠ½ΡΠΊΠΎΠΉ ΠΊΡΠ»ΡΡΡΡΡ.The article examines the substrate hydronymy of the middle Oka and the Dnieper regions (ending in -va, -da, etc.) that is typically attributed to the West-Baltic toponymic stratum and associated with the language of the Moschinskaya archaeological culture and the related archaeological sites. The author analyzed its spatial distribution in the East European Plain. The study has found that: 1) the spread of names of waterbodies ending in -va correlates with the distribution scheme of substrate Baltic hydronymy in general and the monuments of the Dnieper-Dvina, Yukhnovskaya, and Late Dyakovo cultures of the Early Iron Age; 2) the spread of hydronyms with zh/z sound variation (including as a distinctive feature) correlates with the Krivich and Radimich culture areas, and the range of Russian dialects with lisping pronunciation which makes no diff erence between sibilants and hushing sounds; 3) Baltic hydronymy ending in -da is not attested in the area of the Moschinskaya culture and related archaeological sites; 4) among the names with the root ape-/upe- found in the same cultural milieu, only those containing Eastern Baltic variant are verifi able; 5) the hypothesis for East Baltic origination of the names with the root stab- is not inferior to the West Baltic; 6) there are no suffi cient grounds for tracing some river names to the Prussian words pannean and sug since most of these hydronyms refer to a later period while the others have more plausible explanations; 7) for some hydronyms (Zerna, Opochinka, Ponya, Sezhikovka, etc.) the substrate origin is not confi rmed. Based on the above observations, the hypothesis for the presence of a West-Baltic layer of hydronymy in the middle Oka region and the consequent assumption of the West-Baltic origin of the Moshinskaya culture were disputed
On ontology based data integration: Problems and solutions
Β© Published under licence by IOP Publishing Ltd. In the article essential problems of integrating heterogeneous data, arising in development of corporate databases intellectual access systems, are considered. In addition to the common structural problems, caused by variety of data organization, special attention is paid to the less obvious linguistic problems, caused by differences in data notation. A unified approach to overcoming such problems by sequential application of explicit definition of semantics, is described. This approach was tested in development of an intelligent search system for the TATNEFT oil-producing corporation; the system implementation showed high relevance of search results together with an adequate reactivity