5 research outputs found

    Keyphrases analysis of BIM standards through occurrence of most common BIM uses

    Get PDF
    The 8th PSU-UNS International Conference on Engineering and Technology (ICET-2017), Novi Sad, Serbia, June 8-10, 2017 University of Novi Sad, Faculty of Technical Sciences Abstract: Building Information Modeling (BIM) does not represent only the virtual model of the facility but a comperhensive approach consisting of technology, processes, stakeholders' behavior and accompanying standards.Given the fast evolution of BIM, this paper is analysing trends of development of BIM standards throughout the years by applying the keyphrases analysis method, for some of the most common BIM uses and recognizable phrases in BIM industry

    Detection of Sociolinguistic Features in Digital Social Networks for the Detection of Communities

    Get PDF
    The emergence of digital social networks has transformed society, social groups, and institutions in terms of the communi cation and expression of their opinions. Determining how language variations allow the detection of communities, together with the relevance of specifc vocabulary (proposed by the National Council of Accreditation of Colombia (Consejo Nacional de AcreditaciΓ³n - CNA) to determine the quality evaluation parameters for universities in Colombia) in digital assemblages could lead to a better understanding of their dynamics and social foundations, thus resulting in better communication policies and intervention where necessary. The approach presented in this paper intends to determine what are the semantic spaces (sociolinguistic features) shared by social groups in digital social networks. It includes fve layers based on Design Science Research, which are integrated with Natural Language Processing techniques (NLP), Computational Linguistics (CL), and Artifcial Intelligence (AI). The approach is validated through a case study wherein the semantic values of a series of β€œTwit ter” institutional accounts belonging to Colombian Universities are analyzed in terms of the 12 quality factors established by CNA. In addition, the topics and the sociolect used by diferent actors in the university communities are also analyzed. The current approach allows determining the sociolinguistic features of social groups in digital social networks. Its application allows detecting the words or concepts to which each actor of a social group (university) gives more importance in terms of vocabular

    Extracting Food Substitutes From Food Diary via Distributional Similarity

    Get PDF
    Genetic ancestry admixture of patients infected with Influenza A(H1N1)pdm09 sorted by African ancestry. Each individual ancestry is depicted as a column, whereas color represents the proportion of ancestry estimated for that individual (AfricanΒ =Β blue; EuropeanΒ =Β brown; Native AmericanΒ =Β green). (A) Non-hospitalized patients and (B) Hospitalized patients

    Knowledge Extraction and Visualization from Textual Sources Intended for Construction Project Management

    Get PDF
    Π’ΠΎΠΊΠΎΠΌ ΠΆΠΈΠ²ΠΎΡ‚Π½ΠΎΠ³ циклуса инвСстиционог ΠΏΡ€ΠΎΡ˜Π΅ΠΊΡ‚Π° ствара сС Π²Π΅Π»ΠΈΠΊΠΈ корпус нСструктуираних ΠΈ полуструктуираних Π΄ΠΎΠΊΡƒΠΌΠ΅Π½Π°Ρ‚Π°. Π’Ρ€Π°Π΄ΠΈΡ†ΠΈΠΎΠ½Π°Π»Π½ΠΈ приступи Ρƒ ΡΠΊΠ»Π°Π΄ΠΈΡˆΡ‚Π΅ΡšΡƒ ΠΈ ΠΎΡ€Π³Π°Π½ΠΈΠ·ΠΎΠ²Π°ΡšΡƒ ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΡ˜Π° ΠΈΠ· нСструктуираних ΠΏΠΎΠ΄Π°Ρ‚ΠΊΠ° су ΠΎΡ€ΠΈΡ˜Π΅Π½Ρ‚ΠΈΡΠ°Π½ΠΈ Π½Π° Ρ€Π°Π΄ са Π΄ΠΎΠΊΡƒΠΌΠ΅Π½Ρ‚ΠΈΠΌΠ°, ΡˆΡ‚ΠΎ ΠΈΡ… Ρ‡ΠΈΠ½ΠΈ нСподСсним Π·Π° Π°Π½Π°Π»ΠΈΠ·Ρƒ ΠΈ издвајањС знања. Π£ нСструктуираним Π΄ΠΎΠΊΡƒΠΌΠ΅Π½Ρ‚ΠΈΠΌΠ° јС ΠΎΡ‚Π΅ΠΆΠ°Π½ΠΎ ΠΏΡ€ΠΈΠΊΡƒΠΏΡ™Π°ΡšΠ΅, Π°Π½Π°Π»ΠΈΠ·Π° ΠΈ ΠΏΠΎΠ½ΠΎΠ²Π½ΠΎ ΠΊΠΎΡ€ΠΈΡˆΡ›Π΅ΡšΠ΅ Ρ€Π΅Π»Π΅Π²Π°Π½Ρ‚Π½ΠΈΡ… ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΡ˜Π° Ρƒ ΠΈΠ½Ρ‚Π΅Π³Ρ€Π°Π»Π½ΠΎΠΌ ΠΎΠ±Π»ΠΈΠΊΡƒ, ΡˆΡ‚ΠΎ ΠΌΠΎΠΆΠ΅ ΠΈΠ·Π°Π·Π²Π°Ρ‚ΠΈ ΠΏΡ€ΠΎΠ±Π»Π΅ΠΌΠ΅ Π½Π° ΠΏΡ€ΠΎΡ˜Π΅ΠΊΡ‚Ρƒ услСд Π½Π΅Π±Π»Π°Π³ΠΎΠ²Ρ€Π΅ΠΌΠ΅Π½ΠΈΡ… ΠΈΠ»ΠΈ Π½Π΅ΠΎΠ΄Π³ΠΎΠ²Π°Ρ€Π°Ρ˜ΡƒΡ›ΠΈΡ… ΠΎΠ΄Π»ΡƒΠΊΠ°. Π£ овој Π΄ΠΈΡΠ΅Ρ€Ρ‚Π°Ρ†ΠΈΡ˜ΠΈ јС ΠΏΡ€ΠΈΠΊΠ°Π·Π°Π½Π° Ρ€Π΅ΠΏΡ€Π΅Π·Π΅Π½Ρ‚Π°Ρ†ΠΈΡ˜Π° ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΡ˜Π° ΠΈΠ·Π΄Π²ΠΎΡ˜Π΅Π½ΠΈΡ… ΠΈΠ· нСструктуираних тСкстуалних Π΄ΠΎΠΊΡƒΠΌΠ΅Π½Π°Ρ‚Π° Ρƒ ΠΎΠ±Π»ΠΈΠΊΡƒ Π³Ρ€Π°Ρ„Π° Π·Π½Π°Ρ‡Π°Ρ˜Π½ΠΈΡ… Ρ„Ρ€Π°Π·Π°, који корисницима Ρ‚Ρ€Π΅Π±Π° Π΄Π° ΠΎΠΌΠΎΠ³ΡƒΡ›ΠΈ Π²ΠΈΠ·ΡƒΠ΅Π»ΠΈΠ·Π°Ρ†ΠΈΡ˜Ρƒ ΠΈ Π°Π½Π°Π»ΠΈΠ·Ρƒ Π·Π½Π°Ρ‡Π°Ρ˜Π½ΠΈΡ… Ρ‡ΠΈΡšΠ΅Π½ΠΈΡ†Π° Π½Π° ΠΏΡ€ΠΎΡ˜Π΅ΠΊΡ‚Ρƒ са ΠΌΠΈΠ½ΠΈΠΌΠ°Π»Π½ΠΎΠΌ ΠΊΠΎΠ»ΠΈΡ‡ΠΈΠ½ΠΎΠΌ ΡƒΠ»ΠΎΠΆΠ΅Π½ΠΎΠ³ Ρ‚Ρ€ΡƒΠ΄Π°. Π‘Π° Ρ†ΠΈΡ™Π΅ΠΌ Π΄Π° сС ΠΊΠΎΠ½ΡΡ‚Ρ€ΡƒΠΈΡˆΠ΅ домСнски нСзависна Ρ€Π΅ΠΏΡ€Π΅Π·Π΅Π½Ρ‚Π°Ρ†ΠΈΡ˜Π° са ΠΌΠΈΠ½ΠΈΠΌΠ°Π»Π½ΠΈΠΌ Ρ‚Ρ€ΡƒΠ΄ΠΎΠΌ СкспСрта Π·Π° ΠΏΡ€Π΅Ρ‚Ρ…ΠΎΠ΄Π½ΠΎ ΠΊΠΎΠ½Ρ„ΠΈΠ³ΡƒΡ€ΠΈΡΠ°ΡšΠ΅, Π·Π½Π°Ρ‡Π°Ρ˜Π½Π΅ Ρ„Ρ€Π°Π·Π΅ су Π΄Π΅Ρ‚Π΅ΠΊΡ‚ΠΎΠ²Π°Π½Π΅ Ρƒ Π²ΠΈΡˆΠ΅Ρ˜Π΅Π·ΠΈΡ‡Π½ΠΎΠΌ ΠΎΠΊΡ€ΡƒΠΆΠ΅ΡšΡƒ ΠΏΡ€ΠΈΠΌΠ΅Π½ΠΎΠΌ статистичких ΠΌΠ΅Ρ€Π° Π·Π° ΠΎΠ΄Ρ€Π΅Ρ’ΠΈΠ²Π°ΡšΠ΅ корСлисаности ΠΏΠ°Ρ€Π° Ρ€Π΅Ρ‡ΠΈ. Π“Ρ€Π°Ρ„ садрТи аутоматски издвојСнС Π·Π½Π°Ρ‡Π°Ρ˜Π½Π΅ Ρ„Ρ€Π°Π·Π΅ којС су ΠΏΠΎΠ²Π΅Π·Π°Π½Π΅ Π½Π° основу сличности сСмантичких контСкста. Π Π΅ΠΏΡ€Π΅Π·Π΅Π½Ρ‚Π°Ρ†ΠΈΡ˜Π° јС ΠΈΠΌΠΏΠ»Π΅ΠΌΠ΅Π½Ρ‚ΠΈΡ€Π°Π½Π° Ρƒ Π³Ρ€Π°Ρ„ΠΎΠ²ΡΠΊΠΎΡ˜ Π±Π°Π·ΠΈ ΠΏΠΎΠ΄Π°Ρ‚Π°ΠΊΠ° ΡˆΡ‚ΠΎ корисницима ΠΎΠΌΠΎΠ³ΡƒΡ›Π°Π²Π° Π΄Π° Π΄Π΅Ρ‚Π΅ΠΊΡ‚ΡƒΡ˜Ρƒ ΠΈ Π²ΠΈΠ·ΡƒΠ΅Π»ΠΈΠ·ΡƒΡ˜Ρƒ Ρ€Π°Π·Π»ΠΈΡ‡ΠΈΡ‚Π΅ скривСнС обрасцС Ρƒ ΠΏΠΎΠ΄Π°Ρ†ΠΈΠΌΠ°. НСинформативнС Ρ„Ρ€Π°Π·Π΅ су Ρ„ΠΈΠ»Ρ‚Ρ€ΠΈΡ€Π°Π½Π΅ ΠΊΡ€ΠΎΠ· поступкС ΠΎΠ΄Ρ€Π΅Ρ’ΠΈΠ²Π°ΡšΠ° Π΅Π½Ρ‚Ρ€ΠΎΠΏΠΈΡ˜Π΅ скупа контСкста ΠΈ динамичности сусСдства Ρ„Ρ€Π°Π·Π΅ ΠΊΡ€ΠΎΠ· вишС Π³Ρ€Π°Ρ„ΠΎΠ²Π° који ΠΏΡ€Π΅Π΄ΡΡ‚Π°Π²Ρ™Π°Ρ˜Ρƒ Ρ‚Ρ€Π΅Π½ΡƒΡ‚ΠΊΠ΅ Ρƒ Π²Ρ€Π΅ΠΌΠ΅Π½Ρƒ. ΠŸΡ€ΠΈΠΊΠ°Π·Π°Π½Π° јС хСуристика Π·Π° издвајањС комплСксних ΠΊΠΎΠ½Ρ†Π΅ΠΏΠ°Ρ‚Π°, заснована Π½Π° ΠΈΡ‚Π΅Ρ€Π°Ρ‚ΠΈΠ²Π½ΠΎΡ˜ ΠΏΡ€ΠΎΡ†Π΅Π΄ΡƒΡ€ΠΈ Π·Π° Π΄Π΅Ρ‚Π΅ΠΊΡ†ΠΈΡ˜Ρƒ блиских Ρ„Ρ€Π°Π·Π° којС ΠΏΡ€ΠΈΠΏΠ°Π΄Π°Ρ˜Ρƒ истом сСмантичком ΠΏΠΎΠ΄Π³Ρ€Π°Ρ„Ρƒ. ΠœΠΎΠ³ΡƒΡ›Π½ΠΎΡΡ‚ΠΈ ΠΏΡ€ΠΈΠΌΠ΅Π½Π΅ ΠΏΡ€Π΅Π΄Π»ΠΎΠΆΠ΅Π½Π΅ Ρ€Π΅ΠΏΡ€Π΅Π·Π΅Π½Ρ‚Π°Ρ†ΠΈΡ˜Π΅ су дСмонстриранС Π½Π° Π³Ρ€Π°Ρ„Ρƒ конструисаном Π·Π° ΠΏΠΎΡΡ‚ΠΎΡ˜Π΅Ρ›ΠΈ корпус Π΄ΠΎΠΊΡƒΠΌΠ΅Π½Π°Ρ‚Π° са ΠΌΠ΅Ρ’ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½ΠΎΠ³ инвСстиционог ΠΏΡ€ΠΎΡ˜Π΅ΠΊΡ‚Π°.During a construction project lifecycle, an extensive corpus of unstructured or semi-structured text documents is generated. Traditional approaches for information storing and organizing are document-oriented, which is highly inconvenient for data analysis and knowledge extraction. The nature of unstructured sources impedes users’ acquisition, analysis, and reuse of relevant information, leading to possible negative effects in the project management process. This dissertation suggests a procedure for automatic extraction of relevant project concepts from unstructured text documents. Concepts are organized in the form of a key-phrase network, intended to provide users with the possibility to visualize and analyze valuable project facts with less effort. With the objective of constructing a domain-independent and language-independent key-phrase network, with minimal expert involvement for configuration, an approach to detect key phrases was examined by using measures of correlation for word pairs. A network contains key phrases automatically extracted from various types of unstructured documents, with relations based on the similarity of semantic contexts. The representation was implemented as a graph database, enabling project participants to extract and visualize various patterns in data. The problem of noisy key phrases was reduced by introducing the entropy score for a set of co-occurring contexts and the measure of phrase neighborhood dynamics throughout construction project lifecycle. A heuristic for extraction of complex concepts is presented, based on the iterative procedure for detection of adjacent key phrases belonging to a same semantic subnetwork. Possible applications, such as concept tracking through time or determination of communication patterns between project participants, is demonstrated using a key-phrase network generated for the existing document corpus from an international construction project

    Appropriately Incorporating Statistical Significance in PMI

    No full text
    Two recent measures incorporate the notion of statistical significance in basic PMI formulation. In some tasks, we find that the new measures perform worse than the PMI. Our analysis shows that while the basic ideas in incorporating statistical significance in PMI are reasonable, they have been applied slightly inappropriately. By fixing this, we get new measures that improve performance over not just PMI but on other popular co-occurrence measures as well. In fact, the revised measures perform reasonably well compared with more resource intensive non co-occurrence based methods also.
    corecore