Abstract

An increasing amount of social networks usersgenerated data is the most remarkable research challenge nowadays. Despite the progress in the field of semistructured data processing algorithms creation, even initial data collection could not be treated as issues that have been optimally solved. The paper covers a high-level overview of the automated social media content search system. The proposed structure enables to implement instruments for multisource content extraction tasks as well as supporting of identification processes of new patterns, which describe a certain type of content. Issues of Search engine organization, logically unified extracted data repository and possible content classification techniques with the appropriate knowledge base's application are considered. Under the work, existing approaches and automated web-data extraction methods have been analyzed; social media API's functions and limits, as well as ways of semistructured data storage system organization, have been studied. The planned result's application area is automation and informational support of sociological research based on the social media content analysis techniques namely a content propagation simulation in interconnected groups; social and personal anomy study; clarification of the weak linkage's strength concept

    Similar works