1,077,753 research outputs found

    Disambiguation of Large-Scale Educational Network Data for Social Network Analysis

    Get PDF
    Social Network Analysis (SNA) is a research method which quantitatively maps qualitative social interactions between individuals who comprise a ‘network’. Delineating these social networks yields highly valuable data, including SNA measures like centrality, which can be used to measure social influence, connectivity, and more. Further, these networks can also be visualized by graphing individuals as ‘nodes’, and then by drawing ‘edges’ (the lines that connect them) to produce sociograms. With these sociograms, researchers can concurrently conduct visual and statistical analysis of relations between node and network traits of interest. As a result of these capabilities and the growth of social learning, SNA has become increasingly popular in educational settings. However, the difficulties in consolidating students’ interaction data into quantitative networks has steered SNA researchers towards oversimplified social environments which do not exhibit ambiguous connections. For example, research is well established in observing students’ online interactions, where participant information is collected concurrent with interaction data. Similarly, SNA studies in face-to-face contexts are typically bounded to single classrooms which greatly reduces the number of participants’ possible ties. These examples observe environments that are easily monitored, and bar the observation of the true underlying social networks. Hence, a gap exists for those hoping to understand the true, non-course-bounded networks of undergraduate students. To this end, our research group is currently conducting a study comparing all participating freshmen and sophomore engineering students’ interactions to academic outcomes at USU using social network analysis. The current disambiguation process for this large (1000+ nodes), ambiguous (open response name identified ties), interaction data is manually intensive, intricate, and takes careful organization--increasing with network size. Therefore, charged with the task of interaction data disambiguation, I have organized the overarching disambiguation task into a hybrid blend of automation and manual stages, to take advantage of emerging network information throughout the process (i.e., previously ambiguous responses are connected to an entity). The procedural analysis I present in this poster then highlights the technicalities of these stages, which begin with simple spelling checks, and end with a sub-network comparison process (similar to agglomerative hierarchical clustering) to yield accurate and complete network data. My methodology proved effective in matching many ambiguous names with their counterparts found elsewhere in the network. However, several names were still unable to be consolidated and had to be de-identified in their present conditions. Therefore, this presentation also highlights procedures that could be refined through modern SNA clustering methods, including possibilities for defining supervised algorithm parameters and comparing our manual to automated results for such algorithms. As the need for understanding the relationship between interpersonal interactions and educational outcomes expands, so too are the needs for improved SNA methods. To meet these growing needs, researchers must develop new and effective procedures for the disambiguation of authentic interaction data. This presentation provides an example of such research, developing and disseminating more effective and efficient approaches for network development.https://digitalcommons.usu.edu/fsrs2021/1005/thumbnail.jp

    Early Warning Analysis for Social Diffusion Events

    Get PDF
    There is considerable interest in developing predictive capabilities for social diffusion processes, for instance to permit early identification of emerging contentious situations, rapid detection of disease outbreaks, or accurate forecasting of the ultimate reach of potentially viral ideas or behaviors. This paper proposes a new approach to this predictive analytics problem, in which analysis of meso-scale network dynamics is leveraged to generate useful predictions for complex social phenomena. We begin by deriving a stochastic hybrid dynamical systems (S-HDS) model for diffusion processes taking place over social networks with realistic topologies; this modeling approach is inspired by recent work in biology demonstrating that S-HDS offer a useful mathematical formalism with which to represent complex, multi-scale biological network dynamics. We then perform formal stochastic reachability analysis with this S-HDS model and conclude that the outcomes of social diffusion processes may depend crucially upon the way the early dynamics of the process interacts with the underlying network's community structure and core-periphery structure. This theoretical finding provides the foundations for developing a machine learning algorithm that enables accurate early warning analysis for social diffusion events. The utility of the warning algorithm, and the power of network-based predictive metrics, are demonstrated through an empirical investigation of the propagation of political memes over social media networks. Additionally, we illustrate the potential of the approach for security informatics applications through case studies involving early warning analysis of large-scale protests events and politically-motivated cyber attacks

    Multi-Scale Link Prediction

    Full text link
    The automated analysis of social networks has become an important problem due to the proliferation of social networks, such as LiveJournal, Flickr and Facebook. The scale of these social networks is massive and continues to grow rapidly. An important problem in social network analysis is proximity estimation that infers the closeness of different users. Link prediction, in turn, is an important application of proximity estimation. However, many methods for computing proximity measures have high computational complexity and are thus prohibitive for large-scale link prediction problems. One way to address this problem is to estimate proximity measures via low-rank approximation. However, a single low-rank approximation may not be sufficient to represent the behavior of the entire network. In this paper, we propose Multi-Scale Link Prediction (MSLP), a framework for link prediction, which can handle massive networks. The basis idea of MSLP is to construct low rank approximations of the network at multiple scales in an efficient manner. Based on this approach, MSLP combines predictions at multiple scales to make robust and accurate predictions. Experimental results on real-life datasets with more than a million nodes show the superior performance and scalability of our method.Comment: 20 pages, 10 figure

    X-Vine: Secure and Pseudonymous Routing Using Social Networks

    Full text link
    Distributed hash tables suffer from several security and privacy vulnerabilities, including the problem of Sybil attacks. Existing social network-based solutions to mitigate the Sybil attacks in DHT routing have a high state requirement and do not provide an adequate level of privacy. For instance, such techniques require a user to reveal their social network contacts. We design X-Vine, a protection mechanism for distributed hash tables that operates entirely by communicating over social network links. As with traditional peer-to-peer systems, X-Vine provides robustness, scalability, and a platform for innovation. The use of social network links for communication helps protect participant privacy and adds a new dimension of trust absent from previous designs. X-Vine is resilient to denial of service via Sybil attacks, and in fact is the first Sybil defense that requires only a logarithmic amount of state per node, making it suitable for large-scale and dynamic settings. X-Vine also helps protect the privacy of users social network contacts and keeps their IP addresses hidden from those outside of their social circle, providing a basis for pseudonymous communication. We first evaluate our design with analysis and simulations, using several real world large-scale social networking topologies. We show that the constraints of X-Vine allow the insertion of only a logarithmic number of Sybil identities per attack edge; we show this mitigates the impact of malicious attacks while not affecting the performance of honest nodes. Moreover, our algorithms are efficient, maintain low stretch, and avoid hot spots in the network. We validate our design with a PlanetLab implementation and a Facebook plugin.Comment: 15 page

    On the discovery of social roles in large scale social systems

    Get PDF
    The social role of a participant in a social system is a label conceptualizing the circumstances under which she interacts within it. They may be used as a theoretical tool that explains why and how users participate in an online social system. Social role analysis also serves practical purposes, such as reducing the structure of complex systems to rela- tionships among roles rather than alters, and enabling a comparison of social systems that emerge in similar contexts. This article presents a data-driven approach for the discovery of social roles in large scale social systems. Motivated by an analysis of the present art, the method discovers roles by the conditional triad censuses of user ego-networks, which is a promising tool because they capture the degree to which basic social forces push upon a user to interact with others. Clusters of censuses, inferred from samples of large scale network carefully chosen to preserve local structural prop- erties, define the social roles. The promise of the method is demonstrated by discussing and discovering the roles that emerge in both Facebook and Wikipedia. The article con- cludes with a discussion of the challenges and future opportunities in the discovery of social roles in large social systems

    Community Graph Sequence with Sequence Data of Network Structured Data

    Get PDF
    Recently, there has been increasing interest in data analysis for network structured data. The network structured data is represented the relation between one data and other data by graph structure. There are many network structured data such as social networks, biological networks in the real world. In this study, we will analysis the network structured data that has dynamic relation and complex interact with each data. And, we will approach the problem that is to extract transition pattern from the history of temporal change in their network structured data. Especially, in this paper, we will apply community graph sequences to graph sequences of network structured data that has large-scale and complex changes, and propose the method of extracting transition pattern of network structured data. We used social bookmark data as the data streams of analysis object and verified that social bookmark data is the network structured data that has large-scale and complex change
    • …
    corecore