2 research outputs found

    Automatic Categorization of LGBT User Profiles on Twitter with Machine Learning

    Get PDF
    Privacy needs and stigma pose significant barriers to lesbian, gay, bisexual, and transgender (LGBT) people sharing information related to their identities in traditional settings and research methods such as surveys and interviews. Fortunately, social media facilitates people’s belonging to and exchanging information within online LGBT communities. Compared to heterosexual respondents, LGBT users are also more likely to have accounts on social media websites and access social media daily. However, the current relevant LGBT studies on social media are not efficient or assume that any accounts that utilize LGBT-related words in their profile belong to individuals who identify as LGBT. Our human coding of over 16,000 accounts instead proposes the following three categories of LGBT Twitter users: individual, sexual worker/porn, and organization. This research develops a machine learning classifier based on the profile and bio features of these Twitter accounts. To have an efficient and effective process, we use a feature selection method to reduce the number of features and improve the classifier’s performance. Our approach achieves a promising result with around 88% accuracy. We also develop statistical analyses to compare the three categories based on the average weight of top features

    Managing Bigger Online Data

    No full text
    q90004170000C1 -- EL_35_4_Text_V02 -- Social science data repositories in data deluge -- 1. Introduction -- 2. Literature review -- 3. Methodology -- 4. Findings -- 5. Discussion and implications -- 6. Conclusion -- References -- Impact of device on search pattern transitions -- Introduction -- Literature review -- Methodology -- Results analysis -- Query reformulation patterns -- Discussion -- Conclusion -- References -- Exploring topics related to data mining on Wikipedia -- Introduction -- Literature review -- Research methodology -- Results and discussion -- Conclusion -- References -- A paper-text perspective -- 1. Introduction -- 2. Related works -- 3. Methodology -- 4. Distribution of various granularity features -- 5. CDC analysis for various granularity features -- 6. Effectiveness analysis for features' CDC in application -- 7. Conclusion -- References -- The exploration of information extraction and analysis about science and technology policy in China -- 1. Introduction -- 2. Literature review -- 3. Analysis model and method for science and technology policy -- 4. Realization of information extraction and analysis from science and technology policies -- 5. Conclusions -- References -- Semantically linking events for massive scientific literature research -- 1. Introduction -- 2. Related work -- 3. Event representation and operation -- 4. Event space model and operation -- 5. Discussion -- 6. Conclusion -- References -- Information diffusion on communication networks based on Big Data analysis -- Introduction -- Literature review -- Model description -- Communication network construction -- Results and discussion -- Conclusion -- References -- Predicting users' demographic characteristics in a Chinese social media network -- Introduction -- Literature review -- Data and methodology -- Experiments and results analysis -- DiscussionConclusion -- References -- Emotion evolutions of sub-topics about popular events on microblogs -- Introduction -- Literature review -- Methodology -- Experiments and results analysis -- Discussion -- Conclusion and future work -- References -- The role of academic libraries in research data service (RDS) provision -- Introduction -- Methodology -- Literature review -- Findings and discussion -- Conclusion -- References -- A belief-desire-intention model for blog users' negative emotional norm com ... -- Introduction -- Literature review -- Definition of the model -- Experiments -- Discussion -- Conclusions and future research -- References -- Measuring global research activities using geographic data of scholarly article visits -- Introduction -- Literature review -- Data -- Results -- Discussion -- Conclusions -- References -- Guest editorial: managing bigger online data -- Introduction -- Articles in this issue -- Summary -- ReferencesDescription based on publisher supplied metadata and other sources.Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, YYYY. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries
    corecore