Search CORE

5,373 research outputs found

Social media analytics: a survey of techniques, tools and platforms

Author: Batrinca B
Treleaven PC
Publication venue
Publication date: 01/02/2015
Field of study

This paper is written for (social science) researchers seeking to analyze the wealth of social media now available. It presents a comprehensive review of software tools for social networking media, wikis, really simple syndication feeds, blogs, newsgroups, chat and news feeds. For completeness, it also includes introductions to social media scraping, storage, data cleaning and sentiment analysis. Although principally a review, the paper also provides a methodology and a critique of social media tools. Analyzing social media, in particular Twitter feeds for sentiment analysis, has become a major research and business activity due to the availability of web-based application programming interfaces (APIs) provided by Twitter, Facebook and News services. This has led to an ‘explosion’ of data services, software tools for scraping and analysis and social media analytics platforms. It is also a research area undergoing rapid change and evolution due to commercial pressures and the potential for using social media data for computational (social science) research. Using a simple taxonomy, this paper provides a review of leading software tools and how to use them to scrape, cleanse and analyze the spectrum of social media. In addition, it discussed the requirement of an experimental computational environment for social media research and presents as an illustration the system architecture of a social media (analytics) platform built by University College London. The principal contribution of this paper is to provide an overview (including code fragments) for scientists seeking to utilize social media scraping and analytics either in their research or business. The data retrieval techniques that are presented in this paper are valid at the time of writing this paper (June 2014), but they are subject to change since social media data scraping APIs are rapidly changing

UCL Discovery

Text Mining with HathiTrust: Empowering Librarians to Support Digital Scholarship Research

Author: Dickson Koehl Eleanor
Publication venue: Digital USD
Publication date: 29/04/2019
Field of study

This workshop will introduce attendees to text analysis research and the common methods and tools used in this emerging area of scholarship, with particular attention to the HathiTrust Research Center. The workshop\u27s train the trainer curriculum will provide a framework for how librarians can support text data mining, as well as teach transferable skills useful for many other areas of digital scholarly inquiry. Topics include: introduction to gathering, managing, analyzing, and visualizing textual data; hands-on experience with text analysis tools, including the HTRC\u27s off-the-shelf algorithms and datasets, such as the HTRC Extracted Features; and using the command line to run basic text analysis processes. No experience necessary! Attendees must bring a laptop

University of San Diego

CcNav: Understanding Compiler Optimizations in Binary Code

Author: Aschwanden Pascal
Devkota Sabin
Isaacs Katherine E.
Kunen Adam
Legendre Matthew
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Program developers spend significant time on optimizing and tuning programs. During this iterative process, they apply optimizations, analyze the resulting code, and modify the compilation until they are satisfied. Understanding what the compiler did with the code is crucial to this process but is very time-consuming and labor-intensive. Users need to navigate through thousands of lines of binary code and correlate it to source code concepts to understand the results of the compilation and to identify optimizations. We present a design study in collaboration with program developers and performance analysts. Our collaborators work with various artifacts related to the program such as binary code, source code, control flow graphs, and call graphs. Through interviews, feedback, and pair-analytics sessions, we analyzed their tasks and workflow. Based on this task analysis and through a human-centric design process, we designed a visual analytics system Compilation Navigator (CcNav) to aid exploration of the effects of compiler optimizations on the program. CcNav provides a streamlined workflow and a unified context that integrates disparate artifacts. CcNav supports consistent interactions across all the artifacts making it easy to correlate binary code with source code concepts. CcNav enables users to navigate and filter large binary code to identify and summarize optimizations such as inlining, vectorization, loop unrolling, and code hoisting. We evaluate CcNav through guided sessions and semi-structured interviews. We reflect on our design process, particularly the immersive elements, and on the transferability of design studies through our experience with a previous design study on program analysis.Comment: IEEE VIS VAST 202

arXiv.org e-Print Archive

The University of Arizona

Data analytics 2016: proceedings of the fifth international conference on data analytics

Author: Bhulai Sandjai
Semanjski Ivana
Publication venue: The International Academy, Research and Industry Association
Publication date: 01/01/2016
Field of study

VU Research Portal

Ghent University Academic Bibliography

Using Random Forests to Describe Equity in Higher Education: A Critical Quantitative Analysis of Utah’s Postsecondary Pipelines

Author: McDaniel Tyler
Publication venue: Digital Commons @ Butler University
Publication date: 16/04/2018
Field of study

The following work examines the Random Forest (RF) algorithm as a tool for predicting student outcomes and interrogating the equity of postsecondary education pipelines. The RF model, created using longitudinal data of 41,303 students from Utah\u27s 2008 high school graduation cohort, is compared to logistic and linear models, which are commonly used to predict college access and success. Substantially, this work finds High School GPA to be the best predictor of postsecondary GPA, whereas commonly used ACT and AP test scores are not nearly as important. Each model identified several demographic disparities in higher education access, most significantly the effects of individual-level economic disadvantage. District- and school-level factors such as the proportion of Low Income students and the proportion of Underrepresented Racial Minority (URM) students were important and negatively associated with postsecondary success. Methodologically, the RF model was able to capture non-linearity in the predictive power of school- and district-level variables, a key finding which was undetectable using linear models. The RF algorithm outperforms logistic models in prediction of student enrollment, performs similarly to linear models in prediction of postsecondary GPA, and excels both models in its descriptions of non-linear variable relationships. RF provides novel interpretations of data, challenges conclusions from linear models, and has enormous potential to further the literature around equity in postsecondary pipelines

Digital Commons @ Butler University

Visualizing genome and systems biology: technologies, tools, implementation techniques and trends, past, present and future.

Author: Enright Anton J
Iliopoulos Ioannis
Malliarakis Dimitris
Papanikolaou Nikolas
Pavlopoulos Georgios A
Theodosiou Theodosis
Publication venue: Gigascience
Publication date: 01/01/2015
Field of study

"Α picture is worth a thousand words." This widely used adage sums up in a few words the notion that a successful visual representation of a concept should enable easy and rapid absorption of large amounts of information. Although, in general, the notion of capturing complex ideas using images is very appealing, would 1000 words be enough to describe the unknown in a research field such as the life sciences? Life sciences is one of the biggest generators of enormous datasets, mainly as a result of recent and rapid technological advances; their complexity can make these datasets incomprehensible without effective visualization methods. Here we discuss the past, present and future of genomic and systems biology visualization. We briefly comment on many visualization and analysis tools and the purposes that they serve. We focus on the latest libraries and programming languages that enable more effective, efficient and faster approaches for visualizing biological concepts, and also comment on the future human-computer interaction trends that would enable for enhancing visualization further

Springer - Publisher Connector

PubMed Central

Apollo (Cambridge)

Geovisual analytics for spatial decision support: Setting the research agenda

Author: A. MacEachren
Andrienko G.
Anselin L.
D. Keim
Dix A.
Dykes J.
Figueira J.
G. Andrienko
Han J.
Hand D. J.
Heer J.
Hutchins E.
Jacko J. A.
Kirschner P. A.
M.‐J. Kraak
Malczewski J.
N. Andrienko
P. Jankowski
Ramakrishnan R.
S. Wrobel
Saaty T. L.
Shneiderman B.
Simon H.
Thomas J. J
Tufte E. R.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2007
Field of study

This article summarizes the results of the workshop on Visualization, Analytics & Spatial Decision Support, which took place at the GIScience conference in September 2006. The discussions at the workshop and analysis of the state of the art have revealed a need in concerted cross‐disciplinary efforts to achieve substantial progress in supporting space‐related decision making. The size and complexity of real‐life problems together with their ill‐defined nature call for a true synergy between the power of computational techniques and the human capabilities to analyze, envision, reason, and deliberate. Existing methods and tools are yet far from enabling this synergy. Appropriate methods can only appear as a result of a focused research based on the achievements in the fields of geovisualization and information visualization, human‐computer interaction, geographic information science, operations research, data mining and machine learning, decision science, cognitive science, and other disciplines. The name ‘Geovisual Analytics for Spatial Decision Support’ suggested for this new research direction emphasizes the importance of visualization and interactive visual interfaces and the link with the emerging research discipline of Visual Analytics. This article, as well as the whole special issue, is meant to attract the attention of scientists with relevant expertise and interests to the major challenges requiring multidisciplinary efforts and to promote the establishment of a dedicated research community where an appropriate range of competences is combined with an appropriate breadth of thinking

KOPS - The Institutional Repository of the University of Konstanz

University of Twente Research Information

Introductory programming: a systematic literature review

Author: Abu Naser Samy S.
Agarwal Achla
Ahmed
Ahren T. C.
Al-Jarrah Ahmad
Alammary Ali
Annamalai Subashini
Ayub Mewati
Badri Suzan
Bai Yu
Baird Bridget
Bandura Albert
Barlow-Jones Glenda
Bayliss Jessica D.
Ben-Ari Mordechai
Bennedsen Jens
Bennett Chris
Berglund Anders
Berland Matthew
Briggs Tom
Bumbacher Engin
Burch Carl
Carbonaro Antonella
Carbone Angela
Cardell-Oliver Rachel
Chad Lane H
Char Bruce
Charalampos Spyropoulos
Charles Therese
Chinn Donald
Chinn Donald
Corney Malcolm
Coull Natalie J
Crawford Stewart
Cruz Gilbert
de Raadt Michael
de Raadt Michael
de Raadt Michael
de Raadt Michael
de Raadt Michael
Devey Adrian
Dickson Paul E.
Dillon Edward
Doherty Liam
Durrheim Mark S.
D’Souza Daryl
Eagly Alice H
Edgcomb Alex
Edwards Stephen H.
Falkner Katrina
Firmalo Fabic Geela Venise
Fonseca Fred
Fürst Luka
Garner Stuart
Goadrich Mark
Gonzalez Gracielo
Gudmundsen Dee
Haghighi Pari Delir
Hare Brian K
Heliotis James
Hooshyar Danial
Hovemeyer David
Hu Minjie
Hu Minjie
Hu Yun-Jen
Huang Chenn-Jung
Jacqueline
Jayal Ambikesh
Jurado Francisco
Kanaparan Geetha
Kasto Nadia
Kiran L.
Kirby Stephen
Kitchenham Barbara
Kouznetsova Svetlana
Kölling Michael
LeJeune Noel
Leska Chuck
Lipman Derrell
Lister Raymond
Lister Raymond
Lister Raymond
Lopez Mike
Lulis Evelyn
Luoma Harri
Major L.
McKeown Jim
McWhorter William Isaac
Medley M. Dee
Mentis Alexander
Menyhárt László
Mullins Paul
Munson Jonathan P.
Muntha Surya
Murphy Laurie
Neto Vicente Lustosa
Nguyen Thuy-Linh
Okada Ken
Orehovački Tihomir
Orehovački Tihomir
Palmer James Dean
Park Myung Ah
Parsons Dale
Paul Jody
Peachock Patrick
Pearce Janice L.
Pero Štefan
Price Kellie
Quintin
Rajala Teemu
Ramli R.Z.
Ray Andrew
Rodrigo Maria Mercedes T
Roels Reinout
Rountree Janet
Russo Mark F.
Sanou Loé
Schoeffel Pablo
Schramm Joachim
Shabalina Olga
Sharp Jason H
Sheard Judy
Sheard Judy
Shuhidan Shuhaida
Sindre Guttorm
Skudder Ben
Song Hosung
Sorva Juha
Sung Kelvin
Takemura Yasuhiro
Teague D.
Teague Donna
Teague Donna
Teague Donna
Teague Donna
Thompson Errol
Torrey Lisa
Truong Nghi
Vincenti Giovanni
Wang Hong
Watkins Kera Z. B.
Weragama Dinesha
Whalley Jacqueline
Whalley Jacqueline
Whalley Jacqueline
Whittall S. J.
Whittinghill David
Wiebe E
Wood Krissi
Yoo Jungsoon P
Yusri Nurliana
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2018
Field of study

As computing becomes a mainstream discipline embedded in the school curriculum and acts as an enabler for an increasing range of academic disciplines in higher education, the literature on introductory programming is growing. Although there have been several reviews that focus on specific aspects of introductory programming, there has been no broad overview of the literature exploring recent trends across the breadth of introductory programming. This paper is the report of an ITiCSE working group that conducted a systematic review in order to gain an overview of the introductory programming literature. Partitioning the literature into papers addressing the student, teaching, the curriculum, and assessment, we explore trends, highlight advances in knowledge over the past 15 years, and indicate possible directions for future research

Michigan Technological University

Crossref

Falmouth University Research Repository (FURR)

ResearchOnline@GCU