117 research outputs found
The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software Libraries: A Collaborative Ethnography of Documentation Work
Computational research and data analytics increasingly relies on complex
ecosystems of open source software (OSS) "libraries" -- curated collections of
reusable code that programmers import to perform a specific task. Software
documentation for these libraries is crucial in helping programmers/analysts
know what libraries are available and how to use them. Yet documentation for
open source software libraries is widely considered low-quality. This article
is a collaboration between CSCW researchers and contributors to data analytics
OSS libraries, based on ethnographic fieldwork and qualitative interviews. We
examine several issues around the formats, practices, and challenges around
documentation in these largely volunteer-based projects. There are many
different kinds and formats of documentation that exist around such libraries,
which play a variety of educational, promotional, and organizational roles. The
work behind documentation is similarly multifaceted, including writing,
reviewing, maintaining, and organizing documentation. Different aspects of
documentation work require contributors to have different sets of skills and
overcome various social and technical barriers. Finally, most of our
interviewees do not report high levels of intrinsic enjoyment for doing
documentation work (compared to writing code). Their motivation is affected by
personal and project-specific factors, such as the perceived level of credit
for doing documentation work versus more "technical" tasks like adding new
features or fixing bugs. In studying documentation work for data analytics OSS
libraries, we gain a new window into the changing practices of data-intensive
research, as well as help practitioners better understand how to support this
often invisible and infrastructural work in their projects
iEEG-BIDS, extending the Brain Imaging Data Structure specification to human intracranial electrophysiology
MNE-BIDS: Organizing electrophysiological data into the BIDS format and facilitating their analysis
Beyond advertising: New infrastructures for publishing integrated research objects
ABSTRACT: Moving beyond static text and illustrations is a central challenge for scientific publishing in the 21st century. As early as 1995, Donoho and Buckheit paraphrased John Claerbout that “an article about [a] computational result is advertising, not scholarship. The actual scholarship is the full software environment, code and data, that produced the result” [1]. Awareness of this problem has only grown over the last 25 years; nonetheless, scientific publishing infrastructures remain remarkably resistant to change [2]. Even as these infrastructures have largely stagnated, the internet has ushered in a transition “from the wet lab to the web lab” [3]. New expectations have emerged in this shift, but these expectations must play against the reality of currently available infrastructures and associated sociological pressures. Here, we compare current scientific publishing norms against those associated with online content more broadly, and we argue that meeting the “Claerbout challenge” of providing the full software environment, code, and data supporting a scientific result will require open infrastructure development to create environments for authoring, reviewing, and accessing interactive research objects
Multivoxel codes for representing and integrating acoustic features in human cortex
Using fMRI and multivariate pattern analysis, we determined whether acoustic features are represented by independent or integrated neural codes in human cortex. Male and female listeners heard band-pass noise varying simultaneously in spectral (frequency) and temporal (amplitude-modulation [AM] rate) features. In the superior temporal plane, changes in multivoxel activity due to frequency were largely invariant with respect to AM rate (and vice versa), consistent with an independent representation. In contrast, in posterior parietal cortex, neural representation was exclusively integrated and tuned to specific conjunctions of frequency and AM features. Direct between-region comparisons show that whereas independent coding of frequency and AM weakened with increasing levels of the hierarchy, integrated coding strengthened at the transition between non-core and parietal cortex. Our findings support the notion that primary auditory cortex can represent component acoustic features in an independent fashion and suggest a role for parietal cortex in feature integration and the structuring of acoustic input.
Significance statement A major goal for neuroscience is discovering the sensory features to which the brain is tuned and how those features are integrated into cohesive perception. We used whole-brain human fMRI and a statistical modeling approach to quantify the extent to which sound features are represented separately or in an integrated fashion in cortical activity patterns. We show that frequency and AM rate, two acoustic features that are fundamental to characterizing biological important sounds such as speech, are represented separately in primary auditory cortex but in an integrated fashion in parietal cortex. These findings suggest that representations in primary auditory cortex can be simpler than previously thought and also implicate a role for parietal cortex in integrating features for coherent perception
iEEG-BIDS, extending the Brain Imaging Data Structure specification to human intracranial electrophysiology
The Brain Imaging Data Structure (BIDS) is a community-driven specification for organizing neuroscience data and metadata with the aim to make datasets more transparent, reusable, and reproducible. Intracranial electroencephalography (iEEG) data offer a unique combination of high spatial and temporal resolution measurements of the living human brain. To improve internal (re)use and external sharing of these unique data, we present a specification for storing and sharing iEEG data: iEEG-BIDS
Encoding and decoding models in cognitive electrophysiology
Contains fulltext :
179395.pdf (publisher's version ) (Open Access)Cognitive neuroscience has seen rapid growth in the size and complexity of data recorded from the human brain as well as in the computational tools available to analyze this data. This data explosion has resulted in an increased use of multivariate, model-based methods for asking neuroscience questions, allowing scientists to investigate multiple hypotheses with a single dataset, to use complex, time-varying stimuli, and to study the human brain under more naturalistic conditions. These tools come in the form of "Encoding" models, in which stimulus features are used to model brain activity, and "Decoding" models, in which neural features are used to generated a stimulus output. Here we review the current state of encoding and decoding models in cognitive electrophysiology and provide a practical guide toward conducting experiments and analyses in this emerging field. Our examples focus on using linear models in the study of human language and audition. We show how to calculate auditory receptive fields from natural sounds as well as how to decode neural recordings to predict speech. The paper aims to be a useful tutorial to these approaches, and a practical introduction to using machine learning and applied statistics to build models of neural activity. The data analytic approaches we discuss may also be applied to other sensory modalities, motor systems, and cognitive systems, and we cover some examples in these areas. In addition, a collection of Jupyter notebooks is publicly available as a complement to the material covered in this paper, providing code examples and tutorials for predictive modeling in python. The aim is to provide a practical understanding of predictive modeling of human brain data and to propose best-practices in conducting these analyses.24 p
- …
