Search CORE

952 research outputs found

Identification-method research for open-source software ecosystems

Author: Liao Zhifang
Liu Hui
Liu Shengzong
Wang Ningwei
Zhang Qi
Zhang Yan
Publication venue: 'MDPI AG'
Publication date: 01/02/2019
Field of study

In recent years, open-source software (OSS) development has grown, with many developers around the world working on different OSS projects. A variety of open-source software ecosystems have emerged, for instance, GitHub, StackOverflow, and SourceForge. One of the most typical social-programming and code-hosting sites, GitHub, has amassed numerous open-source-software projects and developers in the same virtual collaboration platform. Since GitHub itself is a large open-source community, it hosts a collection of software projects that are developed together and coevolve. The great challenge here is how to identify the relationship between these projects, i.e., project relevance. Software-ecosystem identification is the basis of other studies in the ecosystem. Therefore, how to extract useful information in GitHub and identify software ecosystems is particularly important, and it is also a research area in symmetry. In this paper, a Topic-based Project Knowledge Metrics Framework (TPKMF) is proposed. By collecting the multisource dataset of an open-source ecosystem, project-relevance analysis of the open-source software is carried out on the basis of software-ecosystem identification. Then, we used our Spectral Clustering algorithm based on Core Project (CP-SC) to identify software-ecosystem projects and further identify software ecosystems. We verified that most software ecosystems usually contain a core software project, and most other projects are associated with it. Furthermore, we analyzed the characteristics of the ecosystem, and we also found that interactive information has greater impact on project relevance. Finally, we summarize the Topic-based Project Knowledge Metrics Framework

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

ResearchOnline@GCU

Estimating causal networks in biosphere–atmosphere interaction with the PCMCI approach

Author: Carrara A.
El-Madany T.
Krich C.
Mahecha M.
Migliavacca M.
Miralles D.
Perez‑Priego O.
Runge J.
Publication venue: 'Copernicus GmbH'
Publication date: 01/01/2020
Field of study

Local meteorological conditions and biospheric activity are tightly coupled. Understanding these links is an essential prerequisite for predicting the Earth system under climate change conditions. However, many empirical studies on the interaction between the biosphere and the atmosphere are based on correlative approaches that are not able to deduce causal paths, and only very few studies apply causal discovery methods. Here, we use a recently proposed causal graph discovery algorithm, which aims to reconstruct the causal dependency structure underlying a set of time series. We explore the potential of this method to infer temporal dependencies in biosphere-atmosphere interactions. Specifically we address the following questions: How do periodicity and heteroscedasticity influence causal detection rates, i.e. the detection of existing and non-existing links? How consistent are results for noise-contaminated data? Do results exhibit an increased information content that justifies the use of this causal-inference method? We explore the first question using artificial time series with well known dependencies that mimic real-world biosphere-atmosphere interactions. The two remaining questions are addressed jointly in two case studies utilizing observational data. Firstly, we analyse three replicated eddy covariance datasets from a Mediterranean ecosystem at half hourly time resolution allowing us to understand the impact of measurement uncertainties. Secondly, we analyse global NDVI time series (GIMMS 3g) along with gridded climate data to study large-scale climatic drivers of vegetation greenness. Overall, the results confirm the capacity of the causal discovery method to extract time-lagged linear dependencies under realistic settings. The violation of the method's assumptions increases the likelihood to detect false links. Nevertheless, we consistently identify interaction patterns in observational data. Our findings suggest that estimating a directed biosphere-atmosphere network at the ecosystem level can offer novel possibilities to unravel complex multi-directional interactions. Other than classical correlative approaches, our findings are constrained to a few meaningful set of relations which can be powerful insights for the evaluation of terrestrial ecosystem models

Institute of Transport Research:Publications

Ghent University Academic Bibliography

MPG.PuRe

An Introduction to Software Ecosystems

Author: De Roover Coen
Mens Tom
Publication venue
Publication date: 28/07/2023
Field of study

This chapter defines and presents different kinds of software ecosystems. The focus is on the development, tooling and analytics aspects of software ecosystems, i.e., communities of software developers and the interconnected software components (e.g., projects, libraries, packages, repositories, plug-ins, apps) they are developing and maintaining. The technical and social dependencies between these developers and software components form a socio-technical dependency network, and the dynamics of this network change over time. We classify and provide several examples of such ecosystems. The chapter also introduces and clarifies the relevant terms needed to understand and analyse these ecosystems, as well as the techniques and research methods that can be used to analyse different aspects of these ecosystems.Comment: Preprint of chapter "An Introduction to Software Ecosystems" by Tom Mens and Coen De Roover, published in the book "Software Ecosystems: Tooling and Analytics" (eds. T. Mens, C. De Roover, A. Cleve), 2023, ISBN 978-3-031-36059-6, reproduced with permission of Springer. The final authenticated version of the book and this chapter is available online at: https://doi.org/10.1007/978-3-031-36060-

arXiv.org e-Print Archive

A Cross-Repository Model for Predicting Popularity in GitHub

Author: Bidoki Neda Hajiakhoond
Garibay Ivan
Keathley Heather
Sukthankar Gita
Publication venue
Publication date: 01/12/2018
Field of study

Social coding platforms, such as GitHub, can serve as natural laboratories for studying the diffusion of innovation through tracking the pattern of code adoption by programmers. This paper focuses on the problem of predicting the popularity of software repositories over time; our aim is to forecast the time series of popularity-related events (code forks and watches). In particular, we are interested in cross-repository patterns-how do events on one repository affect other repositories? Our proposed LSTM (Long Short-Term Memory) recurrent neural network integrates events across multiple active repositories, outperforming a standard ARIMA (Auto-Regressive Integrated Moving Average) time series prediction based on the single repository. The ability of the LSTM to leverage cross-repository information gives it a significant edge over standard time series forecasting.Comment: 6 page

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

A large-scale empirical exploration on refactoring activities in open source software projects

Author: Bacchelli Alberto
Gall Harald C
Grano Giovanni
Palomba Fabio
Vassallo Carmine
Publication venue: 'Elsevier BV'
Publication date: 01/07/2019
Field of study

Refactoring is a well-established practice that aims at improving the internal structure of a software system without changing its external behavior. Existing literature provides evidence of how and why developers perform refactoring in practice. In this paper, we continue on this line of research by performing a large-scale empirical analysis of refactoring practices in 200 open source systems. Specifically, we analyze the change history of these systems at commit level to investigate: (i) whether developers perform refactoring operations and, if so, which are more diffused and (ii) when refactoring operations are applied, and (iii) which are the main developer-oriented factors leading to refactoring. Based on our results, future research can focus on enabling automatic support for less frequent refactorings and on recommending refactorings based on the developer's workload, project's maturity and developer's commitment to the project

Crossref

ZORA

Three Essays on Growth and Innovation of Digital Platforms

Author: Hukal Philipp
Publication venue
Publication date
Field of study

Digital platforms are complex digital technology arrangements that enable the interaction of otherwise unaffiliated organisations. This interaction often generates novel outputs and as a result digital platforms are seen as a powerful driver of digital innovation. Yet exactly how digital platforms generate innovations by facilitating interaction merits further investigation. This dissertation illustrates aspects of how platforms grow and innovate using the case of the open-geo data platform OpenStreetMap. The study draws from both quantitative as well as qualitative analysis techniques applied to highly detailed data capturing the use, design, and operation of the platform over more than ten years. A series of computationally-intensive, mixedmethods studies were conducted to utilise the full scale of available empirical material while maintaining contextual richness relevant to the case. Embedded in recent topics on digital platforms, three empirical studies are presented. Each study focuses on one aspect of growth and innovation on digital platforms. The studies specifically examine; (i) how platform operators can stimulate generativity, that is the generation of novel outputs without direct input by the operator, (ii), how the unique attributes of digital technologies enable the creation of complex ecosystems that allow for highpaced changes in a platform’s architecture even if that increases the structural complexity of a platform, and, (iii) how participants coordinate contributions to a platform’s operation when they cannot rely on stable interfaces. Collectively these studies contribute to the understanding of how platforms generate new digital innovations

Warwick Research Archives Portal Repository