2,540 research outputs found
Evaluating Maintainability Prejudices with a Large-Scale Study of Open-Source Projects
Exaggeration or context changes can render maintainability experience into
prejudice. For example, JavaScript is often seen as least elegant language and
hence of lowest maintainability. Such prejudice should not guide decisions
without prior empirical validation. We formulated 10 hypotheses about
maintainability based on prejudices and test them in a large set of open-source
projects (6,897 GitHub repositories, 402 million lines, 5 programming
languages). We operationalize maintainability with five static analysis
metrics. We found that JavaScript code is not worse than other code, Java code
shows higher maintainability than C# code and C code has longer methods than
other code. The quality of interface documentation is better in Java code than
in other code. Code developed by teams is not of higher and large code bases
not of lower maintainability. Projects with high maintainability are not more
popular or more often forked. Overall, most hypotheses are not supported by
open-source data.Comment: 20 page
Recommended from our members
Technology entrepreneurship and value creation on open innovation platforms
This dissertation studies how entrepreneurial firms create economic value from open source technology platforms, interfaces on which firms disclose knowledge and distribute innovation for free without retaining any proprietary rights. Despite their increasing importance in innovation and growing popularity among profit-seeking new ventures, open source platforms present a major challenge for value creation, as they lack price signals to guide ventures’ transactions and forfeit ventures’ control over key resources and knowledge for innovation. Those features are in contrast with the fundamental assumption about price and revenue in economics. They also run counter to the central tenet in strategy research that private knowledge and rare resources are central to competitive advantage and profiting from innovation.
To address this puzzle about value creation from free technologies base on free knowledge and resources, this dissertation specifically focus on the economic implications of strategies ventures can leverage within and across open source development communities. Chapter I reviews the literature relevant to entrepreneurship in an open and inter-dependent innovation environment. Exploring research opportunities emerged from the literature review, Chapter II explores the possibility that multihoming, a critical growth strategy of ventures as open source complementors in platform competition, allows ventures to reinforce their existing user base – a prerequisite of value creation from open source. Chapter III directly addresses value creation by investigating how collaborating with external contributors, another critical open source strategy, influences venture capital investment. Both essays highlight how platform network effects unfold without price signals and proprietary rights of the technologies in shaping the outcome for ventures’ strategies. They also emphasize those strategies’ demand side implications on users, participants on another side of open source platforms.
The empirical analyses of this dissertation are based on multiple open source technologies platforms, with data obtained from on GitHub, the worlds’ largest open source software storage provider, containing 5 Terabytes of information on 2.1 million ventures, 96 million technologies and over 2 billion development activities, under research designs for deriving causal references. Overall, the dissertation seeks to advance the understanding of value creation in entrepreneurship through open source platforms, an increasingly important phenomenon in contemporary economy.Managemen
How diverse is your team? Investigating gender and nationality diversity in GitHub teams
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.Background Building an effective team of developers is a complex task faced by both software companies and open source communities. The problem of forming a “dream” team involves many variables, including consideration of human factors and it is not a dilemma solvable in a mathematical way. Empirical studies might provide interesting insights to explain which factors need to be taken into account in building a team of developers and which levers act to optimise productivity among developers. Aim In this paper, we present the results of an empirical study aimed at investigating the link between team diversity (i.e., gender, nationality) and productivity (issue fixing time). Method We consider issues solved from the GHTorrent dataset inferring gender and nationality of each team’s members. We also evaluate the politeness of all comments involved in issue resolution. Results Results show that higher gender diversity is linked with a lower team average issue fixing time (higher productivity), that nationality diversity is linked with lower team politeness and that gender diversity is linked with higher sentiment.Peer reviewedFinal Published versio
Healthy or Not: A Way to Predict Ecosystem Health in GitHub
With the development of open source community, through the interaction of developers, the collaborative development of software, and the sharing of software tools, the formation of open source software ecosystem has matured. Natural ecosystems provide ecological services on which human beings depend. Maintaining a healthy natural ecosystem is a necessity for the sustainable development of mankind. Similarly, maintaining a healthy ecosystem of open source software is also a prerequisite for the sustainable development of open source communities, such as GitHub. This paper takes GitHub as an example to analyze the health condition of open source ecosystem and, also, it is a research area in Symmetry. Firstly, the paper presents the healthy definition of GitHub open source ecosystem health and, then, according to the main components of natural ecosystem health, the paper proposes the health indicators and health indicators evaluation method. Based on the above, the GitHub ecosystem health prediction method is proposed. By analyzing the projects and data collected in GitHub, it is found that, using the proposed evaluation indicators and method, we can analyze the healthy development trend of the GitHub ecosystem and contribute to the stability of ecosystem development
Herding a Deluge of Good Samaritans: How GitHub Projects Respond to Increased Attention
Collaborative crowdsourcing is a well-established model of work, especially in the case of open source software development. The structure and operation of these virtual and loosely-knit teams differ from traditional organizations. As such, little is known about how their behavior may change in response to an increase in external attention. To understand these dynamics, we analyze millions of actions of thousands of contributors in over 1100 open source software projects that topped the GitHub Trending Projects page and thus experienced a large increase in attention, in comparison to a control group of projects identified through propensity score matching. In carrying out our research, we use the lens of organizational change, which considers the challenges teams face during rapid growth and how they adapt their work routines, organizational structure, and management style. We show that trending results in an explosive growth in the effective team size. However, most newcomers make only shallow and transient contributions. In response, the original team transitions towards administrative roles, responding to requests and reviewing work done by newcomers. Projects evolve towards a more distributed coordination model with newcomers becoming more central, albeit in limited ways. Additionally, teams become more modular with subgroups specializing in different aspects of the project. We discuss broader implications for collaborative crowdsourcing teams that face attention shocks.National Science Foundation Grant No. IIS-1617820.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/153786/1/Maldeniya et al. 2020.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/153786/4/Maldeniya et al. 2020 Published Version.pdfDescription of Maldeniya et al. 2020.pdf : Main ArticleDescription of Maldeniya et al. 2020 Published Version.pdf : Published Versio
An Event-based Analysis Framework for Open Source Software Development Projects
The increasing popularity and success of Open Source Software (OSS) development projects has drawn significant attention of academics and open source participants over the last two decades. As one of the key areas in OSS research, assessing and predicting OSS performance is of great value to both OSS communities and organizations who are interested in investing in OSS projects. Most existing research, however, has considered OSS project performance as the outcome of static cross-sectional factors such as number of developers, project activity level, and license choice. While variance studies can identify some predictors of project outcomes, they tend to neglect the actual process of development. Without a closer examination of how events occur, an understanding of OSS projects is incomplete. This dissertation aims to combine both process and variance strategy, to investigate how OSS projects change over time through their development processes; and to explore how these changes affect project performance. I design, instantiate, and evaluate a framework and an artifact, EventMiner, to analyze OSS projects’ evolution through development activities. This framework integrates concepts from various theories such as distributed cognition (DCog) and complexity theory, applying data mining techniques such as decision trees, motif analysis, and hidden Markov modeling to automatically analyze and interpret the trace data of 103 OSS projects from an open source repository. The results support the construction of process theories on OSS development. The study contributes to literature in DCog, design routines, OSS development, and OSS performance. The resulting framework allows OSS researchers who are interested in OSS development processes to share and reuse data and data analysis processes in an open-source manner
- …