Search CORE

17,170 research outputs found

Stack Overflow: A Code Laundering Platform?

Author: An Le
Antoniol Giuliano
Khomh Foutse
Mlouki Ons
Publication venue
Publication date: 01/01/2017
Field of study

Developers use Question and Answer (Q&A) websites to exchange knowledge and expertise. Stack Overflow is a popular Q&A website where developers discuss coding problems and share code examples. Although all Stack Overflow posts are free to access, code examples on Stack Overflow are governed by the Creative Commons Attribute-ShareAlike 3.0 Unported license that developers should obey when reusing code from Stack Overflow or posting code to Stack Overflow. In this paper, we conduct a case study with 399 Android apps, to investigate whether developers respect license terms when reusing code from Stack Overflow posts (and the other way around). We found 232 code snippets in 62 Android apps from our dataset that were potentially reused from Stack Overflow, and 1,226 Stack Overflow posts containing code examples that are clones of code released in 68 Android apps, suggesting that developers may have copied the code of these apps to answer Stack Overflow questions. We investigated the licenses of these pieces of code and observed 1,279 cases of potential license violations (related to code posting to Stack overflow or code reuse from Stack overflow). This paper aims to raise the awareness of the software engineering community about potential unethical code reuse activities taking place on Q&A websites like Stack Overflow.Comment: In proceedings of the 24th IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER

arXiv.org e-Print Archive

Crossref

PolyPublie

オープンソースソフトウェアノサイリヨウ二オケルイクツカノカダイ二カンスルケンキュウ

Author: キュウジツ
仇実
Publication venue
Publication date
Field of study

Osaka University Knowledge Archive

Standardization of electroencephalography for multi-site, multi-platform and multi-investigator studies: Insights from the canadian biomarker integration network in depression

Author: Alonso Esther
Arnott Stephen R.
Atluri Sravya
Blumberger Daniel
Brenner Colleen A.
Daskalakis Zafiris J.
Dhami Prabhjot
Dharsee Moyez
Evans Kenneth R.
Farzan Faranak
Frehlich Matthew
Frey Benicio N.
Kennedy Sidney H.
Kleffner Killian
Lam Raymond W.
Liotti Mario
Mcandrews Mary Pat
Milev Roumen
Price Rae
Ravindran Arun
Rotzinger Susan
Vila-Rodriguez Fidel
Wong Willy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Subsequent to global initiatives in mapping the human brain and investigations of neurobiological markers for brain disorders, the number of multi-site studies involving the collection and sharing of large volumes of brain data, including electroencephalography (EEG), has been increasing. Among the complexities of conducting multi-site studies and increasing the shelf life of biological data beyond the original study are timely standardization and documentation of relevant study parameters. We presentthe insights gained and guidelines established within the EEG working group of the Canadian Biomarker Integration Network in Depression (CAN-BIND). CAN-BIND is a multi-site, multi-investigator, and multiproject network supported by the Ontario Brain Institute with access to Brain-CODE, an informatics platform that hosts a multitude of biological data across a growing list of brain pathologies. We describe our approaches and insights on documenting and standardizing parameters across the study design, data collection, monitoring, analysis, integration, knowledge-translation, and data archiving phases of CAN-BIND projects. We introduce a custom-built EEG toolbox to track data preprocessing with open-access for the scientific community. We also evaluate the impact of variation in equipment setup on the accuracy of acquired data. Collectively, this work is intended to inspire establishing comprehensive and standardized guidelines for multi-site studies

Archivio istituzionale della ricerca - Università di Padova

Recommended from our members

Assessment of work-based reports: an analysis of assessment frameworks

Author: Monk John
Publication venue
Publication date: 01/01/2005
Field of study

In Britain engineering professional development has traditionally been seen as a three phase process consisting of a period of engineering formation, a period of training and a period during which engineering responsibilities are demonstrated. An individual could submit evidence of these activities and become registered as a Professional Engineer. Increasing numbers of people employed in the role of engineer do not have formal engineering qualifications and a part or all their engineering formation is carried out within engineering companies or organizations. These people therefore do not have the academically authenticated credentials to register as professional engineers but if they are ignored then the pool of registered engineers will cease to be representative of the profession. The Engineering Council, the body responsible for registering engineers in the UK, has acknowledged the changes in the structure of the profession and has introduced an alternative route for assessing the knowledge and understanding that underpins the competence of a professional engineer. Individual engineers can demonstrate that they have an adequate engineering formation through any combination of academic qualifications and a technical report on some aspect of their professional engineering work. The introduction of the technical report requires the Professional Engineering Bodies to carry out an assessment outside the traditional assessment framework of the Universities. This paper reviews and analyses the requirements of assessment systems and derives the components of such a system that will ensure that the results of the assessment of a work-based technical report will be respected and be seen as assuring comparable standards to the academic routes to engineering formation. By examining assessment separately from the processes of teaching and learning, the paper also reveals the extent of an assessment process and its costs

Open Research Online

Identifying Bugs in Make and JVM-Oriented Builds

Author: Chaliasos Stefanos
Mitropoulos Dimitris
Sotiropoulos Thodoris
Spinellis Diomidis
Publication venue
Publication date: 14/05/2020
Field of study

Incremental and parallel builds are crucial features of modern build systems. Parallelism enables fast builds by running independent tasks simultaneously, while incrementality saves time and computing resources by processing the build operations that were affected by a particular code change. Writing build definitions that lead to error-free incremental and parallel builds is a challenging task. This is mainly because developers are often unable to predict the effects of build operations on the file system and how different build operations interact with each other. Faulty build scripts may seriously degrade the reliability of automated builds, as they cause build failures, and non-deterministic and incorrect build results. To reason about arbitrary build executions, we present buildfs, a generally-applicable model that takes into account the specification (as declared in build scripts) and the actual behavior (low-level file system operation) of build operations. We then formally define different types of faults related to incremental and parallel builds in terms of the conditions under which a file system operation violates the specification of a build operation. Our testing approach, which relies on the proposed model, analyzes the execution of single full build, translates it into buildfs, and uncovers faults by checking for corresponding violations. We evaluate the effectiveness, efficiency, and applicability of our approach by examining hundreds of Make and Gradle projects. Notably, our method is the first to handle Java-oriented build systems. The results indicate that our approach is (1) able to uncover several important issues (245 issues found in 45 open-source projects have been confirmed and fixed by the upstream developers), and (2) orders of magnitude faster than a state-of-the-art tool for Make builds

arXiv.org e-Print Archive

Assisting Software Developers With License Compliance

Author: Vendome Christopher
Publication venue: W&M ScholarWorks
Publication date: 01/01/2018
Field of study

Open source licensing determines how open source systems are reused, distributed, and modified from a legal perspective. While it facilitates rapid development, it can present difficulty for developers in understanding due to the legal language of these licenses. Because of misunderstandings, systems can incorporate licensed code in a way that violates the terms of the license. Such incompatibilities between licensing can result in the inability to reuse a particular library without either relicensing the system or redesigning the architecture of the system. Prior efforts have predominantly focused on license identification or understanding the underlying phenomena without reasoning about compatibility in a broad scale. The work in this dissertation first investigates the rationale of developers and identifies the areas that developers struggle with respect to free/open source software licensing. First, we investigate the diffusion of licenses and the prevalence of license changes in a large scale empirical study of 16,221 Java systems. We observed a clear lack of traceability and a lack of standardized licensing that led to difficulties and confusion for developers trying to reuse source code. We further investigated the difficulty by surveying the developers of the systems with license changes to understand why they first adopted a license and then changed licenses. Additionally, we performed an analysis on issue trackers and legal mailing lists to extract licensing bugs. From these works, we identified key areas in which developers struggled and needed support. While developers need support to identify license incompatibilities and understand both the cause and implications of the incompatibilities, we observed that state-of-the-art license identification tools did not identify license exceptions. Since these exceptions directly modify the license terms (either the permissions granted by the license or the restrictions imposed by the license), we proposed an approach to complement current license identification techniques in order to classify license exceptions. The approach relies on supervised machine learners to classify the licensing text to identify the particular license exceptions or the lack of a license exception. Subsequently, we built an infrastructure to assist developers with evaluating license compliance warnings for their system. The infrastructure evaluates compliance across the dependency tree of a system to ensure it is compliant with all of the licenses of the dependencies. When an incompatibility is present, it notes the specific library/libraries and the conflicting license(s) so that the developers can investigate these compliance warnings, which would prevent distribution of their software, in their system. We conduct a study on 121,094 open source projects spanning 6 programming languages, and we demonstrate that the infrastructure is able to identify license incompatibilities between these projects and their dependencies

College of William & Mary: W&M Publish