183,364 research outputs found
Demystifying Developers' Issues in Distributed Training of Deep Learning Software
Deep learning (DL) has been pervasive in a wide spectrum of nowadays software
systems and applications. The rich features of these DL based software
applications (i.e., DL software) usually rely on powerful DL models. To train
powerful DL models with large datasets efficiently, it has been a common
practice for developers to parallelize and distribute the computation and
memory over multiple devices in the training process, which is known as
distributed training. However, existing efforts in the software engineering
(SE) research community mainly focus on issues in the general process of
training DL models. In contrast, to the best of our knowledge, issues that
developers encounter in distributed training have never been well studied.
Given the surging importance of distributed training in the current practice of
developing DL software, this paper fills in the knowledge gap and presents the
first comprehensive study on developers' issues in distributed training. To
this end, we extract and analyze 1,054 real-world developers' issues in
distributed training from Stack Overflow and GitHub, two commonly used data
sources for studying software issues. We construct a fine-grained taxonomy
consisting of 30 categories regarding the fault symptoms and summarize common
fix patterns for different symptoms. Based on the results, we suggest
actionable implications and research avenues that can potentially facilitate
the future development of distributed training
Grand Challenges of Traceability: The Next Ten Years
In 2007, the software and systems traceability community met at the first
Natural Bridge symposium on the Grand Challenges of Traceability to establish
and address research goals for achieving effective, trustworthy, and ubiquitous
traceability. Ten years later, in 2017, the community came together to evaluate
a decade of progress towards achieving these goals. These proceedings document
some of that progress. They include a series of short position papers,
representing current work in the community organized across four process axes
of traceability practice. The sessions covered topics from Trace Strategizing,
Trace Link Creation and Evolution, Trace Link Usage, real-world applications of
Traceability, and Traceability Datasets and benchmarks. Two breakout groups
focused on the importance of creating and sharing traceability datasets within
the research community, and discussed challenges related to the adoption of
tracing techniques in industrial practice. Members of the research community
are engaged in many active, ongoing, and impactful research projects. Our hope
is that ten years from now we will be able to look back at a productive decade
of research and claim that we have achieved the overarching Grand Challenge of
Traceability, which seeks for traceability to be always present, built into the
engineering process, and for it to have "effectively disappeared without a
trace". We hope that others will see the potential that traceability has for
empowering software and systems engineers to develop higher-quality products at
increasing levels of complexity and scale, and that they will join the active
community of Software and Systems traceability researchers as we move forward
into the next decade of research
Grand Challenges of Traceability: The Next Ten Years
In 2007, the software and systems traceability community met at the first
Natural Bridge symposium on the Grand Challenges of Traceability to establish
and address research goals for achieving effective, trustworthy, and ubiquitous
traceability. Ten years later, in 2017, the community came together to evaluate
a decade of progress towards achieving these goals. These proceedings document
some of that progress. They include a series of short position papers,
representing current work in the community organized across four process axes
of traceability practice. The sessions covered topics from Trace Strategizing,
Trace Link Creation and Evolution, Trace Link Usage, real-world applications of
Traceability, and Traceability Datasets and benchmarks. Two breakout groups
focused on the importance of creating and sharing traceability datasets within
the research community, and discussed challenges related to the adoption of
tracing techniques in industrial practice. Members of the research community
are engaged in many active, ongoing, and impactful research projects. Our hope
is that ten years from now we will be able to look back at a productive decade
of research and claim that we have achieved the overarching Grand Challenge of
Traceability, which seeks for traceability to be always present, built into the
engineering process, and for it to have "effectively disappeared without a
trace". We hope that others will see the potential that traceability has for
empowering software and systems engineers to develop higher-quality products at
increasing levels of complexity and scale, and that they will join the active
community of Software and Systems traceability researchers as we move forward
into the next decade of research
- …