90 research outputs found
GitDelver Enterprise Dataset (GDED):An Industrial Closed-source Dataset for Socio-Technical Research
User Review-Based Change File Localization for Mobile Applications
In the current mobile app development, novel and emerging DevOps practices
(e.g., Continuous Delivery, Integration, and user feedback analysis) and tools
are becoming more widespread. For instance, the integration of user feedback
(provided in the form of user reviews) in the software release cycle represents
a valuable asset for the maintenance and evolution of mobile apps. To fully
make use of these assets, it is highly desirable for developers to establish
semantic links between the user reviews and the software artefacts to be
changed (e.g., source code and documentation), and thus to localize the
potential files to change for addressing the user feedback. In this paper, we
propose RISING (Review Integration via claSsification, clusterIng, and
linkiNG), an automated approach to support the continuous integration of user
feedback via classification, clustering, and linking of user reviews. RISING
leverages domain-specific constraint information and semi-supervised learning
to group user reviews into multiple fine-grained clusters concerning similar
users' requests. Then, by combining the textual information from both commit
messages and source code, it automatically localizes potential change files to
accommodate the users' requests. Our empirical studies demonstrate that the
proposed approach outperforms the state-of-the-art baseline work in terms of
clustering and localization accuracy, and thus produces more reliable results.Comment: 15 pages, 3 figures, 8 table
Are We Building on the Rock? On the Importance of Data Preprocessing for Code Summarization
Code summarization, the task of generating useful comments given the code,
has long been of interest. Most of the existing code summarization models are
trained and validated on widely-used code comment benchmark datasets. However,
little is known about the quality of the benchmark datasets built from
real-world projects. Are the benchmark datasets as good as expected? To bridge
the gap, we conduct a systematic research to assess and improve the quality of
four benchmark datasets widely used for code summarization tasks. First, we
propose an automated code-comment cleaning tool that can accurately detect
noisy data caused by inappropriate data preprocessing operations from existing
benchmark datasets. Then, we apply the tool to further assess the data quality
of the four benchmark datasets, based on the detected noises. Finally, we
conduct comparative experiments to investigate the impact of noisy data on the
performance of code summarization models. The results show that these data
preprocessing noises widely exist in all four benchmark datasets, and removing
these noisy data leads to a significant improvement on the performance of code
summarization. We believe that the findings and insights will enable a better
understanding of data quality in code summarization tasks, and pave the way for
relevant research and practice
Baseline characteristics and comparability of older multimorbid patients with polypharmacy and general practitioners participating in a randomized controlled primary care trial.
OBJECTIVES
Recruiting general practitioners (GPs) and their multimorbid older patients for trials is challenging for multiple reasons (e.g., high workload, limited mobility). The comparability of study participants is important for interpreting study findings. This manuscript describes the baseline characteristics of GPs and patients participating in the 'Optimizing PharmacoTherapy in older multimorbid adults In primary CAre' (OPTICA) trial, a study of optimization of pharmacotherapy for multimorbid older adults. The overall aim of this study was to determine if the GPs and patients participating in the OPTICA trial are comparable to the real-world population in Swiss primary care.
DESIGN
Analysis of baseline data from GPs and patients in the OPTICA trial and a reference cohort from the FIRE ('Family medicine ICPC Research using Electronic medical records') project.
SETTING
Primary care, Switzerland.
PARTICIPANTS
Three hundred twenty-three multimorbid (≥ 3 chronic conditions) patients with polypharmacy (≥ 5 regular medications) aged ≥ 65 years and 43 GPs recruited for the OPTICA trial were compared to 22,907 older multimorbid patients with polypharmacy and 227 GPs from the FIRE database.
METHODS
We compared the characteristics of GPs and patients participating in the OPTICA trial with other GPs and other older multimorbid adults with polypharmacy in the FIRE database. We described the baseline willingness to have medications deprescribed of the patients participating in the OPTICA trial using the revised Patients' Attitudes Towards Deprescribing (rPATD) questionnaire.
RESULTS
The GPs in the FIRE project and OPTICA were similar in terms of sociodemographic characteristics and their work as a GP (e.g. aged in their fifties, ≥ 10 years of experience, ≥ 60% are self-employed, ≥ 80% work in a group practice). The median age of patients in the OPTICA trial was 77 years and 45% of trial participants were women. Patients participating in the OPTICA trial and patients in the FIRE database were comparable in terms of age, certain clinical characteristics (e.g. systolic blood pressure, body mass index) and health services use (e.g. selected lab and vital data measurements). More than 80% of older multimorbid patients reported to be willing to stop ≥ 1 of their medications if their doctor said that this would be possible.
CONCLUSION
The characteristics of patients and GPs recruited into the OPTICA trial are relatively comparable to characteristics of a real-world Swiss population, which indicates that recruiting a generalizable patient sample is possible in the primary care setting. Multimorbid patients in the OPTICA trial reported a high willingness to have medications deprescribed.
TRIAL REGISTRATION
Clinicaltrials.gov ( NCT03724539 ), KOFAM (Swiss national portal) ( SNCTP000003060 ), Universal Trial Number (U1111-1226-8013)
- …