108,634 research outputs found

    Is "Better Data" Better than "Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)

    Full text link
    We report and fix an important systematic error in prior studies that ranked classifiers for software analytics. Those studies did not (a) assess classifiers on multiple criteria and they did not (b) study how variations in the data affect the results. Hence, this paper applies (a) multi-criteria tests while (b) fixing the weaker regions of the training data (using SMOTUNED, which is a self-tuning version of SMOTE). This approach leads to dramatically large increases in software defect predictions. When applied in a 5*5 cross-validation study for 3,681 JAVA classes (containing over a million lines of code) from open source systems, SMOTUNED increased AUC and recall by 60% and 20% respectively. These improvements are independent of the classifier used to predict for quality. Same kind of pattern (improvement) was observed when a comparative analysis of SMOTE and SMOTUNED was done against the most recent class imbalance technique. In conclusion, for software analytic tasks like defect prediction, (1) data pre-processing can be more important than classifier choice, (2) ranking studies are incomplete without such pre-processing, and (3) SMOTUNED is a promising candidate for pre-processing.Comment: 10 pages + 2 references. Accepted to International Conference of Software Engineering (ICSE), 201

    Is "Better Data" Better than "Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)

    Full text link
    We report and fix an important systematic error in prior studies that ranked classifiers for software analytics. Those studies did not (a) assess classifiers on multiple criteria and they did not (b) study how variations in the data affect the results. Hence, this paper applies (a) multi-criteria tests while (b) fixing the weaker regions of the training data (using SMOTUNED, which is a self-tuning version of SMOTE). This approach leads to dramatically large increases in software defect predictions. When applied in a 5*5 cross-validation study for 3,681 JAVA classes (containing over a million lines of code) from open source systems, SMOTUNED increased AUC and recall by 60% and 20% respectively. These improvements are independent of the classifier used to predict for quality. Same kind of pattern (improvement) was observed when a comparative analysis of SMOTE and SMOTUNED was done against the most recent class imbalance technique. In conclusion, for software analytic tasks like defect prediction, (1) data pre-processing can be more important than classifier choice, (2) ranking studies are incomplete without such pre-processing, and (3) SMOTUNED is a promising candidate for pre-processing.Comment: 10 pages + 2 references. Accepted to International Conference of Software Engineering (ICSE), 201

    Internal Controls After Sarbanes-Oxley: Revisiting Corporate Law\u27s Duty of Care as Responsibility for Systems

    Get PDF
    Revisiting section 3.4.2 of Clark\u27s Corporate Law (\u27Duty of Care as Responsibility for Systems ) reminds us, however, that the internal controls story actually goes back many decades, and that many of the strategic issues that are at the heart of section 404 have long been contentious. My Article will briefly update Clark\u27s account through the late 1980s and 1990s before returning to Sarbanes-Oxley and rulemaking thereunder by the SEC and the newly created Public Company Accounting Oversight Board ( PCAOB ). My main point builds on one of Clark\u27s but digs deeper. Internal controls requirements, whether federal or state, are incoherent unless and until one articulates clearly for whose benefit they exist, and to what end. There are, in fact, a number of competing articulations. The failure to identify a single and coherent rationale creates significant uncertainty, which has been exploited by players in the legal, accounting, consulting, and information technology fields. Companies are probably spending more time and resources on 404 compliance than a reasonable reading of the legislation and the rules necessarily requires, heavily influenced by those who gain from issuer over-compliance. This rent-seeking compromises the political viability and substantive quality of what is at the heart a beneficial statutory reform

    Oil spill detection using optical sensors: a multi-temporal approach

    Get PDF
    Oil pollution is one of the most destructive consequences due to human activities in the marine environment. Oil wastes come from many sources and take decades to be disposed of. Satellite based remote sensing systems can be implemented into a surveillance and monitoring network. In this study, a multi-temporal approach to the oil spill detection problem is investigated. Change Detection (CD) analysis was applied to MODIS/Terra and Aqua and OLI/Landsat 8 images of several reported oil spill events, characterized by different geographic location, sea conditions, source and extension of the spill. Toward the development of an automatic detection algorithm, a Change Vector Analysis (CVA) technique was implemented to carry out the comparison between the current image of the area of interest and a dataset of reference image, statistically analyzed to reduce the sea spectral variability between different dates. The proposed approach highlights the optical sensors’ capabilities in detecting oil spills at sea. The effectiveness of different sensors’ resolution towards the detection of spills of different size, and the relevance of the sensors’ revisiting time to track and monitor the evolution of the event is also investigated

    Committing to Equal Opportunity

    Get PDF
    The main goal of this research paper was to examine whether the New Hampshire funding system of public education is effective in providing equal educational opportunities to all children. The findings from the quantitative and qualitative analyses of this research study suggest that, despite a historically increasing role of the state government, New Hampshire’s funding system of public education has not proven to be effective in providing the opportunity of an adequate education to students in poor districts, and gaps in student achievement persist between poor and wealthier districts. The underlying problems of the funding structure are explored, and, finally, this report suggests a list of policy recommendations to make the New Hampshire funding system more effective in providing an adequate education to all children

    Written report in learning geometry: explanation and argumentation

    Get PDF
    In this article, we examine how the written report, within the context of assessment for learning, helps students in learning geometry and in developing their explanation and argumentation skills. We present the results of a qualitative case study involving Portuguese students of the 8th grade. This study suggests that using written reports improves those capabilities and, therefore, the comprehension of geometric concepts and processes. These benefits for learning are enhanced through the implementation of some assessment strategies, namely oral and written feedback

    The global warming hiatus: Slowdown or redistribution?

    Get PDF
    Global mean surface temperatures (GMST) exhibited a smaller rate of warming during 1998-2013, compared to the warming in the latter half of the 20th Century. Although, not a "true" hiatus in the strict definition of the word, this has been termed the "global warming hiatus" by IPCC (2013). There have been other periods that have also been defined as the "hiatus" depending on the analysis. There are a number of uncertainties and knowledge gaps regarding the "hiatus." This report reviews these issues and also posits insights from a collective set of diverse information that helps us understand what we do and do not know. One salient insight is that the GMST phenomenon is a surface characteristic that does not represent a slowdown in warming of the climate system but rather is an energy redistribution within the oceans. Improved understanding of the ocean distribution and redistribution of heat will help better monitor Earth's energy budget and its consequences. A review of recent scientific publications on the "hiatus" shows the difficulty and complexities in pinpointing the oceanic sink of the "missing heat" from the atmosphere and the upper layer of the oceans, which defines the "hiatus." Advances in "hiatus" research and outlooks (recommendations) are given in this report

    Engaging students in the curriculum through the use of blogs; how and why?

    Full text link
    This paper presents an academic case for the use of blogs in higher education, and some key considerations for those planning and designing blogging activities in an HE setting. Focusing on the roles of action/activity and experience, reflection and community in learning, this paper suggests how the blogging process can engage students and enhance learning, and how specific features of blogs might be used to bring maximum benefit to the learner
    corecore