108,634 research outputs found
Is "Better Data" Better than "Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)
We report and fix an important systematic error in prior studies that ranked
classifiers for software analytics. Those studies did not (a) assess
classifiers on multiple criteria and they did not (b) study how variations in
the data affect the results. Hence, this paper applies (a) multi-criteria tests
while (b) fixing the weaker regions of the training data (using SMOTUNED, which
is a self-tuning version of SMOTE). This approach leads to dramatically large
increases in software defect predictions. When applied in a 5*5
cross-validation study for 3,681 JAVA classes (containing over a million lines
of code) from open source systems, SMOTUNED increased AUC and recall by 60% and
20% respectively. These improvements are independent of the classifier used to
predict for quality. Same kind of pattern (improvement) was observed when a
comparative analysis of SMOTE and SMOTUNED was done against the most recent
class imbalance technique. In conclusion, for software analytic tasks like
defect prediction, (1) data pre-processing can be more important than
classifier choice, (2) ranking studies are incomplete without such
pre-processing, and (3) SMOTUNED is a promising candidate for pre-processing.Comment: 10 pages + 2 references. Accepted to International Conference of
Software Engineering (ICSE), 201
Is "Better Data" Better than "Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)
We report and fix an important systematic error in prior studies that ranked
classifiers for software analytics. Those studies did not (a) assess
classifiers on multiple criteria and they did not (b) study how variations in
the data affect the results. Hence, this paper applies (a) multi-criteria tests
while (b) fixing the weaker regions of the training data (using SMOTUNED, which
is a self-tuning version of SMOTE). This approach leads to dramatically large
increases in software defect predictions. When applied in a 5*5
cross-validation study for 3,681 JAVA classes (containing over a million lines
of code) from open source systems, SMOTUNED increased AUC and recall by 60% and
20% respectively. These improvements are independent of the classifier used to
predict for quality. Same kind of pattern (improvement) was observed when a
comparative analysis of SMOTE and SMOTUNED was done against the most recent
class imbalance technique. In conclusion, for software analytic tasks like
defect prediction, (1) data pre-processing can be more important than
classifier choice, (2) ranking studies are incomplete without such
pre-processing, and (3) SMOTUNED is a promising candidate for pre-processing.Comment: 10 pages + 2 references. Accepted to International Conference of
Software Engineering (ICSE), 201
Internal Controls After Sarbanes-Oxley: Revisiting Corporate Law\u27s Duty of Care as Responsibility for Systems
Revisiting section 3.4.2 of Clark\u27s Corporate Law (\u27Duty of Care as Responsibility for Systems ) reminds us, however, that the internal controls story actually goes back many decades, and that many of the strategic issues that are at the heart of section 404 have long been contentious. My Article will briefly update Clark\u27s account through the late 1980s and 1990s before returning to Sarbanes-Oxley and rulemaking thereunder by the SEC and the newly created Public Company Accounting Oversight Board ( PCAOB ). My main point builds on one of Clark\u27s but digs deeper. Internal controls requirements, whether federal or state, are incoherent unless and until one articulates clearly for whose benefit they exist, and to what end. There are, in fact, a number of competing articulations. The failure to identify a single and coherent rationale creates significant uncertainty, which has been exploited by players in the legal, accounting, consulting, and information technology fields. Companies are probably spending more time and resources on 404 compliance than a reasonable reading of the legislation and the rules necessarily requires, heavily influenced by those who gain from issuer over-compliance. This rent-seeking compromises the political viability and substantive quality of what is at the heart a beneficial statutory reform
Oil spill detection using optical sensors: a multi-temporal approach
Oil pollution is one of the most destructive consequences due to human activities in the marine environment. Oil wastes come from many sources and take decades to be disposed of. Satellite based remote sensing systems can be implemented into a surveillance and monitoring network. In this study, a multi-temporal approach to the oil spill detection problem is investigated. Change Detection (CD) analysis was applied to MODIS/Terra and Aqua and OLI/Landsat 8 images of several reported oil spill events, characterized by different geographic location, sea conditions, source and extension of the spill. Toward the development of an automatic detection algorithm, a Change Vector Analysis (CVA) technique was implemented to carry out the comparison between the current image of the area of interest and a dataset of reference image, statistically analyzed to reduce the sea spectral variability between different dates. The proposed approach highlights the optical sensors’ capabilities in detecting oil spills at sea. The effectiveness of different sensors’ resolution towards the detection of spills of different size, and the relevance of the sensors’ revisiting time to track and monitor the evolution of the event is also investigated
Committing to Equal Opportunity
The main goal of this research paper was to examine whether the New Hampshire funding system of public education is effective in providing equal educational opportunities to all children. The findings from the quantitative and qualitative analyses of this research study suggest that, despite a historically increasing role of the state government, New Hampshire’s funding system of public education has not proven to be effective in providing the opportunity of an adequate education to students in poor districts, and gaps in student achievement persist between poor and wealthier districts. The underlying problems of the funding structure are explored, and, finally, this report suggests a list of policy recommendations to make the New Hampshire funding system more effective in providing an adequate education to all children
Written report in learning geometry: explanation and argumentation
In this article, we examine how the written report, within the context of assessment for learning, helps students in learning geometry and in developing their explanation and argumentation skills. We present the results of a qualitative case study involving Portuguese students of the 8th grade. This study suggests that using written reports improves those capabilities and, therefore, the comprehension of geometric concepts and processes. These benefits for learning are enhanced through the implementation of some assessment strategies, namely oral and written feedback
Recommended from our members
Key Skills: making connections between HE and the workplace
This paper draws on a recent research project into high-level key skills links between HE and employment. The project has worked with groups in several universities and companies to explore how the developmental model embedded in the QCA key skills national standards can be used to support learning and assessment of higher level (QCA levels 4 and 5) key skills. Employers increasingly value skills such as teamworking, communicating effectively with partners and customers, and being able to adapt to new situations and develop new capabilities. Within organisations individuals may be expected to move from project to project and job to job. They may be expected to identify their own particular training needs, work within the company business goals and develop their own individual skills portfolio to satisfy professional
recognition requirements. HE currently appears to offer relatively little support or training to develop the key skills needed in such environments. As part of the project students in HE have used a framework of
planning, monitoring progress, presenting outcomes and reviewing progress to develop their skills. The model encourages learners to recognise and articulate their own capabilities more clearly, and offers an assessment structure for profiling achievement. It is this 'meta-skills' approach that is used to bridge the gap between HE and employment by encouraging learners to be actively aware of the context in which they are currently situated, and to make connections with experience, skills and knowledge they have gained elsewhere. The paper presents some preliminary findings and comments from the project
The global warming hiatus: Slowdown or redistribution?
Global mean surface temperatures (GMST) exhibited a smaller rate of warming during 1998-2013, compared to the warming in the latter half of the 20th Century. Although, not a "true" hiatus in the strict definition of the word, this has been termed the "global warming hiatus" by IPCC (2013). There have been other periods that have also been defined as the "hiatus" depending on the analysis. There are a number of uncertainties and knowledge gaps regarding the "hiatus." This report reviews these issues and also posits insights from a collective set of diverse information that helps us understand what we do and do not know. One salient insight is that the GMST phenomenon is a surface characteristic that does not represent a slowdown in warming of the climate system but rather is an energy redistribution within the oceans. Improved understanding of the ocean distribution and redistribution of heat will help better monitor Earth's energy budget and its consequences. A review of recent scientific publications on the "hiatus" shows the difficulty and complexities in pinpointing the oceanic sink of the "missing heat" from the atmosphere and the upper layer of the oceans, which defines the "hiatus." Advances in "hiatus" research and outlooks (recommendations) are given in this report
Engaging students in the curriculum through the use of blogs; how and why?
This paper presents an academic case for the use of blogs in higher education, and some key considerations for those planning and designing blogging activities in an HE setting. Focusing on the roles of action/activity and experience, reflection and community in learning, this paper suggests how the blogging process can engage students and enhance learning, and how specific features of blogs might be used to bring maximum benefit to the learner
- …