4,874,073 research outputs found

    Corrections of the NIST Statistical Test Suite for Randomness

    Get PDF
    It is well known that the NIST statistical test suite was used for the evaluation of AES candidate algorithms. We have found that the test setting of Discrete Fourier Transform test and Lempel-Ziv test of this test suite are wrong. We give four corrections of mistakes in the test settings. This suggests that re-evaluation of the test results should be needed

    The sit up test to exhaustion as a test for muscular endurance evaluation

    Get PDF
    AIMS/HYPOTHESIS The aim of this study was to examine the sit up test to exhaustion as a field test for muscular endurance evaluation in a sample of sedentary people of both sexes. METHODS: A cross-sectional study was performed. Three-hundred-eighty-one participants volunteered for the study (28.5 \ub1 10.0 years; 168.2 \ub1 8.9 cm; 65.1 \ub1 11.1 kg), of which 194 males (27.5 \ub1 10.2 years; 173.6 \ub1 7.0 cm; 71.2 \ub1 5.2 kg) and 187 females (29.6 \ub1 10.1 years; 162.6 \ub1 7.1 cm; 58.7 \ub1 8.9 kg). Each subject voluntarily and randomly performed: a sit up test (SUT), a push up test (PUT), and a free weight squat test (ST), all till exhaustion. A multiple regression analysis was adopted for data analysis. Subsequently a percentile model for muscle endurance was developed. The 25th, 50th, and 75th percentile were identified as upper limit for low muscular endurance, average muscular endurance, and lower limit for high muscular endurance, respectively. RESULTS: Considering the sit up test as the dependent variable, the coefficients (R(2) = 0.23; r = 0.49; p < 0.001), and (R(2) = 0.31; r = 0.57; p < 0.001) emerged from a multiple regression analysis applied with respect to the push up test and the squat test, respectively. Gender stratification showed regression coefficients of (R(2) = 0.19; r = 0.44; p < 0.001) for SUT vs. PUT, and (R(2) = 0.30; r = 0.56; p < 0.001) for SUT vs. ST in male; and (R(2) = 0.23; r = 0.49; p < 0.001) for SUT vs. PUT, and (R(2) = 0.34; r = 0.59; p < 0.001) for SUT vs. ST in female. CONCLUSIONS/INTERPRETATION: The SUT showed low inter-relation with the other proposed tests indicating that the adoption of a single test for the global evaluation of muscle endurance is not the optimal approach. Moreover, the SUT was found to be inexpensive, safe, and appropriate for core muscle endurance measurement for both male and female

    EFFECTIVENESS RATIO SYSTEM IMPLEMENTATION EVALUATION OF ELECTRONIC TEST USING WONDERSHARE QUIZ CREATOR AND PAPER TEST RESULTS OF STUDENT LEARNING ON WORD PROCESSING APPLICATION MATERIALS IN MAN 1 YOGYAKARTA

    Get PDF
    This research to determine ratio of the implementation of electronic test using Wondershare quiz creator and paper test on the material word processing applications and to determine the effectiveness of electronic test using Wondershare quiz creator and paper test in terms of student learning outcomes in the test word processor application materials in MAN 1 Yogyakarta. This research is a comparative study. The population in this study were all students of class X which totaled 212 students. The sample in this study was a class XB and XE, each class totaled 30 students. Data collection techniques using test instruments and questionnaires (questionnaire) using four Likert scale. Test the validity of the instrument using expert judgment and product moment formula, while the instrument reliability testing using Cronbach alpha formula. The test instrument in the research done on the subject of evaluation XC class electronic test and evaluation imposed XF grade test paper, amounted to 30 respondents are not used as a sample. The data analysis technique using the average difference test or t-test. The results showed that there is the ratio of the implementation of electronic forms evaluation system test using Wondershare quiz creator and paper test on the material word processing application on MAN 1 Yogykarta. This is shown by the value of t test was 2.071744 at 5% significance level. There is a system of evaluation of the effectiveness ratio test electronic form using Wondershare quiz creator and paper test in terms of student achievement test on the material word processing application on MAN 1 Yogyakarta. This is shown by the value of t test for evaluating the effectiveness was 3.052142 at 5% significance level and the value of achievement test for the difference was 2.240424 5% significance level. Keywords: wondershare quiz creator, electronic test and paper tes

    Empirical Evaluation of Test Coverage for Functional Programs

    Get PDF
    The correlation between test coverage and test effectiveness is important to justify the use of coverage in practice. Existing results on imperative programs mostly show that test coverage predicates effectiveness. However, since functional programs are usually structurally different from imperative ones, it is unclear whether the same result may be derived and coverage can be used as a prediction of effectiveness on functional programs. In this paper we report the first empirical study on the correlation between test coverage and test effectiveness on functional programs. We consider four types of coverage: as input coverages, statement/branch coverage and expression coverage, and as oracle coverages, count of assertions and checked coverage. We also consider two types of effectiveness: raw effectiveness and normalized effectiveness. Our results are twofold. (1) In general the findings on imperative programs still hold on functional programs, warranting the use of coverage in practice. (2) On specific coverage criteria, the results may be unexpected or different from the imperative ones, calling for further studies on functional programs

    Empirical Evaluation of Mutation-based Test Prioritization Techniques

    Full text link
    We propose a new test case prioritization technique that combines both mutation-based and diversity-based approaches. Our diversity-aware mutation-based technique relies on the notion of mutant distinguishment, which aims to distinguish one mutant's behavior from another, rather than from the original program. We empirically investigate the relative cost and effectiveness of the mutation-based prioritization techniques (i.e., using both the traditional mutant kill and the proposed mutant distinguishment) with 352 real faults and 553,477 developer-written test cases. The empirical evaluation considers both the traditional and the diversity-aware mutation criteria in various settings: single-objective greedy, hybrid, and multi-objective optimization. The results show that there is no single dominant technique across all the studied faults. To this end, \rev{we we show when and the reason why each one of the mutation-based prioritization criteria performs poorly, using a graphical model called Mutant Distinguishment Graph (MDG) that demonstrates the distribution of the fault detecting test cases with respect to mutant kills and distinguishment

    HALOE test and evaluation software

    Get PDF
    Computer programming, system development and analysis efforts during this contract were carried out in support of the Halogen Occultation Experiment (HALOE) at NASA/Langley. Support in the major areas of data acquisition and monitoring, data reduction and system development are described along with a brief explanation of the HALOE project. Documented listings of major software are located in the appendix

    Test and inspection for process control of monolithic circuits

    Get PDF
    Report details the test and inspection procedures for the mass production of high reliability integrated circuits. It covers configuration control, basic fundamentals of quality control, control charts, wafer process evaluation, general process evaluation, evaluation score system, and diffusion evaluation

    Effect of Statistical Fluctuation in Monte Carlo Based Photon Beam Dose Calculation on Gamma Index Evaluation

    Full text link
    The gamma-index test has been commonly adopted to quantify the degree of agreement between a reference dose distribution and an evaluation dose distribution. Monte Carlo (MC) simulation has been widely used for the radiotherapy dose calculation for both clinical and research purposes. The goal of this work is to investigate both theoretically and experimentally the impact of the MC statistical fluctuation on the gamma-index test when the fluctuation exists in the reference, the evaluation, or both dose distributions. To the first order approximation, we theoretically demonstrated in a simplified model that the statistical fluctuation tends to overestimate gamma-index values when existing in the reference dose distribution and underestimate gamma-index values when existing in the evaluation dose distribution given the original gamma-index is relatively large for the statistical fluctuation. Our numerical experiments using clinical photon radiation therapy cases have shown that 1) when performing a gamma-index test between an MC reference dose and a non-MC evaluation dose, the average gamma-index is overestimated and the passing rate decreases with the increase of the noise level in the reference dose; 2) when performing a gamma-index test between a non-MC reference dose and an MC evaluation dose, the average gamma-index is underestimated when they are within the clinically relevant range and the passing rate increases with the increase of the noise level in the evaluation dose; 3) when performing a gamma-index test between an MC reference dose and an MC evaluation dose, the passing rate is overestimated due to the noise in the evaluation dose and underestimated due to the noise in the reference dose. We conclude that the gamma-index test should be used with caution when comparing dose distributions computed with Monte Carlo simulation

    Vibroacoustic test plan evaluation: Parameter variation study

    Get PDF
    Statistical decision models are shown to provide a viable method of evaluating the cost effectiveness of alternate vibroacoustic test plans and the associated test levels. The methodology developed provides a major step toward the development of a realistic tool to quantitatively tailor test programs to specific payloads. Testing is considered at the no test, component, subassembly, or system level of assembly. Component redundancy and partial loss of flight data are considered. Most and probabilistic costs are considered, and incipient failures resulting from ground tests are treated. Optimums defining both component and assembly test levels are indicated for the modified test plans considered. modeling simplifications must be considered in interpreting the results relative to a particular payload. New parameters introduced were a no test option, flight by flight failure probabilities, and a cost to design components for higher vibration requirements. Parameters varied were the shuttle payload bay internal acoustic environment, the STS launch cost, the component retest/repair cost, and the amount of redundancy in the housekeeping section of the payload reliability model
    corecore