174 research outputs found

    Aspects of categorical data analysis.

    Get PDF
    Thesis (M.Sc.)-University of Natal, Durban, 1998.The purpose of this study is to investigate and understand data which are grouped into categories. At the onset, the study presents a review of early research contributions and controversies surrounding categorical data analysis. The concept of sparseness in a contingency table refers to a table where many cells have small frequencies. Previous research findings showed that incorrect results were obtained in the analysis of sparse tables. Hence, attention is focussed on the effect of sparseness on modelling and analysis of categorical data in this dissertation. Cressie and Read (1984) suggested a versatile alternative, the power divergence statistic, to statistics proposed in the past. This study includes a detailed discussion of the power-divergence goodness-of-fit statistic with areas of interest covering a review on the minimum power divergence estimation method and evaluation of model fit. The effects of sparseness are also investigated for the power-divergence statistic. Comparative reviews on the accuracy, efficiency and performance of the power-divergence family of statistics under large and small sample cases are presented. Statistical applications on the power-divergence statistic have been conducted in SAS (Statistical Analysis Software). Further findings on the effect of small expected frequencies on accuracy of the X2 test are presented from the studies of Tate and Hyer (1973) and Lawal and Upton (1976). Other goodness-of-fit statistics which bear relevance to the sparse multino-mial case are discussed. They include Zelterman's (1987) D2 goodness-of-fit statistic, Simonoff's (1982, 1983) goodness-of-fit statistics as well as Koehler and Larntz's tests for log-linear models. On addressing contradictions for the sparse sample case under asymptotic conditions and an increase in sample size, discussions are provided on Simonoff's use of nonparametric techniques to find the variances as well as his adoption of the jackknife and bootstrap technique

    A Psychometric Investigation of a Mathematics Placement Test at a Science, Technology, Engineering, and Mathematics (STEM) Gifted Residential High School

    Get PDF
    Educational institutions, at all levels, must justify their use of placement testing and confront questions of their impact on students’ educational outcomes to assure all stakeholders that students are being enrolled in courses appropriate with their ability in order to maximize their chances of success (Linn, 1994; Mattern & Packman, 2009; McFate & Olmsted III, 1999; Norman, Medhanie, Harwell, Anderson, & Post, 2011; Wiggins, 1989). The aims of this research were to (1) provide evidence of Content Validity, (2) provide evidence of Construct Validity and Internal Consistency Reliability, (3) examine the item characteristics and potential bias of the items between males and females, and (4) provide evidence of Criterion-Related Validity by investigating the ability of the mathematics placement test scores to predict future performance in an initial mathematics course. Students’ admissions portfolios and scores from the mathematics placement test were used to examine the aims of this research. Content Validity was evidenced through the use of a card-sorting task by internal and external subject matter experts. Results from Multidimensional Scaling and Hierarchical Cluster Analysis revealed a congruence of approximately 63 percent between the two group configurations. Next, an Exploratory Factor Analysis was used to investigate the underlying factor structure of the mathematics placement test. Findings indicated a three factor structure of PreCalculus, Geometry, and Algebra 1, with moderate correlations between factors. Thirdly, an item analysis was conducted to explore the item parameters (i.e., item difficulty, and item discrimination) and to test for gender biases. Results from the item analysis suggested that the Algebra 1 and Geometry items were generally easy for the population of interest, while the PreCalculus items presented more of a challenge. Furthermore, the mathematics placement test was optimized by removing eleven items from the Algebra 1 factor and two items from the PreCalculus factor. All Internal Consistency Reliability estimates remained strong and ranged from .736 to .950. Finally, Hierarchical Multiple Linear Regressions were used to examine the relationship between students’ total and factor scores from the mathematics placement test with students’ performance in their first semester mathematics course. Findings from the four Hierarchical Multiple Linear Regressions demonstrate that the total score students’ receive on the mathematics placement test predicts their achievement in their initial mathematics course, above and beyond the contributions of their demographic information and previous academic background. More specifically, the Algebra 1 Factor Score from the mathematics placement test was the strongest predictor of student success among the lower level mathematics courses (i.e., Mathematical Investigations I or II). Similarly, both the Algebra 1 and PreCalculus Factor Scores from the mathematics placement test were significant predictors of students’ grades in their first upper level mathematics course (i.e., Mathematical Investigations III or IV), providing evidence of Predictive Validity. The current mathematics placement test and procedures appear appropriate for the population of interest given the empirical evidence demonstrated in this research study regarding the psychometric properties of the exam. The continued use of the revised mathematics placement test in the course placement decision-making process is advisable

    Training Manual on Advanced Analytical Tools for Social Science Research Vol.1

    Get PDF
    Applying appropriate analytical techniques form the backbone of any research endeavor in agriculture, fisheries, and allied sciences. Without proper knowledge of applying statistical/econometric tools, software, and derivation of inferences from the same, it would not be possible to gather relevant interpretations of the investigation. Hence, the importance of well-designed data collection protocol, analysis, and interpretation cannot be underestimated. Such inferences form the basis of sound policy planning and resource management. Technology advancements and the development of analytical software have made the data analysis process less laborious. A basic understanding of the application of advanced analytical tools and their interpretation increases the productivity and efficiency of social science researchers engaged in agriculture/animal/fisheries science research. Hence the Winter School on Advanced Analytical Tools for Social Science Research is designed to enhance the analytical skills of social science researchers from NARES by allowing them to familiarize with advanced analytical procedures and their practical applications. This Winter School is a step towards familiarizing recent analytical techniques in social science to derive quality research outputs. The course is designed to acquaint the participants with areas such as exploratory data analysis, sampling techniques, data classificatory techniques, non–parametric methods, econometric analysis, and time series modeling, etc. Lectures on GIS/Spatial modeling, scaling techniques, data mining and big data analytics, machine learning techniques, and ecosystem evaluation have also been touched upon. The course is more practical-oriented, with a greater emphasis on interpreting the results. It employs a combination of lectures and exercises using statistical software

    Vol. 15, No. 1 (Full Issue)

    Get PDF

    Essentials of Business Analytics

    Get PDF

    Vol. 1, No. 2 (Full Issue)

    Get PDF

    Feature Screening of Ultrahigh Dimensional Feature Spaces With Applications in Interaction Screening

    Get PDF
    Data for which the number of predictors exponentially exceeds the number of observations is becoming increasingly prevalent in fields such as bioinformatics, medical imaging, computer vision, And social network analysis. One of the leading questions statisticians must answer when confronted with such “big data” is how to reduce a set of exponentially many predictors down to a set of a mere few predictors which have a truly causative effect on the response being modelled. This process is often referred to as feature screening. In this work we propose three new methods for feature screening. The first method we propose (TC-SIS) is specifically intended for use with data having both categorical response and predictors. The second method we propose (JCIS) is meant for feature screening for interactions between predictors. JCIS is rare among interaction screening methods in that it does not require first finding a set of causative main effects before screening for interactive effects. Our final method (GenCorr) is intended for use with data having a multivariate response. GenCorr is the only method for multivariate screening which can screen for both causative main effects and causative interactions. Each of these aforementioned methods will be shown to possess both theoretical robustness as well as empirical agility
    corecore