Search CORE

77,340 research outputs found

Latent Fisher Discriminant Analysis

Author: Chen Gang
Publication venue
Publication date: 20/09/2013
Field of study

Linear Discriminant Analysis (LDA) is a well-known method for dimensionality reduction and classification. Previous studies have also extended the binary-class case into multi-classes. However, many applications, such as object detection and keyframe extraction cannot provide consistent instance-label pairs, while LDA requires labels on instance level for training. Thus it cannot be directly applied for semi-supervised classification problem. In this paper, we overcome this limitation and propose a latent variable Fisher discriminant analysis model. We relax the instance-level labeling into bag-level, is a kind of semi-supervised (video-level labels of event type are required for semantic frame extraction) and incorporates a data-driven prior over the latent variables. Hence, our method combines the latent variable inference and dimension reduction in an unified bayesian framework. We test our method on MUSK and Corel data sets and yield competitive results compared to the baseline approach. We also demonstrate its capacity on the challenging TRECVID MED11 dataset for semantic keyframe extraction and conduct a human-factors ranking-based experimental evaluation, which clearly demonstrates our proposed method consistently extracts more semantically meaningful keyframes than challenging baselines.Comment: 12 page

arXiv.org e-Print Archive

CiteSeerX

Financial Literacy: The Impact of Financial Training in High School on the Credit Behavior of College Students

Author: Tenaglia Lisa
Publication venue: Bryant Digital Repository
Publication date: 01/04/2010
Field of study

Managing credit is increasingly important not only for adults, but for college students. In recent years with sky rocketing tuition and easily available credit, college students find themselves with increasing debt burdens that result in serious and lasting financial problems. In response, financial literacy programs are emerging in hopes that better educated people will make healthy financial decisions, as well as responsibly manage credit. Research suggests that financial education should begin in high school so that young adults can effectively manage credit during the college years. This study assesses both college students’ financial knowledge and their credit management practices. Specifically, it examines whether Bryant University students retain and use the financial training from high school when making financial decisions and managing credit. The findings from this study illustrate that almost 75% of the 345 students that manage their own credit in college received financial training in high school and that although this training is negatively correlated with poor credit management behavior in college, the association is weak. This study further suggests that even with additional financial literacy training available in college, almost 60% of these students demonstrate poor credit management behavior. As a result, this study suggests that young people need to improve their credit management skills by setting budgets and employing good credit management techniques

DigitalCommons@Bryant University

Optimal classifier selection and negative bias in error rate estimation: An empirical study on high-dimensional prediction

Author: Boulesteix Anne-Laure
Strobl Carolin
Publication venue
Publication date: 01/01/2009
Field of study

In biometric practice, researchers often apply a large number of different methods in a "trial-and-error" strategy to get as much as possible out of their data and, due to publication pressure or pressure from the consulting customer, present only the most favorable results. This strategy may induce a substantial optimistic bias in prediction error estimation, which is quantitatively assessed in the present manuscript. The focus of our work is on class prediction based on high-dimensional data (e.g. microarray data), since such analyses are particularly exposed to this kind of bias. In our study we consider a total of 124 variants of classifiers (possibly including variable selection or tuning steps) within a cross-validation evaluation scheme. The classifiers are applied to original and modified real microarray data sets, some of which are obtained by randomly permuting the class labels to mimic non-informative predictors while preserving their correlation structure. We then assess the minimal misclassification rate over the different variants of classifiers in order to quantify the bias arising when the optimal classifier is selected a posteriori in a data-driven manner. The bias resulting from the parameter tuning (including gene selection parameters as a special case) and the bias resulting from the choice of the classification method are examined both separately and jointly. We conclude that the strategy to present only the optimal result is not acceptable, and suggest alternative approaches for properly reporting classification accuracy

Springer - Publisher Connector

Directory of Open Access Journals

Open Access LMU

PubMed Central

Recommended from our members

Statistical Workflow for Feature Selection in Human Metabolomics Data.

Author: Antonelli Joseph
Cheng Susan
Claggett Brian L
Demler Olga V
Deng Katherine
Henglin Mir
Hushcha Pavel V
Jain Mohit
Kim Andy
Kim Nicole
Lagerborg Kim A
Mora Samia
Niiranen Teemu J
Ovsak Gavin
Pereira Alexandre C
Rao Kevin
Tyagi Octavia
Watrous Jeramie D
Publication venue: eScholarship, University of California
Publication date: 01/07/2019
Field of study

High-throughput metabolomics investigations, when conducted in large human cohorts, represent a potentially powerful tool for elucidating the biochemical diversity underlying human health and disease. Large-scale metabolomics data sources, generated using either targeted or nontargeted platforms, are becoming more common. Appropriate statistical analysis of these complex high-dimensional data will be critical for extracting meaningful results from such large-scale human metabolomics studies. Therefore, we consider the statistical analytical approaches that have been employed in prior human metabolomics studies. Based on the lessons learned and collective experience to date in the field, we offer a step-by-step framework for pursuing statistical analyses of cohort-based human metabolomics data, with a focus on feature selection. We discuss the range of options and approaches that may be employed at each stage of data management, analysis, and interpretation and offer guidance on the analytical decisions that need to be considered over the course of implementing a data analysis workflow. Certain pervasive analytical challenges facing the field warrant ongoing focused research. Addressing these challenges, particularly those related to analyzing human metabolomics data, will allow for more standardization of as well as advances in how research in the field is practiced. In turn, such major analytical advances will lead to substantial improvements in the overall contributions of human metabolomics investigations

eScholarship - University of California