Search CORE

40 research outputs found

Number of non zero coefficients for each syndrome for the best glmnet model (α = .11 using all features).

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

t: total, p: points, d: distances, ar: areas and an: angles.Number of non zero coefficients for each syndrome for the best glmnet model (α = .11 using all features).</p

FigShare

Pairwise average misclassification error rate for the best glmnet model.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Pairwise average misclassification error rate for the best glmnet model.</p

FigShare

Importance plots glmnet.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Visualization of simultaneous classification for syndromes. For each syndrome an importance plot (row I) and a plot visualizing classification features (row F) is provided. Importance plot assigns an importance with respect to classification to each point as described in the text. Feature plots visualize absolute regression coefficients by thickness of line segments (distances), size of points (coordinates), color of areas (areas; dark red more important than light red) and small triangles (angles; dark red more important than light red).</p

FigShare

Importance weighting.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Illustration of the procedure to compute importance for point δ. Contributions of point p1, area of triangle t1, distance d1, and angle a1 (blue) are weighted according to distance to δ (red). Distances to p1, centroid c1, midpoint m1, vertex v1 are used for p1, t1, d1, and a1, respectively.</p

FigShare

Confusion matrix for the best glmnet model, α = .11, using all features.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Rows indicate the percentages of predicted syndromes for each of the syndromes in the study.Confusion matrix for the best glmnet model, α = .11, using all features.</p

FigShare

Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date: 01/01/2014
Field of study

<div>Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a principal component analysis (PCA) pre-processing step. In our data example classification accuracy improves from 47% to 62% when switching from PCA to penalized regression. A second goal is to visualize the resulting classifiers. We develop importance plots highlighting the influence of coordinates in the original 2D space. Features used for classification are mapped to coordinates in the original images and combined into an importance measure for each pixel. These plots assist in assessing plausibility of classifiers, interpretation of classifiers, and determination of the relative importance of different features.</div

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Leiden University Scholary Publications

FigShare

Average misclassification error glmnet.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Average misclassification error with 95% confidence intervals across leave-one-out cross-validation for models with different values of mixing parameter α. (a) all features (red) and only points (blue) were used and (b) all features and their squares (red) and only points and their squares (blue) were used.</p

FigShare

Importance plots PCA.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Visualizations analogous to <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0109033#pone-0109033-g005" target="_blank">figure 5</a> for PCA based classification.</p

FigShare

Simultaneous average misclassification error (AME) per syndrome.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

Simultaneous average misclassification error (AME) per syndrome.</p

FigShare

Illustration of data set.

Author: Bernhard Horsthemke (116259)
Brunilda Balliu (663151)
Dagmar Wieczorek (215021)
Rolf P. Würtz (663152)
Stefan Böhringer (174739)
Publication venue
Publication date
Field of study

(a) Example of registered nodes. (b) Distances between coordinate pairs excluding symmetries. Numbers 1 to 48 correspond to landmarks; red: pairwise edges, excluding symmetries; black: Delaunay triangulation. Example of symmetric distances (25, 24) and (23,24).</p

FigShare

Number of non zero coefficients for each syndrome for the best glmnet model (α = .11 using all features).

Pairwise average misclassification error rate for the best <i>glmnet</i> model.

Importance plots <i>glmnet</i>.

Importance weighting.

Confusion matrix for the best <i>glmnet</i> model, α = .11, using all features.

Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes

Average misclassification error <i>glmnet</i>.

Importance plots PCA.

Simultaneous average misclassification error (AME) per syndrome.

Illustration of data set.