Search CORE

63 research outputs found

Analysis of the rate of convergence of an over-parametrized deep neural network estimate learned by gradient descent

Author: Kohler Michael
Krzyzak Adam
Publication venue
Publication date: 04/10/2022
Field of study

Estimation of a regression function from independent and identically distributed random variables is considered. The

L_2

error with integration with respect to the design measure is used as an error criterion. Over-parametrized deep neural network estimates are defined where all the weights are learned by the gradient descent. It is shown that the expected

L_2

error of these estimates converges to zero with the rate close to

n^{-1/(1+d)}

in case that the regression function is H\"older smooth with H\"older exponent

p \in [1/2,1]

. In case of an interaction model where the regression function is assumed to be a sum of H\"older smooth functions where each of the functions depends only on

d^*

many of

d

components of the design variable, it is shown that these estimates achieve the corresponding

d^*

-dimensional rate of convergence

arXiv.org e-Print Archive

Estimation of a function of low local dimensionality by deep neural networks

Author: Kohler Michael
Krzyzak Adam
Langer Sophie
Publication venue
Publication date: 01/06/2022
Field of study

University of Twente Research Information

Estimation of a function of low local dimensionality by deep neural networks

Author: Kohler Michael
Krzyzak Adam
Langer Sophie
Publication venue
Publication date: 15/06/2020
Field of study

Deep neural networks (DNNs) achieve impressive results for complicated tasks like object detection on images and speech recognition. Motivated by this practical success, there is now a strong interest in showing good theoretical properties of DNNs. To describe for which tasks DNNs perform well and when they fail, it is a key challenge to understand their performance. The aim of this paper is to contribute to the current statistical theory of DNNs. We apply DNNs on high dimensional data and we show that the least squares regression estimates using DNNs are able to achieve dimensionality reduction in case that the regression function has locally low dimensionality. Consequently, the rate of convergence of the estimate does not depend on its input dimension

d

, but on its local dimension

d^*

and the DNNs are able to circumvent the curse of dimensionality in case that

d^*

is much smaller than

d

. In our simulation study we provide numerical experiments to support our theoretical result and we compare our estimate with other conventional nonparametric regression estimates. The performance of our estimates is also validated in experiments with real data

arXiv.org e-Print Archive

University of Twente Research Information

Influence of nuclei segmentation on breast cancer malignancy classification

Author: Adam Krzyzak
Jelen Lukasz
Thomas Fevens
Publication venue
Publication date: 10/02/2009
Field of study

Breast Cancer is one of the most deadly cancers affecting middle–aged women. Accurate diagnosis and prognosis are crucial to reduce the high death rate. Nowadays there are numerous diagnostic tools for breast cancer diagnosis. In this paper we discuss a role of nuclear segmentation from fine needle aspiration biopsy (FNA) slides and its influence on malignancy classification. Classification of malignancy plays a very important role during the diagnosis process of breast cancer. Out of all cancer diagnostic tools, FNA slides provide the most valuable information about the cancer malignancy grade which helps to choose an appropriate treatment. This process involves assessing numerous nuclear features and therefore precise segmentation of nuclei is very important. In this work we compare three powerful segmentation approaches and test their impact on the classification of breast cancer malignancy. The studied approaches involve level set segmentation, fuzzy c–means segmentation and textural segmentation based on co–occurrence matrix. Segmented nuclei were used to extract nuclear features for malignancy classification. For classification purposes four different classifiers were trained and tested with previously extracted features. The compared classifiers are Multilayer Perceptron (MLP), Self–Organizing Maps (SOM), Principal Component–based Neural Network (PCA) and Support Vector Machines (SVM). The presented results show that level set segmentation yields the best results over the three compared approaches and leads to a good feature extraction with a lowest average error rate of 6.51% over four different classifiers. The best performance was recorded for multilayer perceptron with an error rate of 3.07% using fuzzy c–means segmentation

Concordia University Research Repository

Estimating quantiles in imperfect simulation models using conditional density estimation

Author: Kohler Michael
Krzyzak Adam
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2020
Field of study

TUbiblio

Estimation of extreme quantiles in a simulation model

Author: Kohler Michael
Krzyzak Adam
Publication venue: 'Informa UK Limited'
Publication date: 22/01/2019
Field of study

TUbiblio

Crossref

On the rate of convergence of local averaging plug-in classification rules under a margin condition

Author: Kohler Michael
Krzyzak Adam
Publication venue
Publication date: 01/01/2007
Field of study

TUbiblio

Invariant Pattern Recognition with Log-Polar Transform and Dual-Tree Complex Wavelet-Fourier Features

Author: Adam Krzyzak
Guangyi Chen
Publication venue: 'MDPI AG'
Publication date: 01/04/2023
Field of study

In this paper, we propose a novel method for 2D pattern recognition by extracting features with the log-polar transform, the dual-tree complex wavelet transform (DTCWT), and the 2D fast Fourier transform (FFT2). Our new method is invariant to translation, rotation, and scaling of the input 2D pattern images in a multiresolution way, which is very important for invariant pattern recognition. We know that very low-resolution sub-bands lose important features in the pattern images, and very high-resolution sub-bands contain significant amounts of noise. Therefore, intermediate-resolution sub-bands are good for invariant pattern recognition. Experiments on one printed Chinese character dataset and one 2D aircraft dataset show that our new method is better than two existing methods for a combination of rotation angles, scaling factors, and different noise levels in the input pattern images in most testing cases

Directory of Open Access Journals

Asymptotic confidence intervals for Poisson regression

Author: Kohler Michael
Krzyzak Adam
Publication venue
Publication date: 01/01/2007
Field of study

AbstractLet (X,Y) be a Rd×N0-valued random vector where the conditional distribution of Y given X=x is a Poisson distribution with mean m(x). We estimate m by a local polynomial kernel estimate defined by maximizing a localized log-likelihood function. We use this estimate of m(x) to estimate the conditional distribution of Y given X=x by a corresponding Poisson distribution and to construct confidence intervals of level α of Y given X=x. Under mild regularity conditions on m(x) and on the distribution of X we show strong convergence of the integrated L1 distance between Poisson distribution and its estimate. We also demonstrate that the corresponding confidence interval has asymptotically (i.e., for sample size tending to infinity) level α, and that the probability that the length of this confidence interval deviates from the optimal length by more than one converges to zero with the number of samples tending to infinity

TUbiblio

Elsevier - Publisher Connector