65 research outputs found
Performance Enhancement of Logistic Regression for Big Data on Spark
This research proposes a new ftting algorithm of logistic regression on IRWLS that utilizes the procedure of scanning data row-by-row and has the ability to acquire an exact result with only a few iterations. Furthermore, this research also realizes the distributed parallelization of the proposed method on Spark and conducts various experiments to manifest its memory-wise advantage over the traditional methods such as Spark MLlib package. The results show that the proposed method can provide an exact result rather than an approximated one within 5 or 6 iterations; achieve a satisfying accuracy for fight delay prediction within 1 or 2 iterations; has a better potential for parallelization and a better performance than MLlib with a 3-4x faster speed without full optimizations; and its performance is not undermined by an increasing data memory ratio
Harnessing machine learning for fiber-induced nonlinearity mitigation in long-haul coherent optical OFDM
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).Coherent optical orthogonal frequency division multiplexing (CO-OFDM) has attracted a lot of interest in optical fiber communications due to its simplified digital signal processing (DSP) units, high spectral-efficiency, flexibility, and tolerance to linear impairments. However, CO-OFDM’s high peak-to-average power ratio imposes high vulnerability to fiber-induced non-linearities. DSP-based machine learning has been considered as a promising approach for fiber non-linearity compensation without sacrificing computational complexity. In this paper, we review the existing machine learning approaches for CO-OFDM in a common framework and review the progress in this area with a focus on practical aspects and comparison with benchmark DSP solutions.Peer reviewe
Measuring Blood Glucose Concentrations in Photometric Glucometers Requiring Very Small Sample Volumes
Glucometers present an important self-monitoring tool for diabetes patients
and therefore must exhibit high accu- racy as well as good usability features.
Based on an invasive, photometric measurement principle that drastically
reduces the volume of the blood sample needed from the patient, we present a
framework that is capable of dealing with small blood samples, while
maintaining the required accuracy. The framework consists of two major parts:
1) image segmentation; and 2) convergence detection. Step 1) is based on
iterative mode-seeking methods to estimate the intensity value of the region of
interest. We present several variations of these methods and give theoretical
proofs of their convergence. Our approach is able to deal with changes in the
number and position of clusters without any prior knowledge. Furthermore, we
propose a method based on sparse approximation to decrease the computational
load, while maintaining accuracy. Step 2) is achieved by employing temporal
tracking and prediction, herewith decreasing the measurement time, and, thus,
improving usability. Our framework is validated on several real data sets with
different characteristics. We show that we are able to estimate the underlying
glucose concentration from much smaller blood samples than is currently
state-of-the- art with sufficient accuracy according to the most recent ISO
standards and reduce measurement time significantly compared to
state-of-the-art methods
Sequential Dimensionality Reduction for Extracting Localized Features
Linear dimensionality reduction techniques are powerful tools for image
analysis as they allow the identification of important features in a data set.
In particular, nonnegative matrix factorization (NMF) has become very popular
as it is able to extract sparse, localized and easily interpretable features by
imposing an additive combination of nonnegative basis elements. Nonnegative
matrix underapproximation (NMU) is a closely related technique that has the
advantage to identify features sequentially. In this paper, we propose a
variant of NMU that is particularly well suited for image analysis as it
incorporates the spatial information, that is, it takes into account the fact
that neighboring pixels are more likely to be contained in the same features,
and favors the extraction of localized features by looking for sparse basis
elements. We show that our new approach competes favorably with comparable
state-of-the-art techniques on synthetic, facial and hyperspectral image data
sets.Comment: 24 pages, 12 figures. New numerical experiments on synthetic data
sets, discussion about the convergenc
- …