2 research outputs found
Evaluation of Non-linearity in MIR Spectroscopic Data for Compressed Learning
Mid-Infrared (MIR) spectroscopy has emerged as
the most economically viable technology to determine milk
values as well as to identify a set of animal phenotypes
related to health, feeding, well-being and environment.
However, Fourier transform-MIR spectra incurs a significant
amount of redundant data. This creates critical issues such
as increased learning complexity while performing Fog and
Cloud based data analytics in smart farming. These issues
can be resolved through data compression using unsupervisory
techniques like PCA, and perform analytics in the
compressed-domain i.e. without de-compressing. Compression
algorithms should preserve non-linearity of MIRS data
(if exists), since emerging advanced learning algorithms
can improve their prediction accuracy. This study has investigated
the non-linearity between the feature variables
in the measurement-domain as well as in two compressed
domains using standard Linear PCA and Kernel PCA. Also
the non-linearity between the feature variables and the
commonly used target milk quality parameters (Protein,
Lactose, Fat) has been analyzed. The study evaluates the
prediction accuracy using PLS and LS-SVM respectively as
linear and non-linear predictive models
Learning in the compressed data domain: Application to milk quality prediction
Smart dairy farming has become one of the most exciting and challenging area in cloud-based data analytics.
Transfer of raw data from all farms to a central cloud is currently not feasible as applications are generating
more data while internet connectivity is lacking in rural farms. As a solution, Fog computing has become a key
factor to process data near the farm and derive farm insights by exchanging data between on-farm applications
and transferring some data to the cloud. In this context, learning in the compressed data domain, where decompression
is not necessary, is highly desirable as it minimizes the energy used for communication/computation,
reduces required memory/storage, and improves application latency. Mid-infrared spectroscopy (MIRS) is used
globally to predict several milk quality parameters as well as deriving many animal-level phenotypes. Therefore,
compressed learning on MIRS data is beneficial both in terms of data processing in the Fog, as well as storing
large data sets in the cloud. In this paper, we used principal component analysis and wavelet transform as two
techniques for compressed learning to convert MIRS data into a compressed data domain. The study derives near
lossless compression parameters for both techniques to transform MIRS data without impacting the prediction
accuracy for a selection of milk quality traits