3 research outputs found
Distill Gold from Massive Ores: Efficient Dataset Distillation via Critical Samples Selection
Data-efficient learning has drawn significant attention, especially given the
current trend of large multi-modal models, where dataset distillation can be an
effective solution. However, the dataset distillation process itself is still
very inefficient. In this work, we model the distillation problem with
reference to information theory. Observing that severe data redundancy exists
in dataset distillation, we argue to put more emphasis on the utility of the
training samples. We propose a family of methods to exploit the most valuable
samples, which is validated by our comprehensive analysis of the optimal data
selection. The new strategy significantly reduces the training cost and extends
a variety of existing distillation algorithms to larger and more diversified
datasets, e.g. in some cases only 0.04% training data is sufficient for
comparable distillation performance. Moreover, our strategy consistently
enhances the performance, which may open up new analyses on the dynamics of
distillation and networks. Our method is able to extend the distillation
algorithms to much larger-scale datasets and more heterogeneous datasets, e.g.
ImageNet-1K and Kinetics-400. Our code will be made publicly available
A Short-Term Photovoltaic Power Forecasting Method Combining a Deep Learning Model with Trend Feature Extraction and Feature Selection
High precision short-term photovoltaic (PV) power prediction can reduce the damage associated with large-scale photovoltaic grid-connection to the power system. In this paper, a combination deep learning forecasting method based on variational mode decomposition (VMD), a fast correlation-based filter (FCBF) and bidirectional long short-term memory (BiLSTM) network is developed to minimize PV power forecasting error. In this model, VMD is used to extract the trend feature of PV power, then FCBF is adopted to select the optimal input-set to reduce the forecasting error caused by the redundant feature. Finally, the input-set is put into the BiLSTM network for training and testing. The performance of this model is tested by a case study using the public data-set provided by a PV station in Australia. Comparisons with common short-term PV power forecasting models are also presented. The results show that under the processing of trend feature extraction and feature selection, the proposed methodology provides a more stable and accurate forecasting effect than other forecasting models
Spatial transcriptomic profiling of isolated microregions in tissue sections utilizing laser-induced forward transfer.
Profiling gene expression while preserving cell locations aids in the comprehensive understanding of cell fates in multicellular organisms. However, simple and flexible isolation of microregions of interest (mROIs) for spatial transcriptomics is still challenging. We present a laser-induced forward transfer (LIFT)-based method combined with a full-length mRNA-sequencing protocol (LIFT-seq) for profiling region-specific tissues. LIFT-seq demonstrated that mROIs from two adjacent sections could reliably and sensitively detect and display gene expression. In addition, LIFT-seq can identify region-specific mROIs in the mouse cortex and hippocampus. Finally, LIFT-seq identified marker genes in different layers of the cortex with very similar expression patterns. These genes were then validated using in situ hybridization (ISH) results. Therefore, LIFT-seq will be a valuable and efficient technique for profiling the spatial transcriptome in various tissues