3 research outputs found

    Distill Gold from Massive Ores: Efficient Dataset Distillation via Critical Samples Selection

    Full text link
    Data-efficient learning has drawn significant attention, especially given the current trend of large multi-modal models, where dataset distillation can be an effective solution. However, the dataset distillation process itself is still very inefficient. In this work, we model the distillation problem with reference to information theory. Observing that severe data redundancy exists in dataset distillation, we argue to put more emphasis on the utility of the training samples. We propose a family of methods to exploit the most valuable samples, which is validated by our comprehensive analysis of the optimal data selection. The new strategy significantly reduces the training cost and extends a variety of existing distillation algorithms to larger and more diversified datasets, e.g. in some cases only 0.04% training data is sufficient for comparable distillation performance. Moreover, our strategy consistently enhances the performance, which may open up new analyses on the dynamics of distillation and networks. Our method is able to extend the distillation algorithms to much larger-scale datasets and more heterogeneous datasets, e.g. ImageNet-1K and Kinetics-400. Our code will be made publicly available

    A Short-Term Photovoltaic Power Forecasting Method Combining a Deep Learning Model with Trend Feature Extraction and Feature Selection

    No full text
    High precision short-term photovoltaic (PV) power prediction can reduce the damage associated with large-scale photovoltaic grid-connection to the power system. In this paper, a combination deep learning forecasting method based on variational mode decomposition (VMD), a fast correlation-based filter (FCBF) and bidirectional long short-term memory (BiLSTM) network is developed to minimize PV power forecasting error. In this model, VMD is used to extract the trend feature of PV power, then FCBF is adopted to select the optimal input-set to reduce the forecasting error caused by the redundant feature. Finally, the input-set is put into the BiLSTM network for training and testing. The performance of this model is tested by a case study using the public data-set provided by a PV station in Australia. Comparisons with common short-term PV power forecasting models are also presented. The results show that under the processing of trend feature extraction and feature selection, the proposed methodology provides a more stable and accurate forecasting effect than other forecasting models

    Spatial transcriptomic profiling of isolated microregions in tissue sections utilizing laser-induced forward transfer.

    No full text
    Profiling gene expression while preserving cell locations aids in the comprehensive understanding of cell fates in multicellular organisms. However, simple and flexible isolation of microregions of interest (mROIs) for spatial transcriptomics is still challenging. We present a laser-induced forward transfer (LIFT)-based method combined with a full-length mRNA-sequencing protocol (LIFT-seq) for profiling region-specific tissues. LIFT-seq demonstrated that mROIs from two adjacent sections could reliably and sensitively detect and display gene expression. In addition, LIFT-seq can identify region-specific mROIs in the mouse cortex and hippocampus. Finally, LIFT-seq identified marker genes in different layers of the cortex with very similar expression patterns. These genes were then validated using in situ hybridization (ISH) results. Therefore, LIFT-seq will be a valuable and efficient technique for profiling the spatial transcriptome in various tissues
    corecore