Search CORE

3 research outputs found

Distill Gold from Massive Ores: Efficient Dataset Distillation via Critical Samples Selection

Author: Cui Kaitong
Li Yong-Lu
Lu Cewu
Tai Yu-Wing
Tang Chi-Keung
Wang Ziyu
Xu Yue
Publication venue
Publication date: 28/05/2023
Field of study

Data-efficient learning has drawn significant attention, especially given the current trend of large multi-modal models, where dataset distillation can be an effective solution. However, the dataset distillation process itself is still very inefficient. In this work, we model the distillation problem with reference to information theory. Observing that severe data redundancy exists in dataset distillation, we argue to put more emphasis on the utility of the training samples. We propose a family of methods to exploit the most valuable samples, which is validated by our comprehensive analysis of the optimal data selection. The new strategy significantly reduces the training cost and extends a variety of existing distillation algorithms to larger and more diversified datasets, e.g. in some cases only 0.04% training data is sufficient for comparable distillation performance. Moreover, our strategy consistently enhances the performance, which may open up new analyses on the dynamics of distillation and networks. Our method is able to extend the distillation algorithms to much larger-scale datasets and more heterogeneous datasets, e.g. ImageNet-1K and Kinetics-400. Our code will be made publicly available

arXiv.org e-Print Archive

A Short-Term Photovoltaic Power Forecasting Method Combining a Deep Learning Model with Trend Feature Extraction and Feature Selection

Author: Chun Sing Lai
Haoliang Yuan
Kaitong Wu
Loi Lei Lai
Wenbo Cui
Xiangang Peng
Zilu Li
Publication venue: 'MDPI AG'
Publication date: 27/07/2022
Field of study

High precision short-term photovoltaic (PV) power prediction can reduce the damage associated with large-scale photovoltaic grid-connection to the power system. In this paper, a combination deep learning forecasting method based on variational mode decomposition (VMD), a fast correlation-based filter (FCBF) and bidirectional long short-term memory (BiLSTM) network is developed to minimize PV power forecasting error. In this model, VMD is used to extract the trend feature of PV power, then FCBF is adopted to select the optimal input-set to reduce the forecasting error caused by the redundant feature. Finally, the input-set is put into the BiLSTM network for training and testing. The performance of this model is tested by a case study using the public data-set provided by a PV station in Australia. Comparisons with common short-term PV power forecasting models are also presented. The results show that under the processing of trend feature extraction and feature selection, the proposed methodology provides a more stable and accurate forecasting effect than other forecasting models

Multidisciplinary Digital Publishing Institute

Spatial transcriptomic profiling of isolated microregions in tissue sections utilizing laser-induced forward transfer.

Author: Jitao Xu
Kaiqiang Ye
Kaitong Dang
Qiannan Cui
Qingyang Qin
Qinyu Ge
Wanqing Chang
Xiangwei Zhao
Xiaofeng Han
Xiaolei Zhu
Yun Xu
Yunxia Guo
Publication venue: Public Library of Science (PLoS)
Publication date: 01/01/2024
Field of study

Profiling gene expression while preserving cell locations aids in the comprehensive understanding of cell fates in multicellular organisms. However, simple and flexible isolation of microregions of interest (mROIs) for spatial transcriptomics is still challenging. We present a laser-induced forward transfer (LIFT)-based method combined with a full-length mRNA-sequencing protocol (LIFT-seq) for profiling region-specific tissues. LIFT-seq demonstrated that mROIs from two adjacent sections could reliably and sensitively detect and display gene expression. In addition, LIFT-seq can identify region-specific mROIs in the mouse cortex and hippocampus. Finally, LIFT-seq identified marker genes in different layers of the cortex with very similar expression patterns. These genes were then validated using in situ hybridization (ISH) results. Therefore, LIFT-seq will be a valuable and efficient technique for profiling the spatial transcriptome in various tissues

Directory of Open Access Journals