24 research outputs found
Machine Unlearning Method Based On Projection Residual
Machine learning models (mainly neural networks) are used more and more in
real life. Users feed their data to the model for training. But these processes
are often one-way. Once trained, the model remembers the data. Even when data
is removed from the dataset, the effects of these data persist in the model.
With more and more laws and regulations around the world protecting data
privacy, it becomes even more important to make models forget this data
completely through machine unlearning.
This paper adopts the projection residual method based on Newton iteration
method. The main purpose is to implement machine unlearning tasks in the
context of linear regression models and neural network models. This method
mainly uses the iterative weighting method to completely forget the data and
its corresponding influence, and its computational cost is linear in the
feature dimension of the data. This method can improve the current machine
learning method. At the same time, it is independent of the size of the
training set. Results were evaluated by feature injection testing (FIT).
Experiments show that this method is more thorough in deleting data, which is
close to model retraining.Comment: This paper is accepted by DSAA2022. The 9th IEEE International
Conference on Data Science and Advanced Analytic
RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery
The extraction of sequence patterns from a collection of functionally linked
unlabeled DNA sequences is known as DNA motif discovery, and it is a key task
in computational biology. Several deep learning-based techniques have recently
been introduced to address this issue. However, these algorithms can not be
used in real-world situations because of the need for labeled data. Here, we
presented RL-MD, a novel reinforcement learning based approach for DNA motif
discovery task. RL-MD takes unlabelled data as input, employs a relative
information-based method to evaluate each proposed motif, and utilizes these
continuous evaluation results as the reward. The experiments show that RL-MD
can identify high-quality motifs in real-world data.Comment: This paper is accepted by DSAA2022. The 9th IEEE International
Conference on Data Science and Advanced Analytic
Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and Person Name Recognition
ChatGPT's proficiency in handling modern standard languages suggests
potential for its use in understanding ancient Chinese. This paper explores
ChatGPT's capabilities on ancient Chinese via two tasks: translating ancient
Chinese to modern Chinese and recognizing ancient Chinese names. A comparison
of ChatGPT's output with human translations serves to evaluate its
comprehension of ancient Chinese. The findings indicate that: (1.)the
proficiency of ancient Chinese by ChatGPT is yet to reach a satisfactory level;
(2.) ChatGPT performs the best on ancient-to-modern translation when feeding
with three context sentences. To help reproduce our work, we display the python
code snippets used in this study.Comment: Technical repor
Revisiting the Role of Label Smoothing in Enhanced Text Sentiment Classification
Label smoothing is a widely used technique in various domains, such as text
classification, image classification and speech recognition, known for
effectively combating model overfitting. However, there is little fine-grained
analysis on how label smoothing enhances text sentiment classification. To fill
in the gap, this article performs a set of in-depth analyses on eight datasets
for text sentiment classification and three deep learning architectures:
TextCNN, BERT, and RoBERTa, under two learning schemes: training from scratch
and fine-tuning. By tuning the smoothing parameters, we can achieve improved
performance on almost all datasets for each model architecture. We further
investigate the benefits of label smoothing, finding that label smoothing can
accelerate the convergence of deep models and make samples of different labels
easily distinguishable.Comment: Technical Repor
Pose Guided Human Image Synthesis with Partially Decoupled GAN
Pose Guided Human Image Synthesis (PGHIS) is a challenging task of
transforming a human image from the reference pose to a target pose while
preserving its style. Most existing methods encode the texture of the whole
reference human image into a latent space, and then utilize a decoder to
synthesize the image texture of the target pose. However, it is difficult to
recover the detailed texture of the whole human image. To alleviate this
problem, we propose a method by decoupling the human body into several parts
(\eg, hair, face, hands, feet, \etc) and then using each of these parts to
guide the synthesis of a realistic image of the person, which preserves the
detailed information of the generated images. In addition, we design a
multi-head attention-based module for PGHIS. Because most convolutional neural
network-based methods have difficulty in modeling long-range dependency due to
the convolutional operation, the long-range modeling capability of attention
mechanism is more suitable than convolutional neural networks for pose transfer
task, especially for sharp pose deformation. Extensive experiments on
Market-1501 and DeepFashion datasets reveal that our method almost outperforms
other existing state-of-the-art methods in terms of both qualitative and
quantitative metrics.Comment: 16 pages, 14th Asian Conference on Machine Learning conferenc
A Hierarchical Model for the Ages of Galactic Halo White Dwarfs
In astrophysics, we often aim to estimate one or more parameters for each member object in a population and study the distribution of the fitted parameters across the population. In this paper, we develop novel methods that allow us to take advantage of existing software designed for such case-by-case analyses to simultaneously fit parameters of both the individual objects and the parameters that quantify their distribution across the population. Our methods are based on Bayesian hierarchical modelling which is known to produce parameter estimators for the individual objects that are on average closer to their true values than estimators based on case-by-case analyses. We verify this in the context of estimating ages of Galactic halo white dwarfs (WDs) via a series of simulation studies. Finally, we deploy our new techniques on optical and near-infrared photometry of ten candidate halo WDs to obtain estimates of their ages along with an estimate of the mean age of Galactic halo WDs of 12.11 +0.85-0.86 Gyr. Although this sample is small, our technique lays the ground work for large-scale studies using data from the Gaia mission