126 research outputs found
Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization
Semi-supervised learning (SSL), thanks to the significant reduction of data
annotation costs, has been an active research topic for large-scale 3D scene
understanding. However, the existing SSL-based methods suffer from severe
training bias, mainly due to class imbalance and long-tail distributions of the
point cloud data. As a result, they lead to a biased prediction for the tail
class segmentation. In this paper, we introduce a new decoupling optimization
framework, which disentangles feature representation learning and classifier in
an alternative optimization manner to shift the bias decision boundary
effectively. In particular, we first employ two-round pseudo-label generation
to select unlabeled points across head-to-tail classes. We further introduce
multi-class imbalanced focus loss to adaptively pay more attention to feature
learning across head-to-tail classes. We fix the backbone parameters after
feature learning and retrain the classifier using ground-truth points to update
its parameters. Extensive experiments demonstrate the effectiveness of our
method outperforming previous state-of-the-art methods on both indoor and
outdoor 3D point cloud datasets (i.e., S3DIS, ScanNet-V2, Semantic3D, and
SemanticKITTI) using 1% and 1pt evaluation
A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters
The detection of abnormal fetal heartbeats during pregnancy is important for
monitoring the health conditions of the fetus. While adult ECG has made several
advances in modern medicine, noninvasive fetal electrocardiography (FECG)
remains a great challenge. In this paper, we introduce a new method based on
affine combinations of adaptive filters to extract FECG signals. The affine
combination of multiple filters is able to precisely fit the reference signal,
and thus obtain more accurate FECGs. We proposed a method to combine the Least
Mean Square (LMS) and Recursive Least Squares (RLS) filters. Our approach found
that the Combined Recursive Least Squares (CRLS) filter achieves the best
performance among all proposed combinations. In addition, we found that CRLS is
more advantageous in extracting FECG from abdominal electrocardiograms (AECG)
with a small signal-to-noise ratio (SNR). Compared with the state-of-the-art
MSF-ANC method, CRLS shows improved performance. The sensitivity, accuracy and
F1 score are improved by 3.58%, 2.39% and 1.36%, respectively.Comment: 5 pages, 4 figures, 3 table
Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection
This paper presents our Facial Action Units (AUs) recognition submission to
the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our
approach consists of three main modules: (i) a pre-trained facial
representation encoder which produce a strong facial representation from each
input face image in the input sequence; (ii) an AU-specific feature generator
that specifically learns a set of AU features from each facial representation;
and (iii) a spatio-temporal graph learning module that constructs a
spatio-temporal graph representation. This graph representation describes AUs
contained in all frames and predicts the occurrence of each AU based on both
the modeled spatial information within the corresponding face and the learned
temporal dynamics among frames. The experimental results show that our approach
outperformed the baseline and the spatio-temporal graph representation learning
allows our model to generate the best results among all ablated systems. Our
model ranks at the 4th place in the AU recognition track at the 5th ABAW
Competition
ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation
Most recent scribble-supervised segmentation methods commonly adopt a CNN
framework with an encoder-decoder architecture. Despite its multiple benefits,
this framework generally can only capture small-range feature dependency for
the convolutional layer with the local receptive field, which makes it
difficult to learn global shape information from the limited information
provided by scribble annotations. To address this issue, this paper proposes a
new CNN-Transformer hybrid solution for scribble-supervised medical image
segmentation called ScribFormer. The proposed ScribFormer model has a
triple-branch structure, i.e., the hybrid of a CNN branch, a Transformer
branch, and an attention-guided class activation map (ACAM) branch.
Specifically, the CNN branch collaborates with the Transformer branch to fuse
the local features learned from CNN with the global representations obtained
from Transformer, which can effectively overcome limitations of existing
scribble-supervised segmentation methods. Furthermore, the ACAM branch assists
in unifying the shallow convolution features and the deep convolution features
to improve model's performance further. Extensive experiments on two public
datasets and one private dataset show that our ScribFormer has superior
performance over the state-of-the-art scribble-supervised segmentation methods,
and achieves even better results than the fully-supervised segmentation
methods. The code is released at https://github.com/HUANGLIZI/ScribFormer.Comment: Accepted by IEEE Transactions on Medical Imaging (TMI
ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation
Most recent scribble-supervised segmentation methods commonly adopt a CNN framework with an encoder-decoder architecture. Despite its multiple benefits, this framework generally can only capture small-range feature dependency for the convolutional layer with the local receptive field, which makes it difficult to learn global shape information from the limited information provided by scribble annotations. To address this issue, this paper proposes a new CNN-Transformer hybrid solution for scribble-supervised medical image segmentation called ScribFormer. The proposed ScribFormer model has a triple-branch structure, i.e., the hybrid of a CNN branch, a Transformer branch, and an attention-guided class activation map (ACAM) branch. Specifically, the CNN branch collaborates with the Transformer branch to fuse the local features learned from CNN with the global representations obtained from Transformer, which can effectively overcome limitations of existing scribble-supervised segmentation methods. Furthermore, the ACAM branch assists in unifying the shallow convolution features and the deep convolution features to improve model’s performance further. Extensive experiments on two public datasets and one private dataset show that our ScribFormer has superior performance over the state-of-the-art scribble-supervised segmentation methods, and achieves even better results than the fully-supervised segmentation methods. The code is released at https://github.com/HUANGLIZI/ScribFormer
Recommended from our members
Electroplating lithium transition metal oxides.
Materials synthesis often provides opportunities for innovation. We demonstrate a general low-temperature (260°C) molten salt electrodeposition approach to directly electroplate the important lithium-ion (Li-ion) battery cathode materials LiCoO2, LiMn2O4, and Al-doped LiCoO2. The crystallinities and electrochemical capacities of the electroplated oxides are comparable to those of the powders synthesized at much higher temperatures (700° to 1000°C). This new growth method significantly broadens the scope of battery form factors and functionalities, enabling a variety of highly desirable battery properties, including high energy, high power, and unprecedented electrode flexibility
Multimodal mechanisms of pathogenic variants in the signal peptide of FIX leading to hemophilia B
Signal peptide (SP) is essential for protein secretion, and pathogenic variants in the SP of factor IX (FIX) have been identified in hemophilia B (HB). However, the underlying mechanism for the genotype-phenotype correlation of these variants has not been well studied. Here, we systematically examined the effects of 13 pathogenic point variants in the SP of FIX using different approaches. Our results showed that these point variants lead to HB by missense variants and/or aberrant premessenger RNA (pre-mRNA) splicing. The missense variants in a hydrophobic core (h-region) mainly affected the cotranslational translocation function of the SP, and those in C-terminal containing cleavage site (c-region) caused FIX deficiency mainly by disturbing the cotranslational translocation and/or cleavage of the SP. Almost absolute aberrant pre-mRNA splicing was only observed in variants of c.82T\u3eG, but a slight change of splicing patterns was found in variants of c.53G\u3eT, c.77C\u3eA, c.82T\u3eC, and c.83G\u3eA, indicating that these variants might have different degrees of impact on pre-mRNA splicing. Although two 6-nt deletion aberrant pre-mRNA splicing products caused FIX deficiency by disturbing the SP cleavage, they could produce some functional mature FIX, and vitamin K could increase the secretion of functional FIX. Taken together, our data indicated that pathogenic variants in the SP of FIX caused HB through diverse molecular mechanisms or even a mixture of several mechanisms, and vitamin K availability could be partially attributed to varying bleeding tendencies in patients carrying the same variant in the SP
- …