289 research outputs found
Muti-Scale And Token Mergence: Make Your ViT More Efficient
Since its inception, Vision Transformer (ViT) has emerged as a prevalent
model in the computer vision domain. Nonetheless, the multi-head self-attention
(MHSA) mechanism in ViT is computationally expensive due to its calculation of
relationships among all tokens. Although some techniques mitigate computational
overhead by discarding tokens, this also results in the loss of potential
information from those tokens. To tackle these issues, we propose a novel token
pruning method that retains information from non-crucial tokens by merging them
with more crucial tokens, thereby mitigating the impact of pruning on model
performance. Crucial and non-crucial tokens are identified by their importance
scores and merged based on similarity scores. Furthermore, multi-scale features
are exploited to represent images, which are fused prior to token pruning to
produce richer feature representations. Importantly, our method can be
seamlessly integrated with various ViTs, enhancing their adaptability.
Experimental evidence substantiates the efficacy of our approach in reducing
the influence of token pruning on model performance. For instance, on the
ImageNet dataset, it achieves a remarkable 33% reduction in computational costs
while only incurring a 0.1% decrease in accuracy on DeiT-S
Forced Oscillation Source Location via Multivariate Time Series Classification
Precisely locating low-frequency oscillation sources is the prerequisite of
suppressing sustained oscillation, which is an essential guarantee for the
secure and stable operation of power grids. Using synchrophasor measurements, a
machine learning method is proposed to locate the source of forced oscillation
in power systems. Rotor angle and active power of each power plant are utilized
to construct multivariate time series (MTS). Applying Mahalanobis distance
metric and dynamic time warping, the distance between MTS with different phases
or lengths can be appropriately measured. The obtained distance metric,
representing characteristics during the transient phase of forced oscillation
under different disturbance sources, is used for offline classifier training
and online matching to locate the disturbance source. Simulation results using
the four-machine two-area system and IEEE 39-bus system indicate that the
proposed location method can identify the power system forced oscillation
source online with high accuracy.Comment: 5 pages, 3 figures. Accepted by 2018 IEEE/PES Transmission and
Distribution Conferenc
Temporal Subtyping of Alzheimer's Disease Using Medical Conditions Preceding Alzheimer's Disease Onset in Electronic Health Records
Subtyping of Alzheimer's disease (AD) can facilitate diagnosis, treatment,
prognosis and disease management. It can also support the testing of new
prevention and treatment strategies through clinical trials. In this study, we
employed spectral clustering to cluster 29,922 AD patients in the OneFlorida
Data Trust using their longitudinal EHR data of diagnosis and conditions into
four subtypes. These subtypes exhibit different patterns of progression of
other conditions prior to the first AD diagnosis. In addition, according to the
results of various statistical tests, these subtypes are also significantly
different with respect to demographics, mortality, and prescription medications
after the AD diagnosis. This study could potentially facilitate early detection
and personalized treatment of AD as well as data-driven generalizability
assessment of clinical trials for AD.Comment: 10 page
- …