368 research outputs found
Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks
Prediction of popularity has profound impact for social media, since it
offers opportunities to reveal individual preference and public attention from
evolutionary social systems. Previous research, although achieves promising
results, neglects one distinctive characteristic of social data, i.e.,
sequentiality. For example, the popularity of online content is generated over
time with sequential post streams of social media. To investigate the
sequential prediction of popularity, we propose a novel prediction framework
called Deep Temporal Context Networks (DTCN) by incorporating both temporal
context and temporal attention into account. Our DTCN contains three main
components, from embedding, learning to predicting. With a joint embedding
network, we obtain a unified deep representation of multi-modal user-post data
in a common embedding space. Then, based on the embedded data sequence over
time, temporal context learning attempts to recurrently learn two adaptive
temporal contexts for sequential popularity. Finally, a novel temporal
attention is designed to predict new popularity (the popularity of a new
user-post pair) with temporal coherence across multiple time-scales.
Experiments on our released image dataset with about 600K Flickr photos
demonstrate that DTCN outperforms state-of-the-art deep prediction algorithms,
with an average of 21.51% relative performance improvement in the popularity
prediction (Spearman Ranking Correlation).Comment: accepted in IJCAI-1
HDIdx: High-Dimensional Indexing for Efficient Approximate Nearest Neighbor Search
Fast Nearest Neighbor (NN) search is a fundamental challenge in large-scale
data processing and analytics, particularly for analyzing multimedia contents
which are often of high dimensionality. Instead of using exact NN search,
extensive research efforts have been focusing on approximate NN search
algorithms. In this work, we present "HDIdx", an efficient high-dimensional
indexing library for fast approximate NN search, which is open-source and
written in Python. It offers a family of state-of-the-art algorithms that
convert input high-dimensional vectors into compact binary codes, making them
very efficient and scalable for NN search with very low space complexity
RIS-Aided MIMO Systems with Hardware Impairments: Robust Beamforming Design and Analysis
Reconfigurable intelligent surface (RIS) has been anticipated to be a novel
cost-effective technology to improve the performance of future wireless
systems. In this paper, we investigate a practical RIS-aided
multiple-input-multiple-output (MIMO) system in the presence of transceiver
hardware impairments, RIS phase noise and imperfect channel state information
(CSI). Joint design of the MIMO transceiver and RIS reflection matrix to
minimize the total average mean-square-error (MSE) of all data streams is
particularly considered. This joint design problem is non-convex and
challenging to solve due to the newly considered practical imperfections. To
tackle the issue, we first analyze the total average MSE by incorporating the
impacts of the above system imperfections. Then, in order to handle the tightly
coupled optimization variables and non-convex NP-hard constraints, an efficient
iterative algorithm based on alternating optimization (AO) framework is
proposed with guaranteed convergence, where each subproblem admits a
closed-form optimal solution by leveraging the majorization-minimization (MM)
technique. Moreover, via exploiting the special structure of the unit-modulus
constraints, we propose a modified Riemannian gradient ascent (RGA) algorithm
for the discrete RIS phase shift optimization. Furthermore, the optimality of
the proposed algorithm is validated under line-of-sight (LoS) channel
conditions, and the irreducible MSE floor effect induced by imperfections of
both hardware and CSI is also revealed in the high signal-to-noise ratio (SNR)
regime. Numerical results show the superior MSE performance of our proposed
algorithm over the adopted benchmark schemes, and demonstrate that increasing
the number of RIS elements is not always beneficial under the above system
imperfections.Comment: 30 pages, 8 figures. This paper has been submitted to IEEE journal
for possible publicatio
Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
Taxi demand prediction is an important building block to enabling intelligent
transportation systems in a smart city. An accurate prediction model can help
the city pre-allocate resources to meet travel demand and to reduce empty taxis
on streets which waste energy and worsen the traffic congestion. With the
increasing popularity of taxi requesting services such as Uber and Didi Chuxing
(in China), we are able to collect large-scale taxi demand data continuously.
How to utilize such big data to improve the demand prediction is an interesting
and critical real-world problem. Traditional demand prediction methods mostly
rely on time series forecasting techniques, which fail to model the complex
non-linear spatial and temporal relations. Recent advances in deep learning
have shown superior performance on traditionally challenging tasks such as
image classification by learning the complex features and correlations from
large-scale data. This breakthrough has inspired researchers to explore deep
learning techniques on traffic prediction problems. However, existing methods
on traffic prediction have only considered spatial relation (e.g., using CNN)
or temporal relation (e.g., using LSTM) independently. We propose a Deep
Multi-View Spatial-Temporal Network (DMVST-Net) framework to model both spatial
and temporal relations. Specifically, our proposed model consists of three
views: temporal view (modeling correlations between future demand values with
near time points via LSTM), spatial view (modeling local spatial correlation
via local CNN), and semantic view (modeling correlations among regions sharing
similar temporal patterns). Experiments on large-scale real taxi demand data
demonstrate effectiveness of our approach over state-of-the-art methods.Comment: AAAI 2018 pape
Shengu'an exerts anti-osteoporotic effect in rats via TGFβ1-Smad2/3 signal pathway, and enhancement of bone and cartilage metabolism
Purpose: To study the anti-osteoporotic effect of Shengu'an in rats, and elucidate the mechanism of action involved.Methods: Forty healthy female SPF mice were randomly divided into control group, saline-treated group, TGFβRⅡ receptor inhibitor group, and shengu'an group. The expressions of type Ⅱ collagen (Co1-II) and platelet endothelial cell adhesion factor (CD-31) were determined. The expressions of transforming growth factor β1 (TGF-β1), p-smad2/3, matrix metalloproteinase-9 (MMP-9) and osteoblast specific transcription factor (osterix) were assayed by western blotting.Results: The expression of Co1-II in the vertebral body was significantly lower in model mice than in control mice, but was significantly higher in shengu'an mice when compared with model mice (p < 0.05). In shengu'an mice, CoI-I was markedly upregulated, relative to model mice, and the expressions of CD31 in TGFβRⅡreceptor inhibitor group and shengu'an group were lower than in model group (p < 0.05). There were significantly lower expressions of TGF-β1 and p-smad2/3 in the vertebral body of shengu'an group than in model mice, but osterix was upregulated relative to model mice (p < 0.05).Conclusion: Shengu'an exerts anti-osteoporotic effect by downregulating TGFβ/smad signal pathway. There is thus a potential for its clinical application in the management of osteoporosis.
Keywords: Shengu'an, TGFβ1-Smad2/3 signal, Bone cartilage metabolism, Osteoporosi
Seismic Foundation Model (SFM): a new generation deep learning model in geophysics
While computer science has seen remarkable advancements in foundation models,
which remain underexplored in geoscience. Addressing this gap, we introduce a
workflow to develop geophysical foundation models, including data preparation,
model pre-training, and adaption to downstream tasks. From 192 globally
collected 3-D seismic volumes, we create a carefully curated dataset of
2,286,422 2-D seismic images. Fully using these unlabeled images, we employ the
self-supervised learning to pre-train a Transformer-based Seismic Foundation
Model (SFM) for producing all-purpose seismic features that work across various
tasks and surveys. Through experiments on seismic facies classification,
geobody identification, interpolation, denoising, and inversion, our
pre-trained model demonstrates versatility, generalization, scalability, and
superior performance over baseline models. Conclusively, we provide a
foundation model and vast dataset to advance AI in geophysics, addressing
challenges (poor generalization, lacking labels, and repetitive training for
task-specified models) of applying AI in geophysics and paving the way for
future innovations in geoscience.Comment: 27 pages, 9 figures, and 4 table
U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation
Concept personalization methods enable large text-to-image models to learn
specific subjects (e.g., objects/poses/3D models) and synthesize renditions in
new contexts. Given that the image references are highly biased towards visual
attributes, state-of-the-art personalization models tend to overfit the whole
subject and cannot disentangle visual characteristics in pixel space. In this
study, we proposed a more challenging setting, namely fine-grained visual
appearance personalization. Different from existing methods, we allow users to
provide a sentence describing the desired attributes. A novel decoupled
self-augmentation strategy is proposed to generate target-related and
non-target samples to learn user-specified visual attributes. These augmented
data allow for refining the model's understanding of the target attribute while
mitigating the impact of unrelated attributes. At the inference stage,
adjustments are conducted on semantic space through the learned target and
non-target embeddings to further enhance the disentanglement of target
attributes. Extensive experiments on various kinds of visual attributes with
SOTA personalization methods show the ability of the proposed method to mimic
target visual appearance in novel contexts, thus improving the controllability
and flexibility of personalization.Comment: 14 pages, 13 figures, 2 table
Cognition Impairment Prior to Errors of Working Memory Based on Event-Related Potential
Cognitive impairment contributes to errors in different tasks. Poor attention and poor cognitive control are the two neural mechanisms for performance errors. A few studies have been conducted on the error mechanism of working memory. It is unclear whether the changes in memory updating, attention, and cognitive control can cause errors and, if so, whether they can be probed at the same time in one single task. Therefore, this study analyzed event-related potentials in a two-back working memory task. A total of 40 male participants finished the task. The differences between the error and the correct trials in amplitudes and latencies of N1, P2, N2, and P3 were analyzed. The P2 and P3 amplitudes decreased significantly in the error trials, while the N2 amplitude increased. The results showed that impaired attention, poor memory updating, and impaired cognitive control were consistently associated with the error in working memory. Furthermore, the results suggested that monitoring the neurophysiological characteristics associated with attention and cognitive control was important for studying the error mechanism and error prediction. The results also suggested that the P3 and N2 amplitudes could be used as indexes for error foreshadowing
- …