368 research outputs found

    Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks

    Full text link
    Prediction of popularity has profound impact for social media, since it offers opportunities to reveal individual preference and public attention from evolutionary social systems. Previous research, although achieves promising results, neglects one distinctive characteristic of social data, i.e., sequentiality. For example, the popularity of online content is generated over time with sequential post streams of social media. To investigate the sequential prediction of popularity, we propose a novel prediction framework called Deep Temporal Context Networks (DTCN) by incorporating both temporal context and temporal attention into account. Our DTCN contains three main components, from embedding, learning to predicting. With a joint embedding network, we obtain a unified deep representation of multi-modal user-post data in a common embedding space. Then, based on the embedded data sequence over time, temporal context learning attempts to recurrently learn two adaptive temporal contexts for sequential popularity. Finally, a novel temporal attention is designed to predict new popularity (the popularity of a new user-post pair) with temporal coherence across multiple time-scales. Experiments on our released image dataset with about 600K Flickr photos demonstrate that DTCN outperforms state-of-the-art deep prediction algorithms, with an average of 21.51% relative performance improvement in the popularity prediction (Spearman Ranking Correlation).Comment: accepted in IJCAI-1

    HDIdx: High-Dimensional Indexing for Efficient Approximate Nearest Neighbor Search

    Get PDF
    Fast Nearest Neighbor (NN) search is a fundamental challenge in large-scale data processing and analytics, particularly for analyzing multimedia contents which are often of high dimensionality. Instead of using exact NN search, extensive research efforts have been focusing on approximate NN search algorithms. In this work, we present "HDIdx", an efficient high-dimensional indexing library for fast approximate NN search, which is open-source and written in Python. It offers a family of state-of-the-art algorithms that convert input high-dimensional vectors into compact binary codes, making them very efficient and scalable for NN search with very low space complexity

    RIS-Aided MIMO Systems with Hardware Impairments: Robust Beamforming Design and Analysis

    Full text link
    Reconfigurable intelligent surface (RIS) has been anticipated to be a novel cost-effective technology to improve the performance of future wireless systems. In this paper, we investigate a practical RIS-aided multiple-input-multiple-output (MIMO) system in the presence of transceiver hardware impairments, RIS phase noise and imperfect channel state information (CSI). Joint design of the MIMO transceiver and RIS reflection matrix to minimize the total average mean-square-error (MSE) of all data streams is particularly considered. This joint design problem is non-convex and challenging to solve due to the newly considered practical imperfections. To tackle the issue, we first analyze the total average MSE by incorporating the impacts of the above system imperfections. Then, in order to handle the tightly coupled optimization variables and non-convex NP-hard constraints, an efficient iterative algorithm based on alternating optimization (AO) framework is proposed with guaranteed convergence, where each subproblem admits a closed-form optimal solution by leveraging the majorization-minimization (MM) technique. Moreover, via exploiting the special structure of the unit-modulus constraints, we propose a modified Riemannian gradient ascent (RGA) algorithm for the discrete RIS phase shift optimization. Furthermore, the optimality of the proposed algorithm is validated under line-of-sight (LoS) channel conditions, and the irreducible MSE floor effect induced by imperfections of both hardware and CSI is also revealed in the high signal-to-noise ratio (SNR) regime. Numerical results show the superior MSE performance of our proposed algorithm over the adopted benchmark schemes, and demonstrate that increasing the number of RIS elements is not always beneficial under the above system imperfections.Comment: 30 pages, 8 figures. This paper has been submitted to IEEE journal for possible publicatio

    Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction

    Full text link
    Taxi demand prediction is an important building block to enabling intelligent transportation systems in a smart city. An accurate prediction model can help the city pre-allocate resources to meet travel demand and to reduce empty taxis on streets which waste energy and worsen the traffic congestion. With the increasing popularity of taxi requesting services such as Uber and Didi Chuxing (in China), we are able to collect large-scale taxi demand data continuously. How to utilize such big data to improve the demand prediction is an interesting and critical real-world problem. Traditional demand prediction methods mostly rely on time series forecasting techniques, which fail to model the complex non-linear spatial and temporal relations. Recent advances in deep learning have shown superior performance on traditionally challenging tasks such as image classification by learning the complex features and correlations from large-scale data. This breakthrough has inspired researchers to explore deep learning techniques on traffic prediction problems. However, existing methods on traffic prediction have only considered spatial relation (e.g., using CNN) or temporal relation (e.g., using LSTM) independently. We propose a Deep Multi-View Spatial-Temporal Network (DMVST-Net) framework to model both spatial and temporal relations. Specifically, our proposed model consists of three views: temporal view (modeling correlations between future demand values with near time points via LSTM), spatial view (modeling local spatial correlation via local CNN), and semantic view (modeling correlations among regions sharing similar temporal patterns). Experiments on large-scale real taxi demand data demonstrate effectiveness of our approach over state-of-the-art methods.Comment: AAAI 2018 pape

    Shengu'an exerts anti-osteoporotic effect in rats via TGFβ1-Smad2/3 signal pathway, and enhancement of bone and cartilage metabolism

    Get PDF
    Purpose: To study the anti-osteoporotic effect of Shengu'an in rats, and elucidate the mechanism of action involved.Methods: Forty healthy female SPF mice were randomly divided into control group, saline-treated group, TGFβRⅡ receptor inhibitor group, and shengu'an group. The expressions of type Ⅱ collagen (Co1-II) and platelet endothelial cell adhesion factor (CD-31) were determined. The expressions of transforming growth factor β1 (TGF-β1), p-smad2/3, matrix metalloproteinase-9 (MMP-9) and osteoblast specific transcription factor (osterix) were assayed by western blotting.Results: The expression of Co1-II in the vertebral body was significantly lower in model mice than in control mice, but was significantly higher in shengu'an mice when compared with model mice (p < 0.05). In shengu'an mice, CoI-I was markedly upregulated, relative to model mice, and the expressions of CD31 in TGFβRⅡreceptor inhibitor group and shengu'an group were lower than in model group (p < 0.05). There were significantly lower expressions of TGF-β1 and p-smad2/3 in the vertebral body of shengu'an group than in model mice, but osterix was upregulated relative to model mice (p < 0.05).Conclusion: Shengu'an exerts anti-osteoporotic effect by downregulating TGFβ/smad signal pathway. There is thus a potential for its clinical application in the management of osteoporosis. Keywords: Shengu'an, TGFβ1-Smad2/3 signal, Bone cartilage metabolism, Osteoporosi

    Seismic Foundation Model (SFM): a new generation deep learning model in geophysics

    Full text link
    While computer science has seen remarkable advancements in foundation models, which remain underexplored in geoscience. Addressing this gap, we introduce a workflow to develop geophysical foundation models, including data preparation, model pre-training, and adaption to downstream tasks. From 192 globally collected 3-D seismic volumes, we create a carefully curated dataset of 2,286,422 2-D seismic images. Fully using these unlabeled images, we employ the self-supervised learning to pre-train a Transformer-based Seismic Foundation Model (SFM) for producing all-purpose seismic features that work across various tasks and surveys. Through experiments on seismic facies classification, geobody identification, interpolation, denoising, and inversion, our pre-trained model demonstrates versatility, generalization, scalability, and superior performance over baseline models. Conclusively, we provide a foundation model and vast dataset to advance AI in geophysics, addressing challenges (poor generalization, lacking labels, and repetitive training for task-specified models) of applying AI in geophysics and paving the way for future innovations in geoscience.Comment: 27 pages, 9 figures, and 4 table

    U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation

    Full text link
    Concept personalization methods enable large text-to-image models to learn specific subjects (e.g., objects/poses/3D models) and synthesize renditions in new contexts. Given that the image references are highly biased towards visual attributes, state-of-the-art personalization models tend to overfit the whole subject and cannot disentangle visual characteristics in pixel space. In this study, we proposed a more challenging setting, namely fine-grained visual appearance personalization. Different from existing methods, we allow users to provide a sentence describing the desired attributes. A novel decoupled self-augmentation strategy is proposed to generate target-related and non-target samples to learn user-specified visual attributes. These augmented data allow for refining the model's understanding of the target attribute while mitigating the impact of unrelated attributes. At the inference stage, adjustments are conducted on semantic space through the learned target and non-target embeddings to further enhance the disentanglement of target attributes. Extensive experiments on various kinds of visual attributes with SOTA personalization methods show the ability of the proposed method to mimic target visual appearance in novel contexts, thus improving the controllability and flexibility of personalization.Comment: 14 pages, 13 figures, 2 table

    Cognition Impairment Prior to Errors of Working Memory Based on Event-Related Potential

    Get PDF
    Cognitive impairment contributes to errors in different tasks. Poor attention and poor cognitive control are the two neural mechanisms for performance errors. A few studies have been conducted on the error mechanism of working memory. It is unclear whether the changes in memory updating, attention, and cognitive control can cause errors and, if so, whether they can be probed at the same time in one single task. Therefore, this study analyzed event-related potentials in a two-back working memory task. A total of 40 male participants finished the task. The differences between the error and the correct trials in amplitudes and latencies of N1, P2, N2, and P3 were analyzed. The P2 and P3 amplitudes decreased significantly in the error trials, while the N2 amplitude increased. The results showed that impaired attention, poor memory updating, and impaired cognitive control were consistently associated with the error in working memory. Furthermore, the results suggested that monitoring the neurophysiological characteristics associated with attention and cognitive control was important for studying the error mechanism and error prediction. The results also suggested that the P3 and N2 amplitudes could be used as indexes for error foreshadowing
    • …
    corecore