162 research outputs found

    Auto-Encoding Scene Graphs for Image Captioning

    Full text link
    We propose Scene Graph Auto-Encoder (SGAE) that incorporates the language inductive bias into the encoder-decoder image captioning framework for more human-like captions. Intuitively, we humans use the inductive bias to compose collocations and contextual inference in discourse. For example, when we see the relation `person on bike', it is natural to replace `on' with `ride' and infer `person riding bike on a road' even the `road' is not evident. Therefore, exploiting such bias as a language prior is expected to help the conventional encoder-decoder models less likely overfit to the dataset bias and focus on reasoning. Specifically, we use the scene graph --- a directed graph (G\mathcal{G}) where an object node is connected by adjective nodes and relationship nodes --- to represent the complex structural layout of both image (I\mathcal{I}) and sentence (S\mathcal{S}). In the textual domain, we use SGAE to learn a dictionary (D\mathcal{D}) that helps to reconstruct sentences in the S→G→D→S\mathcal{S}\rightarrow \mathcal{G} \rightarrow \mathcal{D} \rightarrow \mathcal{S} pipeline, where D\mathcal{D} encodes the desired language prior; in the vision-language domain, we use the shared D\mathcal{D} to guide the encoder-decoder in the I→G→D→S\mathcal{I}\rightarrow \mathcal{G}\rightarrow \mathcal{D} \rightarrow \mathcal{S} pipeline. Thanks to the scene graph representation and shared dictionary, the inductive bias is transferred across domains in principle. We validate the effectiveness of SGAE on the challenging MS-COCO image captioning benchmark, e.g., our SGAE-based single-model achieves a new state-of-the-art 127.8127.8 CIDEr-D on the Karpathy split, and a competitive 125.5125.5 CIDEr-D (c40) on the official server even compared to other ensemble models

    Investigating the Effects of Dimension-Specific Sentiments on Product Sales: The Perspective of Sentiment Preferences

    Get PDF
    While the literature has reached a consensus on the awareness effect of online word-of-mouth (eWOM), this paper studies its persuasive effect—specifically, dimension-specific sentiment effects on product sales.We examine the sentiment information in eWOM along different product dimensions and reveal different persuasive effects on consumers’ purchase decisions based on consumers’ sentiment preference, which is defined as the relative importance that consumers place on various dimension-specific sentiments. We use an aspect-level sentiment analysis to derive dimension-specific sentiment and PVAR (panel vector auto-regression) models, and estimate their effects on product sales using a movie panel dataset. The findings show that three dimension-specific sentiments (star, genre, and plot) are positively related to movie sales.Regarding consumers’ sentiment preferences, we find a positive relationship to movie sales that is stronger for plot sentiment, relative to star sentiment for low-budget movies. For high-budget movies, we find a positive relationship to movie sales that is stronger for star sentiment, relative to plot or genre sentiment

    A Bayesian Approach for Localization of Acoustic Emission Source in Plate-Like Structures

    Get PDF
    This paper presents a Bayesian approach for localizing acoustic emission (AE) source in plate-like structures with consideration of uncertainties from modeling error and measurement noise. A PZT sensor network is deployed to monitor and acquire AE wave signals released by possible damage. By using continuous wavelet transform (CWT), the time-of-flight (TOF) information of the AE wave signals is extracted and measured. With a theoretical TOF model, a Bayesian parameter identification procedure is developed to obtain the AE source location and the wave velocity at a specific frequency simultaneously and meanwhile quantify their uncertainties. It is based on Bayes’ theorem that the posterior distributions of the parameters about the AE source location and the wave velocity are obtained by relating their priors and the likelihood of the measured time difference data. A Markov chain Monte Carlo (MCMC) algorithm is employed to draw samples to approximate the posteriors. Also, a data fusion scheme is performed to fuse results identified at multiple frequencies to increase accuracy and reduce uncertainty of the final localization results. Experimental studies on a stiffened aluminum panel with simulated AE events by pensile lead breaks (PLBs) are conducted to validate the proposed Bayesian AE source localization approach

    The GrĂŒnwald–Letnikov method for fractional differential equations

    Get PDF
    AbstractThis paper is devoted to the numerical treatment of fractional differential equations. Based on the GrĂŒnwald–Letnikov definition of fractional derivatives, finite difference schemes for the approximation of the solution are discussed. The main properties of these explicit and implicit methods concerning the stability, the convergence and the error behavior are studied related to linear test equations. The asymptotic stability and the absolute stability of these methods are proved. Error representations and estimates for the truncation, propagation and global error are derived. Numerical experiments are given

    MFI2-AS1 enhances the survival of esophageal cancer cell via regulation of miR-331-3p/SOX4

    Get PDF
    Purpose: To investigate the specific role of melanotransferrin antisense RNA (MFI2-AS1) in esophageal cancer (EC) progression. Methods: The differential expression of MFI2-AS1 in EC tissues and cells was determined using quantitative reverse transcription–polymerase chain reaction (qRT-PCR). Silencing MFI2-AS1 was performed by transfection with specific short hairpin RNAs targeting MFI2-AS1. The 3-(4,5- dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide assay (MTT) and flow cytometry (FC) were used to assess cell viability and apoptosis of EC cells, respectively. The sponging microRNA (miRNA) of MFI2-AS1 was validated using luciferase activity and RNA immunoprecipitation assays while the downstream target gene of the sponging miRNA was evaluated by luciferase activity assay. Results: MFI2-AS1 was significantly enhanced in EC tissues (p < 0.01) and indicated a poor prognosis in EC patients. Knockdown of MFI2-AS1 in EC cells decreased cell viability and promoted cell apoptosis of EC cells. Functionally, MFI2-AS1 targeted miR-331-3p, and sex-determining region on Ychromosome-related high-mobility-group box4 (SOX4) was identified as a target gene of miR-331-3p. Ectopic expression of SOX4  counteracted the suppressive effect of MFI2-AS1 knockdown on EC cell viability and stimulative effect on EC cell apoptosis. Conclusion: The pro-oncogenic effect of MFI2-AS1 on EC progression occurs via the regulation of the miR-331-3p/SOX4 axis, providing a new potential therapeutic target for EC

    Parse and Recall: Towards Accurate Lung Nodule Malignancy Prediction like Radiologists

    Full text link
    Lung cancer is a leading cause of death worldwide and early screening is critical for improving survival outcomes. In clinical practice, the contextual structure of nodules and the accumulated experience of radiologists are the two core elements related to the accuracy of identification of benign and malignant nodules. Contextual information provides comprehensive information about nodules such as location, shape, and peripheral vessels, and experienced radiologists can search for clues from previous cases as a reference to enrich the basis of decision-making. In this paper, we propose a radiologist-inspired method to simulate the diagnostic process of radiologists, which is composed of context parsing and prototype recalling modules. The context parsing module first segments the context structure of nodules and then aggregates contextual information for a more comprehensive understanding of the nodule. The prototype recalling module utilizes prototype-based learning to condense previously learned cases as prototypes for comparative analysis, which is updated online in a momentum way during training. Building on the two modules, our method leverages both the intrinsic characteristics of the nodules and the external knowledge accumulated from other nodules to achieve a sound diagnosis. To meet the needs of both low-dose and noncontrast screening, we collect a large-scale dataset of 12,852 and 4,029 nodules from low-dose and noncontrast CTs respectively, each with pathology- or follow-up-confirmed labels. Experiments on several datasets demonstrate that our method achieves advanced screening performance on both low-dose and noncontrast scenarios.Comment: MICCAI 202
    • 

    corecore