91 research outputs found

    Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

    Full text link
    Recovering the 3D representation of an object from single-view or multi-view RGB images by deep neural networks has attracted increasing attention in the past few years. Several mainstream works (e.g., 3D-R2N2) use recurrent neural networks (RNNs) to fuse multiple feature maps extracted from input images sequentially. However, when given the same set of input images with different orders, RNN-based approaches are unable to produce consistent reconstruction results. Moreover, due to long-term memory loss, RNNs cannot fully exploit input images to refine reconstruction results. To solve these problems, we propose a novel framework for single-view and multi-view 3D reconstruction, named Pix2Vox. By using a well-designed encoder-decoder, it generates a coarse 3D volume from each input image. Then, a context-aware fusion module is introduced to adaptively select high-quality reconstructions for each part (e.g., table legs) from different coarse 3D volumes to obtain a fused 3D volume. Finally, a refiner further refines the fused 3D volume to generate the final output. Experimental results on the ShapeNet and Pix3D benchmarks indicate that the proposed Pix2Vox outperforms state-of-the-arts by a large margin. Furthermore, the proposed method is 24 times faster than 3D-R2N2 in terms of backward inference time. The experiments on ShapeNet unseen 3D categories have shown the superior generalization abilities of our method.Comment: ICCV 201

    Influence and Role of Social Practice on the Development of Comprehensive Quality of University Students

    Get PDF
    With the development of society and the popularization of higher education, the cultivation of comprehensive quality of college students has become the focus of attention of the education sector and the community. This paper researches and discusses the influence and role of social practice on the development of comprehensive quality of college students. Firstly, it discusses the importance of social practice activities in enhancing the comprehensive quality of college students, including the role of cultivating practical ability, enhancing the sense of social responsibility and teamwork spirit. Secondly, it analyzes the influence of social practice on the cognitive level, emotional attitude and values of college students, and points out that social practice can promote the overall development of college students. Finally, the problems and challenges of the current social practice activities are discussed, and corresponding countermeasures and suggestions are put forward to further play the role of social practice in the comprehensive quality cultivation of college students, and to provide better support and guarantee for the growth and development of college students. (Bai, 2020

    Towards General Visual-Linguistic Face Forgery Detection

    Full text link
    Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust. Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model. We argue that such supervisions lack semantic information and interpretability. To address this issues, in this paper, we propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation. Since text annotations are not available in current deepfakes datasets, VLFFD first generates the mixed forgery image with corresponding fine-grained prompts via Prompt Forgery Image Generator (PFIG). Then, the fine-grained mixed data and coarse-grained original data and is jointly trained with the Coarse-and-Fine Co-training framework (C2F), enabling the model to gain more generalization and interpretability. The experiments show the proposed method improves the existing detection models on several challenging benchmarks. Furthermore, we have integrated our method with multimodal large models, achieving noteworthy results that demonstrate the potential of our approach. This integration not only enhances the performance of our VLFFD paradigm but also underscores the versatility and adaptability of our method when combined with advanced multimodal technologies, highlighting its potential in tackling the evolving challenges of deepfake detection

    Analysis of index gases of coal spontaneous combustion using fourier transform infrared spectrometer

    Get PDF
    Analysis of the index gases of coal for the prevention of spontaneous combustion is of great importance for the enhancement of coal mine safety. In this work, Fourier Transform Infrared Spectrometer (FTIRS) is presented to be used to analyze the index gases of coal in real time to monitor spontaneous combustion conditions. Both the instrument parameters and the analysis method are introduced at first by combining characteristics of the absorption spectra of the target analyte with the analysis requirements. Next, more than ten sets of the gas mixture containing ten components (CH 4 , C 2 H 6 , C 3 H 8 , iso-C 4 H 10 , n-C 4 H 10 , C 2 H 4 , C 3 H 6 , C 2 H 2 , CO, and CO 2 ) are included and analyzed with a Spectrum Two FTIRS made by Perkin Elmer. The testing results show that the detection limit of most analytes is less than 2 × 10 −6 . All the detection limits meet the monitoring requirements of coal spontaneous combustion in China, which means that FTIRS may be an ideal instrument and the analysis method used in this paper is sufficient for spontaneous combustion gas monitoring on-line and even in situ, since FTIRS has many advantages such as fast analysis, being maintenance-free, and good safety

    Gait parameter fitting and adaptive enhancement based on cerebral blood oxygen information

    Get PDF
    Accurate recognition of patients’ movement intentions and real-time adjustments are crucial in rehabilitation exoskeleton robots. However, some patients are unable to utilize electromyography (EMG) signals for this purpose due to poor or missing signals in their lower limbs. In order to address this issue, we propose a novel method that fits gait parameters using cerebral blood oxygen signals. Two types of walking experiments were conducted to collect brain blood oxygen signals and gait parameters from volunteers. Time domain, frequency domain, and spatial domain features were extracted from brain hemoglobin. The AutoEncoder-Decoder method is used for feature dimension reduction. A regression model based on the long short-term memory (LSTM) model was established to fit the gait parameters and perform incremental learning for new individual data. Cross-validation was performed on the model to enhance individual adaptivity and reduce the need for individual pre-training. The coefficient of determination (R2) for the gait parameter fit was 71.544%, with a mean square error (RMSE) of less than 3.321%. Following adaptive enhancement, the coefficient of R2 increased by 6.985%, while the RMSE decreased by 0.303%. These preliminary results indicate the feasibility of fitting gait parameters using cerebral blood oxygen information. Our research offers a new perspective on assisted locomotion control for patients who lack effective myoelectricity, thereby expanding the clinical application of rehabilitation exoskeleton robots. This work establishes a foundation for promoting the application of Brain-Computer Interface (BCI) technology in the field of sports rehabilitation

    A Novel Estimation Method for Temperature of Magnetic Nanoparticles Dominated by Brownian Relaxation Based on Magnetic Particle Spectroscopy

    Full text link
    This paper presents a novel method for estimating the temperature of magnetic nanoparticles (MNPs) based on AC magnetization harmonics of MNPs dominated by Brownian relaxation. The difference in the AC magnetization response and magnetization harmonic between the Fokker-Planck equation and the Langevin function was analyzed, and we studied the relationship between the magnetization harmonic and the key factors, such as Brownian relaxation time, temperature, magnetic field strength, core size and hydrodynamic size of MNPs, excitation frequency, and so on. We proposed a compensation function for AC magnetization harmonic with consideration of the key factors and the difference between the Fokker-Planck equation and the Langevin function. Then a temperature estimation model based on the compensation function and the Langevin function was established. By employing the least squares algorithm, the temperature was successfully calculated. The experimental results show that the temperature error is less than 0.035 K in the temperature range from 310 K to 320 K. The temperature estimation model is expected to improve the performance of the magnetic nanoparticle thermometer and be applied to magnetic nanoparticle-mediated hyperthermia

    RSQ: a statistical method for quantification of isoform-specific structurome using transcriptome-wide structural profiling data [preprint]

    Get PDF
    The structure of RNA, which is considered to be a second layer of information alongside the genetic code, provides fundamental insights into the cellular function of both coding and non-coding RNAs. Several high-throughput technologies have been developed to profile transcriptome-wide RNA structures, i.e., the structurome. However, it is challenging to interpret the profiling data because the observed data represent an average over different RNA conformations and isoforms with different abundance. To address this challenge, we developed an RNA structurome quantification method (RSQ) to statistically model the distribution of reads over both isoforms and RNA conformations, and thus provide accurate quantification of the isoform-specific structurome. The quantified RNA structurome enables the comparison of isoform-specific conformations between different conditions, the exploration of RNA conformation variation affected by single nucleotide polymorphism (SNP) , and the measurement of RNA accessibility for binding of either small RNAs in RNAi-based assays or RNA binding protein in transcriptional regulation. The model used in our method sheds new light on the potential impact of the RNA structurome on gene regulation

    Lessons learned from the NeurIPS 2021 MetaDL challenge:Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

    Get PDF
    Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available. Metalearning methods can address this problem by transferring knowledge from related tasks, thus reducing the amount of data and computing resources needed to learn new tasks. We organize the MetaDL competition series, which provide opportunities for research groups all over the world to create and experimentally assess new meta-(deep)learning solutions for real problems. In this paper, authored collaboratively between the competition organizers and the top-ranked participants, we describe the design of the competition, the datasets, the best experimental results, as well as the top-ranked methods in the NeurIPS 2021 challenge, which attracted 15 active teams who made it to the final phase (by outperforming the baseline), making over 100 code submissions during the feedback phase. The solutions of the top participants have been open-sourced. The lessons learned include that learning good representations is essential for effective transfer learnin
    • …
    corecore