Search CORE

91 research outputs found

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

Author: Sun Xiaoshuai
Xie Haozhe
Yao Hongxun
Zhang Shengping
Zhou Shangchen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/07/2019
Field of study

Recovering the 3D representation of an object from single-view or multi-view RGB images by deep neural networks has attracted increasing attention in the past few years. Several mainstream works (e.g., 3D-R2N2) use recurrent neural networks (RNNs) to fuse multiple feature maps extracted from input images sequentially. However, when given the same set of input images with different orders, RNN-based approaches are unable to produce consistent reconstruction results. Moreover, due to long-term memory loss, RNNs cannot fully exploit input images to refine reconstruction results. To solve these problems, we propose a novel framework for single-view and multi-view 3D reconstruction, named Pix2Vox. By using a well-designed encoder-decoder, it generates a coarse 3D volume from each input image. Then, a context-aware fusion module is introduced to adaptively select high-quality reconstructions for each part (e.g., table legs) from different coarse 3D volumes to obtain a fused 3D volume. Finally, a refiner further refines the fused 3D volume to generate the final output. Experimental results on the ShapeNet and Pix3D benchmarks indicate that the proposed Pix2Vox outperforms state-of-the-arts by a large margin. Furthermore, the proposed method is 24 times faster than 3D-R2N2 in terms of backward inference time. The experiments on ShapeNet unseen 3D categories have shown the superior generalization abilities of our method.Comment: ICCV 201

arXiv.org e-Print Archive

Crossref

Influence and Role of Social Practice on the Development of Comprehensive Quality of University Students

Author: HE Guoqiang
LI Zheng
SUN Haozhe
WANG Zewen
Publication venue: Scholink Co., LTD
Publication date: 19/04/2024
Field of study

With the development of society and the popularization of higher education, the cultivation of comprehensive quality of college students has become the focus of attention of the education sector and the community. This paper researches and discusses the influence and role of social practice on the development of comprehensive quality of college students. Firstly, it discusses the importance of social practice activities in enhancing the comprehensive quality of college students, including the role of cultivating practical ability, enhancing the sense of social responsibility and teamwork spirit. Secondly, it analyzes the influence of social practice on the cognitive level, emotional attitude and values of college students, and points out that social practice can promote the overall development of college students. Finally, the problems and challenges of the current social practice activities are discussed, and corresponding countermeasures and suggestions are put forward to further play the role of social practice in the comprehensive quality cultivation of college students, and to provide better support and guarantee for the growth and development of college students. (Bai, 2020

Scholink Journals

Towards General Visual-Linguistic Face Forgery Detection

Author: Chen Shen
Ding Shouhong
Ji Rongrong
Sun Ke
Sun Xiaoshuai
Yang Haozhe
Yao Taiping
Publication venue
Publication date: 07/02/2024
Field of study

Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust. Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model. We argue that such supervisions lack semantic information and interpretability. To address this issues, in this paper, we propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation. Since text annotations are not available in current deepfakes datasets, VLFFD first generates the mixed forgery image with corresponding fine-grained prompts via Prompt Forgery Image Generator (PFIG). Then, the fine-grained mixed data and coarse-grained original data and is jointly trained with the Coarse-and-Fine Co-training framework (C2F), enabling the model to gain more generalization and interpretability. The experiments show the proposed method improves the existing detection models on several challenging benchmarks. Furthermore, we have integrated our method with multimodal large models, achieving noteworthy results that demonstrate the potential of our approach. This integration not only enhances the performance of our VLFFD paradigm but also underscores the versatility and adaptability of our method when combined with advanced multimodal technologies, highlighting its potential in tackling the evolving challenges of deepfake detection

arXiv.org e-Print Archive

Analysis of index gases of coal spontaneous combustion using fourier transform infrared spectrometer

Author: Haizhu Luo
Haozhe Dong
Xiaojun Tang
Yong Sun
Yuntao Liang
Publication venue
Publication date: 01/01/2014
Field of study

Analysis of the index gases of coal for the prevention of spontaneous combustion is of great importance for the enhancement of coal mine safety. In this work, Fourier Transform Infrared Spectrometer (FTIRS) is presented to be used to analyze the index gases of coal in real time to monitor spontaneous combustion conditions. Both the instrument parameters and the analysis method are introduced at first by combining characteristics of the absorption spectra of the target analyte with the analysis requirements. Next, more than ten sets of the gas mixture containing ten components (CH 4 , C 2 H 6 , C 3 H 8 , iso-C 4 H 10 , n-C 4 H 10 , C 2 H 4 , C 3 H 6 , C 2 H 2 , CO, and CO 2 ) are included and analyzed with a Spectrum Two FTIRS made by Perkin Elmer. The testing results show that the detection limit of most analytes is less than 2 × 10 −6 . All the detection limits meet the monitoring requirements of coal spontaneous combustion in China, which means that FTIRS may be an ideal instrument and the analysis method used in this paper is sufficient for spontaneous combustion gas monitoring on-line and even in situ, since FTIRS has many advantages such as fast analysis, being maintenance-free, and good safety

CiteSeerX

Directory of Open Access Journals

Gait parameter fitting and adaptive enhancement based on cerebral blood oxygen information

Author: Chunguang Li
Haozhe Ma
Lining Sun
Yaoxing Peng
Yufei Zhu
Publication venue: Frontiers Media S.A.
Publication date: 01/07/2023
Field of study

Accurate recognition of patients’ movement intentions and real-time adjustments are crucial in rehabilitation exoskeleton robots. However, some patients are unable to utilize electromyography (EMG) signals for this purpose due to poor or missing signals in their lower limbs. In order to address this issue, we propose a novel method that fits gait parameters using cerebral blood oxygen signals. Two types of walking experiments were conducted to collect brain blood oxygen signals and gait parameters from volunteers. Time domain, frequency domain, and spatial domain features were extracted from brain hemoglobin. The AutoEncoder-Decoder method is used for feature dimension reduction. A regression model based on the long short-term memory (LSTM) model was established to fit the gait parameters and perform incremental learning for new individual data. Cross-validation was performed on the model to enhance individual adaptivity and reduce the need for individual pre-training. The coefficient of determination (R2) for the gait parameter fit was 71.544%, with a mean square error (RMSE) of less than 3.321%. Following adaptive enhancement, the coefficient of R2 increased by 6.985%, while the RMSE decreased by 0.303%. These preliminary results indicate the feasibility of fitting gait parameters using cerebral blood oxygen information. Our research offers a new perspective on assisted locomotion control for patients who lack effective myoelectricity, thereby expanding the clinical application of rehabilitation exoskeleton robots. This work establishes a foundation for promoting the application of Brain-Computer Interface (BCI) technology in the field of sports rehabilitation

Directory of Open Access Journals

A Novel Estimation Method for Temperature of Magnetic Nanoparticles Dominated by Brownian Relaxation Based on Magnetic Particle Spectroscopy

Author: Du Zhongzhou
Han Shijie
Hua Zhanpeng
Liu Wenzhong
Sun Yi
Wang Haozhe
Wu Wenjie
Ye Na
Yoshida Takashi
Yu Longtu
Zhang Haochen
Zhao Gaoli
Publication venue
Publication date: 31/12/2023
Field of study

This paper presents a novel method for estimating the temperature of magnetic nanoparticles (MNPs) based on AC magnetization harmonics of MNPs dominated by Brownian relaxation. The difference in the AC magnetization response and magnetization harmonic between the Fokker-Planck equation and the Langevin function was analyzed, and we studied the relationship between the magnetization harmonic and the key factors, such as Brownian relaxation time, temperature, magnetic field strength, core size and hydrodynamic size of MNPs, excitation frequency, and so on. We proposed a compensation function for AC magnetization harmonic with consideration of the key factors and the difference between the Fokker-Planck equation and the Langevin function. Then a temperature estimation model based on the compensation function and the Langevin function was established. By employing the least squares algorithm, the temperature was successfully calculated. The experimental results show that the temperature error is less than 0.035 K in the temperature range from 310 K to 320 K. The temperature estimation model is expected to improve the performance of the magnetic nanoparticle thermometer and be applied to magnetic nanoparticle-mediated hyperthermia

arXiv.org e-Print Archive

RSQ: a statistical method for quantification of isoform-specific structurome using transcriptome-wide structural profiling data [preprint]

Author: Bai Boyang
Chen Yiwen
Chen Yong
Dai Qi
Sun Ming
Tu Shikui
University of Texas at Dallas
University of Texas at Dallas
Wang Haozhe
Wang Yunfei
Zhang Michael Q.
Zhu Xiaopeng
Publication venue: eScholarship@UMassChan
Publication date: 18/06/2016
Field of study

The structure of RNA, which is considered to be a second layer of information alongside the genetic code, provides fundamental insights into the cellular function of both coding and non-coding RNAs. Several high-throughput technologies have been developed to profile transcriptome-wide RNA structures, i.e., the structurome. However, it is challenging to interpret the profiling data because the observed data represent an average over different RNA conformations and isoforms with different abundance. To address this challenge, we developed an RNA structurome quantification method (RSQ) to statistically model the distribution of reads over both isoforms and RNA conformations, and thus provide accurate quantification of the isoform-specific structurome. The quantified RNA structurome enables the comparison of isoform-specific conformations between different conditions, the exploration of RNA conformation variation affected by single nucleotide polymorphism (SNP) , and the measurement of RNA accessibility for binding of either small RNAs in RNAi-based assays or RNA binding protein in transcriptional regulation. The model used in our method sheds new light on the potential impact of the RNA structurome on gene regulation

eScholarship@UMMS

Lessons learned from the NeurIPS 2021 MetaDL challenge:Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Author: Alcobaça Edesio
Carvalho André C. P. L. F.
Chen Hong
El Baz Adrian
Ferreira Fabio
Gouk Henry
Guan Chaoyu
Guyon Isabelle
Hospedales Timothy
Hu Shell
Huisman Mike
Hutter Frank
Liu Zhengying
Mohr Felix
Oztürk Ekrem
Sun Haozhe
Ullah Ihsan
van Rijn Jan N.
Wang Xin
Zhu Wenwu
Publication venue
Publication date: 08/12/2021
Field of study

Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available. Metalearning methods can address this problem by transferring knowledge from related tasks, thus reducing the amount of data and computing resources needed to learn new tasks. We organize the MetaDL competition series, which provide opportunities for research groups all over the world to create and experimentally assess new meta-(deep)learning solutions for real problems. In this paper, authored collaboratively between the competition organizers and the top-ranked participants, we describe the design of the competition, the datasets, the best experimental results, as well as the top-ranked methods in the NeurIPS 2021 challenge, which attracted 15 active teams who made it to the final phase (by outperforming the baseline), making over 100 code submissions during the feedback phase. The solutions of the top participants have been open-sourced. The lessons learned include that learning good representations is essential for effective transfer learnin

Edinburgh Research Explorer