Search CORE

608 research outputs found

An Expressive Deep Model for Human Action Parsing from A Single Image

Author: Huang Rui
Liang Zhujin
Lin Liang
Wang Xiaolong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/02/2015
Field of study

This paper aims at one newly raising task in vision and multimedia research: recognizing human actions from still images. Its main challenges lie in the large variations in human poses and appearances, as well as the lack of temporal motion information. Addressing these problems, we propose to develop an expressive deep model to naturally integrate human layout and surrounding contexts for higher level action understanding from still images. In particular, a Deep Belief Net is trained to fuse information from different noisy sources such as body part detection and object detection. To bridge the semantic gap, we used manually labeled data to greatly improve the effectiveness and efficiency of the pre-training and fine-tuning stages of the DBN training. The resulting framework is shown to be robust to sometimes unreliable inputs (e.g., imprecise detections of human parts and objects), and outperforms the state-of-the-art approaches.Comment: 6 pages, 8 figures, ICME 201

arXiv.org e-Print Archive

Crossref

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks

Author: Lin Liang
Wang Keze
Wang Meng
Wang Xiaolong
Zuo Wangmeng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/02/2015
Field of study

Human activity understanding with 3D/depth sensors has received increasing attention in multimedia processing and interactions. This work targets on developing a novel deep model for automatic activity recognition from RGB-D videos. We represent each human activity as an ensemble of cubic-like video segments, and learn to discover the temporal structures for a category of activities, i.e. how the activities to be decomposed in terms of classification. Our model can be regarded as a structured deep architecture, as it extends the convolutional neural networks (CNNs) by incorporating structure alternatives. Specifically, we build the network consisting of 3D convolutions and max-pooling operators over the video segments, and introduce the latent variables in each convolutional layer manipulating the activation of neurons. Our model thus advances existing approaches in two aspects: (i) it acts directly on the raw inputs (grayscale-depth data) to conduct recognition instead of relying on hand-crafted features, and (ii) the model structure can be dynamically adjusted accounting for the temporal variations of human activities, i.e. the network configuration is allowed to be partially activated during inference. For model training, we propose an EM-type optimization method that iteratively (i) discovers the latent structure by determining the decomposed actions for each training example, and (ii) learns the network parameters by using the back-propagation algorithm. Our approach is validated in challenging scenarios, and outperforms state-of-the-art methods. A large human activity database of RGB-D videos is presented in addition.Comment: This manuscript has 10 pages with 9 figures, and a preliminary version was published in ACM MM'14 conferenc

arXiv.org e-Print Archive

Crossref

Recommended from our members

Interest rate regulation, earnings transparency and capital structure: evidence from China

Author: Cai Huifen
Han Liang
Li Xiaolong
Tian Lin
Publication venue: 'Emerald'
Publication date: 01/01/2019
Field of study

We use samples from Chinese listed companies to investigate the effects of interest rate deregulation and earnings transparency on company’s capital structure in China over the period of 2003-2015. In particular, we study the link between state-owned enterprises (SOEs), economic growth targets, and marketization in China's unique institutional context. The results show earnings transparency increases firm leverage and the additional tests suggest that such an effect takes place via a mechanism by reducing the cost of debt finance. However, information transparency could moderate the effects of interest rate deregulation on corporate capital structure. In addition, SOEs are less sensitive towards the changes of interest rates in China because lending to SOEs is policy-oriented and lacks of market evaluation of business risk. Government control is conducive to enhancing the transparency of the whole industry, however, market-oriented reform is conducive to enhancing the transparency of the company's own information. The results are robust to endogeneity tests and a variety of variable and model specifications. Lastly, we find that information transparency has little impact on equity financing because of IPO and SEO strictly controlled by the Chinese government. The paper makes contribution to the relationship between earnings disclosure quality and capital structure in the Chinese unique institutional context, such as taking the progressive interest rate reform, SOES, different economic growth target and different marketization level in each province of China. We suggest that investors will pay more attention to the company's own unique information transparency in the provinces with high degree of marketization. As a potential direction for future research, we will investigate how the earnings transparency has impact on capital structure, and how such impact would depend on the transparency of specific business, the cap of foreign shareholding and the convenience of investment

Central Archive at the University of Reading

Middlesex University Research Repository