Search CORE

273 research outputs found

Hierarchicality of Trade Flow Networks Reveals Complexity of Products

Author: Luo Jingfei
Shi Peiteng
Yang Bo
Zhang Jiang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 14/01/2014
Field of study

With globalization, countries are more connected than before by trading flows, which currently amount to at least 36 trillion dollars. Interestingly, approximately 30-60 percent of global exports consist of intermediate products. Therefore, the trade flow network of a particular product with high added values can be regarded as a value chain. The problem is weather we can discriminate between these products based on their unique flow network structure. This paper applies the flow analysis method developed in ecology to 638 trading flow networks of different products. We claim that the allometric scaling exponent

\eta

can be used to characterize the degree of hierarchicality of a flow network, i.e., whether the trading products flow on long hierarchical chains. Then, the flow networks of products with higher added values and complexity, such as machinery&transport equipment with larger exponents, are highlighted. These higher values indicate that their trade flow networks are more hierarchical. As a result, without extra data such as global input-output table, we can identify the product categories with higher complexity and the relative importance of a country in the global value chain solely by the trading network.Comment: 14 pages,7 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

High-dimensional covariance regression with application to co-expression QTL detection

Author: Kim Rakheon
Zhang Jingfei
Publication venue
Publication date: 02/04/2024
Field of study

While covariance matrices have been widely studied in many scientific fields, relatively limited progress has been made on estimating conditional covariances that permits a large covariance matrix to vary with high-dimensional subject-level covariates. In this paper, we present a new sparse multivariate regression framework that models the covariance matrix as a function of subject-level covariates. In the context of co-expression quantitative trait locus (QTL) studies, our method can be used to determine if and how gene co-expressions vary with genetic variations. To accommodate high-dimensional responses and covariates, we stipulate a combined sparsity structure that encourages covariates with non-zero effects and edges that are modulated by these covariates to be simultaneously sparse. We approach parameter estimation with a blockwise coordinate descent algorithm, and investigate the

\ell_2

convergence rate of the estimated parameters. In addition, we propose a computationally efficient debiased inference procedure for uncertainty quantification. The efficacy of the proposed method is demonstrated through numerical experiments and an application to a gene co-expression network study with brain cancer patients

arXiv.org e-Print Archive

Learning to Diversify Web Search Results with a Document Repulsion Model

Author: Li Jingfei
Song Dawei
Wang Benyou
Wu Yue
Zhang Peng
Publication venue: 'Elsevier BV'
Publication date: 01/10/2017
Field of study

Search diversification (also called diversity search), is an important approach to tackling the query ambiguity problem in information retrieval. It aims to diversify the search results that are originally ranked according to their probabilities of relevance to a given query, by re-ranking them to cover as many as possible different aspects (or subtopics) of the query. Most existing diversity search models heuristically balance the relevance ranking and the diversity ranking, yet lacking an efficient learning mechanism to reach an optimized parameter setting. To address this problem, we propose a learning-to-diversify approach which can directly optimize the search diversification performance (in term of any effectiveness metric). We first extend the ranking function of a widely used learning-to-rank framework, i.e., LambdaMART, so that the extended ranking function can correlate relevance and diversity indicators. Furthermore, we develop an effective learning algorithm, namely Document Repulsion Model (DRM), to train the ranking function based on a Document Repulsion Theory (DRT). DRT assumes that two result documents covering similar query aspects (i.e., subtopics) should be mutually repulsive, for the purpose of search diversification. Accordingly, the proposed DRM exerts a repulsion force between each pair of similar documents in the learning process, and includes the diversity effectiveness metric to be optimized as part of the loss function. Although there have been existing learning based diversity search methods, they often involve an iterative sequential selection process in the ranking process, which is computationally complex and time consuming for training, while our proposed learning strategy can largely reduce the time cost. Extensive experiments are conducted on the TREC diversity track data (2009, 2010 and 2011). The results demonstrate that our model significantly outperforms a number of baselines in terms of effectiveness and robustness. Further, an efficiency analysis shows that the proposed DRM has a lower computational complexity than the state of the art learning-to-diversify methods

Crossref

Open Research Online