Search CORE

22 research outputs found

Discovering items with potential popularity on social media

Author: Abbas Khushnood
Mingsheng Shang
Xin Luo
Publication venue
Publication date: 05/04/2016
Field of study

Predicting the future popularity of online content is highly important in many applications. Preferential attachment phenomena is encountered in scale free networks.Under it's influece popular items get more popular thereby resulting in long tailed distribution problem. Consequently, new items which can be popular (potential ones), are suppressed by the already popular items. This paper proposes a novel model which is able to identify potential items. It identifies the potentially popular items by considering the number of links or ratings it has recieved in recent past along with it's popularity decay. For obtaining an effecient model we consider only temporal features of the content, avoiding the cost of extracting other features. We have found that people follow recent behaviours of their peers. In presence of fit or quality items already popular items lose it's popularity. Prediction accuracy is measured on three industrial datasets namely Movielens, Netflix and Facebook wall post. Experimental results show that compare to state-of-the-art model our model have better prediction accuracy.Comment: 7 pages in ACM style.7 figures and 1 tabl

arXiv.org e-Print Archive

Crossref

Variability of Contact Process in Complex Networks

Author: Anderson R. M.
Bailey N. T. J.
Dailey D. J.
Diekmann O.
Erdős P.
Hui Yang
Kai Gong
Ming Tang
Mingsheng Shang
Publication venue: 'AIP Publishing'
Publication date: 03/04/2012
Field of study

We study numerically how the structures of distinct networks influence the epidemic dynamics in contact process. We first find that the variability difference between homogeneous and heterogeneous networks is very narrow, although the heterogeneous structures can induce the lighter prevalence. Contrary to non-community networks, strong community structures can cause the secondary outbreak of prevalence and two peaks of variability appeared. Especially in the local community, the extraordinarily large variability in early stage of the outbreak makes the prediction of epidemic spreading hard. Importantly, the bridgeness plays a significant role in the predictability, meaning the further distance of the initial seed to the bridgeness, the less accurate the predictability is. Also, we investigate the effect of different disease reaction mechanisms on variability, and find that the different reaction mechanisms will result in the distinct variabilities at the end of epidemic spreading.Comment: 6 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Recommended from our members

Application of Bayesian network including Microcystis morphospecies for microcystin risk assessment in three cyanobacterial bloom-plagued lakes, China

Author: Li Lin
Shan Kun
Shang Mingsheng
Song Lirong
Wang Xiaoxiao
Yang Hong
Zhou Botian
Publication venue: 'Elsevier BV'
Publication date: 01/03/2019
Field of study

Microcystis spp., which occur as colonies of different sizes under natural conditions, have expanded in temperate and tropical freshwater ecosystems and caused seriously environmental and ecological problems. In the current study, a Bayesian network (BN) framework was developed to access the probability of microcystins (MCs) risk in large shallow eutrophic lakes in China, namely, Taihu Lake, Chaohu Lake, and Dianchi Lake. By means of a knowledge-supported way, physicochemical factors, Microcystis morphospecies, and MCs were integrated into different network structures. The sensitive analysis illustrated that Microcystis aeruginosa biomass was overall the best predictor of MCs risk, and its high biomass relied on the combined condition that water temperature exceeded 24 °C and total phosphorus was above 0.2 mg/L. Simulated scenarios suggested that the probability of hazardous MCs (≥1.0 μg/L) was higher under interactive effect of temperature increase and nutrients (nitrogen and phosphorus) imbalance than that of warming alone. Likewise, data-driven model development using a naïve Bayes classifier and equal frequency discretization resulted in a substantial technical performance (CCI = 0.83, K = 0.60), but the performance significantly decreased when model excluded species-specific biomasses from input variables (CCI = 0.76, K = 0.40). The BN framework provided a useful screening tool to evaluate cyanotoxin in three studied lakes in China, and it can also be used in other lakes suffering from cyanobacterial blooms dominated by Microcystis

Central Archive at the University of Reading

Institute of Hydrobiology, Chinese Academy Of Sciences

Recommended from our members

Use statistical machine learning to detect nutrient thresholds in Microcystis blooms and microcystin management

Author: Shan Kun
Shang Mingsheng
Song Lirong
Wang Xiaoxiao
Yang Hong
Zhou Botian
Publication venue: 'Elsevier BV'
Publication date: 01/04/2020
Field of study

The frequency of toxin-producing cyanobacterial blooms has increased in recent decades due to nutrient enrichment and climate change. Because Microcystis blooms are related to different environmental conditions, identifying potential nutrient control targets can facilitate water quality managers to reduce the likelihood of microcystins (MCs) risk. However, complex biotic interactions and field data limitations have constrained our understanding of the nutrient-microcystin relationship. This study develops a Bayesian modelling framework with intracellular and extracellular MCs that characterize the relationships between different environmental and biological factors. This model was fit to the across-lake dataset including three bloom-plagued lakes in China and estimated the putative thresholds of total nitrogen (TN) and total phosphorus (TP). The lake-specific nutrient thresholds were estimated using Bayesian updating process. Our results suggested dual N and P reduction in controlling cyanotoxin risks. The total Microcystis biomass can be substantially suppressed by achieving the putative thresholds of TP (0.10 mg/L) in Lakes Taihu and Chaohu, but a stricter TP target (0.05 mg/L) in Dianchi Lake. To maintain MCs concentrations below 1.0 μg/L, the estimated TN threshold in three lakes was 1.8 mg/L, but the effect can be counteracted by the increase of temperature. Overall, the present approach provides an efficient way to integrate empirical knowledge into the data-driven model and is helpful for the management of water resources

Central Archive at the University of Reading

Institute of Hydrobiology, Chinese Academy Of Sciences

Ultrafast and Sensitive Self-Powered Photodetector Featuring Self-Limited Depletion Region and Fully Depleted Channel with van der Waals Contacts

Author: Chen Hongyu
Dai Mingjin
Fu Yong Qing
Ge Chuanyang
Hu PingAn
Hu Yunxia
Li Wen
Long Mingsheng
Shang Huiming
Wang Fakun
Zhai Tianyou
Zhang Jia
Publication venue: 'American Chemical Society (ACS)'
Publication date: 28/07/2020
Field of study

Self-powered photodetectors with great potential for implanted medical diagnosis and smart communications have been severely hindered by the difficulty of simultaneously achieving high sensitivity and fast response speed. Here, we report an ultrafast and highly sensitive self-powered photodetector based on two-dimensional (2D) InSe, which is achieved by applying a device architecture design and generating ideal Schottky or ohmic contacts on 2D layered semiconductors, which are difficult to realize in the conventional semiconductors owing to their surface Fermi-level pinning. The as-fabricated InSe photodiode features a maximal lateral self-limited depletion region and a vertical fully depleted channel. It exhibits a high detectivity of 1.26 × 1013 Jones and an ultrafast response speed of ∼200 ns, which breaks the response speed limit of reported self-powered photodetectors based on 2D semiconductors. The high sensitivity is achieved by an ultralow dark current noise generated from the robust van der Waals (vdW) Schottky junction and a high photoresponsivity due to the formation of a maximal lateral self-limited depletion region. The ultrafast response time is dominated by the fast carrier drift driven by a strong built-in electric field in the vertical fully depleted channel. This device architecture can help us to design high-performance photodetectors utilizing vdW layered semiconductors

Northumbria Research Link

An efficient annealing-assisted differential evolution for multi-parameter adaptive latent factor analysis

Author: LI Qing
PANG Guansong
SHANG Mingsheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2022
Field of study

Abstract A high-dimensional and incomplete (HDI) matrix is a typical representation of big data. However, advanced HDI data analysis models tend to have many extra parameters. Manual tuning of these parameters, generally adopting the empirical knowledge, unavoidably leads to additional overhead. Although variable adaptive mechanisms have been proposed, they cannot balance the exploration and exploitation with early convergence. Moreover, learning such multi-parameters brings high computational time, thereby suffering gross accuracy especially when solving a bilinear problem like conducting the commonly used latent factor analysis (LFA) on an HDI matrix. Herein, an efficient annealing-assisted differential evolution for multi-parameter adaptive latent factor analysis (ADMA) is proposed to address these problems. First, a periodic equilibrium mechanism is employed using the physical mechanism annealing, which is embedded in the mutation operation of differential evolution (DE). Then, to further improve its efficiency, we adopt a probabilistic evaluation mechanism consistent with the crossover probability of DE. Experimental results of both adaptive and non-adaptive state-of-the-art methods on industrial HDI datasets illustrate that ADMA achieves a desirable global optimum with reasonable overhead and prevails competing methods in terms of predicting the missing data in HDI matrices

Institutional Knowledge at Singapore Management University

Directory of Open Access Journals

Joint hyperbolic and Euclidean geometry contrastive graph neural networks

Author: PANG Guansong
SHANG Mingsheng
WU Di
XU Xiaoyu
Publication venue: 'Elsevier BV'
Publication date: 01/09/2022
Field of study

Institutional Knowledge at Singapore Management University

The Stock Market Model with Delayed Information Impact from a Socioeconomic View

Author: Guiyuan Shi
Mingsheng Shang
Yuxia Zhang
Zhiting Wang
Publication venue: 'MDPI AG'
Publication date: 01/07/2021
Field of study

Finding the critical factor and possible “Newton’s laws” in financial markets has been an important issue. However, with the development of information and communication technologies, financial models are becoming more realistic but complex, contradicting the objective law “Greatest truths are the simplest.” Therefore, this paper presents an evolutionary model independent of micro features and attempts to discover the most critical factor. In the model, information is the only critical factor, and stock price is the emergence of collective behavior. The statistical properties of the model are significantly similar to the real market. It also explains the correlations of stocks within an industry, which provides a new idea for studying critical factors and core structures in the financial markets

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central

Revealing Physiochemical Factors and Zooplankton Influencing <i>Microcystis</i> Bloom Toxicity in a Large-Shallow Lake Using Bayesian Machine Learning

Author: Kun Shan
Lan Wang
Lirong Song
Mingsheng Shang
Xiaoxiao Wang
Publication venue: MDPI AG
Publication date: 01/08/2022
Field of study

Toxic cyanobacterial blooms have become a severe global hazard to human and environmental health. Most studies have focused on the relationships between cyanobacterial composition and cyanotoxins production. Yet, little is known about the environmental conditions influencing the hazard of cyanotoxins. Here, we analysed a unique 22 sites dataset comprising monthly observations of water quality, cyanobacterial genera, zooplankton assemblages, and microcystins (MCs) quota and concentrations in a large-shallow lake. Missing values of MCs were imputed using a non-negative latent factor (NLF) analysis, and the results achieved a promising accuracy. Furthermore, we used the Bayesian additive regression tree (BART) to quantify how Microcystis bloom toxicity responds to relevant physicochemical characteristics and zooplankton assemblages. As expected, the BART model achieved better performance in Microcystis biomass and MCs concentration predictions than some comparative models, including random forest and multiple linear regression. The importance analysis via BART illustrated that the shade index was overall the best predictor of MCs concentrations, implying the predominant effects of light limitations on the MCs content of Microcystis. Variables of greatest significance to the toxicity of Microcystis also included pH and dissolved inorganic nitrogen. However, total phosphorus was found to be a strong predictor of the biomass of total Microcystis and toxic M. aeruginosa. Together with the partial dependence plot, results revealed the positive correlations between protozoa and Microcystis biomass. In contrast, copepods biomass may regulate the MC quota and concentrations. Overall, our observations arouse universal demands for machine-learning strategies to represent nonlinear relationships between harmful algal blooms and environmental covariates

Directory of Open Access Journals

PubMed Central