Search CORE

132 research outputs found

Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index

Author: Cheng Jiahao
Publication venue
Publication date: 16/08/2023
Field of study

Co-occurrence networks are an important method in the field of natural language processing and text mining for discovering semantic relationships within texts. However, the traditional traversal algorithm for constructing co-occurrence networks has high time complexity and space complexity when dealing with large-scale text data. In this paper, we propose an optimized algorithm based on inverted indexing and breadth-first search to improve the efficiency of co-occurrence network construction and reduce memory consumption. Firstly, the traditional traversal algorithm is analyzed, and its performance issues in constructing co-occurrence networks are identified. Then, the detailed implementation process of the optimized algorithm is presented. Subsequently, the CSL large-scale Chinese scientific literature dataset is used for experimental validation, comparing the performance of the traditional traversal algorithm and the optimized algorithm in terms of running time and memory usage. Finally, using non-parametric test methods, the optimized algorithm is proven to have significantly better performance than the traditional traversal algorithm. The research in this paper provides an effective method for the rapid construction of co-occurrence networks, contributing to the further development of the Information Organization fields.Comment: 10 pages, 8 figure

arXiv.org e-Print Archive

A Study on Satisfaction and Willingness to Continuously Participate in Business Class Virtual Simulation Competitions -- An Empirical Analysis Based on Technology Acceptance Model (TAM)

Author: Jiahao Long
Shuang Cheng
Yejiang Yang
Publication venue: 'Scholink Co, Ltd.'
Publication date: 17/10/2022
Field of study

The combination of virtual simulation technology and innovation and entrepreneurship education can achieve the role of interaction between the environment and the real environment during students’ virtual simulation, which can attract active participation of users and help college students understand the risks and opportunities faced by enterprises in the process of operation and growth, so as to improve the ability of enterprise operation and management, and deepen students’ understanding of theory and practical ability. In this paper, we construct a technology acceptance model (TAM) through seven dimensions: perceived usefulness, perceived ease of use, external environment, teacher guidance, willingness to participate, satisfaction, and willingness to continue to use, and investigate students’ satisfaction and willingness to continue to participate in the virtual simulation competition in some universities. The results of the data analysis show that the satisfaction and willingness to continue to participate in the virtual simulation competition play a good role

Scholink Journals

Multi-Factor Spatio-Temporal Prediction based on Graph Decomposition Learning

Author: Ji Jiahao
Long Cheng
Mou Yu
Wang Jingyuan
Publication venue
Publication date: 07/03/2024
Field of study

Spatio-temporal (ST) prediction is an important and widely used technique in data mining and analytics, especially for ST data in urban systems such as transportation data. In practice, the ST data generation is usually influenced by various latent factors tied to natural phenomena or human socioeconomic activities, impacting specific spatial areas selectively. However, existing ST prediction methods usually do not refine the impacts of different factors, but directly model the entangled impacts of multiple factors. This amplifies the modeling complexity of ST data and compromises model interpretability. To this end, we propose a multi-factor ST prediction task that predicts partial ST data evolution under different factors, and combines them for a final prediction. We make two contributions to this task: an effective theoretical solution and a portable instantiation framework. Specifically, we first propose a theoretical solution called decomposed prediction strategy and prove its effectiveness from the perspective of information entropy theory. On top of that, we instantiate a novel model-agnostic framework, named spatio-temporal graph decomposition learning (STGDL), for multi-factor ST prediction. The framework consists of two main components: an automatic graph decomposition module that decomposes the original graph structure inherent in ST data into subgraphs corresponding to different factors, and a decomposed learning network that learns the partial ST data on each subgraph separately and integrates them for the final prediction. We conduct extensive experiments on four real-world ST datasets of two types of graphs, i.e., grid graph and network graph. Results show that our framework significantly reduces prediction errors of various ST models by 9.41% on average (35.36% at most). Furthermore, a case study reveals the interpretability potential of our framework

arXiv.org e-Print Archive

On the bubble trapped underneath a droplet impacting a moving hydrophilic surface: From perfect slip to no slip

Author: Cheng Jiahao
Floryan J. M.
Hao Jiguang
Li Yalei
Publication venue
Publication date: 13/02/2023
Field of study

The bubble trapped underneath a droplet impacting a moving hydrophilic surface was investigated using high-speed photography. The bubble diameter was found to depend weakly on the surface speed Vs, but strongly on the Weber number We. The bubble and the surrounding liquid slip on the surface while accelerating to Vs, with the slip velocity gradually decreasing to zero, demonstrating that the no-slip boundary condition does not apply during the acceleration period. The terminal slip distance, identifying the maximum distance between the bubble and the impact point, increases with an increase of Vs and weakly depends on We. Its observed length was up to 1.39 mm. An acceleration extracted from the experiments quantifies the slip and provides a simple tool for predicting the terminal slip distance

arXiv.org e-Print Archive

Anomalous Nernst effect in compensated ferrimagnetic CoxGd1-x films

Author: Cai Li
Cheng Yang
Jiang Wanjun
Liu Jiahao
Liu Ruihao
Xu Teng
Publication venue
Publication date: 28/11/2022
Field of study

The anomalous Nernst effect (ANE) is one of the most intriguing thermoelectric phenomena which has attracted growing interest both for its underlying physics and potential applications. Typically, a large ANE response is observed in magnets with pronounced magnetizations or nontrivial Berry curvature. Here, we report a significant ANE signal in compensated ferrimagnetic CoxGd1-x alloy films, which exhibit vanishingly small magnetization. In particular, we found that the polarity of ANE signal is dominated by the magnetization orientation of the transition metal Co sublattices, rather than the net magnetization of CoxGd1-x films. This observation is not expected from the conventional understanding of ANE but is analogous to the anomalous Hall effect in compensated ferrimagnets. We attribute the origin of ANE and its Co-dominant property to the Co-dominant Berry curvature. Our work could trigger a more comprehensive understanding of ANE and may be useful for building energy-harvesting devices by employing ANE in compensated ferrimagnets

arXiv.org e-Print Archive

AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction

Author: Cong Gao
Hettige Kethmi Hirushini
Ji Jiahao
Long Cheng
Wang Jingyuan
Xiang Shili
Publication venue
Publication date: 06/02/2024
Field of study

Air quality prediction and modelling plays a pivotal role in public health and environment management, for individuals and authorities to make informed decisions. Although traditional data-driven models have shown promise in this domain, their long-term prediction accuracy can be limited, especially in scenarios with sparse or incomplete data and they often rely on black-box deep learning structures that lack solid physical foundation leading to reduced transparency and interpretability in predictions. To address these limitations, this paper presents a novel approach named Physics guided Neural Network for Air Quality Prediction (AirPhyNet). Specifically, we leverage two well-established physics principles of air particle movement (diffusion and advection) by representing them as differential equation networks. Then, we utilize a graph structure to integrate physics knowledge into a neural network architecture and exploit latent representations to capture spatio-temporal relationships within the air quality data. Experiments on two real-world benchmark datasets demonstrate that AirPhyNet outperforms state-of-the-art models for different testing scenarios including different lead time (24h, 48h, 72h), sparse data and sudden change prediction, achieving reduction in prediction errors up to 10%. Moreover, a case study further validates that our model captures underlying physical processes of particle movement and generates accurate predictions with real physical meaning.Comment: Accepted by the 12th International Conference on Learning Representations (ICLR 2024

arXiv.org e-Print Archive

Characterizing User Behaviors in Open-Source Software User Forums: An Empirical Study

Author: Chen Jiahao
Cheng Jinghui
Guo Jin L. C.
Hellman Jazlyn
Uddin Md. Sami
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2022
Field of study

User forums of Open Source Software (OSS) enable end-users to collaboratively discuss problems concerning the OSS applications. Despite decades of research on OSS, we know very little about how end-users engage with OSS communities on these forums, in particular, the challenges that hinder their continuous and meaningful participation in the OSS community. Many previous works are developer-centric and overlook the importance of end-user forums. As a result, end-users' expectations are seldom reflected in OSS development. To better understand user behaviors in OSS user forums, we carried out an empirical study analyzing about 1.3 million posts from user forums of four popular OSS applications: Zotero, Audacity, VLC, and RStudio. Through analyzing the contribution patterns of three common user types (end-users, developers, and organizers), we observed that end-users not only initiated most of the threads (above 96% of threads in three projects, 86% in the other), but also acted as the significant contributors for responding to other users' posts, even though they tended to lack confidence in their activities as indicated by psycho-linguistic analyses. Moreover, we found end-users more open, reflecting a more positive emotion in communication than organizers and developers in the forums. Our work contributes new knowledge about end-users' activities and behaviors in OSS user forums that the vital OSS stakeholders can leverage to improve end-user engagement in the OSS development process.Comment: 15th International Conference on Cooperative and Human Aspects of Softare Engineerin

arXiv.org e-Print Archive

PolyPublie