132 research outputs found
Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index
Co-occurrence networks are an important method in the field of natural
language processing and text mining for discovering semantic relationships
within texts. However, the traditional traversal algorithm for constructing
co-occurrence networks has high time complexity and space complexity when
dealing with large-scale text data. In this paper, we propose an optimized
algorithm based on inverted indexing and breadth-first search to improve the
efficiency of co-occurrence network construction and reduce memory consumption.
Firstly, the traditional traversal algorithm is analyzed, and its performance
issues in constructing co-occurrence networks are identified. Then, the
detailed implementation process of the optimized algorithm is presented.
Subsequently, the CSL large-scale Chinese scientific literature dataset is used
for experimental validation, comparing the performance of the traditional
traversal algorithm and the optimized algorithm in terms of running time and
memory usage. Finally, using non-parametric test methods, the optimized
algorithm is proven to have significantly better performance than the
traditional traversal algorithm. The research in this paper provides an
effective method for the rapid construction of co-occurrence networks,
contributing to the further development of the Information Organization fields.Comment: 10 pages, 8 figure
A Study on Satisfaction and Willingness to Continuously Participate in Business Class Virtual Simulation Competitions -- An Empirical Analysis Based on Technology Acceptance Model (TAM)
The combination of virtual simulation technology and innovation and entrepreneurship education can achieve the role of interaction between the environment and the real environment during students’ virtual simulation, which can attract active participation of users and help college students understand the risks and opportunities faced by enterprises in the process of operation and growth, so as to improve the ability of enterprise operation and management, and deepen students’ understanding of theory and practical ability. In this paper, we construct a technology acceptance model (TAM) through seven dimensions: perceived usefulness, perceived ease of use, external environment, teacher guidance, willingness to participate, satisfaction, and willingness to continue to use, and investigate students’ satisfaction and willingness to continue to participate in the virtual simulation competition in some universities. The results of the data analysis show that the satisfaction and willingness to continue to participate in the virtual simulation competition play a good role
Multi-Factor Spatio-Temporal Prediction based on Graph Decomposition Learning
Spatio-temporal (ST) prediction is an important and widely used technique in
data mining and analytics, especially for ST data in urban systems such as
transportation data. In practice, the ST data generation is usually influenced
by various latent factors tied to natural phenomena or human socioeconomic
activities, impacting specific spatial areas selectively. However, existing ST
prediction methods usually do not refine the impacts of different factors, but
directly model the entangled impacts of multiple factors. This amplifies the
modeling complexity of ST data and compromises model interpretability. To this
end, we propose a multi-factor ST prediction task that predicts partial ST data
evolution under different factors, and combines them for a final prediction. We
make two contributions to this task: an effective theoretical solution and a
portable instantiation framework. Specifically, we first propose a theoretical
solution called decomposed prediction strategy and prove its effectiveness from
the perspective of information entropy theory. On top of that, we instantiate a
novel model-agnostic framework, named spatio-temporal graph decomposition
learning (STGDL), for multi-factor ST prediction. The framework consists of two
main components: an automatic graph decomposition module that decomposes the
original graph structure inherent in ST data into subgraphs corresponding to
different factors, and a decomposed learning network that learns the partial ST
data on each subgraph separately and integrates them for the final prediction.
We conduct extensive experiments on four real-world ST datasets of two types of
graphs, i.e., grid graph and network graph. Results show that our framework
significantly reduces prediction errors of various ST models by 9.41% on
average (35.36% at most). Furthermore, a case study reveals the
interpretability potential of our framework
On the bubble trapped underneath a droplet impacting a moving hydrophilic surface: From perfect slip to no slip
The bubble trapped underneath a droplet impacting a moving hydrophilic
surface was investigated using high-speed photography. The bubble diameter was
found to depend weakly on the surface speed Vs, but strongly on the Weber
number We. The bubble and the surrounding liquid slip on the surface while
accelerating to Vs, with the slip velocity gradually decreasing to zero,
demonstrating that the no-slip boundary condition does not apply during the
acceleration period. The terminal slip distance, identifying the maximum
distance between the bubble and the impact point, increases with an increase of
Vs and weakly depends on We. Its observed length was up to 1.39 mm. An
acceleration extracted from the experiments quantifies the slip and provides a
simple tool for predicting the terminal slip distance
Anomalous Nernst effect in compensated ferrimagnetic CoxGd1-x films
The anomalous Nernst effect (ANE) is one of the most intriguing
thermoelectric phenomena which has attracted growing interest both for its
underlying physics and potential applications. Typically, a large ANE response
is observed in magnets with pronounced magnetizations or nontrivial Berry
curvature. Here, we report a significant ANE signal in compensated
ferrimagnetic CoxGd1-x alloy films, which exhibit vanishingly small
magnetization. In particular, we found that the polarity of ANE signal is
dominated by the magnetization orientation of the transition metal Co
sublattices, rather than the net magnetization of CoxGd1-x films. This
observation is not expected from the conventional understanding of ANE but is
analogous to the anomalous Hall effect in compensated ferrimagnets. We
attribute the origin of ANE and its Co-dominant property to the Co-dominant
Berry curvature. Our work could trigger a more comprehensive understanding of
ANE and may be useful for building energy-harvesting devices by employing ANE
in compensated ferrimagnets
AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction
Air quality prediction and modelling plays a pivotal role in public health
and environment management, for individuals and authorities to make informed
decisions. Although traditional data-driven models have shown promise in this
domain, their long-term prediction accuracy can be limited, especially in
scenarios with sparse or incomplete data and they often rely on black-box deep
learning structures that lack solid physical foundation leading to reduced
transparency and interpretability in predictions. To address these limitations,
this paper presents a novel approach named Physics guided Neural Network for
Air Quality Prediction (AirPhyNet). Specifically, we leverage two
well-established physics principles of air particle movement (diffusion and
advection) by representing them as differential equation networks. Then, we
utilize a graph structure to integrate physics knowledge into a neural network
architecture and exploit latent representations to capture spatio-temporal
relationships within the air quality data. Experiments on two real-world
benchmark datasets demonstrate that AirPhyNet outperforms state-of-the-art
models for different testing scenarios including different lead time (24h, 48h,
72h), sparse data and sudden change prediction, achieving reduction in
prediction errors up to 10%. Moreover, a case study further validates that our
model captures underlying physical processes of particle movement and generates
accurate predictions with real physical meaning.Comment: Accepted by the 12th International Conference on Learning
Representations (ICLR 2024
Characterizing User Behaviors in Open-Source Software User Forums: An Empirical Study
User forums of Open Source Software (OSS) enable end-users to collaboratively
discuss problems concerning the OSS applications. Despite decades of research
on OSS, we know very little about how end-users engage with OSS communities on
these forums, in particular, the challenges that hinder their continuous and
meaningful participation in the OSS community. Many previous works are
developer-centric and overlook the importance of end-user forums. As a result,
end-users' expectations are seldom reflected in OSS development. To better
understand user behaviors in OSS user forums, we carried out an empirical study
analyzing about 1.3 million posts from user forums of four popular OSS
applications: Zotero, Audacity, VLC, and RStudio. Through analyzing the
contribution patterns of three common user types (end-users, developers, and
organizers), we observed that end-users not only initiated most of the threads
(above 96% of threads in three projects, 86% in the other), but also acted as
the significant contributors for responding to other users' posts, even though
they tended to lack confidence in their activities as indicated by
psycho-linguistic analyses. Moreover, we found end-users more open, reflecting
a more positive emotion in communication than organizers and developers in the
forums. Our work contributes new knowledge about end-users' activities and
behaviors in OSS user forums that the vital OSS stakeholders can leverage to
improve end-user engagement in the OSS development process.Comment: 15th International Conference on Cooperative and Human Aspects of
Softare Engineerin
- …