232 research outputs found
Multicast Network Coding and Field Sizes
In an acyclic multicast network, it is well known that a linear network
coding solution over GF() exists when is sufficiently large. In
particular, for each prime power no smaller than the number of receivers, a
linear solution over GF() can be efficiently constructed. In this work, we
reveal that a linear solution over a given finite field does \emph{not}
necessarily imply the existence of a linear solution over all larger finite
fields. Specifically, we prove by construction that: (i) For every source
dimension no smaller than 3, there is a multicast network linearly solvable
over GF(7) but not over GF(8), and another multicast network linearly solvable
over GF(16) but not over GF(17); (ii) There is a multicast network linearly
solvable over GF(5) but not over such GF() that is a Mersenne prime
plus 1, which can be extremely large; (iii) A multicast network linearly
solvable over GF() and over GF() is \emph{not} necessarily
linearly solvable over GF(); (iv) There exists a class of
multicast networks with a set of receivers such that the minimum field size
for a linear solution over GF() is lower bounded by
, but not every larger field than GF() suffices to
yield a linear solution. The insight brought from this work is that not only
the field size, but also the order of subgroups in the multiplicative group of
a finite field affects the linear solvability of a multicast network
Reproducibility Analysis and Enhancements for Multi-Aspect Dense Retriever with Aspect Learning
Multi-aspect dense retrieval aims to incorporate aspect information (e.g.,
brand and category) into dual encoders to facilitate relevance matching. As an
early and representative multi-aspect dense retriever, MADRAL learns several
extra aspect embeddings and fuses the explicit aspects with an implicit aspect
"OTHER" for final representation. MADRAL was evaluated on proprietary data and
its code was not released, making it challenging to validate its effectiveness
on other datasets. We failed to reproduce its effectiveness on the public
MA-Amazon data, motivating us to probe the reasons and re-examine its
components. We propose several component alternatives for comparisons,
including replacing "OTHER" with "CLS" and representing aspects with the first
several content tokens. Through extensive experiments, we confirm that learning
"OTHER" from scratch in aspect fusion is harmful. In contrast, our proposed
variants can greatly enhance the retrieval performance. Our research not only
sheds light on the limitations of MADRAL but also provides valuable insights
for future studies on more powerful multi-aspect dense retrieval models. Code
will be released at:
https://github.com/sunxiaojie99/Reproducibility-for-MADRAL.Comment: accepted by ecir2024 as a reproducibility pape
Feature-Enhanced Network with Hybrid Debiasing Strategies for Unbiased Learning to Rank
Unbiased learning to rank (ULTR) aims to mitigate various biases existing in
user clicks, such as position bias, trust bias, presentation bias, and learn an
effective ranker. In this paper, we introduce our winning approach for the
"Unbiased Learning to Rank" task in WSDM Cup 2023. We find that the provided
data is severely biased so neural models trained directly with the top 10
results with click information are unsatisfactory. So we extract multiple
heuristic-based features for multi-fields of the results, adjust the click
labels, add true negatives, and re-weight the samples during model training.
Since the propensities learned by existing ULTR methods are not decreasing
w.r.t. positions, we also calibrate the propensities according to the click
ratios and ensemble the models trained in two different ways. Our method won
the 3rd prize with a DCG@10 score of 9.80, which is 1.1% worse than the 2nd and
25.3% higher than the 4th.Comment: 5 pages, 1 figure, WSDM Cup 202
An indirect space-vector modulated three-phase AC-DC matrix converter for hybrid electric vehicles
This paper presents a bi-directional AC-DC matrix converter for the power supply systems of hybrid electric vehicles. Compared to conventional PWM rectifier and associated DC-DC converter, large DC bus capacitor banks are eliminated. The converter converts three-phase AC power into the desired DC output with higher energy density via a single power stage. A closed-loop control strategy based on indirect space vector modulation is proposed and the validity has been verified. The strategy ensures that the output voltage can be regulated tightly against different loads and wide input variation and high-quality input current can be achieved
Modeling and Analysis of Train Rear-End Collision Accidents Based on Stochastic Petri Nets
We proposed a model of the train rear-end collision accidents based on stochastic Petri nets (SPN) theory. By isomorphic Markov chain model of the proposed accident model, we provide the quantitative analysis of the train rear-end collision accidents. Fuzzy random method is also applied to analyze the performance of the proposed model. In addition, according to the data extracted from a large amount of historical data of the accident statistics, we present a case analysis and discussion. It showed that the results of the proposed train rear-end accident model based on SPN are reasonable in practical applications and can be used to effectively analyze the accidents and prevent loss, and the results may be a reference to the department of railway safety management
Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval
Grounded on pre-trained language models (PLMs), dense retrieval has been
studied extensively on plain text. In contrast, there has been little research
on retrieving data with multiple aspects using dense models. In the scenarios
such as product search, the aspect information plays an essential role in
relevance matching, e.g., category: Electronics, Computers, and Pet Supplies. A
common way of leveraging aspect information for multi-aspect retrieval is to
introduce an auxiliary classification objective, i.e., using item contents to
predict the annotated value IDs of item aspects. However, by learning the value
embeddings from scratch, this approach may not capture the various semantic
similarities between the values sufficiently. To address this limitation, we
leverage the aspect information as text strings rather than class IDs during
pre-training so that their semantic similarities can be naturally captured in
the PLMs. To facilitate effective retrieval with the aspect strings, we propose
mutual prediction objectives between the text of the item aspect and content.
In this way, our model makes more sufficient use of aspect information than
conducting undifferentiated masked language modeling (MLM) on the concatenated
text of aspects and content. Extensive experiments on two real-world datasets
(product and mini-program search) show that our approach can outperform
competitive baselines both treating aspect values as classes and conducting the
same MLM for aspect and content strings. Code and related dataset will be
available at the URL \footnote{https://github.com/sunxiaojie99/ATTEMPT}.Comment: accepted by cikm202
- …