18,159 research outputs found
Sentence alignment of Hungarian-English parallel corpora using a hybrid algorithm
We present an efficient hybrid method for aligning sentences with their translations in a parallel bilingual corpus. The new algorithm is composed of a length-based and anchor matching method that uses Named Entity recognition. This algorithm combines the speed of length-based models with the accuracy of anchor finding methods. The accuracy of finding cognates for Hungarian-English language pair is extremely low, hence we thought of using a novel approach that includes Named Entity recognition. Due to the well selected anchors it was found to outperform the best two sentence alignment algorithms so far published for the Hungarian-English language pair
A GPU-enabled solver for time-constrained linear sum assignment problems
This paper deals with solving large instances of the Linear Sum Assignment Problems (LSAPs) under realtime constraints, using Graphical Processing Units (GPUs). The motivating scenario is an industrial application for P2P live streaming that is moderated by a central tracker that is periodically solving LSAP instances to optimize the connectivity of thousands of peers. However, our findings are generic enough to be applied in other contexts. Our main contribution is a parallel version of a heuristic algorithm called Deep Greedy Switching (DGS) on GPUs using the CUDA programming language. DGS sacrifices absolute optimality in favor of a substantial speedup in comparison to classical LSAP solvers like the Hungarian and auctioning methods. We show the modifications needed to parallelize the DGS algorithm and the performance gains of our approach compared to a sequential CPU-based implementation of DGS and a mixed CPU/GPU-based implementation of it
OS Scheduling Algorithms for Memory Intensive Workloads in Multi-socket Multi-core servers
Major chip manufacturers have all introduced multicore microprocessors.
Multi-socket systems built from these processors are routinely used for running
various server applications. Depending on the application that is run on the
system, remote memory accesses can impact overall performance. This paper
presents a new operating system (OS) scheduling optimization to reduce the
impact of such remote memory accesses. By observing the pattern of local and
remote DRAM accesses for every thread in each scheduling quantum and applying
different algorithms, we come up with a new schedule of threads for the next
quantum. This new schedule potentially cuts down remote DRAM accesses for the
next scheduling quantum and improves overall performance. We present three such
new algorithms of varying complexity followed by an algorithm which is an
adaptation of Hungarian algorithm. We used three different synthetic workloads
to evaluate the algorithm. We also performed sensitivity analysis with respect
to varying DRAM latency. We show that these algorithms can cut down DRAM access
latency by up to 55% depending on the algorithm used. The benefit gained from
the algorithms is dependent upon their complexity. In general higher the
complexity higher is the benefit. Hungarian algorithm results in an optimal
solution. We find that two out of four algorithms provide a good trade-off
between performance and complexity for the workloads we studied
Precoding by Pairing Subchannels to Increase MIMO Capacity with Discrete Input Alphabets
We consider Gaussian multiple-input multiple-output (MIMO) channels with
discrete input alphabets. We propose a non-diagonal precoder based on the
X-Codes in \cite{Xcodes_paper} to increase the mutual information. The MIMO
channel is transformed into a set of parallel subchannels using Singular Value
Decomposition (SVD) and X-Codes are then used to pair the subchannels. X-Codes
are fully characterized by the pairings and a real rotation matrix
for each pair (parameterized with a single angle). This precoding structure
enables us to express the total mutual information as a sum of the mutual
information of all the pairs. The problem of finding the optimal precoder with
the above structure, which maximizes the total mutual information, is solved by
{\em i}) optimizing the rotation angle and the power allocation within each
pair and {\em ii}) finding the optimal pairing and power allocation among the
pairs. It is shown that the mutual information achieved with the proposed
pairing scheme is very close to that achieved with the optimal precoder by Cruz
{\em et al.}, and is significantly better than Mercury/waterfilling strategy by
Lozano {\em et al.}. Our approach greatly simplifies both the precoder
optimization and the detection complexity, making it suitable for practical
applications.Comment: submitted to IEEE Transactions on Information Theor
A simple Havel-Hakimi type algorithm to realize graphical degree sequences of directed graphs
One of the simplest ways to decide whether a given finite sequence of
positive integers can arise as the degree sequence of a simple graph is the
greedy algorithm of Havel and Hakimi. This note extends their approach to
directed graphs. It also studies cases of some simple forbidden edge-sets.
Finally, it proves a result which is useful to design an MCMC algorithm to find
random realizations of prescribed directed degree sequences.Comment: 11 pages, 1 figure submitted to "The Electronic Journal of
Combinatorics
Search for the end of a path in the d-dimensional grid and in other graphs
We consider the worst-case query complexity of some variants of certain
\cl{PPAD}-complete search problems. Suppose we are given a graph and a
vertex . We denote the directed graph obtained from by
directing all edges in both directions by . is a directed subgraph of
which is unknown to us, except that it consists of vertex-disjoint
directed paths and cycles and one of the paths originates in . Our goal is
to find an endvertex of a path by using as few queries as possible. A query
specifies a vertex , and the answer is the set of the edges of
incident to , together with their directions. We also show lower bounds for
the special case when consists of a single path. Our proofs use the theory
of graph separators. Finally, we consider the case when the graph is a grid
graph. In this case, using the connection with separators, we give
asymptotically tight bounds as a function of the size of the grid, if the
dimension of the grid is considered as fixed. In order to do this, we prove a
separator theorem about grid graphs, which is interesting on its own right
- …