Search CORE

527 research outputs found

Spectral approach to linear programming bounds on codes

Author: A. M. Barg
C. Bachoc
D. Yu. Nogin
F.R. Gantmakher
G. Szegö
G.A. Kabatianski
M.E. Ismail
R.J. McEliece
V.I. Levenshtein
V.I. Levenshtein
V.I. Levenshtein
W.H. Foster
Publication venue: 'Pleiades Publishing Ltd'
Publication date: 13/03/2006
Field of study

We give new proofs of asymptotic upper bounds of coding theory obtained within the frame of Delsarte's linear programming method. The proofs rely on the analysis of eigenvectors of some finite-dimensional operators related to orthogonal polynomials. The examples of the method considered in the paper include binary codes, binary constant-weight codes, spherical codes, and codes in the projective spaces.Comment: 11 pages, submitte

arXiv.org e-Print Archive

Crossref

Finding a Mate With No Social Skills

Author: Carter C.
Inada Y.
Levenshtein V. I.
Marriott C.
Penrose M.
Weis A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/04/2015
Field of study

Sexual reproductive behavior has a necessary social coordination component as willing and capable partners must both be in the right place at the right time. While there are many known social behavioral adaptations to support solutions to this problem, we explore the possibility and likelihood of solutions that rely only on non-social mechanisms. We find three kinds of social organization that help solve this social coordination problem (herding, assortative mating, and natal philopatry) emerge in populations of simulated agents with no social mechanisms available to support these organizations. We conclude that the non-social origins of these social organizations around sexual reproduction may provide the environment for the development of social solutions to the same and different problems.Comment: 8 pages, 5 figures, GECCO'1

arXiv.org e-Print Archive

Crossref

The benefits of using a walking interface to navigate virtual environments

Author: Bowman D. A.
Hollerbach J. M.
Levenshtein V. I.
Loomis J. M.
Razzaque S.
Roy A. Ruddle
Ruddle R. A.
Simon Lessels
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/04/2009
Field of study

Navigation is the most common interactive task performed in three-dimensional virtual environments (VEs), but it is also a task that users often find difficult. We investigated how body-based information about the translational and rotational components of movement helped participants to perform a navigational search task (finding targets hidden inside boxes in a room-sized space). When participants physically walked around the VE while viewing it on a head-mounted display (HMD), they then performed 90&percnt; of trials perfectly, comparable to participants who had performed an equivalent task in the real world during a previous study. By contrast, participants performed less than 50&percnt; of trials perfectly if they used a tethered HMD (move by physically turning but pressing a button to translate) or a desktop display (no body-based information). This is the most complex navigational task in which a real-world level of performance has been achieved in a VE. Behavioral data indicates that both translational and rotational body-based information are required to accurately update one's position during navigation, and participants who walked tended to avoid obstacles, even though collision detection was not implemented and feedback not provided. A walking interface would bring immediate benefits to a number of VE applications

Crossref

White Rose Research Online

Fast phonetic similarity search over large repositories

Author: G.A. Miller
M. Paterson
P.A.V. Hall
R. Hamming
V.A. Mann
V.I. Levenshtein
W.H. Gomaa
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2014
Field of study

Analysis of unstructured data may be inefficient in the presence of spelling errors. Existing approaches use string similarity methods to search for valid words within a text, with a supporting dictionary. However, they are not rich enough to encode phonetic information to assist the search. In this paper, we present a novel approach for efficiently perform phonetic similarity search over large data sources, that uses a data structure called PhoneticMap to encode language-specific phonetic information. We validate our approach through an experiment over a data set using a Portuguese variant of a well-known repository, to automatically correct words with spelling errors

Crossref

UCL Discovery

Suffix Tree of Alignment: An Efficient Index for Similar Data

Author: A. Amir
D. Gusfield
E. Ukkonen
E.M. McCreight
G. Navarro
H.H. Do
J. Ziv
K. Sadakane
M. Crochemore
M. Farach-Colton
P. Bille
R. Grossi
R.A. Baeza-Yates
S. Huang
S. Karlin
S. Kuruppu
V. Levenshtein
V. Mäkinen
V. Mäkinen
Publication venue
Publication date: 01/01/2013
Field of study

We consider an index data structure for similar strings. The generalized suffix tree can be a solution for this. The generalized suffix tree of two strings

A

and

B

is a compacted trie representing all suffixes in

A

and

B

. It has

|A|+|B|

leaves and can be constructed in

O(|A|+|B|)

time. However, if the two strings are similar, the generalized suffix tree is not efficient because it does not exploit the similarity which is usually represented as an alignment of

A

and

B

. In this paper we propose a space/time-efficient suffix tree of alignment which wisely exploits the similarity in an alignment. Our suffix tree for an alignment of

A

and

B

has

|A| + l_d + l_1

leaves where

l_d

is the sum of the lengths of all parts of

B

different from

A

and

l_1

is the sum of the lengths of some common parts of

A

and

B

. We did not compromise the pattern search to reduce the space. Our suffix tree can be searched for a pattern

P

O(|P|+occ)

time where

occ

is the number of occurrences of

P

A

and

B

. We also present an efficient algorithm to construct the suffix tree of alignment. When the suffix tree is constructed from scratch, the algorithm requires

O(|A| + l_d + l_1 + l_2)

time where

l_2

is the sum of the lengths of other common substrings of

A

and

B

. When the suffix tree of

A

is already given, it requires

O(l_d + l_1 + l_2)

time.Comment: 12 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

King's Research Portal

Using Social Media to Promote STEM Education: Matching College Students with Role Models

Author: A Bandura
A Bandura
AB Heldman
C Merrill
CJ Owen
D Karunanayake
DM Blei
EA Ensher
G Salton
HV Emmerik
JE Lydon
K Weber
L Tsui
M Hall
P Jaccard
P Lockwood
S Metz
TA Judge
VI Levenshtein
Publication venue
Publication date: 01/07/2016
Field of study

STEM (Science, Technology, Engineering, and Mathematics) fields have become increasingly central to U.S. economic competitiveness and growth. The shortage in the STEM workforce has brought promoting STEM education upfront. The rapid growth of social media usage provides a unique opportunity to predict users' real-life identities and interests from online texts and photos. In this paper, we propose an innovative approach by leveraging social media to promote STEM education: matching Twitter college student users with diverse LinkedIn STEM professionals using a ranking algorithm based on the similarities of their demographics and interests. We share the belief that increasing STEM presence in the form of introducing career role models who share similar interests and demographics will inspire students to develop interests in STEM related fields and emulate their models. Our evaluation on 2,000 real college students demonstrated the accuracy of our ranking algorithm. We also design a novel implementation that recommends matched role models to the students.Comment: 16 pages, 8 figures, accepted by ECML/PKDD 2016, Industrial Trac

arXiv.org e-Print Archive

Crossref

Escaping the Big Brother: an empirical study on factors influencing identification and information leakage on the Web

Author: Bartunov S
Bilgic M
Carmagnola F
Hay M
Iofciu T
Labitzke S
Levenshtein V
Mislove A
Monge AE
Shehab M
Shen W
Winkler WE
Zafarani R
Publication venue: 'SAGE Publications'
Publication date: 01/01/2014
Field of study

This paper presents a study on factors that may increase the risks of personal information leakage, due to the possibility of connecting user profiles that are not explicitly linked together. First, we introduce a technique for user identification based on cross-site checking and linking of user attributes. Then, we describe the experimental evaluation of the identification technique both on a real setting and on an online sample, showing its accuracy to discover unknown personal data. Finally, we combine the results on the accuracy of identification with the results of a questionnaire completed by the same subjects who performed the test on the real setting. The aim of the study was to discover possible factors that make users vulnerable to this kind of techniques. We found out that the number of social networks used, their features and especially the amount of profiles abandoned and forgotten by the user are factors that increase the likelihood of identification and the privacy risks

Crossref

Open Research Online (The Open University)

Archivio istituzionale della ricerca - Università di Genova

On the most compact regular lattice in large dimensions: A statistical mechanical approach

Author: B. Derrida
C.A. Rogers
C.A. Rogers
C.A. Rogers
C.A. Rogers
C.L. Siegel
F.H. Stillinger
G. Parisi
G. Parisi
G. Parisi
G. Parisi
G.A. Kabatiansky
Giorgio Parisi
H. Cohn
H. Minkowski
H.L. Frisch
H.L. Frisch
H.L. Frisch
J.H. Conway
L. Angelani
M. Mézard
S. Torquato
V.I. Levenshtein
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/10/2007
Field of study

In this paper I will approach the computation of the maximum density of regular lattices in large dimensions using a statistical mechanics approach. The starting point will be some theorems of Roger, which are virtually unknown in the community of physicists. Using his approach one can see that there are many similarities (and differences) with the problem of computing the entropy of a liquid of perfect spheres. The relation between the two problems is investigated in details. Some conjectures are presented, that need further investigation in order to check their consistency.Comment: 27 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Archivio della ricerca- Università di Roma La Sapienza

On the accuracy of language trees

Author: A Rambaut
C Christensen
CH Langley
D Bakker
D Bryant
D Bryant
D Robinson
EW Holmann
F Petroni
F Tria
F Tria
Francesca Tria
H Kishino
J Nerbonne
JL Thorne
M Dunn
M Randers
M Serva
M Swadesh
M Swadesh
Matjaz Perc
MJ Sanderson
MJ Sanderson
N Saitou
PMQ Atkinson
Q Atkinson
R Desper
RD Gray
RD Gray
S Pompei
S Wichmann
Simone Pompei
SJ Greenhill
VI Levenshtein
Vittorio Loreto
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Historical linguistics aims at inferring the most likely language phylogenetic tree starting from information concerning the evolutionary relatedness of languages. The available information are typically lists of homologous (lexical, phonological, syntactic) features or characters for many different languages. From this perspective the reconstruction of language trees is an example of inverse problems: starting from present, incomplete and often noisy, information, one aims at inferring the most likely past evolutionary history. A fundamental issue in inverse problems is the evaluation of the inference made. A standard way of dealing with this question is to generate data with artificial models in order to have full access to the evolutionary process one is going to infer. This procedure presents an intrinsic limitation: when dealing with real data sets, one typically does not know which model of evolution is the most suitable for them. A possible way out is to compare algorithmic inference with expert classifications. This is the point of view we take here by conducting a thorough survey of the accuracy of reconstruction methods as compared with the Ethnologue expert classifications. We focus in particular on state-of-the-art distance-based methods for phylogeny reconstruction using worldwide linguistic databases. In order to assess the accuracy of the inferred trees we introduce and characterize two generalizations of standard definitions of distances between trees. Based on these scores we quantify the relative performances of the distance-based algorithms considered. Further we quantify how the completeness and the coverage of the available databases affect the accuracy of the reconstruction. Finally we draw some conclusions about where the accuracy of the reconstructions in historical linguistics stands and about the leading directions to improve it.Comment: 36 pages, 14 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Archivio della ricerca- Università di Roma La Sapienza