Search CORE

451 research outputs found

Performance Evaluation and Optimization of Math-Similarity Search

Author: C Spearman
D Hawking
J Bar-Ilan
L Vaughan
M Kendall
Q Zhang
R Fagin
W Hartmann
Publication venue
Publication date: 29/05/2015
Field of study

Similarity search in math is to find mathematical expressions that are similar to a user's query. We conceptualized the similarity factors between mathematical expressions, and proposed an approach to math similarity search (MSS) by defining metrics based on those similarity factors [11]. Our preliminary implementation indicated the advantage of MSS compared to non-similarity based search. In order to more effectively and efficiently search similar math expressions, MSS is further optimized. This paper focuses on performance evaluation and optimization of MSS. Our results show that the proposed optimization process significantly improved the performance of MSS with respect to both relevance ranking and recall.Comment: 15 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Web citations in patents: Evidence of technological impact?

Author: Aguillo
Alcácer
Bar-Ilan
Bar-Ilan
Bar-Ilan
Brin
Cronin
Endres
Harries
Haustein
Ingwersen
Kim
Kousha
Kousha
Li
Looy
M
Marley
Meyer
Meyer
Meyer
Michel
Moskovkin
Narin
Narin
Narin
Oppenheim
Orduna-Malea
Orduna-Malea
Orduna-Malea
Ortega
Park
Priem
Seeber
Smith
Smith
Sud
Thelwall
Thelwall
Thelwall
Thelwall
Thelwall
Thelwall
Tijssen
Vaughan
Verbeek
Wilkinson
Publication venue: 'Wiley'
Publication date: 01/01/2017
Field of study

This is an accepted manuscript of an article published by Wiley Blackwell in Journal of the Association for Information Science and Technology on 17/07/2017, available online: https://doi.org/10.1002/asi.23821 The accepted version of the publication may differ from the final published version.Patents sometimes cite web pages either as general background to the problem being addressed or to identify prior publications that will limit the scope of the patent granted. Counts of the number of patents citing an organisation’s website may therefore provide an indicator of its technological capacity or relevance. This article introduces methods to extract URL citations from patents and evaluates the usefulness of counts of patent web citations as a technology indicator. An analysis of patents citing 200 US universities or 177 UK universities found computer science and engineering departments to be frequently cited, as well as research-related web pages, such as Wikipedia, YouTube or Internet Archive. Overall, however, patent URL citations seem to be frequent enough to be useful for ranking major US and the top few UK universities if popular hosted subdomains are filtered out, but the hit count estimates on the first search engine results page should not be relied upon for accuracy

Crossref

RiuNet

Wolverhampton Intellectual Repository and E-theses

Reconstruction of Network Evolutionary History from Extant Network Topology and Duplication History

Author: A. Barabasi
A. Bhan
A. Vazquez
A. Wagner
I. Ispolatov
J. Bar-Ilan
J. Dutkowski
J. Pinney
L. Hakes
M. Middendorf
M. Stumpf
N. Farid
R. Pastor-Satorras
R. Patro
R. Sole
S. Navlakha
T. Yamada
Publication venue
Publication date: 01/01/2012
Field of study

Genome-wide protein-protein interaction (PPI) data are readily available thanks to recent breakthroughs in biotechnology. However, PPI networks of extant organisms are only snapshots of the network evolution. How to infer the whole evolution history becomes a challenging problem in computational biology. In this paper, we present a likelihood-based approach to inferring network evolution history from the topology of PPI networks and the duplication relationship among the paralogs. Simulations show that our approach outperforms the existing ones in terms of the accuracy of reconstruction. Moreover, the growth parameters of several real PPI networks estimated by our method are more consistent with the ones predicted in literature.Comment: 15 pages, 5 figures, submitted to ISBRA 201

arXiv.org e-Print Archive

Crossref

ScholarBank@NUS

Search Engine Similarity Analysis: A Combined Content and Rankings Approach

Author: CS Wallace
E Enge
J Bar-Ilan
J Sachse
K Bharat
L Vaughan
M Gordon
MA Jaro
R Fagin
SH Lee
W Ding
W Webber
Y Wang
Z Bar-Yossef
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/11/2020
Field of study

How different are search engines? The search engine wars are a favorite topic of on-line analysts, as two of the biggest companies in the world, Google and Microsoft, battle for prevalence of the web search space. Differences in search engine popularity can be explained by their effectiveness or other factors, such as familiarity with the most popular first engine, peer imitation, or force of habit. In this work we present a thorough analysis of the affinity of the two major search engines, Google and Bing, along with DuckDuckGo, which goes to great lengths to emphasize its privacy-friendly credentials. To do so, we collected search results using a comprehensive set of 300 unique queries for two time periods in 2016 and 2019, and developed a new similarity metric that leverages both the content and the ranking of search responses. We evaluated the characteristics of the metric against other metrics and approaches that have been proposed in the literature, and used it to (1) investigate the similarities of search engine results, (2) the evolution of their affinity over time, (3) what aspects of the results influence similarity, and (4) how the metric differs over different kinds of search services. We found that Google stands apart, but Bing and DuckDuckGo are largely indistinguishable from each other.Comment: Shorter version of this paper was accepted in the 21st International Conference on Web Information Systems Engineering (WISE 2020). The final authenticated version is available online at https://doi.org/10.1007/978-3-030-62008-0_

arXiv.org e-Print Archive

Crossref

Estimating search engine index size variability: a 9-year longitudinal study

Author: A Anagnostopoulos
A Broder
A Kilgarriff
A Kilgarriff
A Spink
A Uyar
Antal van den Bosch
D Lewandowski
GK Zipf
H Turtle
J Bar-Ilan
J Bar-Ilan
J Bar-Ilan
J Rice
L Vaughan
M Henzinger
M Thelwall
M Thelwall
M Thelwall
M Zimmer
Maurice de Kunder
N Payne
R Rousseau
S Lawrence
S Lawrence
Toine Bogers
Y Hirate
Z Bar-Yossef
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Publication and patent analysis of European researchers in the field of production technology and manufacturing systems

Author: A Agrawal
A Geuna
A Molinari
A Schubert
B Looy van
C Labbé
D Czarnitzki
Domenico Maisano
F Franceschini
F Franceschini
F Franceschini
F Franceschini
F Franceschini
Fiorenzo Franceschini
J Bar-Ilan
J Guan
L Bornmann
M Calderini
P Criscuolo
P Mattson
PK Wong
PS Nagpaul
S Breschi
T Lazaridis
YH Cheng
Publication venue: Springer
Publication date: 01/01/2012
Field of study

This paper develops a structured comparison among a sample of European researchers in the field of Production Technology and Manufacturing Systems, on the basis of scientific publications and patents. Researchers are evaluated and compared by a variegated set of indicators concerning (1) the output of individual researchers and (2) that of groups of researchers from the same country. While not claiming to be exhaustive, the results of this preliminary study provide a rough indication of the publishing and patenting activity of researchers in the field of interest, identifying (dis)similarities between different countries. Of particular interest is a proposal for aggregating analysis results by means of maps based on publication and patent indicators. A large amount of empirical data are presented and discusse

CiteSeerX

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

The hw-rank: an h-index variant for ranking web pages

Author: A Korn
A Schubert
AFJ Raan Van
E Garfield
F Ruane
G Pinski
G Salton
HF Moed
J Bar-Ilan
J Hauke
JD West
JE Hirsch
JM Kleinberg
Judit Bar-Ilan
L Bornmann
L Egghe
L Katz
M Thelwall
Mark Levene
P Ingwersen
R Costas
R Guns
S Brin
S Fortunato
SJ Carrière
SX Zhao
T Braun
VP Guerrero-Bote
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2015
Field of study

We introduce a novel ranking of search results based on a variant of the h-index for directed information networks such as the Web. The h-index was originally introduced to measure an individual researcher’s scientific output and influence, but here a variant of it is applied to assess the ‘‘importance’’ of web pages. Like PageRank, the‘‘importance’’ of a page is defined by the ‘‘importance’’ of the pages linking to it. However, unlike the computation of PageRank which involves the whole web graph, computing the h-index for web pages (the hw-rank) is based on a local computation and only the neighbors of the neighbors of the given node are considered. Preliminary results show a strong correlation between ranking with the hw-rank and PageRank, and moreover its computation is simpler and less complex than computation of the PageRank. Further, larger scale experiments are needed in order to assess the applicability of the method

Crossref

Birkbeck Institutional Research Online

The carboligation reaction of acetohydroxyacid synthase II: Steady-state intermediate distributions in wild type and mutants by NMR

Author: Abell
Bar-Ilan
Barak
Bowen
Cleland
D. M. Chipman
Engel
Epelbaum
Epelbaum
G. Hubner
Gollop
Gollop
Green
Hopfield
Ibdah
K. Tittmann
Kern
M. Vyazmensky
Tittmann
Tittmann
Tittmann
Tse
Z. Barak
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date
Field of study

Crossref

Why are Websites co-linked? The case of Canadian universities

Author: B. L. Berg
D. Wilkinson
E. S. Allen
E. T. Jepsen
H. J. Kim
J. Bar-Ilan
J. Bar-Ilan
K. Crowston
L. Vaughan
L. Vaughan
L. Vaughan
Liwen Vaughan
M. Thelwall
M. Thelwall
M. Thelwall
M. Thelwall
Margaret E. I. Kipp
Yijun Gao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Relationship among research collaboration, number of documents and number of citations. A case study in Spanish computer science production in 2000-2009.

Author: A. Gazni
A. VanRaan
Alfonso Ibáñez
B. Cronin
B. Lancho-Barrantes
B. Ponomariov
C. Bartneck
C. Laine
C. Liao
Concha Bielza
D. Archibugi
D. Beaver
D. Beaver
E. Garfield
E. Garfield
G. Abramo
G. Abramo
G. Bammer
G. Cabanac
G. Olson
H. Mann
J. Bar-Ilan
J. Katz
J. Levitt
J. Solomon
J. Wainer
L. Bornmann
L. Fortnow
L. Smolinsky
M. Franceschet
M. Franceschet
M. Franceschet
O. Persson
P. Cullen
P. Weingart
Pedro Larrañaga
R. Hauptman
R. Landry
R. Ruiz-Perez
R. Sooryamoorthy
R. Zetterstrom
S. Presser
W. Glanzel
W. Kruskal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

This paper analyzes the relationship among research collaboration, number of documents and number of citations of computer science research activity. It analyzes the number of documents and citations and how they vary by number of authors. They are also analyzed (according to author set cardinality) under different circumstances, that is, when documents are written in different types of collaboration, when documents are published in different document types, when documents are published in different computer science subdisciplines, and, finally, when documents are published by journals with different impact factor quartiles. To investigate the above relationships, this paper analyzes the publications listed in the Web of Science and produced by active Spanish university professors between 2000 and 2009, working in the computer science field. Analyzing all documents, we show that the highest percentage of documents are published by three authors, whereas single-authored documents account for the lowest percentage. By number of citations, there is no positive association between the author cardinality and citation impact. Statistical tests show that documents written by two authors receive more citations per document and year than documents published by more authors. In contrast, results do not show statistically significant differences between documents published by two authors and one author. The research findings suggest that international collaboration results on average in publications with higher citation rates than national and institutional collaborations. We also find differences regarding citation rates between journals and conferences, across different computer science subdisciplines and journal quartiles as expected. Finally, our impression is that the collaborative level (number of authors per document) will increase in the coming years, and documents published by three or four authors will be the trend in computer science literature

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM