Search CORE

65 research outputs found

Mean P(best > random) for conferences that took place in the indicated years, for both the Scholar and Scopus datasets.

Author: Anderson Rocha (531490)
Jacques Wainer (531488)
Michael Eckmann (706373)
Publication venue
Publication date
Field of study

Mean P(best > random) for conferences that took place in the indicated years, for both the Scholar and Scopus datasets.</p

The Francis Crick Institute

Specifics of the Scopus data.

Author: Anderson Rocha (531490)
Jacques Wainer (531488)
Michael Eckmann (706373)
Publication venue
Publication date
Field of study

The first figure in each cell is the number of non-best papers in the conference instance. The figure in parenthesis is the number of best papers.</p

The Francis Crick Institute

P(best > random) for the two datasets analyzed herein.

Author: Anderson Rocha (531490)
Jacques Wainer (531488)
Michael Eckmann (706373)
Publication venue
Publication date
Field of study

The entry “all” indicates the overall P(best > random). The error bar indicates the 95% confidence interval and the point at the center indicates the mean value of the probability that a best paper will receive more citations than a random non-best paper. The entries 2005 to 2011 indicate the mean and confidence interval of P(best > random) for conferences that took place in those years.</p

The Francis Crick Institute

Multiple Parenting Phylogeny Dataset 1.0

Author: Alberto Oliveira (570921)
Anderson Rocha (531490)
Pasquale Ferrara (574213)
Publication venue
Publication date
Field of study

This material contains the description and the links where to find the datasets used in the evaluation of Multiple Parenting Phylogeny Evaluation</p

The Francis Crick Institute

Specifics of the Google Scholar data.

Author: Anderson Rocha (531490)
Jacques Wainer (531488)
Michael Eckmann (706373)
Publication venue
Publication date
Field of study

The first figure in each cell is the number of non-best papers in the conference instance. The figure in parenthesis is the number of best papers.</p

The Francis Crick Institute

Using Visual Rhythms for Detecting Video-based Facial Spoof Attacks

Author: Allan Pinto (686499)
Anderson Rocha (531490)
Helio Pedrini (686835)
William Robson Schwartz (686834)
Publication venue
Publication date: 19/05/2015
Field of study

UVAD Dataset</p

The Francis Crick Institute

End User License Agreement

Author: Allan Pinto (686499)
Anderson Rocha (531490)
Helio Pedrini (686835)
William Robson Schwartz (686834)
Publication venue
Publication date
Field of study

End user license agreement required to download the Unicamp Video-Attack Dataset (UVAD)</p

The Francis Crick Institute

Composition of the cross-dataset training and testing.

Author: Anderson Rocha (531490)
Eduardo Valle (531489)
Herbert F. Jelinek (333014)
Jacques Wainer (531488)
Ramon Pires (5659174)
Publication venue
Publication date
Field of study

*The annotations SH and DH are added to form the training set in DR1, summing 180 images due to the overlap.</p

The Francis Crick Institute

Laser Printer Attribution: Exploring New Features and Beyond

Author: Anderson Rocha (531490)
Anselmo Ferreira (579035)
Giuliano Pinheiro (670886)
Jefersson Dos santos (596680)
Luiz Navarro (670892)
Publication venue
Publication date
Field of study

Dataset of the paper Laser Printer Attribution: Exploring New Features and Beyond </p

The Francis Crick Institute

On the Reconstruction of Text Phylogeny Trees: Evaluation and Analysis of Textual Relationships

Author: Anderson Rocha (531490)
Guilherme D. Marmerola (3361562)
Marina A. Oikawa (3592025)
Siome Goldenstein (613913)
Zanoni Dias (406615)
Publication venue
Publication date: 19/12/2016
Field of study

<div>Over the history of mankind, textual records change. Sometimes due to mistakes during transcription, sometimes on purpose, as a way to rewrite facts and reinterpret history. There are several classical cases, such as the logarithmic tables, and the transmission of antique and medieval scholarship. Today, text documents are largely edited and redistributed on the Web. Articles on news portals and collaborative platforms (such as Wikipedia), source code, posts on social networks, and even scientific publications or literary works are some examples in which textual content can be subject to changes in an evolutionary process. In this scenario, given a set of near-duplicate documents, it is worthwhile to find which one is the original and the history of changes that created the whole set. Such functionality would have immediate applications on news tracking services, detection of plagiarism, textual criticism, and copyright enforcement, for instance. However, this is not an easy task, as textual features pointing to the documents’ evolutionary direction may not be evident and are often dataset dependent. Moreover, side information, such as time stamps, are neither always available nor reliable. In this paper, we propose a framework for reliably reconstructing text phylogeny trees, and seamlessly exploring new approaches on a wide range of scenarios of text reusage. We employ and evaluate distinct combinations of dissimilarity measures and reconstruction strategies within the proposed framework, and evaluate each approach with extensive experiments, including a set of artificial near-duplicate documents with known phylogeny, and from documents collected from Wikipedia, whose modifications were made by Internet users. We also present results from qualitative experiments in two different applications: text plagiarism and reconstruction of evolutionary trees for manuscripts (stemmatology).</div

Directory of Open Access Journals

PubMed Central

Repositorio da Producao Cientifica e Intelectual da Unicamp

The Francis Crick Institute