Search CORE

73 research outputs found

Basement structure of the United Arab Emirates derived from an analysis of regional gravity and aeromagnetic database

Author: Ali MY
Fairhead JD
Green CM
Noufal A
Publication venue: 'Elsevier BV'
Publication date: 01/08/2017
Field of study

Gravity and aeromagnetic data covering the whole territory of the United Arab Emirates (UAE) have been used to evaluate both shallow and deep geological structures, in particular the depth to basement since it is not imaged by seismic data anywhere within the UAE. Thus, the aim has been to map the basement so that its structure can help to assess its control on the distribution of hydrocarbons within the UAE. Power spectrum analysis reveals gravity and magnetic signatures to have some similarities, in having two main density/susceptibility interfaces widely separated in depth such that regional-residual anomaly separation could effectively be undertaken. The upper density/susceptibility interface occurs at a depth of about 1.5 km while the deeper interface varies in depth throughout the UAE. For gravity, this deeper interface is assumed to be due to the combined effect of lateral changes in density structures within the sediments and in depth of basement while for magnetics it is assumed the sediments have negligible susceptibility and the anomalies unrelated to the volcanic/magmatic bodies result from only changes in depth to basement. The power spectrum analysis over the suspect volcanic/magmatic bodies indicates they occur at ~ 5 km depth. The finite tilt-depth and finite local wavenumber methods were used to estimate depth to source and only depths that agree to within 10% of each other were used to generate the depth to basement map. This depth to basement map, to the west of the UAE-Oman Mountains, varies in depth from 5 km to in excess of 15 km depth and is able to structurally account for the location of the shear structures, seen in the residual magnetic data, and the location of the volcanic/magmatic centres relative to a set of elongate N-S to NE-SW trending basement highs. The majority of oilfields in the UAE are located within these basement highs. Therefore, the hydrocarbon distribution in the UAE basin appears to be controlled by the location of the basement ridges

Biblioteca Digital de la Comunidad de Madrid

White Rose Research Online

Basement structure of the United Arab Emirates derived from an analysis of regional gravity and aeromagnetic database

Author: Ali MY
Fairhead JD
Green CM
Noufal A
Publication venue: 'Elsevier BV'
Publication date: 01/08/2017
Field of study

Crossref

White Rose Research Online

FastBLAST: Homology Relationships for Millions of Proteins

Author: A Marchler-Bauer
AA Schaffer
Adam P. Arkin
BE Suzek
Cecile Fairhead
CH Wu
CM Zmasek
D Wilson
F Pearl
H Mi
I Letunic
JD Selengut
LB Koski
M Remm
MN Price
Morgan N. Price
NJ Mulder
Paramvir S. Dehal
PS Dehal
R Durbin
RD Finn
RL Tatusov
S Yooseph
SF Altschul
W Gish
W Li
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of proteins, is used to identify potential orthologs, to find new protein families, and to provide rapid access to these homology relationships. As DNA sequencing accelerates and data sets grow, all-versus-all BLAST has become computationally demanding.Methodology/principal findingsWe present FastBLAST, a heuristic replacement for all-versus-all BLAST that relies on alignments of proteins to known families, obtained from tools such as PSI-BLAST and HMMer. FastBLAST avoids most of the work of all-versus-all BLAST by taking advantage of these alignments and by clustering similar sequences. FastBLAST runs in two stages: the first stage identifies additional families and aligns them, and the second stage quickly identifies the homologs of a query sequence, based on the alignments of the families, before generating pairwise alignments. On 6.53 million proteins from the non-redundant Genbank database ("NR"), FastBLAST identifies new families 25 times faster than all-versus-all BLAST. Once the first stage is completed, FastBLAST identifies homologs for the average query in less than 5 seconds (8.6 times faster than BLAST) and gives nearly identical results. For hits above 70 bits, FastBLAST identifies 98% of the top 3,250 hits per query.Conclusions/significanceFastBLAST enables research groups that do not have supercomputers to analyze large protein sequence data sets. FastBLAST is open source software and is available at http://microbesonline.org/fastblast

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic Genomes

Author: A Alexeyenko
A Hadgu
Aaron J. Mackey
AJ Enright
AJ Enright
CE Storm
CE Storm
Cecile Fairhead
CG Elsik
CM Zmasek
CM Zmasek
David S. Roos
DP Wall
EL Sonnhammer
EV Koonin
EV Koonin
F Chen
Feng Chen
H Hegyi
J Gouzy
J Magidson
JD Thompson
Jeroen K. Vermunt
JK Vermunt
JK Vermunt
KP O'Brien
L Li
LB Koski
M Remm
RF Doolittle
RL Tatusov
RL Tatusov
RL Tatusov
RL Tatusov
S Bandyopadhyay
S Henikoff
S Van Dongen
SF Altschul
SL Hui
T Hulsen
TF Deluca
WM Fitch
WM Fitch
Y Lee
Y Qu
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Orthology detection is critically important for accurate functional annotation, and has been widely used to facilitate studies on comparative and evolutionary genomics. Although various methods are now available, there has been no comprehensive analysis of performance, due to the lack of a genomic-scale ‘gold standard’ orthology dataset. Even in the absence of such datasets, the comparison of results from alternative methodologies contains useful information, as agreement enhances confidence and disagreement indicates possible errors. Latent Class Analysis (LCA) is a statistical technique that can exploit this information to reasonably infer sensitivities and specificities, and is applied here to evaluate the performance of various orthology detection methods on a eukaryotic dataset. Overall, we observe a trade-off between sensitivity and specificity in orthology detection, with BLAST-based methods characterized by high sensitivity, and tree-based methods by high specificity. Two algorithms exhibit the best overall balance, with both sensitivity and specificity>80%: INPARANOID identifies orthologs across two species while OrthoMCL clusters orthologs from multiple species. Among methods that permit clustering of ortholog groups spanning multiple genomes, the (automated) OrthoMCL algorithm exhibits better within-group consistency with respect to protein function and domain architecture than the (manually curated) KOG database, and the homolog clustering algorithm TribeMCL as well. By way of using LCA, we are also able to comprehensively assess similarities and statistical dependence between various strategies, and evaluate the effects of parameter settings on performance. In summary, we present a comprehensive evaluation of orthology detection on a divergent set of eukaryotic genomes, thus providing insights and guides for method selection, tuning and development for different applications. Many biological questions have been addressed by multiple tests yielding binary (yes/no) outcomes but no clear definition of truth, making LCA an attractive approach for computational biology

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Tilburg University Repository

Comparative Genomics of Mycoplasma: Analysis of Conserved Essential Genes and Diversity of the Pan-Genome

Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

FigShare

Transcriptome of Aphanomyces euteiches: New Oomycete Putative Pathogenicity Factors and Metabolic Pathways

Author: A Bateman
A Gotesson
A McLeod
AH Fairlamb
AR Hardham
Arnaud Bottin
Arnaud Couloux
AV Robold
B Gornhardt
B Henrissat
Bernard Dumas
BM Tyler
BN Lee
Catherine Mathé
Cecile Fairhead
Christophe Jacquet
D Qutob
E Gaulin
E Gaulin
E Gaulin
E Susko
E Wicker
Elodie Gaulin
F Panabières
F Villalba-Mateos
G Fellbrich
G Papavizas
G Stacey
GE Crooks
H Tordai
HL Martin
HO Akamatsu
HS Judelson
HZ Yan
I Inoue
J Tovar
J Win
JD Bendtsen
JD Thompson
JE Mitchell
JM Bollinger Jr
JP Levenfors
JY Le Berre
K Gajendran
K Söderhäll
M Comini
M Hauser
M Larsson
M Le Jean
M Ponchet
M Tian
M Tian
M Waugh
MA Madoui
ML Pilet-Nayel
Mohammed-Amine Madoui
MR Yen
MS Alphey
N Séjalon-Delmas
NL Hiller
P Rice
P Wojtaszek
PA Delwich
Patrick Wincker
RD Johnson
RH Jiang
S Brecht
S Costanzo
S Kamoun
S Krieger
SC Whisson
SF Altschul
SL Oza
SR Eddy
T Bai
T Torto-Alalibo
TA Randall
TA Torto
TA Torto-Alalibo
V Anantharaman
V Mikes
VS Blazer
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Aphanomyces euteiches is an oomycete pathogen that causes seedling blight and root rot of legumes, such as alfalfa and pea. The genus Aphanomyces is phylogenically distinct from well-studied oomycetes such as Phytophthora sp., and contains species pathogenic on plants and aquatic animals. To provide the first foray into gene diversity of A. euteiches, two cDNA libraries were constructed using mRNA extracted from mycelium grown in an artificial liquid medium or in contact to plant roots. A unigene set of 7,977 sequences was obtained from 18,864 high-quality expressed sequenced tags (ESTs) and characterized for potential functions. Comparisons with oomycete proteomes revealed major differences between the gene content of A. euteiches and those of Phytophthora species, leading to the identification of biosynthetic pathways absent in Phytophthora, of new putative pathogenicity genes and of expansion of gene families encoding extracellular proteins, notably different classes of proteases. Among the genes specific of A. euteiches are members of a new family of extracellular proteins putatively involved in adhesion, containing up to four protein domains similar to fungal cellulose binding domains. Comparison of A. euteiches sequences with proteomes of fully sequenced eukaryotic pathogens, including fungi, apicomplexa and trypanosomatids, allowed the identification of A. euteiches genes with close orthologs in these microorganisms but absent in other oomycetes sequenced so far, notably transporters and non-ribosomal peptide synthetases, and suggests the presence of a defense mechanism against oxidative stress which was initially characterized in the pathogenic trypanosomatids

Public Library of Science (PLOS)

HAL Evry

Crossref

Directory of Open Access Journals

PubMed Central

HAL-CEA

Cryptic Diversity of African Tigerfish (Genus Hydrocynus) Reveals Palaeogeographic Signatures of Linked Neogene Geotectonic Events

Author: A Drummond
AE Moore
AE Moore
AE Moore
AJ Drummond
AL Bazinet
AL du Toit
AR Rogers
AR Rogers
AS Goudie
B Brewster
B Lehner
BD Kinabo
C Badgley
C Lévêque
C Speigel
CA Hunn
CG Faulkes
CG Faulkes
CH Scholz
CJ Ebinger
Colleen O'Ryan
CP Burridge
CV Reeves
D Calcagnotto
D Craw
D Delvaux
D Paugy
D Paugy
D Posada
D Posada
DA Benson
DJ Fairhead
DJ Fairhead
DL Griffin
DL Swofford
E Hekkala
E Njonfang
EK Balon
F Dixey
F Ronquist
F Tajima
Fenton P. D. Cotterill
FPD Cotterill
FPD Cotterill
FPD Cotterill
FU Bauer
G Bell-Cross
G Bell-Cross
GC Johns
GG Teugels
GS Merron
GT WoldeGabriel
HC Harpending
J Arroyave
J Cracraft
J Cracraft
J Stankiewicz
JD Thompson
JEE Smedmark
JJ Day
JM Regnoult
JM Reid
KJ Brown
KM Stewart
KM Stewart
KO Winemiller
L Bromham
L Excoffier
L Excoffier
M De Wit
M Kimura
Maarten J. de Wit
MD Crisp
MJ Hickerson
MP Cummings
MP Modisi
MS Njome
NA Drake
Neil John Gemmell
O Otero
O Otero
O Otero
Paul H. Skelton
PB Berendzen
PH Skelton
PH Skelton
PNB Jackson
PNB Jackson
PS Walsh
R Guiraud
RA Jubb
RA Jubb
RL Bruhn
RR Sokal
S Nagaoka
S Roller
S Tavaré
S Wright
SAE Marijnissen
SAM Goodier
Sarah A. M. Goodier
T Abebe
T Hrbek
TA Hall
TM Berra
TR Roberts
U Ring
WH Li
Y Moodley
YX Fu
YX Fu
Publication venue: Public Library of Science
Publication date: 14/12/2011
Field of study

The geobiotic history of landscapes can exhibit controls by tectonics over biotic evolution. This causal relationship positions ecologically specialized species as biotic indicators to decipher details of landscape evolution. Phylogeographic statistics that reconstruct spatio-temporal details of evolutionary histories of aquatic species, including fishes, can reveal key events of drainage evolution, notably where geochronological resolution is insufficient. Where geochronological resolution is insufficient, phylogeographic statistics that reconstruct spatio-temporal details of evolutionary histories of aquatic species, notably fishes, can reveal key events of drainage evolution. This study evaluates paleo-environmental causes of mitochondrial DNA (mtDNA) based phylogeographic records of tigerfishes, genus Hydrocynus, in order to reconstruct their evolutionary history in relation to landscape evolution across Africa. Strong geographical structuring in a cytochrome b (cyt-b) gene phylogeny confirms the established morphological diversity of Hydrocynus and reveals the existence of five previously unknown lineages, with Hydrocynus tanzaniae sister to a clade comprising three previously unknown lineages (Groups B, C and D) and H. vittatus. The dated phylogeny constrains the principal cladogenic events that have structured Hydrocynus diversity from the late Miocene to the Plio-Pleistocene (ca. 0–16 Ma). Phylogeographic tests reveal that the diversity and distribution of Hydrocynus reflects a complex history of vicariance and dispersals, whereby range expansions in particular species testify to changes to drainage basins. Principal divergence events in Hydrocynus have interfaced closely with evolving drainage systems across tropical Africa. Tigerfish evolution is attributed to dominant control by pulses of geotectonism across the African plate. Phylogenetic relationships and divergence estimates among the ten mtDNA lineages illustrates where and when local tectonic events modified Africa's Neogene drainage. Haplotypes shared amongst extant Hydrocynus populations across northern Africa testify to recent dispersals that were facilitated by late Neogene connections across the Nilo-Sahelian drainage. These events in tigerfish evolution concur broadly with available geological evidence and reveal prominent control by the African Rift System, evident in the formative events archived in phylogeographic records of tigerfish

Public Library of Science (PLOS)

Cape Town University OpenUCT

Crossref

Directory of Open Access Journals

PubMed Central

Gene-Specific Signatures of Elevated Non-Synonymous Substitution Rates Correlate Poorly across the Plasmodium Genus

Author: A Kushwaha
AA Escalante
AA Sultan
AE Topolska
AF Cowman
AG Clark
AG Maier
AL Hughes
AM Tomas
BC van Schaijk
C Cerami
Cecile Fairhead
CG Black
CJ McCormick
CJ Stoeckert Jr
D Gaur
David J. Conway
DC Jeffares
DL Narum
Gareth D. Weedall
H Hisaeda
HM Muller
I Siden-Kiamos
IA Quakyi
IT Ling
IT Ling
J Mu
J Mu
J Stubbs
J Thompson
JA Pearce
JD Thompson
JK Thompson
JM Burns Jr
JM Carlton
JT Dessens
JT Dessens
JT Dessens
K Kadota
K Kato
KR Trenholme
M Ghai
M Suyama
M Yuda
M Yuda
MB Borre
MJ Gardner
MR van Dijk
N Arisue
N Hall
O Kaneko
O Kaneko
O Silvie
P Preiser
PR Sanders
PR Sanders
Q Shi
R Chattopadhyay
R Nielsen
RF Howard
RG Ridley
S Eksi
S Lustigman
SK Volkman
SL Perkins
SM Rich
SP Sidjanski
Spencer D. Polley
T Ishino
T Ishino
T Kariu
T Kariu
T Triglia
T-M Gilberger
U Frevert
VM Marshall
VM Marshall
X Li
Y Sterkers
YL Tsai
Z Yang
Z Yang
Publication venue: Public Library of Science
Publication date: 28/05/2008
Field of study

BACKGROUND: Comparative genome analyses of parasites allow large scale investigation of selective pressures shaping their evolution. An acute limitation to such analysis of Plasmodium falciparum is that there is only very partial low-coverage genome sequence of the most closely related species, the chimpanzee parasite P. reichenowi. However, if orthologous genes have been under similar selective pressures throughout the Plasmodium genus then positive selection on the P. falciparum lineage might be predicted to some extent by analysis of other lineages. PRINCIPAL FINDINGS: Here, three independent pairs of closely related species in different sub-generic clades (P. falciparum and P. reichenowi; P. vivax and P. knowlesi; P. yoelii and P. berghei) were compared for a set of 43 candidate ligand genes considered likely to be under positive directional selection and a set of 102 control genes for which there was no selective hypothesis. The ratios of non-synonymous to synonymous substitutions (dN/dS) were significantly elevated in the candidate ligand genes compared to control genes in each of the three clades. However, the rank order correlation of dN/dS ratios for individual candidate genes was very low, less than the correlation for the control genes. SIGNIFICANCE: The inability to predict positive selection on a gene in one lineage by identifying elevated dN/dS ratios in the orthologue within another lineage needs to be noted, as it reflects that adaptive mutations are generally rare events that lead to fixation in individual lineages. Thus it is essential to complete the genome sequences of particular species of phylogenetic importance, such as P. reichenowi

Public Library of Science (PLOS)

Crossref

LSHTM Research Online

Directory of Open Access Journals

PubMed Central

Genome Sequence of Fusobacterium nucleatum Subspecies Polymorphum — a Genetically Tractable Fusobacterium

Author: A Brenot
A Jewett
A Lukashin
A Mira
A Yoshida
AH Rogers
AH Rogers
AI Bolstad
AM Chryssagi
AP Ribeiro-Sobrinho
B Bassler
B Ewing
B Ewing
BJ Paster
BJ Shenker
C Bearfield
C Medigue
Cecile Fairhead
D Jean
D Kersulyte
DJ Bradshaw
DR Demuth
E Holst
E Kononen
ECC Lin
EJ Goldstein
F Feuille
G Conrads
GB Hill
George E. Fox
George M. Weinstock
H Bruggemann
H Jousimies-Somer
H Mikamo
H Mikamo
H Philippe
H Takada
Huaiyang Jiang
I Brook
I Moszer
J Frias
J Kaufman
J Mrazek
Jason Gioia
JD Thompson
JG Lawrence
JG Lawrence
JG Lawrence
JL Dzink
JL Ebersole
JL Siefert
Joseph F. Petrosino
K Li
KY King
L Lindahl
LA Ximénez-Fyvie
LJ Brown
M Carsiotis
M Desvaux
M Ozaki
M Zimmer
MA Ragan
ML Morris
MP McLeod
P Havlak
PE Kolenbrander
PI Diaz
PI Diaz
PJ Christie
R Chaudhry
R Civen
R Gmur
R Niederman
R Zhang
RJ Cahill
RS Tuttle
S Griffiths-Jones
S Hase
S Hunt Gerardo
S Karlin
S Kinder Haake
S Kinder Haake
S Sukupolvi
SA Leach
SA Robrish
SA Robrish
Sandor E. Karpathy
Sarah K. Highlander
Shailaja Yerrapragada
SK Haake
Susan Kinder Haake
T Bobik
T Jaeger
T Kuriyama
T Suzuki
T Takemoto
T Tobe
TA Bobik
TL McKay
TM Lowe
TR Cech
TV Ilyina
UE Schaible
V Braun
V Kapatral
V Kapatral
WE Moore
WS Hayes
Xiang Qin
Y Lehmann
Yamei Liu
YW Han
YW Han
Z-F Cheng
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Fusobacterium nucleatum is a prominent member of the oral microbiota and is a common cause of human infection. F. nucleatum includes five subspecies: polymorphum, nucleatum, vincentii, fusiforme, and animalis. F. nucleatum subsp. polymorphum ATCC 10953 has been well characterized phenotypically and, in contrast to previously sequenced strains, is amenable to gene transfer. We sequenced and annotated the 2,429,698 bp genome of F. nucleatum subsp. polymorphum ATCC 10953. Plasmid pFN3 from the strain was also sequenced and analyzed. When compared to the other two available fusobacterial genomes (F. nucleatum subsp. nucleatum, and F. nucleatum subsp. vincentii) 627 open reading frames unique to F. nucleatum subsp. polymorphum ATCC 10953 were identified. A large percentage of these mapped within one of 28 regions or islands containing five or more genes. Seventeen percent of the clustered proteins that demonstrated similarity were most similar to proteins from the clostridia, with others being most similar to proteins from other gram-positive organisms such as Bacillus and Streptococcus. A ten kilobase region homologous to the Salmonella typhimurium propanediol utilization locus was identified, as was a prophage and integrated conjugal plasmid. The genome contains five composite ribozyme/transposons, similar to the CdISt IStrons described in Clostridium difficile. IStrons are not present in the other fusobacterial genomes. These findings indicate that F. nucleatum subsp. polymorphum is proficient at horizontal gene transfer and that exchange with the Firmicutes, particularly the Clostridia, is common

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

DigitalCommons@The Texas Medical Center

Large expert-curated database for benchmarking document similarity detection in biomedical literature search

Author: Aanei CM
Abid MB
Abramowitz MK
Abu-Zaid A
Afnan M
Agarabi C
Ahmad R
Aizat WM
Al-Farha AA
Al-Lawama M
Alanio A
Alaux C
Albiol J
Albrecht DR
Albuquerque LG
Alimba CG
Allardyce J
Almeida GMF
Alonso-Caneiro D
Alper OM
Amer SEDR
Amiya E
Ammerman BA
Amorim RM
An Q
Andersen SU
Aplin JD
Argyropoulos C
Armitage C
Ascher DB
Ashry M
Asmann YW
Assaeed AM
Atack JM
Atanasov AG
Atchison DA
Atkins GJ
Atlas L
Avery SV
Avillach P
Baade PD
Backman L
Badie C
Bae T
Baier D
Baker CI
Bakkach J
Baldi A
Ball E
Bannon R
Bansal A
Bardot O
Barnett AG
Barraud P
Basharat Z
Basner M
Batra J
Baumert P
Bazanova OM
Beale A
Beck CR
Becker D
Beddoe T
Bell ML
Benezeth Y
Bengtsson-Palme J
Berbesque C
Berezikov E
Bergsland N
Berners-Price S
Bernhardt P
Berrevoet F
Berry E
Berthold M
Bessa TB
Beyene TJ
Biedermann PHW
Bijleveld E
Billington C
Birch J
Bittner F
Bitzer M
Blakely RD
Blanck O
Blaskovich MAT
Bleackley M
Blombach F
Blum R
Boehme KA
Boelaert M
Bogdanos D
Bonvin AMJJ
Bosch C
Bosch O
Boudreau SA
Bourgoin T
Bourke E
Bouvard D
Boykin LM
Bradley G
Bradshaw W
Bramoweth AD
Brand T
Braubach O
Braun D
Braun RJ
Brenneisen P
Bridges KM
Brown JAL
Brown P
Browngardt C
Brownlie J
Bruhl A
Bukowy-Bieryllo Z
Bull JA
Burt A
Bush SJ
Butler LM
Byrareddy SN
Byrne HJ
Cabantous S
Cai Y
Calatayud S
Campana LG
Campbell M
Candal E
Cao Z
Cao Z
Cardoso P
Carlson K
Carter D
Cascella M
Casillas S
Castelvetro V
Caswell PT
Catry T
Cavalli G
Cernava T
Cerovsky V
Chacko G
Chagoyen M
Chakraborty S
Chan SS
Chandrasekaran AR
Chatzitheochari S
Chavez-Fumagalli MA
Chen B
Chen C-E
Chen C-S
Chen DF
Chen H
Chen H
Chen J-T
Chen X
Chen Y
Cheng C
Cheng J
Cheng S
Cheung JTK
Chinapaw M
Chinopoulos C
Cho WCS
Chong L
Chowdhury D
Chung H-J
Chwalibog A
Ciresi A
Cobine PA
Cockcroft S
Coelho LP
Colella V
Conesa A
Conway A
Cook PA
Cooper DN
Cooper J
Coqueret O
Corea EM
Cosacak MI
Costa BM
Costa E
Costa VD
Coupland C
Crawford SY
Cruz AD
Cui H
Cui Q
Cuiv PO
Culver DC
Cuypers M
Cyr N
D'Angiulli A
Dahms TES
Dai Z
Daigle F
Dalgleish R
Dalrymple BP
Danchin A
Danielsen HE
Darras S
Daulatabad SV
Davidson SM
Day DA
de Keersmaecker K
de Leeuw F-E
Dean LT
Debrabant B
Degirmenci V
del Tredici AL
Delahay RM
Demaison L
Denzel MS
Deschodt M
Devkota HP
Devriendt K
Dhariwal R
Diao J
Ding J
Dings RPM
Diouf B
Dixon R
Dlamini SV
Dogan Y
Domingues HS
Dong XC
Donner CF
Dono M
Doxey AC
Dressick W
Drevon CA
Duan H
Ducho C
Ducommun B
Dudley KJ
Dufies M
Duijf PHG
Dumaz N
Dwarakanath BS
Ebell MH
Echeverria N
Ecke T
Eckweiler D
Eerola T
Effiong A
Ehret F
Eisenhardt S
Eixarch E
El-Adawy H
El-Esawi MA
Elkum N
Emmrich JV
Engel MS
Engel N
Epp T
Erickson TB
Esfahlani SS
Eskelinen E-L
Eskew EA
Esnakul AK
Eustace AJ
Evangelou E
Fairhead M
Falk S
Fallah M
Falter-Wagner CM
Fan X
Farber DB
Faville MJ
Feghali KA
Fejzo MS
Fernandez-Triana J
Festa F
Feteira A
Feyerabend F
Fierz W
Filipp FV
Fiona .
Flegel WA
Flood-Page P
Florio T
Forano E
Forsayeth J
Fox SA
Franks SJ
Frentiu FD
Friebe M
Frilander MJ
Fu X
Fujita S
Furuta S
Fuss J
Gabrielsen M
Gajda M
Galea I
Galluzzi L
Gani F
Ganpule AP
Gao J
Garcia-Alix A
Gatchell M
Gaullier G
Gedye K
Gelfer Y
Ghelardi E
Gill MR
Gilliham M
Giordano M
Giunta C
Gladue DP
Gleeson PA
Gloyn L
Gnasso A
Goarant C
Gobet A
Goggs R
Gong H
Gonzalezlez-Prendes R
Goodin A
Goodyear CS
Gora D
Gough MJ
Govender P
Govinden U
Goyal R
Graham EB
Graham KE
Grande-Perez A
Graves PM
Greene G
Greenwald NF
Greidanus H
Greiff V
Grice D
Grimm DG
Groen EJN
Gruber J
Grunau C
Grundle DS
Gruneberg P
Grybos M
Guisado JL
Gumede N
Gumulya Y
Guo Y
Gurevich VV
Gurney-Champion OJ
Gusev O
Gutierrez-Sacristan A
Habes M
Hacker E
Hage SR
Hagen G
Hahn S
Haller DM
Hammerschmidt S
Han H
Han J
Han Q
Han R
Handfield M
Hanson J
Haore G
Hapuarachchi HC
Harder T
Hardingham JE
Harrison P
Hartmann MD
Harvey DJ
Haston S
Heck M
Heers M
Heffler E
Heinrich M
Helantera H
Herbelet S
Hew KF
Higginbottom DB
Higuchi Y
Hilton R
Hiroi N
Hobbs E
Hodzic E
Hoenner X
Hojsgaard D
Hone A
Hongoh Y
Honjo K
Horbar J
Hori H
Hu G
Hu P
Huber HP
Huber M
Hueso LE
Huirne J
Hurt L
Huttner FJ
Idborg H
Ide K
Ikeo K
Ikonomopoulou MP
Ingley E
Jakeman PM
Janga SC
Janzen T
Jayaraman J
Jeltsch A
Jensen A
Jeurissen P
Jia H
Jia H
Jia S
Jiang F
Jiang J
Jiang X
Jibb LA
Jin Y
Jo D
Johnson AM
Johnson DM
Johnston M
Jongen S
Jonscher KR
Jorens PG
Jorgensen JOL
Josse C
Joubert JW
Jung S-H
Junior AM
Jurman G
Kabra D
Kahan T
Kaiser S
Kamagata K
Kamboj SK
Kamiya H
Kane NC
Kang Y-K
Karamanos Y
Karmakar C
Karp NA
Kasian O
Kauppila JH
Kaye LK
Kelly R
Kelly S
Kenna R
Kennedy J
Kersten B
Khalaf RA
Khalid JM
Khan MM
Khatlani T
Khider T
Kijanka GS
Kim Y-M
King SRB
Kinyanjui T
Kish JK
Klempnauer K-H
Kleppe A
Klump H
Kluz T
Knox P
Kobayashi T
Kobold S
Koch K-W
Kohanbash G
Kohls G
Kohonen-Corish MRJ
Koleva-Kolarova RG
Kong X
Konkle-Parker D
Korpela KM
Kostrikis LG
Kraiczy P
Kratz H
Krause G
Krebsbach PH
Kristensen SR
Kristiansson E
Kueberuwa G
Kugler J-M
Kulkarni A
Kumar G
Kumar N
Kumar N
Kumari P
Kunimatsu A
Kurdak H
Kurgan L
Kurniawan NA
Kwon YD
Lachat C
Lacy-Colson J
Lagisz M
Lai HM
Laky B
Lalaouna D
Lammerding J
Lange M
Larrosa M
Laslett AL
Latif A
Lau CL
Lauschke VM
LeClair EE
Lee K-W
Lee M-S
Lee M-Y
Lee S
Li B
Li G
Li J
Li J
Li J
Li Z
Liang D
Liang S
Lidbury BA
Lieb K
Liehr T
Liew AWC
Lim CJ
Lim YY
Lin MZ
Lindsey ML
Line P-D
Liu D
Liu E
Liu F
Liu F
Liu H
Liu H
Liu S
Liu X
Liu Y-P
Lloyd VK
Lo T-W
Locci E
Loft ND
Loidl J
Lopez-Escamez JA
Lopez-Ruiz FJ
Lorenzen J
Lorkowski S
Lovell NH
Lu H
Lu J-J
Lu Q
Lu W
Lu Z
Luengo GS
Lund BA
Lundh L-G
Lussier AA
Luu AM
Lynch I
Lysy PA
Ma C
Ma L
Ma L
Ma L
Ma R
Ma W
Mabb A
Mack HG
Mackey DA
Mahavadi P
Mahdavi SR
Maher P
Maher T
Maibach EW
Maity SN
Malgrange B
Mamoulakis C
Mangoni AA
Manke T
Manstead ASR
Mantalaris A
Marchbank KJ
Marinello F
Marsal J
Marschalek R
Marschall H-U
Martin CS
Martin FL
Martinez-Raga J
Martinez-Salas E
Martis E
Marzocchi U
Mather DE
Mathieu D
Matsui Y
Maza E
McCrum C
McCutcheon JE
McGarrigle CA
Mckay GJ
McMillan B
McMillan N
Meads C
Medina L
Merrick BA
Meseko C
Metzger DW
Meule A
Meunier FA
Michaelis M
Micheau O
Miele AE
Mier P
Mihara H
Min R
Mintz EM
Miotla P
Mitchell KM
Mizukami T
Moal I
Moalic Y
Mohapatra DP
Molari M
Molleman L
Mondal SR
Montagutelli X
Monteiro A
Montes M
Moore MD
Moran JV
Morcillo E
Morozov SY
Mort M
Moss WN
Moultos OA
Moyer R
Mukherjee M
Murai N
Murphy DJ
Murphy SK
Murray SA
Muth T
Naganawa S
Nagler K
Nakayama K
Nammi S
Nandakumar KS
Narayan E
Nasios G
Natoli RM
Navaratnarajah .
Neumann P-A
Ng G
Nguyen F
Nicol C
Nicoletti R
Nie J
Nie Y
Niehof M
Niemeyer F
Nilsen EB
Nilsson H
Nixon B
Nobile CJ
Norris AD
Nwaiwu O
O'Mahony M
O'Toole R
Ogami K
Ohgami RS
Ohlsson S
Ohtomo T
Olatunbosun O
Oldenmenger WH
Olofsson P
Olumayede E
Orme MW
Ortiz A
Oster H
Ostrikov K
Otto S
Ou J
Outeiro TF
Ouyang S
Paganoni S
Page A
Pallebage-Gamarallage M
Palm C
Palma J-A
Pan Z
Panthee S
Paradies Y
Parchi P
Parsons JR
Parsons MH
Parsons N
Pascal P
Paterson R
Paul E
Pearce SP
Pearson JA
Peckham M
Pedemonte N
Peifer M
Pelkonen T
Pelleri MC
Pellizzon MA
Peng Y
Perco P
Pereira JL
Peres MA
Petrelli M
Pheko M
Pichugin A
Pinto CJC
Pinto IM
Pinto KA
Piotrowski M
Piovesan A
Plevris JN
Pluess M
Podolsky IM
Pollesello P
Polz M
Ponti G
Popoola SI
Porcelli P
Portilla M
Portillo MC
Pourret O
Prajapati AS
Pranata R
Prescott J
Prieto D
Prince M
Pritchard AL
Pusch S
Qi D
Qi X
Quinn GP
Quinn TJ
Raghava GPS
Rahimi F
Rahman MS
Raikou VD
Ramula S
Ranft A
Rappsilber J
Reddan T
Rehfeldt F
Reiling JH
Remacle C
Reschke CR
Rezaei M
Rhodes J
Riddick EW
Ritter U
Riva G
Roach NW
Roberts DD
Roberts NJ
Robles G
Rodrigues T
Rodriguez C
Roislien J
Roobol MJ
Ross K
Ross SA
Rotge J-Y
Rowe AD
Rowe JA
Ruepp A
Rust P
Saad S
Sabnis SC
Sack GH
Saggar M
Saito Y
Salama MF
Sallmon H
Santos M
Saudemont A
Sava G
Schrading S
Schramm A
Schreiber M
Schuele B
Schuler S
Schulte LN
Schuon RA
Schymkowitz J
Sczyrba A
Seib KL
Senghore T
Seow E
Sergeant K
Shabalin IG
Shahid S
Shalchyan V
Shen J
Shi H-P
Shimada T
Shin J-S
Shortt C
Siebers R
Sillanpaa E
Silveyra P
Skinner D
Small I
Smeets PAM
Smith SS
So P-W
Solano F
Sonenshine DE
Song H
Song J
Sorzano CO
Southall T
Speakman JR
Srinivasan MV
St Hilaire C
Stabile LP
Staege MS
Stasiak A
Steadman KJ
Stein N
Stella A
Stephens AW
Stevanovic D
Stewart CJ
Stewart DI
Stine K
Storlazzi C
Stoynova NV
Strzalka W
Suarez OM
Subhash S
Sukocheva O
Sultana T
Sumant AV
Summers MJ
Sun G
Sydes M
Tacon P
Tamaian R
Tan A-C
Tan E-C
Tan K-H
Tanaka K
Tang H
Tanino Y
Targett-Adams P
Tayebi M
Tayyem R
Tebbe CC
Telfer EE
Tempel W
Teodorczyk-Injeyan JA
Terrier O
Testoni I
Thijs G
Thorne S
Thrift AG
Tiffon C
Tinnefeld P
Tjahjono DH
Tofani M
Tolle F
Torga G
Toth E
Tressoldi P
Troder SE
Tsapas A
Tsirigotis K
Turak A
Tuttle N
Tzotzos G
Uchendu F
Udo EE
Uhle F
Utsumi T
Uversky VN
Vaidyanathan S
Vaillant M
Valsesia A
Van de Mortel T
Van den Bos W
van Meerten T
van Nieuwerburgh F
van Raaij MJ
van Ruitenbeek J
Vandenbroucke RE
Vanneste S
Veiga FH
Vendrell M
Verloh N
Vesk PA
Vickers P
Victor VM
Villemur R
Villet MH
Vindin H
Viveiros M
Vohl M-C
Voolstra CR
Vorholt JA
Voskarides K
Voutchkova DD
Vuillemin A
Wakelin S
Waldron L
Walsh LJ
Wang AY
Wang F
Wang Y
Watanabe Y
Weigert A
Weinstock C
Wen J-C
Werner GDA
Werten S
Westermair AL
Wham C
White EP
Widera D
Wiener J
Wilharm G
Wilkinson S
Williams R
Willmann R
Wilson C
Wirth B
Wojan TR
Woldesemayat AA
Wolff M
Wong A
Wong BM
Wu T-W
Wuerbel H
Xia W
Xiao X
Xu D
Xu H
Xu J
Xu J
Xu JW
Xue B
Xue Y
Yadollahpour A
Yalcin S
Yamato M
Yan H
Yang E-C
Yang H
Yang L
Yang S
Yang SY
Yang W
Yang Y
Ye Y
Ye Z-Q
Yeung AWK
Yin C-C
Yli-Kauhaluoma J
Yoneyama H
Yu Y
Yuan G-C
Yuh C-H
Zabetakis I
Zaccolo M
Zaucha J
Zeng C
Zeng E
Zevnik B
Zhang C
Zhang C
Zhang J
Zhang L
Zhang L
Zhang X
Zhang Y
Zhang Y
Zhang Z
Zhang Z
Zhang Z-Y
Zhao X
Zhao Y
Zhou K
Zhou M
Zhu S
Ziegler A
Zinke K
Zuberbier T
Publication venue: OXFORD UNIV PRESS
Publication date: 29/10/2019
Field of study

Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency–Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research

UCL Discovery