Search CORE

Oxford University Research Archive

Apollo (Cambridge)

Ten Simple Rules for Getting Help from Online Scientific Communities

Author: Brandon M. Invergo
Colin S. Gillespie
ES Raymond
Giovanni M. Dall'Olio
Hafid Laayouni
Jacopo Marino
Jaume Bertranpetit
JE Stajich
Kevin L. Keys
Khader Shameer
Lars J. Jensen
M Ash
Melanie I. Stefan
Michael Schubert
PE Bourne
Philip E. Bourne
Pierre Poulain
Robert Sugar
W Miller
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2011
Field of study

The increasing complexity of research requires scientists to work at the intersection of multiple fields and to face problems for which their formal education has not prepared them. For example, biologists with no or little background in programming are now often using complex scripts to handle the results from their experiments; vice versa, programmers wishing to enter the world of bioinformatics must know about biochemistry, genetics, and other fields. In this context, communication tools such as mailing lists, web forums, and online communities acquire increasing importance. These tools permit scientists to quickly contact people skilled in a specialized field. A question posed properly to the right online scientific community can help in solving difficult problems, often faster than screening literature or writing to publication authors. The growth of active online scientific communities, such as those listed in Table S1, demonstrates how these tools are becoming an important source of support for an increasing number of researchers. Nevertheless, making proper use of these resources is not easy. Adhering to the social norms of World Wide Web communication—loosely termed “netiquette”—is both important and non-trivial. In this article, we take inspiration from our experience on Internet-shared scientific knowledge, and from similar documents such as “Asking the Questions the Smart Way” and “Getting Answers”, to provide guidelines and suggestions on how to use online communities to solve scientific problems

HAL-Inserm

Copenhagen University Research Information System

Caltech Authors

Digital.CSIC

Hal-Diderot

Genetic variation in prehistoric Sardinia

Author: Barbujani G.
Bertorelle G
Bertranpetit J
Caramelli D
Casoli A
Castri L
Floris R
FRANCALACCI Paolo
Lalueza Fox C
Lari M
Sampietro L
Sanna S
Tykot R
Vernesi C
Vona G
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

We sampled teeth from 53 ancient Sardinian (Nuragic) individuals who lived in the Late Bronze Age and Iron Age, between 3,430 and 2,700 years ago. After eliminating the samples that, in preliminary biochemical tests, did not show a high probability to yield reproducible results, we obtained 23 sequences of the mitochondrial DNA control region, which were associated to haplogroups by comparison with a dataset of modern sequences. The Nuragic samples show a remarkably low genetic diversity, comparable to that observed in ancient Iberians, but much lower than among the Etruscans. Most of these sequences have exact matches in two modern Sardinian populations, supporting a clear genealogical continuity from the Late Bronze Age up to current times. The Nuragic populations appear to be part of a large and geographically unstructured cluster of modern European populations, thus making it difficult to infer their evolutionary relationships. However, the low levels of genetic diversity, both within and among ancient samples, as opposed to the sharp differences among modern Sardinian samples, support the hypothesis of the expansion of a small group of maternally related individuals, and of comparatively recent differentiation of the Sardinian gene pools. © Springer-Verlag 2007

Archivio istituzionale della ricerca - Università di Cagliari

Minimizing recombinations in consensus networks for phylogeographic studies

Author: Asif Javed
BME Moret
C Semple
D Gusfield
DH Huson
EO Wilson
Francesc Calafell
J Hein
Jaume Bertranpetit
L Parida
Laxmi Parida
MA Jobling
Marta Melé
S Arora
TH Cormen
V Vazirani
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background We address the problem of studying recombinational variations in (human) populations. In this paper, our focus is on one computational aspect of the general task: Given two networks <it>G</it>1 and <it>G</it>2, with both mutation and recombination events, defined on overlapping sets of extant units the objective is to compute a consensus network <it>G</it>3 with minimum number of additional recombinations. We describe a polynomial time algorithm with a guarantee that the number of computed new recombination events is within <it>ϵ </it>= <it>sz</it>(<it>G</it>1, <it>G</it>2) (function <it>sz </it>is a well-behaved function of the sizes and topologies of <it>G</it>1 and <it>G</it>2) of the optimal <it>number </it>of recombinations. To date, this is the best known result for a network consensus problem. Results Although the network consensus problem can be applied to a variety of domains, here we focus on structure of human populations. With our preliminary analysis on a segment of the human Chromosome X data we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. These results have been verified independently using traditional manual procedures. To the best of our knowledge, this is the first recombinations-based characterization of human populations. Conclusion We show that our mathematical model identifies recombination spots in the individual haplotypes; the aggregate of these spots over a set of haplotypes defines a recombinational landscape that has enough signal to detect continental as well as population divide based on a short segment of Chromosome X. In particular, we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. The agreement with mutation-based analysis can be viewed as an indirect validation of our results and the model. Since the model in principle gives us more information embedded in the networks, in our future work, we plan to investigate more non-traditional questions via these structures computed by our methodology.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

ScholarlyCommons@Penn

The Genographic Project Public Participation Mitochondrial DNA Database

Author: Balanovska E.
Balanovsky O.
Behar D.
Bertranpetit J.
Blue-Smith J.
Comas D.
Cooper A.
Jin L.
Mitchell R.
Pitchappan R.
Quintana-Murci L.
Rosset S.
Royyuru A.
Santos F.
Schurr T.
Soodyall H.
Tyler-Smith C.
Tzur S.
Wells R.
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

The Genographic Project is studying the genetic signatures of ancient human migrations and creating an open-source research database. It allows members of the public to participate in a real-time anthropological genetics study by submitting personal samples for analysis and donating the genetic results to the database. We report our experience from the first 18 months of public participation in the Genographic Project, during which we have created the largest standardized human mitochondrial DNA (mtDNA) database ever collected, comprising 78,590 genotypes. Here, we detail our genotyping and quality assurance protocols including direct sequencing of the mtDNA HVS-I, genotyping of 22 coding-region SNPs, and a series of computational quality checks based on phylogenetic principles. This database is very informative with respect to mtDNA phylogeny and mutational dynamics, and its size allows us to develop a nearest neighbor–based methodology for mtDNA haplogroup prediction based on HVS-I motifs that is superior to classic rule-based approaches. We make available to the scientific community and general public two new resources: a periodically updated database comprising all data donated by participants, and the nearest neighbor haplogroup prediction tool

CiteSeerX

Adelaide Research & Scholarship

Elsevier - Publisher Connector

ScholarlyCommons@Penn

Y-Chromosomal Diversity in Lebanon Is Structured by Recent Historical Events

Author: Balanovska E.
Balanovsky O.
Behar D.
Bertranpetit J.
Blue-Smith J.
Comas D.
Debiane L.
et al.
Hernanz D.
Herrera R.
Khalife J.
Makhoul N.
Platt D.
Quintana-Murci L.
Royyuru A.
Santos F.
Schurr T.
Spencer Wells R.
Tyler-Smith C.
Xue Y.
Zalloua P.
Publication venue: American Society of Human Genetics
Publication date: 01/01/2008
Field of study

Lebanon is an eastern Mediterranean country inhabited by approximately four million people with a wide variety of ethnicities and religions, including Muslim, Christian, and Druze. In the present study, 926 Lebanese men were typed with Y-chromosomal SNP and STR markers, and unusually, male genetic variation within Lebanon was found to be more strongly structured by religious affiliation than by geography. We therefore tested the hypothesis that migrations within historical times could have contributed to this situation. Y-haplogroup J∗(xJ2) was more frequent in the putative Muslim source region (the Arabian Peninsula) than in Lebanon, and it was also more frequent in Lebanese Muslims than in Lebanese non-Muslims. Conversely, haplogroup R1b was more frequent in the putative Christian source region (western Europe) than in Lebanon and was also more frequent in Lebanese Christians than in Lebanese non-Christians. The most common R1b STR-haplotype in Lebanese Christians was otherwise highly specific for western Europe and was unlikely to have reached its current frequency in Lebanese Christians without admixture. We therefore suggest that the Islamic expansion from the Arabian Peninsula beginning in the seventh century CE introduced lineages typical of this area into those who subsequently became Lebanese Muslims, whereas the Crusader activity in the 11th–13th centuries CE introduced western European lineages into Lebanese Christians

Adelaide Research & Scholarship

From cheek swabs to consensus sequences : an A to Z protocol for high-throughput DNA sequencing of complete human mitochondrial genomes

Author: Adhikarla Syama
Adler Christina J.
Balanovska Elena
Balanovsky Oleg
Bertranpetit Jaume
Clarke Andrew C.
Comas David
Cooper Alan
Der Sarkissian Clio S.I.
Dulik Matthew C.
Gaieski Jill B.
GaneshPrasad Arun Kumar
Haak Wolfgang
Haber Marc
Hernanz Soria
Jin Li
Kaplan Matthew E.
Lacerda Daniela R.
Li Shilin
Martínez-Cruz Begoña
Matisoo-Smith Elizabeth A.
Merchant Nirav C.
Mitchell R. John
Owings Amanda C.
Parida Laxmi
Pitchappan Ramasamy
Platt Daniel E.
Prost Stefan
Quintana-Murci Lluis
Renfrew Colin
Royyuru Ajay K.
Santhakumari Arun Varatharajan
Santos Fabrício R.
Schurr Theodore G.
Soodyall Himla
Stanton Jo Ann L.
Swamikrishnan Pandikumar
Tyler-Smith Chris
Vieira Pedro Paulo
Vilar Miguel G.
Wells R. Spencer
White W. Timothy J.
Zalloua Pierre A.
Ziegle Janet S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Next-generation DNA sequencing (NGS) technologies have made huge impacts in many fields of biological research, but especially in evolutionary biology. One area where NGS has shown potential is for high-throughput sequencing of complete mtDNA genomes (of humans and other animals). Despite the increasing use of NGS technologies and a better appreciation of their importance in answering biological questions, there remain significant obstacles to the successful implementation of NGS-based projects, especially for new users. Results: Here we present an ‘A to Z’ protocol for obtaining complete human mitochondrial (mtDNA) genomes – from DNA extraction to consensus sequence. Although designed for use on humans, this protocol could also be used to sequence small, organellar genomes from other species, and also nuclear loci. This protocol includes DNA extraction, PCR amplification, fragmentation of PCR products, barcoding of fragments, sequencing using the 454 GS FLX platform, and a complete bioinformatics pipeline (primer removal, reference-based mapping, output of coverage plots and SNP calling). Conclusions: All steps in this protocol are designed to be straightforward to implement, especially for researchers who are undertaking next-generation sequencing for the first time. The molecular steps are scalable to large numbers (hundreds) of individuals and all steps post-DNA extraction can be carried out in 96-well plate format. Also, the protocol has been assembled so that individual ‘modules’ can be swapped out to suit available resources

Adelaide Research & Scholarship

Repository@Nottingham

Springer - Publisher Connector

University of Birmingham Research Portal

The University of Arizona

Warwick Research Archives Portal Repository

ScholarlyCommons@Penn

The genome sequencing of an albino Western lowland gorilla reveals inbreeding in the wild

Author: Abello T.
Alkan C.
Baeza-Delgado C.
Bertranpetit J.
Caceres M.
Casillas S.
Dabad M.
de la Calle-Mustienes E.
Eichler E.
Engelken J.
Estellé J.
Fernandez-Callejo M.
Gomez-Skarmeta J.
Gut I.
Gut M.
Hernando-Herraez I.
Hormozdiari F.
Lalueza-Fox C.
Lorente-Galdos B.
Marques-Bonet T.
Melé M.
Mingarro I.
Morcillo-Suarez C.
Navarro A.
Prado-Martinez J.
Raineri E.
Ramirez O.
Ritscher L.
Rubio-Acero R.
Schöneberg T.
Valles M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas. Results We successfully identified the causal genetic variant for Snowflake¿s albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake¿s parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla. Conclusions In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost. Keywords: Gorilla; Albinism; Inbreeding; Genome; Conservatio

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Bilkent University Institutional Repository

Repositori d'Objectes Digitals per a l'Ensenyament la Recerca i la Cultura

Digital.CSIC

Diposit Digital de Documents de la UAB

MPG.PuRe

Similarity in Recombination Rate Estimates Highly Correlates with Genetic Differentiation in Humans

Author: A Auton
A Kong
A Kong
A Kong
AJ Jeffreys
Arcadi Navarro
Belén Lorente-Galdos
BS Weir
Carles Lalueza-Fox
D Serre
David Comas
DC Crawford
DF Conrad
DM Altshuler
DM Evans
Elena Bosch
F Baudat
Ferran Casals
Francesc Calafell
G Coop
G Coop
GA McVean
Giovanni Marco Dall'Olio
Hafid Laayouni
HM Cann
J Bertranpetit
J Graffelman
Jan Graffelman
Jaume Bertranpetit
JC Barrett
JE Stajich
JZ Li
KA Frazer
Kate M. McGee
Ludovica Montanucci
M Gardner
M Jakobsson
Marta Melé
Martin Sikora
MP Stumpf
NA Rosenberg
NG Smith
P Fearnhead
Philip Awadalla
PP Khil
RR Sokal
S Myers
S Myers
SE Ptak
SF Schaffner
SR Grossman
W Winckler
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Recombination varies greatly among species, as illustrated by the poor conservation of the recombination landscape between humans and chimpanzees. Thus, shorter evolutionary time frames are needed to understand the evolution of recombination. Here, we analyze its recent evolution in humans. We calculated the recombination rates between adjacent pairs of 636,933 common single-nucleotide polymorphism loci in 28 worldwide human populations and analyzed them in relation to genetic distances between populations. We found a strong and highly significant correlation between similarity in the recombination rates corrected for effective population size and genetic differentiation between populations. This correlation is observed at the genome-wide level, but also for each chromosome and when genetic distances and recombination similarities are calculated independently from different parts of the genome. Moreover, and more relevant, this relationship is robustly maintained when considering presence/absence of recombination hotspots. Simulations show that this correlation cannot be explained by biases in the inference of recombination rates caused by haplotype sharing among similar populations. This result indicates a rapid pace of evolution of recombination, within the time span of differentiation of modern humans

Public Library of Science (PLOS)

Digital.CSIC