Search CORE

67 research outputs found

A new pooling strategy for high-throughput screening: the Shifted Transversal Design

Author: Thierry-Mieg Nicolas
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: In binary high-throughput screening projects where the goal is the identification of low-frequency events, beyond the obvious issue of efficiency, false positives and false negatives are a major concern. Pooling constitutes a natural solution: it reduces the number of tests, while providing critical duplication of the individual experiments, thereby correcting for experimental noise. The main difficulty consists in designing the pools in a manner that is both efficient and robust: few pools should be necessary to correct the errors and identify the positives, yet the experiment should not be too vulnerable to biological shakiness. For example, some information should still be obtained even if there are slightly more positives or errors than expected. This is known as the group testing problem, or pooling problem. RESULTS: In this paper, we present a new non-adaptive combinatorial pooling design: the "shifted transversal design" (STD). It relies on arithmetics, and rests on two intuitive ideas: minimizing the co-occurrence of objects, and constructing pools of constant-sized intersections. We prove that it allows unambiguous decoding of noisy experimental observations. This design is highly flexible, and can be tailored to function robustly in a wide range of experimental settings (i.e., numbers of objects, fractions of positives, and expected error-rates). Furthermore, we show that our design compares favorably, in terms of efficiency, to the previously described non-adaptive combinatorial pooling designs. CONCLUSION: This method is currently being validated by field-testing in the context of yeast-two-hybrid interactome mapping, in collaboration with Marc Vidal's lab at the Dana Farber Cancer Institute. Many similar projects could benefit from using the Shifted Transversal Design

Hal - Université Grenoble Alpes

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MatrixDB, a database focused on extracellular protein–protein and protein–carbohydrate interactions

Author: Bairoch
Breitkreutz
Chatr-Aryamontri
Davis
Degtyarenko
Dixelius
Durbin
Emilie Chautard
Hashimoto
Hermjakob
Hooper
Jokinen
Kerrien
Kerrien
Kielty
Lionel Ballut
Nicolas Thierry-Mieg
Orchard
Prasad
Ricard-Blum
Rodgers
Salwinski
Shannon
Stein
Sylvie Ricard-Blum
Publication venue: Oxford University Press
Publication date
Field of study

Summary: MatrixDB (http://matrixdb.ibcp.fr) is a database reporting mammalian protein–protein and protein–carbohydrate interactions involving extracellular molecules. It takes into account the full interaction repertoire of the extracellular matrix involving full-length molecules, fragments and multimers. The current version of MatrixDB contains 1972 interactions corresponding to 4412 experiments and involving 259 extracellular biomolecules

Crossref

PubMed Central

Mutations in DNAH1, which encodes an inner arm heavy chain dynein, lead to male infertility from multiple morphological abnormalities of the sperm flagella.

Author: Arnoult Christophe
Ben Khelifa Mariem
Bidart Marie
Coutton Charles
Delaroche Julie
Escalier Denise
Grunwald Didier
Hennebicq Sylviane
Jouk Pierre-Simon
Karaouzène Thomas
Pernet-Gallay Karine
Pierre Virginie
Ray Pierre F
Rendu John
Thierry-Mieg Nicolas
Touré Aminata
Yassine Sandra
Zouari Raoudha
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

International audienceTen to fifteen percent of couples are confronted with infertility and a male factor is involved in approximately half the cases. A genetic etiology is likely in most cases yet only few genes have been formally correlated with male infertility. Homozygosity mapping was carried out on a cohort of 20 North African individuals, including 18 index cases, presenting with primary infertility resulting from impaired sperm motility caused by a mosaic of multiple morphological abnormalities of the flagella (MMAF) including absent, short, coiled, bent, and irregular flagella. Five unrelated subjects out of 18 (28%) carried a homozygous variant in DNAH1, which encodes an inner dynein heavy chain and is expressed in testis. RT-PCR, immunostaining, and electronic microscopy were carried out on samples from one of the subjects with a mutation located on a donor splice site. Neither the transcript nor the protein was observed in this individual, confirming the pathogenicity of this variant. A general axonemal disorganization including mislocalization of the microtubule doublets and loss of the inner dynein arms was observed. Although DNAH1 is also expressed in other ciliated cells, infertility was the only symptom of primary ciliary dyskinesia observed in affected subjects, suggesting that DNAH1 function in cilium is not as critical as in sperm flagellum

Elsevier - Publisher Connector

Crossref

Hal - Université Grenoble Alpes

HAL Descartes

PubMed Central

New insights into protein-protein interaction data lead to increased estimates of the S. cerevisiae interactome size

Author: A Grigoriev
A Vinayagam
AC Gavin
AS Schwartz
C Stark
C von Mering
E Sprinzak
GT Hart
H Huang
H Huang
H Jeong
H Sahai
H Yu
I Lee
I Xenarios
JDJ Han
JM Cherry
K Tarassov
K Venkatesan
L Salwinski
Laure Sambourg
M Costanzo
ME Cusick
ME Cusick
MJA Aryee
MPH Stumpf
Nicolas Thierry-Mieg
NJ Krogan
P Braun
P D'haeseleer
P Uetz
S Fields
SW Michnick
T Ito
T Reguly
X Xin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background As protein interactions mediate most cellular mechanisms, protein-protein interaction networks are essential in the study of cellular processes. Consequently, several large-scale interactome mapping projects have been undertaken, and protein-protein interactions are being distilled into databases through literature curation; yet protein-protein interaction data are still far from comprehensive, even in the model organism <it>Saccharomyces cerevisiae</it>. Estimating the interactome size is important for evaluating the completeness of current datasets, in order to measure the remaining efforts that are required. Results We examined the yeast interactome from a new perspective, by taking into account how thoroughly proteins have been studied. We discovered that the set of literature-curated protein-protein interactions is qualitatively different when restricted to proteins that have received extensive attention from the scientific community. In particular, these interactions are less often supported by yeast two-hybrid, and more often by more complex experiments such as biochemical activity assays. Our analysis showed that high-throughput and literature-curated interactome datasets are more correlated than commonly assumed, but that this bias can be corrected for by focusing on well-studied proteins. We thus propose a simple and reliable method to estimate the size of an interactome, combining literature-curated data involving well-studied proteins with high-throughput data. It yields an estimate of at least 37, 600 direct physical protein-protein interactions in <it>S. cerevisiae</it>. Conclusions Our method leads to higher and more accurate estimates of the interactome size, as it accounts for interactions that are genuine yet difficult to detect with commonly-used experimental assays. This shows that we are even further from completing the yeast interactome map than previously expected.</p

Crossref

Hal - Université Grenoble Alpes

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

HAL Descartes

A new pooling strategy for high-throughput screening: the Shifted Transversal Design-0

Author: Nicolas Thierry-Mieg (72993)
Publication venue
Publication date
Field of study

Copyright information:Taken from "A new pooling strategy for high-throughput screening: the Shifted Transversal Design"BMC Bioinformatics 2006;7():28-28.Published online 19 Jan 2006PMCID:PMC1409803.Copyright © 2006 Thierry-Mieg; licensee BioMed Central Ltd.me number q and builds the set of pools STD(n; q; t·Γ+2·E+1), as specified in corollary 2. Recall that n is the total number of variables and Γ is the compression power, i.e. the smallest such that q≥ n. This figure summarizes the behavior of these pools when the actual number of errors exceeds E, and distinguishes between the two types of errors: false positives and false negatives. In the dark blue region, all errors are detected and corrected. In the intermediate blue rectangles, correction is not guaranteed but detection is: in an unfavorable conformation of positives and errors, correction of all errors may fail, but this failure cannot go unnoticed, and the user can therefore plan additional experiments. In the cyan square, detection is usually also guaranteed, except if E is very small (E < 2·Γ-1): in this case, the line y = 3·E+1-x splits the square in two, and detection is only guaranteed in the bottom left portion, where the total number of errors is at most 3·E+1. Finally, in the outer pale cyan zone, no guarantee is provided

FigShare

Interpool: interpreting smart-pooling results

Author: Barillot
Bruno
Gilles Bailly
Jin
Jin
Knill
Nicolas Thierry-Mieg
Thierry-Mieg
Thierry-Mieg
Vermeirssen
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Interpool: interpreting smart-pooling results

Author: Bailly Gilles
Thierry-Mieg Nicolas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/03/2008
Field of study

International audienc

Hal - Université Grenoble Alpes

Modélisation informatique et analyse prédictive des interactions protéine-protéine chez Caenorhabditis elegans

Author: THIERRY-MIEG Nicolas
TRILLING Laurent
Publication venue
Publication date: 01/01/2001
Field of study

L'objectif de cette thèse consiste en la modélisation informatiqueet l'analyse prédictive d'interactions protéine-protéine chez Caenorhabditis elegans. L'approche adoptée est la suivante. Dans un premier temps, nous avons participé à la production de données d'interaction dans le cadre du projet de cartographie systématique par double hybride des interactions protéine-protéine chez C. elegans, dirigé par le professeur Marc Vidal au Dana Farber Cancer Institute, Boston. Nous avons été responsable de tous les aspects bioinformatiques : entre autres, conception d'amorces de PCR pour le clonage, développement d'algorithmes pour la production des données, conception d'une base de données pour le stockage, mise en place d'une interface web pour la publication des résultats. L'étape suivante a été consacrée à la construction d'une base de données d'interactions protéine-protéine multi-organismes, fédérant les données d'interaction du laboratoire de Marc Vidal avec d'autres données disponibles dans des bases spécialisées. Une attention particulière a été prêtée au choix des descripteurs retenus pour la caractérisation des protéines. Les descripteurs retenus sont éventuellement la localisation cellulaire et les mots-clés issus de SwissProt, ainsi que les domaines de Pfam, Prostite, et plus récemment InterPro. Enfin, une troisième partie a concerné la conception et la mise en oeuvre d'un système prédictif d'interactions protéine-protéine. Notre objectif est d'orienter les recherches menées dans le laboratoire du professeur Vidal, en proposant des paires de protéines susceptibles d'interagir. La méthode développée relève du domaine de l'Extraction de connaissances à partir de Données (KDD). La majeure partie du travail porte sur la conception de procédures originales de pré-traitement et de post-traitement pertinentes. Les résultats sont encourageants, et permettent d'envisager rapidement une validation biologique en collaboration avec le laboratoire du professeur Vidal.GRENOBLE1-BU Sciences (384212103) / SudocSudocFranceF

OpenGrey Repository