Search CORE

20 research outputs found

Scaling success: Linking public breeding with private enterprise

Author: Hirokazu Chiba (362223)
Hiroto Hyakkoku (362224)
Jean-François Pessiot (362222)
Takeaki Taniguchi (362225)
Wataru Fujibuchi (57484)
Publication venue
Publication date: 08/10/2015
Field of study

<p>The known Downstream Promoter Element and Initiator site motifs are shown in boldface.</p

CGSpace (CGIAR)

The Francis Crick Institute

PeakRegressor Identifies Composite Sequence Motifs Responsible for STAT1 Binding Sites and Their Potential rSNPs

Author: A Ameur
B Efron
BC Foat
CM Bishop
D Das
EM Conlon
F Gao
G Robertson
Hirokazu Chiba
Hiroto Hyakkoku
HJ Bussemaker
IE Frank
J Rozowsky
JE Butler
Jean-François Pessiot
R Tibshirani
Takeaki Taniguchi
Wataru Fujibuchi
Xiaolin Wu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

How to identify true transcription factor binding sites on the basis of sequence motif information (e.g., motif pattern, location, combination, etc.) is an important question in bioinformatics. We present “PeakRegressor,” a system that identifies binding motifs by combining DNA-sequence data and ChIP-Seq data. PeakRegressor uses L1-norm log linear regression in order to predict peak values from binding motif candidates. Our approach successfully predicts the peak values of STAT1 and RNA Polymerase II with correlation coefficients as high as 0.65 and 0.66, respectively. Using PeakRegressor, we could identify composite motifs for STAT1, as well as potential regulatory SNPs (rSNPs) involved in the regulation of transcription levels of neighboring genes. In addition, we show that among five regression methods, L1-norm log linear regression achieves the best performance with respect to binding motif identification, biological interpretability and computational efficiency

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Apprentissage automatique pour l'extraction de caractéristiques (application au partitionnement de documents, au résumé automatique et au filtrage collaboratif)

Author: AMINI Massih-Reza
GALLINARI Patrick
PESSIOT Jean-François
Publication venue
Publication date: 01/01/2008
Field of study

PARIS-BIUSJ-Thèses (751052125) / SudocAVIGNON-Bibl. IUP-IUT (840072201) / SudocPARIS-BIUSJ-Mathématiques rech (751052111) / SudocSudocFranceF

OpenGrey Repository

Apprentissage non-supervisé pour la segmentation automatique de textes

Author: Amini Massih-Reza
Caillet Marc
Gallinari Patrick
Pessiot Jean-François
Publication venue: HAL CCSD
Publication date: 01/03/2004
Field of study

National audienc

Une extension du modèle sémantique latent probabiliste pour le partitionnement non-supervisé de documents textuels

Author: Amini Massih-Reza
Gallinari Patrick
Kim Young-Min
Pessiot Jean-François
Publication venue: HAL CCSD
Publication date: 01/05/2009
Field of study

International audienceDans cet article, nous proposons une extension du modèle sémantique latent probabiliste (PLSA) pour la tâche de partitionnement de documents (clustering). Nous montrons que ce modèle étendu est équivalent à une combinaison linéaire de modèles de factorisation matricielle non-négative au sens de la fonction objective KL-divergence. Nous validons notre modèle sur les trois collections de documents et, montrons empiriquement que notre approche est statistiquement plus performante que le modèle PLSA de base pour la tâche de clustering

An Extension of PLSA for Document Clustering

Author: Amini Massih-Reza
Gallinari Patrick
Kim Young-Min
Pessiot Jean-François
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

International audienceIn this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to co-cluster documents and terms simultaneously. We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models

Crossref