Search CORE

20 research outputs found

Uncovering trends in gene naming

Author: Cayting Philip D
Gerstein Mark B
Seringhaus Michael R
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

A survey of unusual gene names reveals trends underlying their choice

Crossref

PubMed Central

PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data

Author: Abyzov Alexej
Carriero Nicholas
Cayting Philip
Gerstein Mark B
Korbel Jan O
Mu Xinmeng Jasmine
Snyder Michael
Zhang Zhengdong
Publication venue: BioMed Central
Publication date: 23/02/2009
Field of study

Paired-End Mapper (PEMer) enables mapping of genomic structural variants at considerably enhanced sensitivity, specificity and resolution over previous approaches

Springer - Publisher Connector

PubMed Central

Comparative analysis of processed ribosomal protein pseudogenes in four mammalian genomes

Author: Balasubramanian Suganthi
Carriero Nicholas
Cayting Philip
Fang Gang
Frankish Adam
Gerstein Mark
Liu Yuen-Jong
Robilotto Rebecca
Zheng Deyou
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

An analysis of ribosomal protein pseudogenes in the four mammalian genomes reveals no correlation between number of pseudogenes and mRNA abundance

Springer - Publisher Connector

PubMed Central

Pseudofam: the pseudogene families database

Author: Altschul
Altschul
Bailey
Bateman
Doxiadis
Durinck
Eisenberg
Ekta Khurana
Finn
Flicek
Gang Fang
Gerstein
Gonclaves
Gruber
Harrison
Hugo Y. K. Lam
Kei-Hoi Cheung
Kim
Liu
Mark B. Gerstein
Nicholas Carriero
Ortutay
Pearson
Philip Cayting
Sassi
Stoesser
Su
Svensson
Tam
Yao
Zhang
Zhang
Zheng
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Pseudofam (http://pseudofam.pseudogene.org) is a database of pseudogene families based on the protein families from the Pfam database. It provides resources for analyzing the family structure of pseudogenes including query tools, statistical summaries and sequence alignments. The current version of Pseudofam contains more than 125 000 pseudogenes identified from 10 eukaryotic genomes and aligned within nearly 3000 families (approximately one-third of the total families in PfamA). Pseudofam uses a large-scale parallelized homology search algorithm (implemented as an extension of the PseudoPipe pipeline) to identify pseudogenes. Each identified pseudogene is assigned to its parent protein family and subsequently aligned to each other by transferring the parent domain alignments from the Pfam family. Pseudogenes are also given additional annotation based on an ontology, reflecting their mode of creation and subsequent history. In particular, our annotation highlights the association of pseudogene families with genomic features, such as segmental duplications. In addition, pseudogene families are associated with key statistics, which identify outlier families with an unusual degree of pseudogenization. The statistics also show how the number of genes and pseudogenes in families correlates across different species. Overall, they highlight the fact that housekeeping families tend to be enriched with a large number of pseudogenes

CiteSeerX

Crossref

PubMed Central

Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation

Author: Apweiler
Benson
Birney
Collins
Dennis
Deyou Zheng
Harrison
Harrison
Harrison
Hubbard
John E. Karro
Kent
Khelifi
Khelifi
Liu
Mark Gerstein
Nadkarni
Nicholas Carriero
Ohshima
Paul Harrrison
Philip Cayting
Torrents
Wang
Yangpan Yan
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhaolei Zhang
Zheng
Zheng
Publication venue: Oxford University Press
Publication date: 11/11/2006
Field of study

The Pseudogene.org knowledgebase serves as a comprehensive repository for pseudogene annotation. The definition of a pseudogene varies within the literature, resulting in significantly different approaches to the problem of identification. Consequently, it is difficult to maintain a consistent collection of pseudogenes in detail necessary for their effective use. Our database is designed to address this issue. It integrates a variety of heterogeneous resources and supports a subset structure that highlights specific groups of pseudogenes that are of interest to the research community. Tools are provided for the comparison of sets and the creation of layered set unions, enabling researchers to derive a current ‘consensus’ set of pseudogenes. Additional features include versatile search, the capacity for robust interaction with other databases, the ability to reconstruct older versions of the database (accounting for changing genome builds) and an underlying object-oriented interface designed for researchers with a minimal knowledge of programming. At the present time, the database contains more than 100 000 pseudogenes spanning 64 prokaryote and 11 eukaryote genomes, including a collection of human annotations compiled from 16 sources

CiteSeerX

Crossref

PubMed Central

Segmental duplications in the human genome reveal details of pseudogene formation

Author: Bailey
Bailey
Bischof
Chao Cheng
Ekta Khurana
Glusman
Graur
Gupta
Harrison
Hugo Y. K. Lam
Innan
Jiang
Kent
Kim
Korbel
Kuehn
Lam
Li
Li
Lipman
Liu
Mark B. Gerstein
Marques-Bonet
Mighell
Nicholas Carriero
Ohno
Ohshima
Pearson
Philip Cayting
Sasidharan
Torrents
Vilella
Zhang
Zhang
Zhang
Zhang
Zhang
Zheng
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Duplicated pseudogenes in the human genome are disabled copies of functioning parent genes. They result from block duplication events occurring throughout evolutionary history. Relatively recent duplications (with sequence similarity ≥90% and length ≥1 kb) are termed segmental duplications (SDs); here, we analyze the interrelationship of SDs and pseudogenes. We present a decision-tree approach to classify pseudogenes based on their (and their parents’) characteristics in relation to SDs. The classification identifies 140 novel pseudogenes and makes possible improved annotation for the 3172 pseudogenes located in SDs. In particular, it reveals that many pseudogenes in SDs likely did not arise directly from parent genes, but are the result of a multi-step process. In these cases, the initial duplication or retrotransposition of a parent gene gives rise to a ‘parent pseudogene’, followed by further duplication creating duplicated–duplicated or duplicated–processed pseudogenes, respectively. Moreover, we can precisely identify these parent pseudogenes by overlap with ancestral SD loci. Finally, a comparison of nucleotide substitutions per site in a pseudogene with its surrounding SD region allows us to estimate the time difference between duplication and disablement events, and this suggests that most duplicated pseudogenes in SDs were likely disabled around the time of the original duplication

CiteSeerX

Crossref

PubMed Central

Analysis of nuclear receptor pseudogenes in vertebrates : How the silent tell their stories

Author: Cayting Philip
Gerstein Mark
Weinstock George
Zhang Zhengdong D
Publication venue
Publication date: 10/05/2022
Field of study

Thư viện trường Đại học Đà Lạt