Search CORE

ScholarBank@NUS

Novel computational methods for increasing PCR primer design effectiveness in directed sequencing

Author: A Stabenau
Anushka Brownley
AR Quinlan
D Shinde
Dana Busam
ET Bolton
F Dong
F Yao
G Sarkar
I Ovcharenko
J Sambrook
Karen Beeson
Kelvin Li
KJ Breslauer
M Stephens
MF Tsai
P Rice
S Rozen
S Weckx
SA Haas
Samuel Levy
Sean Murphy
SF Altschul
SH Chen
Steve Ferriera
T Sjöblom
Timothy B Stockwell
Tina C McIntosh
V Gorelenkov
V Rand
X Wu
Y Li
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Polymerase chain reaction (PCR) is used in directed sequencing for the discovery of novel polymorphisms. As the first step in PCR directed sequencing, effective PCR primer design is crucial for obtaining high-quality sequence data for target regions. Since current computational primer design tools are not fully tuned with stable underlying laboratory protocols, researchers may still be forced to iteratively optimize protocols for failed amplifications after the primers have been ordered. Furthermore, potentially identifiable factors which contribute to PCR failures have yet to be elucidated. This inefficient approach to primer design is further intensified in a high-throughput laboratory, where hundreds of genes may be targeted in one experiment. Results We have developed a fully integrated computational PCR primer design pipeline that plays a key role in our high-throughput directed sequencing pipeline. Investigators may specify target regions defined through a rich set of descriptors, such as Ensembl accessions and arbitrary genomic coordinates. Primer pairs are then selected computationally to produce a minimal amplicon set capable of tiling across the specified target regions. As part of the tiling process, primer pairs are computationally screened to meet the criteria for success with one of two PCR amplification protocols. In the process of improving our sequencing success rate, which currently exceeds 95% for exons, we have discovered novel and accurate computational methods capable of identifying primers that may lead to PCR failures. We reveal the laboratory protocols and their associated, empirically determined computational parameters, as well as describe the novel computational methods which may benefit others in future primer design research. Conclusion The high-throughput PCR primer design pipeline has been very successful in providing the basis for high-quality directed sequencing results and for minimizing costs associated with labor and reprocessing. The modular architecture of the primer design software has made it possible to readily integrate additional primer critique tests based on iterative feedback from the laboratory. As a result, the primer design software, coupled with the laboratory protocols, serves as a powerful tool for low and high-throughput primer design to enable successful directed sequencing.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

The Diploid Genome Sequence of an Individual Human

Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information

CiteSeerX

Diposit Digital de la Universitat de Barcelona

ScholarBank@NUS

Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire

Author: Beakes Gordon W.
Boore Jeffrey L.
Brouwer Henk
Buell C. Robin
Busam Dana
Cano Liliana
Coutinho Pedro M.
De Vries Ronald P.
Dumas Bernard
Ferriera Steve
Fuerstenberg Susan I.
Gachon Claire M. M.
Gaulin Elodie
Govers Francine
Grenville-Briggs Laura
Hamilton John P.
Henrissat Bernard
Holt Carson
Horner Neil
Hostetler Jessica
Huitema Edgar
Jiang Rays H. Y.
Johnson Justin
Kamoun Sophien
Krajaejun Theerapong
Levesque C. Andre
Lin Haining
Martin Frank
Meijer Harold J. G.
Moore Barry
Morris Paul
Phuntmart Vipaporn
Puiu Daniela
Raffaele Sylvain
Robideau Gregg P.
Shetty Jyoti
Stajich Jason E.
Thines Marco
Thomas Paul D.
Tisserat Ned
Tripathy Sucheta
Tyler Brett M.
van West Pieter
Wawra Stephan
Whitty Brett R.
Win Joe
Yandell Mark
Zerillo Marcelo M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: Pythium ultimum (P. ultimum) is a ubiquitous oomycete plant pathogen responsible for a variety of diseases on a broad range of crop and ornamental species. Results: The P. ultimum genome (42.8 Mb) encodes 15,290 genes and has extensive sequence similarity and synteny with related Phytophthora species, including the potato blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86% of genes, with detectable differential expression of suites of genes under abiotic stress and in the presence of a host. The predicted proteome includes a large repertoire of proteins involved in plant pathogen interactions although surprisingly, the P. ultimum genome does not encode any classical RXLR effectors and relatively few Crinkler genes in comparison to related phytopathogenic oomycetes. A lower number of enzymes involved in carbohydrate metabolism were present compared to Phytophthora species, with the notable absence of cutinases, suggesting a significant difference in virulence mechanisms between P. ultimum and more host specific oomycete species. Although we observed a high degree of orthology with Phytophthora genomes, there were novel features of the P. ultimum proteome including an expansion of genes involved in proteolysis and genes unique to Pythium. We identified a small gene family of cadherins, proteins involved in cell adhesion, the first report in a genome outside the metazoans. Conclusions: Access to the P. ultimum genome has revealed not only core pathogenic mechanisms within the oomycetes but also lineage specific genes associated with the alternative virulence and lifestyles found within the pythiaceous lineages compared to the Peronosporaceae

HAL AMU

Wageningen University & Research Publications

ProdInra

Hal-Diderot

Springer - Publisher Connector

eScholarship - University of California

University of Dundee Online Publications

Hochschulschriftenserver - Universität Frankfurt am Main

Distinguishing between cancer driver and passenger gene alteration candidates via cross-species comparison: a pilot study

Springer - Publisher Connector

A framework for human microbiome research

Author: Aagaard Kjersti M.
Abolude Olukemi O.
Abubucker Sahar
Allen-Vercoe Emma
Alm Eric J.
Alvarado Lucia
Andersen Gary L.
Anderson Scott
Appelbaum Elizabeth
Arachchi Harindra M.
Armitage Gary
Arze Cesar A.
Ayvaz Tulin
Badger Jonathan H.
Baker Carl C.
Begg Lisa
Belachew Tsegahiwot
Bhonagiri Veena
Bihan Monika
Birren Bruce W.
Blaser Martin J.
Bloom Toby
Brooks Paul
Buck Gregory A.
Buhay Christian J.
Busam Dana A.
Campbell Joseph L.
Canon Shane R.
Cantarel Brandi L.
Chain Patrick S.
Chen I-Min A.
Chen Lei
Chhibba Shaila
Chinwalla Asif T.
Chu Ken
Ciulla Dawn M.
Clemente Jose C.
Clifton Sandra W.
Conlan Sean
Crabtree Jonathan
Creasy Heather H.
Cutting Mary A.
Davidovics Noam J.
Davis Catherine C.
Deal Carolyn
Delehaunty Kimberley D.
DeSantis Todd Z.
Dewhirst Floyd Everett
Deych Elena
Di Francesco Valentina
Ding Yan
Dooling David J.
Dugan Shannon P.
Dunne Wm. Michael
Durkin A. Scott
Earl Ashlee M.
Edgar Robert C.
Erlich Rachel L.
Farmer Candace N.
Farrell Ruth M.
Faust Karoline
Feldgarden Michael
Felix Victor M.
Fisher Sheila
FitzGerald Michael G.
Fodor Anthony A.
Forney Larry
Foster Leslie
Friedman Jonathan
Friedrich Dennis C.
Fronick Catrina C.
Fulton Lucinda L.
Fulton Robert S.
Gao Hongyu
Garcia Nathalia
Gevers Dirk
Giannoukos Georgia
Gibbs Richard A.
Giblin Christina
Giglio Michelle G.
Giovanni Maria Y.
Goldberg Jonathan M.
Goll Johannes
Gonzalez Antonio
Griggs Allison
Gujja Sharvari
Haas Brian J.
Hallsworth-Pepin Kymberlie
Hamilton Holli A.
Harris Emily L.
Hepburn Theresa A.
Herter Brandi
Highlander Sarah K.
Hoffmann Diane E.
Holder Michael E.
Howarth Clinton
Huang Katherine H.
Huse Susan M.
Huttenhower Curtis
Izard Jacques Georges
Jansson Janet K.
Jiang Huaiyang
Jordan Catherine
Joshi Vandita
Katancik James A.
Keitel Wendy A.
Kelley Scott T.
Kells Cristyn
Kinder-Haake Susan
King Nicholas B.
Knight Rob
Knights Dan
Kong Heidi H.
Koren Omry
Koren Sergey
Kota Karthik C.
Kovar Christie L.
Kyrpides Nikos C.
La Rosa Patricio S.
Lee Sandra L.
Lemon Katherine Paige
Lennon Niall
Lewis Cecil M.
Lewis Lora
Ley Ruth E.
Li Kelvin
Liolios Konstantinos
Liu Bo
Liu Yue
Lo Chien-Chi
Lobos Elizabeth A.
Lozupone Catherine A.
Lunsford R. Dwayne
Madden Tessa
Madupu Ramana
Magrini Vincent
Mahurkar Anup A.
Mannon Peter J.
Mardis Elaine R.
Markowitz Victor M.
Martin John C.
Mavrommatis Konstantinos
McCorrison Jamison M.
McDonald Daniel
McEwen Jean
McGuire Amy L.
McInnes Pamela
Mehta Teena
Methé Barbara A.
Mihindukulasuriya Kathie A.
Miller Jason R.
Minx Patrick J.
Mitreva Makedonka
Muzny Donna M.
Nelson Karen E.
Newsham Irene
Nusbaum Chad
Orvis Joshua
O’Laughlin Michelle
Pagani Ioanna
Palaniappan Krishna
Pamela Sankar J.
Patel Shital M.
Pearson Matthew
Peterson Jane
Petrosino Joseph F.
Podar Mircea
Pohl Craig
Pollard Katherine S.
Pop Mihai
Priest Margaret E.
Proctor Lita M.
Qin Xiang
Raes Jeroen
Ravel Jacques
Reid Jeffrey G.
Rho Mina
Rhodes Rosamond
Riehle Kevin P.
Rivera Maria C.
Rodriguez-Mueller Beltran
Rogers Yu-Hui
Ross Matthew C.
Russ Carsten
Sanka Ravi K.
Sathirapongsasuti Fah
Schloss Jeffery A.
Schloss Patrick D.
Schmidt Thomas M.
Scholz Matthew
Schriml Lynn
Schubert Alyxandria M.
Segata Nicola
Segre Julia A.
Shannon William D.
Sharp Richard R.
Sharpton Thomas J.
Shenoy Narmada
Sheth Nihar U.
Simone Gina A.
Singh Indresh
Smillie Chris Scott
Sobel Jack D.
Sodergren Erica J.
Sommer Daniel D.
Spicer Paul
Sutton Granger G.
Sykes Sean M.
Tabbaa Diana G.
Thiagarajan Mathangi
Tomlinson Chad M.
Torralba Manolito
Treangen Todd J.
Truty Rebecca M.
Versalovic James
Vishnivetskaya Tatiana A.
Vivien Bonazzi J.
Walker Jason
Wang Lu
Wang Zhengyuan
Ward Doyle V.
Warren Wesley
Watson Mark A.
Weinstock George M.
Wellington Christopher
Wetterstrand Kris A.
White James R.
White Owen
Wilczek-Boney Katarzyna
Wilson Richard K.
Wollam Aye M.
Worley Kim C.
Wortman Jennifer R.
Wu Yuan Qing
Wylie Kristine M.
Wylie Todd
Yandava Chandri
Ye Liang
Ye Yuzhen
Yooseph Shibu
Youmans Bonnie P.
Young Sarah K.
Zeng Qiandong
Zhang Lan
Zhou Yanjiao
Zhu Yiming
Zoloth Laurie
Zucker Jeremy Daniel Hofeld
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2011
Field of study

A variety of microbial communities and their genes (the microbiome) exist throughout the human body, with fundamental roles in human health and disease. The National Institutes of Health (NIH)-funded Human Microbiome Project Consortium has established a population-scale framework to develop metagenomic protocols, resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 or 18 body sites up to three times, which have generated 5,177 microbial taxonomic profiles from 16S ribosomal RNA genes and over 3.5 terabases of metagenomic sequence so far. In parallel, approximately 800 reference strains isolated from the human body have been sequenced. Collectively, these data represent the largest resource describing the abundance and variety of the human microbiome, while providing a framework for current and future studies

CiteSeerX

DSpace@MIT

Harvard University - DASH

Structure, function and diversity of the healthy human microbiome

Author: Aagaard Kjersti M
Abolude Olukemi O
Abubucker Sahar
Allen-Vercoe Emma
Alm Eric J
Alvarado Lucia
Andersen Gary L
Anderson Scott
Appelbaum Elizabeth
Arachchi Harindra M
Armitage Gary
Arze Cesar A
Ayvaz Tulin
Badger Jonathan H
Baker Carl C
Begg Lisa
Belachew Tsegahiwot
Bhonagiri Veena
Bihan Monika
Birren Bruce W
Blaser Martin J
Bloom Toby
Bonazzi Vivien
Brooks J. Paul
Buck Gregory A
Buhay Christian J
Busam Dana A
Campbell Joseph L
Canon Shane R
Cantarel Brandi L
Chain Patrick S. G
Chen I-Min A
Chen Lei
Chhibba Shaila
Chinwalla Asif T
Chu Ken
Ciulla Dawn M
Clemente Jose C
Clifton Sandra W
Conlan Sean
Crabtree Jonathan
Creasy Heather H
Cutting Mary A
Davidovics Noam J
Davis Catherine C
Deal Carolyn
Delehaunty Kimberley D
DeSantis Todd Z
Dewhirst Floyd E
Deych Elena
Di Francesco Valentina
Ding Yannan
Dooling David J
Dugan Shannon P
Dunne Wm Michael
Durkin A. Scott
Earl Ashlee M
Edgar Robert C
Erlich Rachel L
Farmer Candace N
Farrell Ruth M
Faust Karoline
Feldgarden Michael
Felix Victor M
Fisher Sheila
FitzGerald Michael G
Fodor Anthony A
Forney Larry J
Foster Leslie
Friedman Jonathan
Friedrich Dennis C
Fronick Catrina C
Fulton Lucinda L
Fulton Robert S
Gao Hongyu
Garcia Nathalia
Gevers Dirk
Giannoukos Georgia
Gibbs Richard A
Giblin Christina
Giglio Michelle G
Giovanni Maria Y
Goldberg Jonathan M
Goll Johannes
Gonzalez Antonio
Griggs Allison
Gujja Sharvari
Haake Susan Kinder
Haas Brian J
Hallsworth-Pepin Kymberlie
Hamilton Holli A
Harris Emily L
Hepburn Theresa A
Herter Brandi
Highlander Sarah K
Hoffmann Diane E
Holder Michael E
Howarth Clinton
Huang Katherine H
Huse Susan M
Huttenhower Curtis
Izard Jacques
Jansson Janet K
Jiang Huaiyang
Jordan Catherine
Joshi Vandita
Katancik James A
Keitel Wendy A
Kelley Scott T
Kells Cristyn
King Nicholas B
Knight Rob
Knights Dan
Kong Heidi H
Koren Omry
Koren Sergey
Kota Karthik C
Kovar Christie L
Kyrpides Nikos C
La Rosa Patricio S
Lee Sandra L
Lemon Katherine P
Lennon Niall
Lewis Cecil M
Lewis Lora
Ley Ruth E
Li Kelvin
Liolios Konstantinos
Liu Bo
Liu Yue
Lo Chien-Chi
Lobos Elizabeth A
Lozupone Catherine A
Lunsford R. Dwayne
Madden Tessa
Madupu Ramana
Magrini Vincent
Mahurkar Anup A
Mannon Peter J
Mardis Elaine R
Markowitz Victor M
Martin John C
Mavromatis Konstantinos
McCorrison Jamison M
McDonald Daniel
McEwen Jean
McGuire Amy L
McInnes Pamela
Mehta Teena
Methe Barbara A
Mihindukulasuriya Kathie A
Miller Jason R
Minx Patrick J
Mitreva Makedonka
Muzny Donna M
Nelson Karen E
Newsham Irene
Nusbaum Chad
O'Laughlin Michelle
Orvis Joshua
Pagani Ioanna
Palaniappan Krishna
Patel Shital M
Pearson Matthew
Peterson Jane
Petrosino Joseph F
Podar Mircea
Pohl Craig
Pollard Katherine S
Pop Mihai
Priest Margaret E
Proctor Lita M
Qin Xiang
Raes Jeroen
Ravel Jacques
Reid Jeffrey G
Rho Mina
Rhodes Rosamond
Riehle Kevin P
Rivera Maria C
Rodriguez-Mueller Beltran
Rogers Yu-Hui
Ross Matthew C
Russ Carsten
Sanka Ravi K
Sankar Pamela
Sathirapongsasuti J. Fah
Schloss Jeffery A
Schloss Patrick D
Schmidt Thomas M
Scholz Matthew
Schriml Lynn
Schubert Alyxandria M
Segata Nicola
Segre Julia A
Shannon William D
Sharp Richard R
Sharpton Thomas J
Shenoy Narmada
Sheth Nihar U
Simone Gina A
Singh Indresh
Smillie Christopher S
Sobel Jack D
Sodergren Erica J
Sommer Daniel D
Spicer Paul
Sutton Granger G
Sykes Sean M
Tabbaa Diana G
Thiagarajan Mathangi
Tomlinson Chad M
Torralba Manolito
Treangen Todd J
Truty Rebecca M
Versalovic James
Vishnivetskaya Tatiana A
Walker Jason
Wang Lu
Wang Zhengyuan
Ward Doyle V
Warren Wesley
Watson Mark A
Weinstock George M
Wellington Christopher
Wetterstrand Kris A
White James R
White Owen
Wilczek-Boney Katarzyna
Wilson Richard K
Wollam Aye M
Worley Kim C
Wortman Jennifer R
Wu YuanQing
Wylie Kristine M
Wylie Todd
Yandava Chandri
Ye Liang
Ye Yuzhen
Yooseph Shibu
Youmans Bonnie P
Young Sarah K
Zeng Qiandong
Zhang Lan
Zhou Yanjiao
Zhu Yiming
Zoloth Laurie
Zucker Jeremy D
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2011
Field of study

Author Posting. © The Authors, 2012. This article is posted here by permission of Nature Publishing Group. The definitive version was published in Nature 486 (2012): 207-214, doi:10.1038/nature11234.Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81–99% of the genera, enzyme families and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology and translational applications of the human microbiome.This research was supported in part by National Institutes of Health grants U54HG004969 to B.W.B.; U54HG003273 to R.A.G.; U54HG004973 to R.A.G., S.K.H. and J.F.P.; U54HG003067 to E.S.Lander; U54AI084844 to K.E.N.; N01AI30071 to R.L.Strausberg; U54HG004968 to G.M.W.; U01HG004866 to O.R.W.; U54HG003079 to R.K.W.; R01HG005969 to C.H.; R01HG004872 to R.K.; R01HG004885 to M.P.; R01HG005975 to P.D.S.; R01HG004908 to Y.Y.; R01HG004900 to M.K.Cho and P. Sankar; R01HG005171 to D.E.H.; R01HG004853 to A.L.M.; R01HG004856 to R.R.; R01HG004877 to R.R.S. and R.F.; R01HG005172 to P. Spicer.; R01HG004857 to M.P.; R01HG004906 to T.M.S.; R21HG005811 to E.A.V.; M.J.B. was supported by UH2AR057506; G.A.B. was supported by UH2AI083263 and UH3AI083263 (G.A.B., C. N. Cornelissen, L. K. Eaves and J. F. Strauss); S.M.H. was supported by UH3DK083993 (V. B. Young, E. B. Chang, F. Meyer, T. M. S., M. L. Sogin, J. M. Tiedje); K.P.R. was supported by UH2DK083990 (J. V.); J.A.S. and H.H.K. were supported by UH2AR057504 and UH3AR057504 (J.A.S.); DP2OD001500 to K.M.A.; N01HG62088 to the Coriell Institute for Medical Research; U01DE016937 to F.E.D.; S.K.H. was supported by RC1DE0202098 and R01DE021574 (S.K.H. and H. Li); J.I. was supported by R21CA139193 (J.I. and D. S. Michaud); K.P.L. was supported by P30DE020751 (D. J. Smith); Army Research Office grant W911NF-11-1-0473 to C.H.; National Science Foundation grants NSF DBI-1053486 to C.H. and NSF IIS-0812111 to M.P.; The Office of Science of the US Department of Energy under Contract No. DE-AC02-05CH11231 for P.S. C.; LANL Laboratory-Directed Research and Development grant 20100034DR and the US Defense Threat Reduction Agency grants B104153I and B084531I to P.S.C.; Research Foundation - Flanders (FWO) grant to K.F. and J.Raes; R.K. is an HHMI Early Career Scientist; Gordon&BettyMoore Foundation funding and institutional funding fromthe J. David Gladstone Institutes to K.S.P.; A.M.S. was supported by fellowships provided by the Rackham Graduate School and the NIH Molecular Mechanisms in Microbial Pathogenesis Training Grant T32AI007528; a Crohn’s and Colitis Foundation of Canada Grant in Aid of Research to E.A.V.; 2010 IBM Faculty Award to K.C.W.; analysis of the HMPdata was performed using National Energy Research Scientific Computing resources, the BluBioU Computational Resource at Rice University

Lirias

DSpace@MIT

Woods Hole Open Access Server

eScholarship - University of California

Distinguishing between cancer driver and passenger gene alteration candidates via cross-species comparison: a pilot study

Author: Busam Dana
Ferriera Steve
Halberg Richard
Ji Xinglai
Peña Maria
Tang Jie
Venkataramu Chinnambally
Yeatman Timothy J
Zhao Shaying
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2010
Field of study

Abstract Background We are developing a cross-species comparison strategy to distinguish between cancer driver- and passenger gene alteration candidates, by utilizing the difference in genomic location of orthologous genes between the human and other mammals. As an initial test of this strategy, we conducted a pilot study with human colorectal cancer (CRC) and its mouse model C57BL/6J ApcMin/+, focusing on human 5q22.2 and 18q21.1-q21.2. Methods We first performed bioinformatics analysis on the evolution of 5q22.2 and 18q21.1-q21.2 regions. Then, we performed exon-targeted sequencing, real time quantitative polymerase chain reaction (qPCR), and real time quantitative reverse transcriptase PCR (qRT-PCR) analyses on a number of genes of both regions with both human and mouse colon tumors. Results These two regions (5q22.2 and 18q21.1-q21.2) are frequently deleted in human CRCs and encode genuine colorectal tumor suppressors APC and SMAD4. They also encode genes such as MCC (mutated in colorectal cancer) with their role in CRC etiology unknown. We have discovered that both regions are evolutionarily unstable, resulting in genes that are clustered in each human region being found scattered at several distinct loci in the genome of many other species. For instance, APC and MCC are within 200 kb apart in human 5q22.2 but are 10 Mb apart in the mouse genome. Importantly, our analyses revealed that, while known CRC driver genes APC and SMAD4 were disrupted in both human colorectal tumors and tumors from ApcMin/+ mice, the questionable MCC gene was disrupted in human tumors but appeared to be intact in mouse tumors. Conclusions These results indicate that MCC may not actually play any causative role in early colorectal tumorigenesis. We also hypothesize that its disruption in human CRCs is likely a mere result of its close proximity to APC in the human genome. Expanding this pilot study to the entire genome may identify more questionable genes like MCC, facilitating the discovery of new CRC driver gene candidates.</p