Search CORE

11 research outputs found

Genome Resources for Climate‐Resilient Cowpea, an Essential Crop for Food Security

Author: Alhakami Hind
Alpert Matthew
Atokple Ibrahim
Barkley Noelle A.
Batieno Benoit J.
Boukar Ousmane
Bozdag Serdar
Cisse Ndiaga
Close Timothy J.
Drabo Issa
Ehlers Jeffrey D.
Farmer Andrew
Fatokun Christian
Gu Yong Q.
Guo Yi-Ning
Huynh Bao-Lam
Jackson Scott A.
Kusi Francis
Lawley Cynthia T.
Lonardi Stefano
Lucas Mitchell R.
Luo Ming-Cheng
Ma Yaqin
Mirebrahim Hamid
Munoz-Amatriain Maria
Roberts Philip A.
Timko Michael P.
Wanamaker Steve
Wu Jiajie
Xu Pei
You Frank
Publication venue: e-Publications@Marquette
Publication date: 01/10/2016
Field of study

Cowpea (Vigna unguiculata L. Walp.) is a legume crop that is resilient to hot and drought‐prone climates, and a primary source of protein in sub‐Saharan Africa and other parts of the developing world. However, genome resources for cowpea have lagged behind most other major crops. Here we describe foundational genome resources and their application to the analysis of germplasm currently in use in West African breeding programs. Resources developed from the African cultivar IT97K‐499‐35 include a whole‐genome shotgun (WGS) assembly, a bacterial artificial chromosome (BAC) physical map, and assembled sequences from 4355 BACs. These resources and WGS sequences of an additional 36 diverse cowpea accessions supported the development of a genotyping assay for 51 128 SNPs, which was then applied to five bi‐parental RIL populations to produce a consensus genetic map containing 37 372 SNPs. This genetic map enabled the anchoring of 100 Mb of WGS and 420 Mb of BAC sequences, an exploration of genetic diversity along each linkage group, and clarification of macrosynteny between cowpea and common bean. The SNP assay enabled a diversity analysis of materials from West African breeding programs. Two major subpopulations exist within those materials, one of which has significant parentage from South and East Africa and more diversity. There are genomic regions of high differentiation between subpopulations, one of which coincides with a cluster of nodulin genes. The new resources and knowledge help to define goals and accelerate the breeding of improved varieties to address food security issues related to limited‐input small‐holder farming and climate stress

epublications@Marquette

Crossref

eScholarship - University of California

The genome of cowpea (Vigna unguiculata [L.] Walp.)

Author: Alhakami Hind
Cannon Steven B.
Close Timothy J.
Doležel Jaroslav
Farmer Andrew D.
Hasan Abid Md.
Hokin Samuel A.
Liang Qihua
Lo Sassoum
Lonardi Stefano
Luo Ming‐Cheng
Muñoz Amatriain María
Ndeve Arsenio
Ounit Rachid
Roberts Philip A.
Santos Jansen R.P.
Schulman Alan H.
Shu Shengqiang
Tanskanen Jaakko
Verdier Jerome
Vrána Jan
Wanamaker Steve I.
Zhu Tingting
Publication venue: John Wiley & Sons
Publication date: 08/02/2024
Field of study

[EN] Cowpea (Vigna unguiculata [L.] Walp.) is a major crop for worldwide food and nutritional security, especially in sub-Saharan Africa, that is resilient to hot and drought-prone environments. An assembly of the single-haplotype inbred genome of cowpea IT97K-499-35 was developed by exploiting the synergies between single-molecule real-time sequencing, optical and genetic mapping, and an assembly reconciliation algorithm. A total of 519 Mb is included in the assembled sequences. Nearly half of the assembled sequence is composed of repetitive elements, which are enriched within recombination-poor pericentromeric regions. A comparative analysis of these elements suggests that genome size differences between Vigna species are mainly attributable to changes in the amount of Gypsy retrotransposons. Conversely, genes are more abundant in more distal, high-recombination regions of the chromosomes; there appears to be more duplication of genes within the NBS-LRR and the SAUR-like auxin superfamilies compared with other warm-season legumes that have been sequenced. A surprising outcome is the identification of an inversion of 4.2 Mb among landraces and cultivars, which includes a gene that has been associated in other plants with interactions with the parasitic weed Striga gesnerioides. The genome sequence facilitated the identification of a putative syntelog for multiple organ gigantism in legumes. A revised numbering system has been adopted for cowpea chromosomes based on synteny with common bean (Phaseolus vulgaris). An estimate of nuclear genome size of 640.6 Mbp based on cytometry is presentedS

Leon University (Spain)

Alhakami/calculate-gene-percentage v1.0

Author: Hind Alhakami
Publication venue
Publication date
Field of study

A perl script to calculate gene percentage in a genome assembly

ZENODO

Algorithms and Data Structures for de novo Sequence Assembly

Author: Alhakami Hind
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

Despite the prodigious throughput of the sequencing instruments currently on the market, the assembly problem remains very challenging, mainly due to the repetitive content of large genomes, uneven sequencing coverage, and the presence of (non-uniform) sequencing errors and chimeric reads. The third generation of sequencing technology such as Pacific Biosciences and Oxford Nanopore offers very long reads at a higher cost per base, but sequencing error rate is much higher. As a consequence, the final assembly is very rarely entirely finished, with one solid sequence per chromosome. Instead, the typical output is an unordered/unoriented set of contiguous regions called contigs. We examine two different but related problems in this study; merging multiple assemblies produced using different assemblers/parameters, and stitching assembled BACs to create a genome-wide assembly.The contribution of this dissertation is twofold. First, compact encoding of finite sets of strings is a classic problem. The manipulation of large sets requires compact data structures that allow for efficient set operations. We defined sequence decision diagrams (SeqDDs), which can encode arbitrary finite sets of strings over an alphabet.Second, reassembly of existing overlapping contigs with the intent to produce a higher quality genome-wide assembly. Second, merge multiple assemblies to produce a higher quality consensus is a compelling problem. We conducted a comparative study of state of the art assembly reconciliation tools, with the intent to use them in assembling a set of approximately four thosands Vigna unguiculata (cowpea) assembled BACs. To accomplish this task, we developed Colored-Positioned de bruijn graph, a variant of the classic de bruijn graph to stitch overlapped assemblies.In this Dissertation we studied and developed data structures and algorithms to merge overlapping assemblies. In particular: (1) Introduced sequence decision diagrams (SeqDDs) to enable compact encoding of finite sets of strings that allow for efficient set operations, among which detecting overlaps. (2) carried a comparative study of state of the art assembly reconciliation tools. and (3) developed tools to cluster overlapped BACs and assemble said clusters. Our assembler implements colored-positioned de bruijn graph, an augmented variant of the classic de bruijn graph, defined in this study

Ezid

eScholarship - University of California

Recommended from our members

A comparative evaluation of genome assembly reconciliation tools.

Author: Alhakami Hind
Lonardi Stefano
Mirebrahim Hamid
Publication venue: eScholarship, University of California
Publication date: 01/05/2017
Field of study

BackgroundThe majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, common practice to produce multiple assemblies using different assemblers and parameters, then select the best one for public release. A more compelling approach would allow one to merge multiple assemblies with the intent of producing a higher quality consensus assembly, which is the objective of assembly reconciliation.ResultsSeveral assembly reconciliation tools have been proposed in the literature, but their strengths and weaknesses have never been compared on a common dataset. We fill this need with this work, in which we report on an extensive comparative evaluation of several tools. Specifically, we evaluate contiguity, correctness, coverage, and the duplication ratio of the merged assembly compared to the individual assemblies provided as input.ConclusionsNone of the tools we tested consistently improved the quality of the input GAGE and synthetic assemblies. Our experiments show an increase in contiguity in the consensus assembly when the original assemblies already have high quality. In terms of correctness, the quality of the results depends on the specific tool, as well as on the quality and the ranking of the input assemblies. In general, the number of misassemblies ranges from being comparable to the best of the input assembly to being comparable to the worst of the input assembly

eScholarship - University of California

Additional file 1 of A comparative evaluation of genome assembly reconciliation tools

Author: Hamid Mirebrahim (4035023)
Hind Alhakami (4035020)
Stefano Lonardi (76597)
Publication venue
Publication date
Field of study

Contains Supplementary Notes 1â7, Supplementary Tables 1â19 and Supplementary Figures 1â13. (PDF 1250 kb

FigShare

The genome of cowpea (Vigna unguiculata [L.] Walp.)

Author: Alhakami Hind
Cannon Steven B.
Close Timothy J.
Doležel Jaroslav
Farmer Andrew D.
Hasan Abid Md.
Hokin Samuel A.
Liang Qihua
Lo Sassoum
Lonardi Stefano
Luo Ming-Cheng
Muñoz-Amatriaín María
Ndeve Arsenio
Ounit Rachid
Roberts Philip A.
Santos Jansen R.P.
Schulman Alan H.
Shu Shengqiang
Tanskanen Jaakko
Verdier Jerome
Vrána Jan
Wanamaker Steve I.
Zhu Tingting
Publication venue
Publication date: 01/01/2019
Field of study

Cowpea (Vigna unguiculata [L.] Walp.) is a major crop for worldwide food and nutritional security, especially in sub-Saharan Africa, that is resilient to hot and drought-prone environments. An assembly of the single-haplotype inbred genome of cowpea IT97K-499-35 was developed by exploiting the synergies between single-molecule real-time sequencing, optical and genetic mapping, and an assembly reconciliation algorithm. A total of 519 Mb is included in the assembled sequences. Nearly half of the assembled sequence is composed of repetitive elements, which are enriched within recombination-poor pericentromeric regions. A comparative analysis of these elements suggests that genome size differences between Vigna species are mainly attributable to changes in the amount of Gypsy retrotransposons. Conversely, genes are more abundant in more distal, high-recombination regions of the chromosomes; there appears to be more duplication of genes within the NBS-LRR and the SAUR-like auxin superfamilies compared with other warm-season legumes that have been sequenced. A surprising outcome is the identification of an inversion of 4.2 Mb among landraces and cultivars, which includes a gene that has been associated in other plants with interactions with the parasitic weed Striga gesnerioides. The genome sequence facilitated the identification of a putative syntelog for multiple organ gigantism in legumes. A revised numbering system has been adopted for cowpea chromosomes based on synteny with common bean (Phaseolus vulgaris). An estimate of nuclear genome size of 640.6 Mbp based on cytometry is presented.Peer reviewe

Jukuri

Crossref

eScholarship - University of California

HAL-INSA Toulouse

Helsingin yliopiston digitaalinen arkisto

ProdInra

Recommended from our members

The genome of cowpea (Vigna unguiculata [L.] Walp.).

Author: Alhakami Hind
Cannon Steven B
Close Timothy J
Doležel Jaroslav
Farmer Andrew D
Hasan Abid Md
Hokin Samuel A
Liang Qihua
Lo Sassoum
Lonardi Stefano
Luo Ming-Cheng
Muñoz-Amatriaín María
Ndeve Arsenio
Ounit Rachid
Roberts Philip A
Santos Jansen RP
Schulman Alan H
Shu Shengqiang
Tanskanen Jaakko
Verdier Jerome
Vrána Jan
Wanamaker Steve I
Zhu Tingting
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Cowpea (Vigna unguiculata [L.] Walp.) is a major crop for worldwide food and nutritional security, especially in sub-Saharan Africa, that is resilient to hot and drought-prone environments. An assembly of the single-haplotype inbred genome of cowpea IT97K-499-35 was developed by exploiting the synergies between single-molecule real-time sequencing, optical and genetic mapping, and an assembly reconciliation algorithm. A total of 519 Mb is included in the assembled sequences. Nearly half of the assembled sequence is composed of repetitive elements, which are enriched within recombination-poor pericentromeric regions. A comparative analysis of these elements suggests that genome size differences between Vigna species are mainly attributable to changes in the amount of Gypsy retrotransposons. Conversely, genes are more abundant in more distal, high-recombination regions of the chromosomes; there appears to be more duplication of genes within the NBS-LRR and the SAUR-like auxin superfamilies compared with other warm-season legumes that have been sequenced. A surprising outcome is the identification of an inversion of 4.2 Mb among landraces and cultivars, which includes a gene that has been associated in other plants with interactions with the parasitic weed Striga gesnerioides. The genome sequence facilitated the identification of a putative syntelog for multiple organ gigantism in legumes. A revised numbering system has been adopted for cowpea chromosomes based on synteny with common bean (Phaseolus vulgaris). An estimate of nuclear genome size of 640.6 Mbp based on cytometry is presented

eScholarship - University of California

Recommended from our members

Genome resources for climate-resilient cowpea, an essential crop for food security.

Author: Alhakami Hind
Alpert Matthew
Atokple Ibrahim
Barkley Noelle A
Batieno Benoit J
Boukar Ousmane
Bozdag Serdar
Cisse Ndiaga
Close Timothy J
Drabo Issa
Ehlers Jeffrey D
Farmer Andrew
Fatokun Christian
Gu Yong Q
Guo Yi-Ning
Huynh Bao-Lam
Jackson Scott A
Kusi Francis
Lawley Cynthia T
Lonardi Stefano
Lucas Mitchell R
Luo MingCheng
Ma Yaqin
Mirebrahim Hamid
Muñoz-Amatriaín María
Roberts Philip A
Timko Michael P
Wanamaker Steve I
Wu Jiajie
Xu Pei
You Frank
Publication venue: eScholarship, University of California
Publication date: 01/03/2017
Field of study

Cowpea (Vigna unguiculata L. Walp.) is a legume crop that is resilient to hot and drought-prone climates, and a primary source of protein in sub-Saharan Africa and other parts of the developing world. However, genome resources for cowpea have lagged behind most other major crops. Here we describe foundational genome resources and their application to the analysis of germplasm currently in use in West African breeding programs. Resources developed from the African cultivar IT97K-499-35 include a whole-genome shotgun (WGS) assembly, a bacterial artificial chromosome (BAC) physical map, and assembled sequences from 4355 BACs. These resources and WGS sequences of an additional 36 diverse cowpea accessions supported the development of a genotyping assay for 51 128 SNPs, which was then applied to five bi-parental RIL populations to produce a consensus genetic map containing 37 372 SNPs. This genetic map enabled the anchoring of 100 Mb of WGS and 420 Mb of BAC sequences, an exploration of genetic diversity along each linkage group, and clarification of macrosynteny between cowpea and common bean. The SNP assay enabled a diversity analysis of materials from West African breeding programs. Two major subpopulations exist within those materials, one of which has significant parentage from South and East Africa and more diversity. There are genomic regions of high differentiation between subpopulations, one of which coincides with a cluster of nodulin genes. The new resources and knowledge help to define goals and accelerate the breeding of improved varieties to address food security issues related to limited-input small-holder farming and climate stress

eScholarship - University of California

A comparative evaluation of genome assembly reconciliation tools

Author: A Bankevich
A Giampetruzzi
A Gurevich
AH Wences
AJ Yañez
AP Florentino
AV Zimin
AV Zimin
AW Eastman
C Bartenhagen
C Vilo
DD Sommer
DR Zerbino
E Dordet-Frisoni
ES Wright
EW Myers
F Vezzi
G Narzisi
G Yao
H Chitsaz
H Dall’Agnol
H Hirakawa
H Soueidan
Hamid Mirebrahim
Hind Alhakami
J Clarke
J Eid
J Nijkamp
JA Rosenfeld
JL Argueso
JR Miller
JT Simpson
JT Simpson
KR Bradnam
L Mayela Soto-Jimenez
M Kolmogorov
M Pop
M Schartl
MC Walter
NJ Croucher
R Li
R Li
R Vicedomini
S Gnerre
S Kurtz
SH Lin
SL Salzberg
ST Ramírez-Puebla
Stefano Lonardi
T Magoc
TA Castoe
W Huang
X Huang
Y Peng
YM Jeong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref