28 research outputs found
Constitutive overexpression of a novel 21 kDa protein by Hodgkin Lymphoma and Aggressive Non-Hodgkin Lymphomas
© 2008 Zhou et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution Licens
Sub-megabase resolution tiling (SMRT) array-based comparative genomic hybridization profiling reveals novel gains and losses of chromosomal regions in Hodgkin Lymphoma and Anaplastic Large Cell Lymphoma cell lines
<p>Abstract</p> <p>Background</p> <p>Hodgkin lymphoma (HL) and Anaplastic Large Cell Lymphoma (ALCL), are forms of malignant lymphoma defined by unique morphologic, immunophenotypic, genotypic, and clinical characteristics, but both overexpress CD30. We used sub-megabase resolution tiling (SMRT) array-based comparative genomic hybridization to screen HL-derived cell lines (KMH2 and L428) and ALCL cell lines (DEL and SR-786) in order to identify disease-associated gene copy number gains and losses.</p> <p>Results</p> <p>Significant copy number gains and losses were observed on several chromosomes in all four cell lines. Assessment of copy number alterations with 26,819 DNA segments identified an average of 20 genetic alterations. Of the recurrent minimally altered regions identified, 11 (55%) were within previously published regions of chromosomal alterations in HL and ALCL cell lines while 9 (45%) were novel alterations not previously reported. HL cell lines L428 and KMH2 shared gains in chromosome cytobands 2q23.1-q24.2, 7q32.2-q36.3, 9p21.3-p13.3, 12q13.13-q14.1, and losses in 13q12.13-q12.3, and 18q21.32-q23. ALCL cell lines SR-786 and DEL, showed gains in cytobands 5p15.32-p14.3, 20p12.3-q13.11, and 20q13.2-q13.32. Both pairs of HL and ALCL cell lines showed losses in 18q21.32-18q23.</p> <p>Conclusion</p> <p>This study is considered to be the first one describing HL and ALCL cell line genomes at sub-megabase resolution. This high-resolution analysis allowed us to propose novel candidate target genes that could potentially contribute to the pathogenesis of HL and ALCL. FISH was used to confirm the amplification of all three isoforms of the trypsin gene (PRSS1/PRSS2/PRSS3) in KMH2 and L428 (HL) and DEL (ALCL) cell lines. These are novel findings that have not been previously reported in the lymphoma literature, and opens up an entirely new area of research that has not been previously associated with lymphoma biology. The findings raise interesting possibilities about the role of signaling pathways triggered by membrane associated serine proteases in HL and aggressive NHL, similar to those described in epithelial tumors.</p
Constitutively overexpressed 21 kDa protein in Hodgkin lymphoma and aggressive non-Hodgkin lymphomas identified as cytochrome B5b (CYB5B)
<p>Abstract</p> <p>Background</p> <p>We have previously reported a novel constitutively overexpressed 21 kDa protein in Hodgkin Lymphoma (HL) and aggressive Non-Hodgkin Lymphomas (NHL). The objective of the current study was to 1) identify this protein using two independent methods, 2) study the expression of the protein and its encoding mRNA in reactive lymph nodes, normal lymphocytes and CD34+ bone marrow precursor cells, 3) analyse patterns of expression of the protein in tissue microarrays assembled from a large number of diagnostic clinical biopsies from patients with HL, and 4) determine the copy number variation and mutation status of the encoding gene in HL cell lines.</p> <p>Results</p> <p>Peptide sequencing by LC-MS/MS and protein identification by protein array screening identified a single protein, CYB5B. No mutations were detected in the <it>CYB5B </it>gene in HL cell lines. Quantitative PCR showed <it>CYB5B </it>gene expression was increased in HL and NHL cell lines. Array CGH using a submegabase resolution tiling array revealed gains in the <it>CYB5B </it>locus in HL cell lines KMH2 and L428. Membrane expression was seen in Reed-Sternberg cells in clinical biopsies from patients with HL but not in reactive lymph nodes. Bone marrow CD34+ precursor cells were CYB5B negative on the cell surface. RT-PCR assays of RNA extracted from T and B cell enriched fractions obtained from normal peripheral blood mononuclear cells, reactive lymph nodes, tonsils and normal bone marrow samples showed no evidence of increased mRNA levels of <it>CYB5B </it>in comparison to housekeeping gene <it>GAPDH</it>.</p> <p>Conclusions</p> <p>The 21 kDa protein overexpressed in HL and aggressive NHL is identical to CYB5B. <it>CYB5B </it>gene expression is increased in a subset of HL and NHL cell lines tested. This is associated with <it>CYB5B </it>gene amplification in HL cell lines KMH2 and L428. CYB5B may be a potential target for antibody-based therapy of HL and aggressive NHL as although cytoplasmic expression is present in reactive lymphocytes, it is not expressed on the cell surface of non-neoplastic lymphocytes or bone marrow precursor cells.</p
Dense sampling of ethnic groups within African countries reveals fine-scale genetic structure and extensive historical admixture
Previous studies have highlighted how African genomes have been shaped by a complex series of historical events. Despite this, genome-wide data have only been obtained from a small proportion of present-day ethnolinguistic groups. By analyzing new autosomal genetic variation data of 1333 individuals from over 150 ethnic groups from Cameroon, Republic of the Congo, Ghana, Nigeria, and Sudan, we demonstrate a previously underappreciated fine-scale level of genetic structure within these countries, for example, correlating with historical polities in western Cameroon. By comparing genetic variation patterns among populations, we infer that many northern Cameroonian and Sudanese groups share genetic links with multiple geographically disparate populations, likely resulting from long-distance migrations. In Ghana and Nigeria, we infer signatures of intermixing dated to over 2000 years ago, corresponding to reports of environmental transformations possibly related to climate change. We also infer recent intermixing signals in multiple African populations, including Congolese, that likely relate to the expansions of Bantu language-speaking peoples
Design considerations for workflow management systems use in production genomics research and the clinic
Abstract The changing landscape of genomics research and clinical practice has created a need for computational pipelines capable of efficiently orchestrating complex analysis stages while handling large volumes of data across heterogeneous computational environments. Workflow Management Systems (WfMSs) are the software components employed to fill this gap. This work provides an approach and systematic evaluation of key features of popular bioinformatics WfMSs in use today: Nextflow, CWL, and WDL and some of their executors, along with Swift/T, a workflow manager commonly used in high-scale physics applications. We employed two use cases: a variant-calling genomic pipeline and a scalability-testing framework, where both were run locally, on an HPC cluster, and in the cloud. This allowed for evaluation of those four WfMSs in terms of language expressiveness, modularity, scalability, robustness, reproducibility, interoperability, ease of development, along with adoption and usage in research labs and healthcare settings. This article is trying to answer, which WfMS should be chosen for a given bioinformatics application regardless of analysis type?. The choice of a given WfMS is a function of both its intrinsic language and engine features. Within bioinformatics, where analysts are a mix of dry and wet lab scientists, the choice is also governed by collaborations and adoption within large consortia and technical support provided by the WfMS team/community. As the community and its needs continue to evolve along with computational infrastructure, WfMSs will also evolve, especially those with permissive licenses that allow commercial use. In much the same way as the dataflow paradigm and containerization are now well understood to be very useful in bioinformatics applications, we will continue to see innovations of tools and utilities for other purposes, like big data technologies, interoperability, and provenance
The extent and impact of variation in ADME genes in sub-Saharan African populations
Investigating variation in genes involved in the absorption, distribution, metabolism, and excretion (ADME) of drugs are key to characterizing pharmacogenomic (PGx) relationships. ADME gene variation is relatively well characterized in European and Asian populations, but data from African populations are under-studied—which has implications for drug safety and effective use in Africa
Development of Bioinformatics Infrastructure for Genomics Research:
Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet's role has evolved in response to changing needs from the consortium and the African bioinformatics community
African Genomic Medicine Portal: A Web Portal for Biomedical Applications
Genomics data are currently being produced at unprecedented rates, resulting in increased knowledge discovery and submission to public data repositories. Despite these advances, genomic information on African-ancestry populations remains significantly low compared with European- and Asian-ancestry populations. This information is typically segmented across several different biomedical data repositories, which often lack sufficient fine-grained structure and annotation to account for the diversity of African populations, leading to many challenges related to the retrieval, representation and findability of such information. To overcome these challenges, we developed the African Genomic Medicine Portal (AGMP), a database that contains metadata on genomic medicine studies conducted on African-ancestry populations. The metadata is curated from two public databases related to genomic medicine, PharmGKB and DisGeNET. The metadata retrieved from these source databases were limited to genomic variants that were associated with disease aetiology or treatment in the context of African-ancestry populations. Over 2000 variants relevant to populations of African ancestry were retrieved. Subsequently, domain experts curated and annotated additional information associated with the studies that reported the variants, including geographical origin, ethnolinguistic group, level of association significance and other relevant study information, such as study design and sample size, where available. The AGMP functions as a dedicated resource through which to access African-specific information on genomics as applied to health research, through querying variants, genes, diseases and drugs. The portal and its corresponding technical documentation, implementation code and content are publicly available
African Genomic Medicine Portal: A Web Portal for Biomedical Applications
Genomics data are currently being produced at unprecedented rates, resulting in increased knowledge discovery and submission to public data repositories. Despite these advances, genomic information on African-ancestry populations remains significantly low compared with European- and Asian-ancestry populations. This information is typically segmented across several different biomedical data repositories, which often lack sufficient fine-grained structure and annotation to account for the diversity of African populations, leading to many challenges related to the retrieval, representation and findability of such information. To overcome these challenges, we developed the African Genomic Medicine Portal (AGMP), a database that contains metadata on genomic medicine studies conducted on African-ancestry populations. The metadata is curated from two public databases related to genomic medicine, PharmGKB and DisGeNET. The metadata retrieved from these source databases were limited to genomic variants that were associated with disease aetiology or treatment in the context of African-ancestry populations. Over 2000 variants relevant to populations of African ancestry were retrieved. Subsequently, domain experts curated and annotated additional information associated with the studies that reported the variants, including geographical origin, ethnolinguistic group, level of association significance and other relevant study information, such as study design and sample size, where available. The AGMP functions as a dedicated resource through which to access African-specific information on genomics as applied to health research, through querying variants, genes, diseases and drugs. The portal and its corresponding technical documentation, implementation code and content are publicly available
Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics
Background: The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African
countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research
consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to
study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building
bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous
African computing environments. Processing and analysis of genomic data is an example of a big data application
requiring complex interdependent data analysis workflows. Such bioinformatics workflows take the primary and
secondary input data through several computationally-intensive processing steps using different software packages,
where some of the outputs form inputs for other steps. Implementing scalable, reproducible, portable and
easy-to-use workflows is particularly challenging.
Results: H3ABioNet has built four workflows to support (1) the calling of variants from high-throughput sequencing
data; (2) the analysis of microbial populations from 16S rDNA sequence data; (3) genotyping and genome-wide
association studies; and (4) single nucleotide polymorphism imputation. A week-long hackathon was organized in
August 2016 with participants from six African bioinformatics groups, and US and European collaborators. Two of the
workflows are built using the Common Workflow Language framework (CWL) and two using Nextflow. All the
workflows are containerized for improved portability and reproducibility using Docker, and are publicly available for
use by members of the H3Africa consortium and the international research community.
Conclusion: The H3ABioNet workflows have been implemented in view of offering ease of use for the end user and
high levels of reproducibility and portability, all while following modern state of the art bioinformatics data processing
protocols. The H3ABioNet workflows will service the H3Africa consortium projects and are currently in use.
All four workflows are also publicly available for research scientists worldwide to use and adapt for their respective
needs. The H3ABioNet workflows will help develop bioinformatics capacity and assist genomics research within Africa
and serve to increase the scientific output of H3Africa and its Pan-African Bioinformatics Network