52 research outputs found
Application of two machine learning algorithms to genetic association studies in the presence of covariates
BACKGROUND: Population-based investigations aimed at uncovering genotype-trait associations often involve high-dimensional genetic polymorphism data as well as information on multiple environmental and clinical parameters. Machine learning (ML) algorithms offer a straightforward analytic approach for selecting subsets of these inputs that are most predictive of a pre-defined trait. The performance of these algorithms, however, in the presence of covariates is not well characterized. METHODS AND RESULTS: In this manuscript, we investigate two approaches: Random Forests (RFs) and Multivariate Adaptive Regression Splines (MARS). Through multiple simulation studies, the performance under several underlying models is evaluated. An application to a cohort of HIV-1 infected individuals receiving anti-retroviral therapies is also provided. CONCLUSION: Consistent with more traditional regression modeling theory, our findings highlight the importance of considering the nature of underlying gene-covariate-trait relationships before applying ML algorithms, particularly when there is potential confounding or effect mediation
Project YES! Youth Engaging for Success: A randomized controlled trial testing a peer mentoring approach among HIV-positive adolescents and young adults in Ndola, Zambia
This research addresses the gaps in knowledge about how to best support adolescents and young adults transitioning to HIV self-management in the context of both child-focused and adult-focused HIV care settings. Johns Hopkins University, in partnership with the Arthur Davison Children’s Hospital, implemented this study through the USAID-funded Project SOAR (led by the Population Council)
Identification of broadly neutralizing antibody epitopes in the HIV-1 envelope glycoprotein using evolutionary models
Background: Identification of the epitopes targeted by antibodies that can neutralize diverse HIV-1 strains can provide important clues for the design of a preventative vaccine. Methods: We have developed a computational approach that can identify key amino acids within the HIV-1 envelope glycoprotein that influence sensitivity to broadly cross-neutralizing antibodies. Given a sequence alignment and neutralization titers for a panel of viruses, the method works by fitting a phylogenetic model that allows the amino acid frequencies at each site to depend on neutralization sensitivities. Sites at which viral evolution influences neutralization sensitivity were identified using Bayes factors (BFs) to compare the fit of this model to that of a null model in which sequences evolved independently of antibody sensitivity. Conformational epitopes were identified with a Metropolis algorithm that searched for a cluster of sites with large Bayes factors on the tertiary structure of the viral envelope. Results: We applied our method to ID50 neutralization data generated from seven HIV-1 subtype C serum samples with neutralization breadth that had been tested against a multi-clade panel of 225 pseudoviruses for which envelope sequences were also available. For each sample, between two and four sites were identified that were strongly associated with neutralization sensitivity (2ln(BF) > 6), a subset of which were experimentally confirmed using site-directed mutagenesis. Conclusions: Our results provide strong support for the use of evolutionary models applied to cross-sectional viral neutralization data to identify the epitopes of serum antibodies that confer neutralization breadth
Cerebral processing of voice gender studied using a continuous carryover fMRI design
Normal listeners effortlessly determine a person's gender by voice, but the cerebral mechanisms underlying this ability remain unclear. Here, we demonstrate 2 stages of cerebral processing during voice gender categorization. Using voice morphing along with an adaptation-optimized functional magnetic resonance imaging design, we found that secondary auditory cortex including the anterior part of the temporal voice areas in the right hemisphere responded primarily to acoustical distance with the previously heard stimulus. In contrast, a network of bilateral regions involving inferior prefrontal and anterior and posterior cingulate cortex reflected perceived stimulus ambiguity. These findings suggest that voice gender recognition involves neuronal populations along the auditory ventral stream responsible for auditory feature extraction, functioning in pair with the prefrontal cortex in voice gender perception
EPHA2 Polymorphisms and Age-Related Cataract in India
Objective: We investigated whether previously reported single nucleotide polymorphisms (SNPs) of EPHA2 in European studies are associated with cataract in India. Methods: We carried out a population-based genetic association study. We enumerated randomly sampled villages in two areas of north and south India to identify people aged 40 and over. Participants attended a clinical examination including lens photography and provided a blood sample for genotyping. Lens images were graded by the Lens Opacification Classification System (LOCS III). Cataract was defined as a LOCS III grade of nuclear >= 4, cortical >= 3, posterior sub-capsular (PSC) >= 2, or dense opacities or aphakia/pseudophakia in either eye. We genotyped SNPs rs3754334, rs7543472 and rs11260867 on genomic DNA extracted from peripheral blood leukocytes using TaqMan assays in an ABI 7900 real-time PCR. We used logistic regression with robust standard errors to examine the association between cataract and the EPHA2 SNPs, adjusting for age, sex and location. Results: 7418 participants had data on at least one of the SNPs investigated. Genotype frequencies of controls were in Hardy-Weinberg Equilibrium (p > 0.05). There was no association of rs3754334 with cataract or type of cataract. Minor allele homozygous genotypes of rs7543472 and rs11260867 compared to the major homozygote genotype were associated with cortical cataract, Odds ratio (OR) = 1.8, 95% Confidence Interval (CI) (1.1, 3.1) p = 0.03 and 2.9 (1.2, 7.1) p = 0.01 respectively, and with PSC cataract, OR = 1.5 (1.1, 2.2) p = 0.02 and 1.8 (0.9, 3.6) p = 0.07 respectively. There was no consistent association of SNPs with nuclear cataract or a combined variable of any type of cataract including operated cataract. Conclusions: Our results in the Indian population agree with previous studies of the association of EPHA2 variants with cortical cataracts. We report new findings for the association with PSC which is particularly prevalent in Indians
Neuromuscular disease genetics in under-represented populations: increasing data diversity
\ua9 The Author(s) 2023. Published by Oxford University Press on behalf of the Guarantors of Brain. Neuromuscular diseases (NMDs) affect ∼15 million people globally. In high income settings DNA-based diagnosis has transformed care pathways and led to gene-specific therapies. However, most affected families are in low-to-middle income countries (LMICs) with limited access to DNA-based diagnosis. Most (86%) published genetic data is derived from European ancestry. This marked genetic data inequality hampers understanding of genetic diversity and hinders accurate genetic diagnosis in all income settings. We developed a cloud-based transcontinental partnership to build diverse, deeply-phenotyped and genetically characterized cohorts to improve genetic architecture knowledge, and potentially advance diagnosis and clinical management. We connected 18 centres in Brazil, India, South Africa, Turkey, Zambia, Netherlands and the UK. We co-developed a cloud-based data solution and trained 17 international neurology fellows in clinical genomic data interpretation. Single gene and whole exome data were analysed via a bespoke bioinformatics pipeline and reviewed alongside clinical and phenotypic data in global webinars to inform genetic outcome decisions. We recruited 6001 participants in the first 43 months. Initial genetic analyses \u27solved\u27 or \u27possibly solved\u27 ∼56% probands overall. In-depth genetic data review of the four commonest clinical categories (limb girdle muscular dystrophy, inherited peripheral neuropathies, congenital myopathy/muscular dystrophies and Duchenne/Becker muscular dystrophy) delivered a ∼59% \u27solved\u27 and ∼13% \u27possibly solved\u27 outcome. Almost 29% of disease causing variants were novel, increasing diverse pathogenic variant knowledge. Unsolved participants represent a new discovery cohort. The dataset provides a large resource from under-represented populations for genetic and translational research. In conclusion, we established a remote transcontinental partnership to assess genetic architecture of NMDs across diverse populations. It supported DNA-based diagnosis, potentially enabling genetic counselling, care pathways and eligibility for gene-specific trials. Similar virtual partnerships could be adopted by other areas of global genomic neurological practice to reduce genetic data inequality and benefit patients globally
Neuromuscular disease genetics in under-represented populations: increasing data diversity
Neuromuscular diseases (NMDs) affect ∼15 million people globally. In high income settings DNA-based diagnosis has transformed care pathways and led to gene-specific therapies. However, most affected families are in low-to-middle income countries (LMICs) with limited access to DNA-based diagnosis. Most (86%) published genetic data is derived from European ancestry. This marked genetic data inequality hampers understanding of genetic diversity and hinders accurate genetic diagnosis in all income settings. We developed a cloud-based transcontinental partnership to build diverse, deeply-phenotyped and genetically characterized cohorts to improve genetic architecture knowledge, and potentially advance diagnosis and clinical management.
We connected 18 centres in Brazil, India, South Africa, Turkey, Zambia, Netherlands and the UK. We co-developed a cloud-based data solution and trained 17 international neurology fellows in clinical genomic data interpretation. Single gene and whole exome data were analysed via a bespoke bioinformatics pipeline and reviewed alongside clinical and phenotypic data in global webinars to inform genetic outcome decisions.
We recruited 6001 participants in the first 43 months. Initial genetic analyses ‘solved’ or ‘possibly solved’ ∼56% probands overall. In-depth genetic data review of the four commonest clinical categories (limb girdle muscular dystrophy, inherited peripheral neuropathies, congenital myopathy/muscular dystrophies and Duchenne/Becker muscular dystrophy) delivered a ∼59% ‘solved’ and ∼13% ‘possibly solved’ outcome. Almost 29% of disease causing variants were novel, increasing diverse pathogenic variant knowledge. Unsolved participants represent a new discovery cohort. The dataset provides a large resource from under-represented populations for genetic and translational research.
In conclusion, we established a remote transcontinental partnership to assess genetic architecture of NMDs across diverse populations. It supported DNA-based diagnosis, potentially enabling genetic counselling, care pathways and eligibility for gene-specific trials. Similar virtual partnerships could be adopted by other areas of global genomic neurological practice to reduce genetic data inequality and benefit patients globally
Impact of a multi-strategy community intervention to reduce maternal and child health inequalities in India : A qualitative study in Haryana
A multi-strategy community intervention, known as National Rural Health Mission (NRHM), was implemented in India from 2005 to 2012. By improving the availability of and access to better-quality healthcare, the aim was to reduce maternal and child health (MCH) inequalities. This study was planned to explore the perceptions and beliefs of stakeholders about extent of implementation and effectiveness of NRHM's health sector plans in improving MCH status and reducing inequalities. A total of 33 in-depth interviews (n = 33) with program managers, community representatives, mothers and 8 focus group discussions (n = 42) with health service providers were conducted from September to December 2013, in Haryana, post NRHM. Using NVivo software (version 9), an inductive applied thematic analysis was done based upon grounded theory, program theory of change and a framework approach. Almost all the participants reported that there was an improvement in overall health infrastructure through an increased availability of accredited social health activists, free ambulance services, and free treatment facilities in rural areas. This had increased the demand and utilization of MCH services, especially for those related to institutional delivery, even by the poor families. Service providers felt that acute shortage of human resources was a major health system level barrier. District-specific individual, community, and socio-political level barriers were also observed. Overall program managers, service providers and community representatives believed that NRHM had a role in improving MCH outcomes and in reduction of geographical and socioeconomic inequalities, through improvement in accessibility, availability and affordability of the MCH services in the rural areas and for the poor. Any reduction in gender-based inequalities, however, was linked to the adoption of small family sizes and an increase in educational levels
- …