45 research outputs found

    Searching a bitstream in linear time for the longest substring of any given density

    Full text link
    Given an arbitrary bitstream, we consider the problem of finding the longest substring whose ratio of ones to zeroes equals a given value. The central result of this paper is an algorithm that solves this problem in linear time. The method involves (i) reformulating the problem as a constrained walk through a sparse matrix, and then (ii) developing a data structure for this sparse matrix that allows us to perform each step of the walk in amortised constant time. We also give a linear time algorithm to find the longest substring whose ratio of ones to zeroes is bounded below by a given value. Both problems have practical relevance to cryptography and bioinformatics.Comment: 22 pages, 19 figures; v2: minor edits and enhancement

    Social and scientific motivations to move beyond groups in allele frequencies: The TOPMed experience

    Get PDF
    For the genomics community, allele frequencies within defined groups (or “strata”) are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or “look up” studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies—and the downstream applications that this presentation enables—is consistent with participants’ informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases

    Lessons learned and recommendations for data coordination in collaborative research: The CSER consortium experience

    Get PDF
    Integrating data across heterogeneous research environments is a key challenge in multi-site, collaborative research projects. While it is important to allow for natural variation in data collection protocols across research sites, it is also important to achieve interoperability between datasets in order to reap the full benefits of collaborative work. However, there are few standards to guide the data coordination process from project conception to completion. In this paper, we describe the experiences of the Clinical Sequence Evidence-Generating Research (CSER) consortium Data Coordinating Center (DCC), which coordinated harmonized survey and genomic sequencing data from seven clinical research sites from 2020 to 2022. Using input from multiple consortium working groups and from CSER leadership, we first identify 14 lessons learned from CSER in the categories of communication, harmonization, informatics, compliance, and analytics. We then distill these lessons learned into 11 recommendations for future research consortia in the areas of planning, communication, informatics, and analytics. We recommend that planning and budgeting for data coordination activities occur as early as possible during consortium conceptualization and development to minimize downstream complications. We also find that clear, reciprocal, and continuous communication between consortium stakeholders and the DCC is equally important to maintaining a secure and centralized informatics ecosystem for pooling data. Finally, we discuss the importance of actively interrogating current approaches to data governance, particularly for research studies that straddle the research-clinical divide

    Arteriopathy diagnosis in childhood arterial ischemic stroke: results of the vascular effects of infection in pediatric stroke study.

    Get PDF
    Background and purposeAlthough arteriopathies are the most common cause of childhood arterial ischemic stroke, and the strongest predictor of recurrent stroke, they are difficult to diagnose. We studied the role of clinical data and follow-up imaging in diagnosing cerebral and cervical arteriopathy in children with arterial ischemic stroke.MethodsVascular effects of infection in pediatric stroke, an international prospective study, enrolled 355 cases of arterial ischemic stroke (age, 29 days to 18 years) at 39 centers. A neuroradiologist and stroke neurologist independently reviewed vascular imaging of the brain (mandatory for inclusion) and neck to establish a diagnosis of arteriopathy (definite, possible, or absent) in 3 steps: (1) baseline imaging alone; (2) plus clinical data; (3) plus follow-up imaging. A 4-person committee, including a second neuroradiologist and stroke neurologist, adjudicated disagreements. Using the final diagnosis as the gold standard, we calculated the sensitivity and specificity of each step.ResultsCases were aged median 7.6 years (interquartile range, 2.8-14 years); 56% boys. The majority (52%) was previously healthy; 41% had follow-up vascular imaging. Only 56 (16%) required adjudication. The gold standard diagnosis was definite arteriopathy in 127 (36%), possible in 34 (9.6%), and absent in 194 (55%). Sensitivity was 79% at step 1, 90% at step 2, and 94% at step 3; specificity was high throughout (99%, 100%, and 100%), as was agreement between reviewers (Îș=0.77, 0.81, and 0.78).ConclusionsClinical data and follow-up imaging help, yet uncertainty in the diagnosis of childhood arteriopathy remains. This presents a challenge to better understanding the mechanisms underlying these arteriopathies and designing strategies for prevention of childhood arterial ischemic stroke

    Customer emotions in service failure and recovery encounters

    Get PDF
    Emotions play a significant role in the workplace, and considerable attention has been given to the study of employee emotions. Customers also play a central function in organizations, but much less is known about customer emotions. This chapter reviews the growing literature on customer emotions in employee–customer interfaces with a focus on service failure and recovery encounters, where emotions are heightened. It highlights emerging themes and key findings, addresses the measurement, modeling, and management of customer emotions, and identifies future research streams. Attention is given to emotional contagion, relationships between affective and cognitive processes, customer anger, customer rage, and individual differences

    The Clinical Sequencing Evidence-Generating Research Consortium: Integrating Genomic Sequencing in Diverse and Medically Underserved Populations

    Get PDF
    The Clinical Sequencing Evidence-Generating Research (CSER) consortium, now in its second funding cycle, is investigating the effectiveness of integrating genomic (exome or genome) sequencing into the clinical care of diverse and medically underserved individuals in a variety of healthcare settings and disease states. The consortium comprises a coordinating center, six funded extramural clinical projects, and an ongoing National Human Genome Research Institute (NHGRI) intramural project. Collectively, these projects aim to enroll and sequence over 6,100 participants in four years. At least 60% of participants will be of non-European ancestry or from underserved settings, with the goal of diversifying the populations that are providing an evidence base for genomic medicine. Five of the six clinical projects are enrolling pediatric patients with various phenotypes. One of these five projects is also enrolling couples whose fetus has a structural anomaly, and the sixth project is enrolling adults at risk for hereditary cancer. The ongoing NHGRI intramural project has enrolled primarily healthy adults. Goals of the consortium include assessing the clinical utility of genomic sequencing, exploring medical follow up and cascade testing of relatives, and evaluating patient-provider-laboratory level interactions that influence the use of this technology. The findings from the CSER consortium will offer patients, healthcare systems, and policymakers a clearer understanding of the opportunities and challenges of providing genomic medicine in diverse populations and settings, and contribute evidence toward developing best practices for the delivery of clinically useful and cost-effective genomic sequencing in diverse healthcare settings

    Genome-wide association study identifies 30 Loci Associated with Bipolar Disorder

    Get PDF
    This paper is dedicated to the memory of Psychiatric Genomics Consortium (PGC) founding member and Bipolar disorder working group co-chair Pamela Sklar. We thank the participants who donated their time, experiences and DNA to this research, and to the clinical and scientific teams that worked with them. We are deeply indebted to the investigators who comprise the PGC. The views expressed are those of the authors and not necessarily those of any funding or regulatory body. Analyses were carried out on the NL Genetic Cluster Computer (http://www.geneticcluster.org ) hosted by SURFsara, and the Mount Sinai high performance computing cluster (http://hpc.mssm.edu).Bipolar disorder is a highly heritable psychiatric disorder. We performed a genome-wide association study including 20,352 cases and 31,358 controls of European descent, with follow-up analysis of 822 variants with P<1x10-4 in an additional 9,412 cases and 137,760 controls. Eight of the 19 variants that were genome-wide significant (GWS, p < 5x10-8) in the discovery GWAS were not GWS in the combined analysis, consistent with small effect sizes and limited power but also with genetic heterogeneity. In the combined analysis 30 loci were GWS including 20 novel loci. The significant loci contain genes encoding ion channels, neurotransmitter transporters and synaptic components. Pathway analysis revealed nine significantly enriched gene-sets including regulation of insulin secretion and endocannabinoid signaling. BDI is strongly genetically correlated with schizophrenia, driven by psychosis, whereas BDII is more strongly correlated with major depressive disorder. These findings address key clinical questions and provide potential new biological mechanisms for BD.This work was funded in part by the Brain and Behavior Research Foundation, Stanley Medical Research Institute, University of Michigan, Pritzker Neuropsychiatric Disorders Research Fund L.L.C., Marriot Foundation and the Mayo Clinic Center for Individualized Medicine, the NIMH Intramural Research Program; Canadian Institutes of Health Research; the UK Maudsley NHS Foundation Trust, NIHR, NRS, MRC, Wellcome Trust; European Research Council; German Ministry for Education and Research, German Research Foundation IZKF of MĂŒnster, Deutsche Forschungsgemeinschaft, ImmunoSensation, the Dr. Lisa-Oehler Foundation, University of Bonn; the Swiss National Science Foundation; French Foundation FondaMental and ANR; Spanish Ministerio de EconomĂ­a, CIBERSAM, Industria y Competitividad, European Regional Development Fund (ERDF), Generalitat de Catalunya, EU Horizon 2020 Research and Innovation Programme; BBMRI-NL; South-East Norway Regional Health Authority and Mrs. Throne-Holst; Swedish Research Council, Stockholm County Council, Söderström Foundation; Lundbeck Foundation, Aarhus University; Australia NHMRC, NSW Ministry of Health, Janette M O'Neil and Betty C Lynch

    Reproducibility in the absence of selective reporting : An illustration from large-scale brain asymmetry research

    Get PDF
    Altres ajuts: Max Planck Society (Germany).The problem of poor reproducibility of scientific findings has received much attention over recent years, in a variety of fields including psychology and neuroscience. The problem has been partly attributed to publication bias and unwanted practices such as p-hacking. Low statistical power in individual studies is also understood to be an important factor. In a recent multisite collaborative study, we mapped brain anatomical left-right asymmetries for regional measures of surface area and cortical thickness, in 99 MRI datasets from around the world, for a total of over 17,000 participants. In the present study, we revisited these hemispheric effects from the perspective of reproducibility. Within each dataset, we considered that an effect had been reproduced when it matched the meta-analytic effect from the 98 other datasets, in terms of effect direction and significance threshold. In this sense, the results within each dataset were viewed as coming from separate studies in an "ideal publishing environment," that is, free from selective reporting and p hacking. We found an average reproducibility rate of 63.2% (SD = 22.9%, min = 22.2%, max = 97.0%). As expected, reproducibility was higher for larger effects and in larger datasets. Reproducibility was not obviously related to the age of participants, scanner field strength, FreeSurfer software version, cortical regional measurement reliability, or regional size. These findings constitute an empirical illustration of reproducibility in the absence of publication bias or p hacking, when assessing realistic biological effects in heterogeneous neuroscience data, and given typically-used sample sizes
    corecore