119 research outputs found

    Interpreting microarray experiments via co-expressed gene groups analysis

    Get PDF
    International audienceMicroarray technology produces vast amounts of data by measuring simultaneously the expression levels of thousands of genes under hundreds of biological conditions. Nowadays, one of the principal challenges in bioinformatics is the interpretation of huge data using different sources of information. We propose a novel data analysis method named CGGA (Co-expressed Gene Groups Analysis) that automatically finds groups of genes that are functionally enriched, i.e. have the same functional annotations, and are co- expressed. CGGA automatically integrates the information of microarrays, i.e. gene expression profiles, with the functional annotations of the genes obtained by the genome-wide information sources such as Gene Ontology (GO)1. By applying CGGA to well-known microarray experiments, we have identified the principal functionally enriched and co-expressed gene groups, and we have shown that this approach enhances and accelerates the interpretation of DNA microarray experiments

    Electronic health records in outpatient clinics: Perspectives of third year medical students

    Get PDF
    Abstract Background United States academic medical centers are increasingly incorporating electronic health records (EHR) into teaching settings. We report third year medical students' attitudes towards clinical learning using the electronic health record in ambulatory primary care clinics. Methods In academic year 2005–06, 60 third year students were invited to complete a questionnaire after finishing the required Ambulatory Medicine/Family Medicine clerkship. The authors elicited themes for the questionnaire by asking a focus group of third year students how using the EHR had impacted their learning. Five themes emerged: organization of information, access to online resources, prompts from the EHR, personal performance (charting and presenting), and communication with patients and preceptors. The authors added a sixth theme: impact on student and patient follow-up. The authors created a 21-item questionnaire, based on these themes that used a 5-point Likert scale from "Strongly Agree" to "Strongly Disagree". The authors emailed an electronic survey link to each consenting student immediately following their clerkship experience in Ambulatory Medicine/Family Medicine. Results 33 of 53 consenting students (62%) returned completed questionnaires. Most students liked the EHR's ability to organize information, with 70% of students responding that essential information was easier to find electronically. Only 36% and 33% of students reported accessing online patient information or clinical guidelines more often when using the EHR than when using paper charts. Most students (72%) reported asking more history questions due to EHR prompts, and 39% ordered more clinical preventive services. Most students (69%) reported that the EHR improved their documentation. 39% of students responded that they received more feedback on their EHR notes compared to paper chart notes. Only 64% of students were satisfied with the doctor-patient communication with the EHR, and 48% stated they spent less time looking at the patient. Conclusion Third year medical students reported generally positive attitudes towards using the EHR in the ambulatory setting. They reported receiving more feedback on their electronic charts than on paper charts. However, students reported significant concerns about the potential impact of the EHR on their ability to conduct the doctor-patient encounter.Peer Reviewe

    Identification and Analysis of Co-Occurrence Networks with NetCutter

    Get PDF
    BACKGROUND: Co-occurrence analysis is a technique often applied in text mining, comparative genomics, and promoter analysis. The methodologies and statistical models used to evaluate the significance of association between co-occurring entities are quite diverse, however. METHODOLOGY/PRINCIPAL FINDINGS: We present a general framework for co-occurrence analysis based on a bipartite graph representation of the data, a novel co-occurrence statistic, and software performing co-occurrence analysis as well as generation and analysis of co-occurrence networks. We show that the overall stringency of co-occurrence analysis depends critically on the choice of the null-model used to evaluate the significance of co-occurrence and find that random sampling from a complete permutation set of the bipartite graph permits co-occurrence analysis with optimal stringency. We show that the Poisson-binomial distribution is the most natural co-occurrence probability distribution when vertex degrees of the bipartite graph are variable, which is usually the case. Calculation of Poisson-binomial P-values is difficult, however. Therefore, we propose a fast bi-binomial approximation for calculation of P-values and show that this statistic is superior to other measures of association such as the Jaccard coefficient and the uncertainty coefficient. Furthermore, co-occurrence analysis of more than two entities can be performed using the same statistical model, which leads to increased signal-to-noise ratios, robustness towards noise, and the identification of implicit relationships between co-occurring entities. Using NetCutter, we identify a novel protein biosynthesis related set of genes that are frequently coordinately deregulated in human cancer related gene expression studies. NetCutter is available at http://bio.ifom-ieo-campus.it/NetCutter/). CONCLUSION: Our approach can be applied to any set of categorical data where co-occurrence analysis might reveal functional relationships such as clinical parameters associated with cancer subtypes or SNPs associated with disease phenotypes. The stringency of our approach is expected to offer an advantage in a variety of applications

    The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies

    Get PDF
    <p>Abstract</p> <p>Introduction</p> <p>The eMERGE (electronic MEdical Records and GEnomics) Network is an NHGRI-supported consortium of five institutions to explore the utility of DNA repositories coupled to Electronic Medical Record (EMR) systems for advancing discovery in genome science. eMERGE also includes a special emphasis on the ethical, legal and social issues related to these endeavors.</p> <p>Organization</p> <p>The five sites are supported by an Administrative Coordinating Center. Setting of network goals is initiated by working groups: (1) Genomics, (2) Informatics, and (3) Consent & Community Consultation, which also includes active participation by investigators outside the eMERGE funded sites, and (4) Return of Results Oversight Committee. The Steering Committee, comprised of site PIs and representatives and NHGRI staff, meet three times per year, once per year with the External Scientific Panel.</p> <p>Current progress</p> <p>The primary site-specific phenotypes for which samples have undergone genome-wide association study (GWAS) genotyping are cataract and HDL, dementia, electrocardiographic QRS duration, peripheral arterial disease, and type 2 diabetes. A GWAS is also being undertaken for resistant hypertension in ≈2,000 additional samples identified across the network sites, to be added to data available for samples already genotyped. Funded by ARRA supplements, secondary phenotypes have been added at all sites to leverage the genotyping data, and hypothyroidism is being analyzed as a cross-network phenotype. Results are being posted in dbGaP. Other key eMERGE activities include evaluation of the issues associated with cross-site deployment of common algorithms to identify cases and controls in EMRs, data privacy of genomic and clinically-derived data, developing approaches for large-scale meta-analysis of GWAS data across five sites, and a community consultation and consent initiative at each site.</p> <p>Future activities</p> <p>Plans are underway to expand the network in diversity of populations and incorporation of GWAS findings into clinical care.</p> <p>Summary</p> <p>By combining advanced clinical informatics, genome science, and community consultation, eMERGE represents a first step in the development of data-driven approaches to incorporate genomic information into routine healthcare delivery.</p

    Cross-Sectional Analysis of Late HAART Initiation in Latin America and the Caribbean: Late Testers and Late Presenters

    Get PDF
    Background: Starting HAART in a very advanced stage of disease is assumed to be the most prevalent form of initiation in HIV-infected subjects in developing countries. Data from Latin America and the Caribbean is still lacking. Our main objective was to determine the frequency, risk factors and trends in time for being late HAART initiator (LHI) in this region. Methodology: Cross-sectional analysis from 9817 HIV-infected treatment-naive patients initiating HAART at 6 sites (Argentina, Chile, Haiti, Honduras, Peru and Mexico) from October 1999 to July 2010. LHI had CD4+^+ count \leq200cells/mm3^3 prior to HAART. Late testers (LT) were those LHI who initiated HAART within 6 months of HIV diagnosis. Late presenters (LP) initiated after 6 months of diagnosis. Prevalence, risk factors and trends over time were analyzed. Principal Findings: Among subjects starting HAART (n = 9817) who had baseline CD4+^+ available (n = 8515), 76% were LHI: Argentina (56%[95%CI:52–59]), Chile (80%[95%CI:77–82]), Haiti (76%[95%CI:74–77]), Honduras (91%[95%CI:87–94]), Mexico (79%[95%CI:75–83]), Peru (86%[95%CI:84–88]). The proportion of LHI statistically changed over time (except in Honduras) (p0.02p\leq0.02; Honduras p = 0.7), with a tendency towards lower rates in recent years. Males had increased risk of LHI in Chile, Haiti, Peru, and in the combined site analyses (CSA). Older patients were more likely LHI in Argentina and Peru (OR 1.21 per +10-year of age, 95%CI:1.02–1.45; OR 1.20, 95%CI:1.02–1.43; respectively), but not in CSA (OR 1.07, 95%CI:0.94–1.21). Higher education was associated with decreased risk for LHI in Chile (OR 0.92 per +1-year of education, 95%CI:0.87–0.98) (similar trends in Mexico, Peru, and CSA). LHI with date of HIV-diagnosis available, 55% were LT and 45% LP. Conclusion: LHI was highly prevalent in CCASAnet sites, mostly due to LT; the main risk factors associated were being male and older age. Earlier HIV-diagnosis and earlier treatment initiation are needed to maximize benefits from HAART in the region

    Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts

    Get PDF
    The recently proposed concept of molecular connectivity maps enables researchers to integrate experimental measurements of genes, proteins, metabolites, and drug compounds under similar biological conditions. The study of these maps provides opportunities for future toxicogenomics and drug discovery applications. We developed a computational framework to build disease-specific drug-protein connectivity maps. We integrated gene/protein and drug connectivity information based on protein interaction networks and literature mining, without requiring gene expression profile information derived from drug perturbation experiments on disease samples. We described the development and application of this computational framework using Alzheimer's Disease (AD) as a primary example in three steps. First, molecular interaction networks were incorporated to reduce bias and improve relevance of AD seed proteins. Second, PubMed abstracts were used to retrieve enriched drug terms that are indirectly associated with AD through molecular mechanistic studies. Third and lastly, a comprehensive AD connectivity map was created by relating enriched drugs and related proteins in literature. We showed that this molecular connectivity map development approach outperformed both curated drug target databases and conventional information retrieval systems. Our initial explorations of the AD connectivity map yielded a new hypothesis that diltiazem and quinidine may be investigated as candidate drugs for AD treatment. Molecular connectivity maps derived computationally can help study molecular signature differences between different classes of drugs in specific disease contexts. To achieve overall good data coverage and quality, a series of statistical methods have been developed to overcome high levels of data noise in biological networks and literature mining results. Further development of computational molecular connectivity maps to cover major disease areas will likely set up a new model for drug development, in which therapeutic/toxicological profiles of candidate drugs can be checked computationally before costly clinical trials begin

    Integrated Personal Health Records: Transformative Tools for Consumer-Centric Care

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Integrated personal health records (PHRs) offer significant potential to stimulate transformational changes in health care delivery and self-care by patients. In 2006, an invitational roundtable sponsored by Kaiser Permanente Institute, the American Medical Informatics Association, and the Agency for Healthcare Research and Quality was held to identify the transformative potential of PHRs, as well as barriers to realizing this potential and a framework for action to move them closer to the health care mainstream. This paper highlights and builds on the insights shared during the roundtable.</p> <p>Discussion</p> <p>While there is a spectrum of dominant PHR models, (standalone, tethered, integrated), the authors state that only the integrated model has true transformative potential to strengthen consumers' ability to manage their own health care. Integrated PHRs improve the quality, completeness, depth, and accessibility of health information provided by patients; enable facile communication between patients and providers; provide access to health knowledge for patients; ensure portability of medical records and other personal health information; and incorporate auto-population of content. Numerous factors impede widespread adoption of integrated PHRs: obstacles in the health care system/culture; issues of consumer confidence and trust; lack of technical standards for interoperability; lack of HIT infrastructure; the digital divide; uncertain value realization/ROI; and uncertain market demand. Recent efforts have led to progress on standards for integrated PHRs, and government agencies and private companies are offering different models to consumers, but substantial obstacles remain to be addressed. Immediate steps to advance integrated PHRs should include sharing existing knowledge and expanding knowledge about them, building on existing efforts, and continuing dialogue among public and private sector stakeholders.</p> <p>Summary</p> <p>Integrated PHRs promote active, ongoing patient collaboration in care delivery and decision making. With some exceptions, however, the integrated PHR model is still a theoretical framework for consumer-centric health care. The authors pose questions that need to be answered so that the field can move forward to realize the potential of integrated PHRs. How can integrated PHRs be moved from concept to practical application? Would a coordinating body expedite this progress? How can existing initiatives and policy levers serve as catalysts to advance integrated PHRs?</p

    Regional labour market mobility. A network analysis of inter-firm relatedness

    Get PDF
    Labour market rigidity is known to hamper the proper adjustment of an economy, thus making it less resilient to shocks. This paper investigates the characteristics and resilience of the regional labour flow network in Veneto, a region famous for its industrial districts and the expertise of its workforce. A unique database of inter-firm worker mobility is used and the made-in-Italy relatedness to other industries is quantified. Descriptive results suggest that permanent-contract workers are more mobile within-sector than fixed-term contractors. The latter are more mobile across sectors. A finer disaggregation of the made-in-Italy industries shows that textile, food and woodwork are highly related to leisure-retail, logistics-wholesale and agriculture. These results can orient policy-making in getting faster labour reallocation. Network analysis establishes a number of stylised facts about labour flow networks, in particular, a hierarchical organisation of flows and a preference for workers to move from low-connected to high-connected firms and vice-versa, i.e. disassortativity. Unlike previous research, this paper identifies clusters of a non-spatial nature, that is, based on the intensity of labour flows. Regression analysis shows that labour mobility, both in and out, is beneficial for firms. However, being located inside labour clusters negatively affects firm performance. Interestingly, when these clusters include MNEs, the firm benefits. These results combined suggest that variety of connections prevails over standardisation

    Biomedical informatics and translational medicine

    Get PDF
    Biomedical informatics involves a core set of methodologies that can provide a foundation for crossing the "translational barriers" associated with translational medicine. To this end, the fundamental aspects of biomedical informatics (e.g., bioinformatics, imaging informatics, clinical informatics, and public health informatics) may be essential in helping improve the ability to bring basic research findings to the bedside, evaluate the efficacy of interventions across communities, and enable the assessment of the eventual impact of translational medicine innovations on health policies. Here, a brief description is provided for a selection of key biomedical informatics topics (Decision Support, Natural Language Processing, Standards, Information Retrieval, and Electronic Health Records) and their relevance to translational medicine. Based on contributions and advancements in each of these topic areas, the article proposes that biomedical informatics practitioners ("biomedical informaticians") can be essential members of translational medicine teams

    Meta-analysis of type 2 Diabetes in African Americans Consortium

    Get PDF
    Type 2 diabetes (T2D) is more prevalent in African Americans than in Europeans. However, little is known about the genetic risk in African Americans despite the recent identification of more than 70 T2D loci primarily by genome-wide association studies (GWAS) in individuals of European ancestry. In order to investigate the genetic architecture of T2D in African Americans, the MEta-analysis of type 2 DIabetes in African Americans (MEDIA) Consortium examined 17 GWAS on T2D comprising 8,284 cases and 15,543 controls in African Americans in stage 1 analysis. Single nucleotide polymorphisms (SNPs) association analysis was conducted in each study under the additive model after adjustment for age, sex, study site, and principal components. Meta-analysis of approximately 2.6 million genotyped and imputed SNPs in all studies was conducted using an inverse variance-weighted fixed effect model. Replications were performed to follow up 21 loci in up to 6,061 cases and 5,483 controls in African Americans, and 8,130 cases and 38,987 controls of European ancestry. We identified three known loci (TCF7L2, HMGA2 and KCNQ1) and two novel loci (HLA-B and INS-IGF2) at genome-wide significance (4.15 × 10(-94)<P<5 × 10(-8), odds ratio (OR)  = 1.09 to 1.36). Fine-mapping revealed that 88 of 158 previously identified T2D or glucose homeostasis loci demonstrated nominal to highly significant association (2.2 × 10(-23) < locus-wide P<0.05). These novel and previously identified loci yielded a sibling relative risk of 1.19, explaining 17.5% of the phenotypic variance of T2D on the liability scale in African Americans. Overall, this study identified two novel susceptibility loci for T2D in African Americans. A substantial number of previously reported loci are transferable to African Americans after accounting for linkage disequilibrium, enabling fine mapping of causal variants in trans-ethnic meta-analysis studies.Peer reviewe
    corecore