Search CORE

14 research outputs found

Developing Metadata Categories as a Strategy to Mobilize Computable Biomedical Knowledge

Author: Alper Brian S
Bray Bruce E
Conte Marisa L
Eldredge Christina
Flynn Allen
Gold Sigfried
Greenes Robert A
Haug Peter
Koru Gunes
McClay James
Sainvil Marc L
Sottara Davide
Tuttle Mark
Yurk Robin A
Publication venue
Publication date: 20/06/2020
Field of study

A work by a group of volunteer members drawn from the Mobilizing Computable Biomedical Knowledge community's Standards Workgroup. See mobilizecbk.org for more information about this community and workgroup.Computable biomedical knowledge artifacts (CBKs) are digital objects or entities representing biomedical knowledge as machine-independent data structures that can be parsed and processed by different information systems. The breadth of content represented in CBKs spans all biomedical knowledge related to human health and so it includes knowledge about molecules, cells, organs, individual people, human populations, and the environment. CBKs vary in their scope, purpose, and audience. Some CBKs support biomedical research. Other CBKs help improve health outcomes by enabling clinical decision support, health education, health promotion, and population health analytics. In some instances, CBKs have multiple uses that span research, education, clinical care, or population health. As the number of CBKs grows large, producers must describe them with structured, searchable metadata so that consumers can find, deploy, and use them properly. This report delineates categories of metadata for describing CBKs sufficiently to enable CBKs to be mobilized for various purposes.https://deepblue.lib.umich.edu/bitstream/2027.42/155655/1/MCBK.Metadata.Paper.June2020.f.pdfDescription of MCBK.Metadata.Paper.June2020.f.pdf : MCBK 2020 Virtual Meeting version of Standards Workgroup's Working Paper on CBK Metadat

Deep Blue Documents at the University of Michigan

Gene expression AffyProbeMiner: a web resource for computing or retrieving accurately redefined Affymetrix probe sets

Author: A Gunes Koru
Alessandro Ferrucci
Antej Nuhanovic
Ari Kahn
Barry R Zeeberg
David W Kane
Gang Qu
Hongfang Liu
John N Weinstein
Michael C Ryan
Peter J Munson
William C Reinhold
Publication venue
Publication date: 01/01/2007
Field of study

CiteSeerX

De-identifying a public use microdata file from the Canadian national discharge abstract database

Author: A Dale
A de Waal
A Gionis
A Hundepool
A Hundepool
A Machanavajjhala
A Machanavajjhala
A Meyerson
A Narayanan
Agency for Healthcare Research and Quality
B Hore
B Yolles
B-C Chen
BCM Fung
BCM Fung
BCM Fung
C Hogue
C Mackie
C Marsh
C Marsh
C Skinner
C Skinner
Canada Statistics
Canadian Institute for Health Information
Canadian Institute for Health Information
CE Shannon
CE Shannon
CK Liew
D Altman
D Defays
D Defays
D Hutchon
D Lafky
David Paton
DB Rubin
Department of Health and Human Services
Department of Health and Human Services
E Boyko
Federal Court (Canada)
Fida Dankar
G Aggarwal
G Duncan
G Loukides
G Sande
G Sullivan
G Sullivan
GD Smith
GR Heer
Gunes Koru
H Kargupta
J Castro
J Domingo-Ferrer
J Domingo-Ferrer
J Domingo-Ferrer
J Domingo-Ferrer
J Domingo-Ferrer
J Domingo-Ferrer
J Domingo-Ferrer
J Jimenez
J Schoenman
J Xu
JJ Kim
JP Gouweleeuw
K Abraham
K Benitez
K El Emam
K El Emam
K El Emam
K El Emam
K El Emam
K El Emam
K El Emam
K El Emam
K El Emam
K LeFevre
Khaled El Emam
L Alexander
L Sweeney
L Sweeney
L Sweeney
L Sweeney
L Sweeney
L Willenborg
L Willenborg
LA Alexander
LH Cox
M Barbaro
M Templ
ME Nergiz
National Committee on Vital and Health Statistics
P Doyle
P Kooiman
P Nanopoulos
P Samarati
P Samarati
P Samarati
R Bayardo
R Gopal
RA Dandekar
RA Dandekar
RJ Bayardo
RJA Little
S Fienberg
S Hansell
S Ochoa
Statistics Canada
Statistics Canada
Statistics Canada
T de Waal
T Delamothe
T Hedrick
T Zeller Jr
V Ciriani
V Iyengar
V Torra
V Torra
V Torra
VS Iyengar
W Lowrance
W Winkler
WE Winkler
X Xiao
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The Canadian Institute for Health Information (CIHI) collects hospital discharge abstract data (DAD) from Canadian provinces and territories. There are many demands for the disclosure of this data for research and analysis to inform policy making. To expedite the disclosure of data for some of these purposes, the construction of a DAD public use microdata file (PUMF) was considered. Such purposes include: confirming some published results, providing broader feedback to CIHI to improve data quality, training students and fellows, providing an easily accessible data set for researchers to prepare for analyses on the full DAD data set, and serve as a large health data set for computer scientists and statisticians to evaluate analysis and data mining techniques. The objective of this study was to measure the probability of re-identification for records in a PUMF, and to de-identify a national DAD PUMF consisting of 10% of records. Methods Plausible attacks on a PUMF were evaluated. Based on these attacks, the 2008-2009 national DAD was de-identified. A new algorithm was developed to minimize the amount of suppression while maximizing the precision of the data. The acceptable threshold for the probability of correct re-identification of a record was set at between 0.04 and 0.05. Information loss was measured in terms of the extent of suppression and entropy. Results Two different PUMF files were produced, one with geographic information, and one with no geographic information but more clinical information. At a threshold of 0.05, the maximum proportion of records with the diagnosis code suppressed was 20%, but these suppressions represented only 8-9% of all values in the DAD. Our suppression algorithm has less information loss than a more traditional approach to suppression. Smaller regions, patients with longer stays, and age groups that are infrequently admitted to hospitals tend to be the ones with the highest rates of suppression. Conclusions The strategies we used to maximize data utility and minimize information loss can result in a PUMF that would be useful for the specific purposes noted earlier. However, to create a more detailed file with less information loss suitable for more complex health services research, the risk would need to be mitigated by requiring the data recipient to commit to a data sharing agreement.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

KC1

Author: A. Gunes Koru (5193775)
Publication venue
Publication date
Field of study

Overview of Data The data is a weka .arff file. It contains 94 independent variables and 1 dependent variable. Paper Abstract Modern requirements tracing tools employ information retrieval methods to automatically generate candidate links. Due to the inherent trade-off between recall and precision, such methods cannot achieve a high coverage without also retrieving a great number of false positives, causing a significant drop in result accuracy. In this paper, we propose an approach to improving the quality of candidate link generation for the requirements tracing process. We base our research on the cluster hypothesis which suggests that correct and incorrect links can be grouped in high-quality and low-quality clusters respectively. Result accuracy can thus be enhanced by identifying and filtering out low-quality clusters. We describe our approach by investigating three open-source datasets, and further evaluate our work through an industrial study. The results show that our approach outperforms a baseline pruning strategy and that improvements are still possible.</p

FigShare

Theory of Relative Dependency: Higher Coupling Concentration in Smaller Modules and its Implications for Software Refactoring and Quality

Author: A. Gunes Koru
Khaled El Emam
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Crossref

DETERMINATION OF THE BIOENERGY PRODUCTION CAPACITY FROM BIOCHEMICAL PROFILES OF SOME AQUATIC PHYTOREMEDIATION PLANTS. ENERGY WHILE CLEANING

Author: Akat O.
Cakar H.
Cirik S.
Firat K.
Gunes A.
Guney M. A.
Korkut A. Y.
Koru E.
Ozkul B.
Saka S.
Suzer C.
Publication venue: Scibulcom Ltd
Publication date: 01/01/2014
Field of study

WOS: 000342876200028This study aims to research the possibilities of converting some hydrophytes into energy by revaluating them after the harvesting process. These hydrophytes used in the phytoremediation studies disperse naturally in aquatic mediums, sometimes even revealing themselves as invasive species. Chosen hydrophytes samples (Eichorrzia crassipes, Cyperus alternifolius, Lemna minor, Pistia stratiotes, Typha latifolia, Nasturtium officinale,Houttonia cordata) are analysed in terms of oil rate, biochemical profiles which include elaeostearic compositions, COI/T.20/Doc No 17 (capillary column gas chromatography) and in-house methods. The obtained data are analysed in comparison to the elaeostearics rate and compositions of the plants used in biodiesel procurement (canola, soy, palm, sunflower, Botryococcus and Chlorella oils). As a result, it is found that linolenic acid and linoleic acid percentages especially stand forth in the plants Eichornia sp., Cyperus sp., Lemna sp., the stearic and oleic acid percentages are significantly high in Pistia sp., and palmitic elaeostearic percentage is higher in the plants of Houttonia sp. and Nasturtium sp. than the plants currently used in biodiesel procurement, yet the oil rate within their system is lower than these plants. Moreover, it is thought that the plant waste obtained after the harvest carried out in order to ensure the water quality of the systems may in the least meet this deficit.Ege UniversityEge University [11-Suf-030]; EBILTEM (Ege University Science and Technology Application and Research Centre)Ege University [2010-BAMYO-001]This research was supported by projects numbered Ege University11-Suf-030 and EBILTEM (Ege University Science and Technology Application and Research Centre) 2010-BAMYO-001. Special thanks to the Major of Bayindir Mehmet Kertis, Prof. Dr. Ozcan Secmen and botanist Aydin Dincaslan who never ceased to support the study materially as well as morally

Ege University Institutional Repository