Search CORE

1,216 research outputs found

AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture

Author: Andorf Carson
Birkett Clayton
Campbell Jacqueline
Cannon Ethalinda K. S.
Cannon Steve
et al.
Grant David
Harper Lisa
Hu Zhi-Liang
Lazo Gerard
Nelson Rex
Park Carissa
Poelchau Monica
Reecy James
Sen Taner Z.
Ware Doreen
Woodhouse Margaret
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2018
Field of study

The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices

Digital Repository @ Iowa State University (ISU)

Darwin Core: An Evolving Community-Developed Biodiversity Data Standard

Author: A Alercia
AT Peterson
AT Peterson
AT Peterson
AW Hill
BR Stein
C Lynch
C Moritz
C Parmesan
C Parmesan
D Field
DA Vieglais
DA Vieglais
David Bloom
David Vieglais
DRB Stockwell
DTF Endresen
DW Inouye
E Pennisi
EH Fegraus
EO Wilson
FSI Chapin
G Walther
H Constable
H Steele
Indra Neil Sarkar
J Jehl Jr
J Wieczorek
JA Pounds
John Wieczorek
M Jenkins
M Loreau
Markus Döring
P Yilmaz
RA Morris
Renato Giovanni
Robert Guralnick
RP Guralnick
S Blum
S Weibel
SA McLeod
SB McLaren
SB McLaren
SL Pimm
Stan Blum
T van Hintum
Tim Robertson
TL Root
VH Heywood
VP Canhos
W Jetz
W Turner
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Biodiversity data derive from myriad sources stored in various formats on many distinct hardware and software platforms. An essential step towards understanding global patterns of biodiversity is to provide a standardized view of these heterogeneous data sources to improve interoperability. Fundamental to this advance are definitions of common terms. This paper describes the evolution and development of Darwin Core, a data standard for publishing and integrating biodiversity information. We focus on the categories of terms that define the standard, differences between simple and relational Darwin Core, how the standard has been implemented, and the community processes that are essential for maintenance and growth of the standard. We present case-study extensions of the Darwin Core into new research communities, including metagenomics and genetic resources. We close by showing how Darwin Core records are integrated to create new knowledge products documenting species distributions and changes due to environmental perturbations

Crossref

KU ScholarWorks

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

A reporting format for leaf-level gas exchange data and metadata

Author: Agarwal DA
Ainsworth EA
Albert LP
Ali A
Anderson J
Aspinwall MJ
Bellasio C
Bernacchi C
Bonnage S
Buckley TN
Bunce J
Burnett AC
Busch FA
Cavanagh A
Cernusak LA
Crystal-Ornelas R
Damerow J
Davidson KJ
De Kauwe MG
Dietze MC
Domingues TF
Dusenge ME
Ellsworth DS
Ely KS
Evans JR
Gauthier PPG
Gimenez BO
Gordon EP
Gough CM
Halbritter AH
Hanson DT
Heskel M
Hogan JA
Hupp JR
Jardine K
Kattge J
Keenan T
Kromdijk J
Kumarathunge DP
Lamour J
Leakey ADB
LeBauer DS
Li Q
Lundgren MR
McDowell N
Meacham-Hensold K
Medlyn BE
Moore DJP
Negrón-Juárez R
Niinemets Ü
Osborne CP
Pivovaroff AL
Poorter H
Reed SC
Rogers A
Ryu Y
Sanz-Saez A
Schmiege SC
Serbin SP
Sharkey TD
Slot M
Smith NG
Sonawane BV
South PF
Souza DC
Stinziano JR
Stuart-Haëntjens E
Taylor SH
Tejera MD
Uddling J
Vandvik V
Varadharajan C
Walker AP
Walker BJ
Warren JM
Way DA
Wolfe BT
Wu J
Wullschleger SD
Xu C
Yan Z
Yang D
Publication venue: Ecological Informatics
Publication date: 01/01/2021
Field of study

Leaf-level gas exchange data support the mechanistic understanding of plant fluxes of carbon and water. These fluxes inform our understanding of ecosystem function, are an important constraint on parameterization of terrestrial biosphere models, are necessary to understand the response of plants to global environmental change, and are integral to efforts to improve crop production. Collection of these data using gas analyzers can be both technically challenging and time consuming, and individual studies generally focus on a small range of species, restricted time periods, or limited geographic regions. The high value of these data is exemplified by the many publications that reuse and synthesize gas exchange data, however the lack of metadata and data reporting conventions make full and efficient use of these data difficult. Here we propose a reporting format for leaf-level gas exchange data and metadata to provide guidance to data contributors on how to store data in repositories to maximize their discoverability, facilitate their efficient reuse, and add value to individual datasets. For data users, the reporting format will better allow data repositories to optimize data search and extraction, and more readily integrate similar data into harmonized synthesis products. The reporting format specifies data table variable naming and unit conventions, as well as metadata characterizing experimental conditions and protocols. For common data types that were the focus of this initial version of the reporting format, i.e., survey measurements, dark respiration, carbon dioxide and light response curves, and parameters derived from those measurements, we took a further step of defining required additional data and metadata that would maximize the potential reuse of those data types. To aid data contributors and the development of data ingest tools by data repositories we provided a translation table comparing the outputs of common gas exchange instruments. Extensive consultation with data collectors, data users, instrument manufacturers, and data scientists was undertaken in order to ensure that the reporting format met community needs. The reporting format presented here is intended to form a foundation for future development that will incorporate additional data types and variables as gas exchange systems and measurement approaches advance in the future. The reporting format is published in the U.S. Department of Energy's ESS-DIVE data repository, with documentation and future development efforts being maintained in a version control system

Boston University Institutional Repository (OpenBU)

University of Birmingham Research Portal

ResearchOnline at James Cook University

The University of Arizona

Louisiana State University

Juelich Shared Electronic Resources

NORA - Norwegian Open Research Archives

White Rose Research Online

University of Essex Research Repository

University of Bergen

ResearchOnline@JCU

DigitalCommons@University of Nebraska

eScholarship - University of California

Western Sydney ResearchDirect

Apollo (Cambridge)

Lancaster E-Prints

Explore Bristol Research

Connecting a Digital Europe through Location and Place. Selected best short papers and posters of the AGILE 2014 Conference, 03-06 June 2014, Castellón, Spain

Author: Granell Carlos
Huerta Joaquin
Schade Sven
Publication venue: AGILE Digital Editions
Publication date: 01/01/2014
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositori Institucional de la Universitat Jaume I

Obo foundry food ontology interconnectivity

Author: ANDRES-HERNANDEZ Liliana
BORDEA Georgeta
CARMODY Leigh
CASTELLANO-ESCUDER Pol
CAVALIERI Duccio
CHAN Lauren
DOOLEY Damion
LACHAT Carl
LANGE Matthew
MOUGIN Fleur
VITALI Francesco
WEBER Magalie
YANG Chen
Publication venue
Publication date: 15/09/2021
Field of study

Since its creation in 2016, the FoodOn ontology has become an interconnected partner in various academic and government inter-agency ontology work spanning agricultural and public health domains. This paper examines existing and potential data interoperability capabilities arising from FoodOn and partner food-related ontologies belonging to the encyclopedic Open Biological and Biomedical Ontology Foundry (OBO) vocabulary platform, and how research organizations and industry might utilize them for their own operations or for data exchange. Projects are seeking standardized vocabulary across all direct food supply activities ranging from agricultural production, harvesting, preparation, food processing, marketing, distribution and consumption, as well as indirectly, within health, economic, food security and sustainability analysis and reporting tools. To satisfy this demand and provide data requires establishing domain specific ontologies whose curators coordinate closely to produce recommended patterns for food system vocabulary

Oskar Bordeaux

AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture

Author: Andorf C.
Arnaud Elizabeth
Berardini T.Z.
Birkett C.
Campbell J.
Cannon Ethalinda K.S.
Cannon S.
Carson J.
Condon B.
Cooper L.
Dunn N.
Elsik C.G.
Farmer A
Ficklin S.P.
Grant D.
Grau E.
Harper L.
Herndon N.
Hu Z.L.
Humann J.
Jaiswal P.
Jonquet C.
Jung S.
Laporte M-A.
Larmande P.
Lazo G.
Main D.
McCarthy F.
Menda N.
Mungall C.J.
Muñoz Torres M.C.
Naithani S.
Nelson R.
Nesdill D.
Park C.
Poelchau M.
Reecy J.
Reiser L.
Sanderson Lacey-Anne
Sen T.Z.
Staton M.
Subramaniam S.
Tello-Ruiz M.K.
Unda V.
Unni D.
Walls R.
Wang L.
Ware D.
Wegrzyn J.
Williams J.
Woodhouse M.
Yu J.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 16/04/2019
Field of study

CGSpace

Complex adaptive systems based data integration : theory and applications

Author: Rohn Eliahu
Publication venue: Digital Commons @ NJIT
Publication date: 27/01/2008
Field of study

Data Definition Languages (DDLs) have been created and used to represent data in programming languages and in database dictionaries. This representation includes descriptions in the form of data fields and relations in the form of a hierarchy, with the common exception of relational databases where relations are flat. Network computing created an environment that enables relatively easy and inexpensive exchange of data. What followed was the creation of new DDLs claiming better support for automatic data integration. It is uncertain from the literature if any real progress has been made toward achieving an ideal state or limit condition of automatic data integration. This research asserts that difficulties in accomplishing integration are indicative of socio-cultural systems in general and are caused by some measurable attributes common in DDLs. This research’s main contributions are: (1) a theory of data integration requirements to fully support automatic data integration from autonomous heterogeneous data sources; (2) the identification of measurable related abstract attributes (Variety, Tension, and Entropy); (3) the development of tools to measure them. The research uses a multi-theoretic lens to define and articulate these attributes and their measurements. The proposed theory is founded on the Law of Requisite Variety, Information Theory, Complex Adaptive Systems (CAS) theory, Sowa’s Meaning Preservation framework and Zipf distributions of words and meanings. Using the theory, the attributes, and their measures, this research proposes a framework for objectively evaluating the suitability of any data definition language with respect to degrees of automatic data integration. This research uses thirteen data structures constructed with various DDLs from the 1960\u27s to date. No DDL examined (and therefore no DDL similar to those examined) is designed to satisfy the law of requisite variety. No DDL examined is designed to support CAS evolutionary processes that could result in fully automated integration of heterogeneous data sources. There is no significant difference in measures of Variety, Tension, and Entropy among DDLs investigated in this research. A direction to overcome the common limitations discovered in this research is suggested and tested by proposing GlossoMote, a theoretical mathematically sound description language that satisfies the data integration theory requirements. The DDL, named GlossoMote, is not merely a new syntax, it is a drastic departure from existing DDL constructs. The feasibility of the approach is demonstrated with a small scale experiment and evaluated using the proposed assessment framework and other means. The promising results require additional research to evaluate GlossoMote’s approach commercial use potential

Digital Commons @ New Jersey Institute of Technology (NJIT)

A survey of semantic web technology for agriculture.

Author: DRURY B.
FERNANDES R.
LOPES A. de A.
MOURA M. F.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

ABSTRACT. Semantic web technologies have become a popular technique to apply meaning to unstructured data. They have been infrequently applied to problems within the agricultural domain when compared to complementary domains. Despite this lack of application, agriculture has a large number of semantic resources that have been developed by large NGOs such as the Food and Agriculture Organization (FAO). This survey is intended to motivate further research in the application of semantic web technologies for agricultural problems, by making available a self contained reference that provides: a comprehensive review of preexisting semantic resources and their construction methods, data interchange standards, as well as a survey of the current applications of semantic web technologies

Repository Open Access to Scientific Information from Embrapa

Xml Beyond The Tags

Author: Meloy Christopher Adam
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 01/01/2011
Field of study

XML is quickly being utilized in the field of technical communication to transfer information from database to person and company to company. Often communicators will structure information without a second thought of how or why certain tags are used to mark up the information. Because the company or a manual says to use those tags, the communicator does so. However, if professionals want to unlock the true potential of XML for better sharing of information across platforms, they need to understand the effects the technology using XML as well as political and cultural factors have on the tags being used. This thesis reviewed literature from multiple fields utilizing XML to find how tag choices can be influenced. XML allows for the sharing of information across multiple platforms and databases. Because of this efficiency, XML is utilized by many technologies. Often communicators must tag information so that the technologies can find the marked up information; therefore, technologies like single sourcing, data mining, and knowledge management influence the types of tags created. Additionally, cultural and political influences are analyzed to see how they play a role in determining what tags are used and created for specific documents. The thesis concludes with predictions on the future of XML and the technological, political, and cultural influences associated with XML tag sets based on information found within the thesis

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)