Search CORE

61 research outputs found

Deep learning classification of chest x-ray images

Author: Majdi Mohammad S.
Merchant Nirav C.
Morris Michael F.
Rodriguez Jeffrey J.
Salman Khalil N.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/05/2020
Field of study

We propose a deep learning based method for classification of commonly occurring pathologies in chest X-ray images. The vast number of publicly available chest X-ray images provides the data necessary for successfully employing deep learning methodologies to reduce the misdiagnosis of thoracic diseases. We applied our method to the classification of two example pathologies, pulmonary nodules and cardiomegaly, and we compared the performance of our method to three existing methods. The results show an improvement in AUC for detection of nodules and cardiomegaly compared to the existing methods.Comment: 4 pages, 4 figures, 2 tables, conference , SSIAI 202

arXiv.org e-Print Archive

Crossref

The University of Arizona

Virtualizing the Stampede2 Supercomputer with Applications to HPC in the Cloud

Author: Abraham M. J.
Amazon Inc.
DeHaan Michael
DeHaan Michael
Dooley Rion
Dooley Rion
McKenna Frank
Merchant Nirav
Rathje Ellen M
Schulz Karl W.
Sefraoui Omar
Yoo Andy B
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/07/2018
Field of study

Methods developed at the Texas Advanced Computing Center (TACC) are described and demonstrated for automating the construction of an elastic, virtual cluster emulating the Stampede2 high performance computing (HPC) system. The cluster can be built and/or scaled in a matter of minutes on the Jetstream self-service cloud system and shares many properties of the original Stampede2, including: i) common identity management, ii) access to the same file systems, iii) equivalent software application stack and module system, iv) similar job scheduling interface via Slurm. We measure time-to-solution for a number of common scientific applications on our virtual cluster against equivalent runs on Stampede2 and develop an application profile where performance is similar or otherwise acceptable. For such applications, the virtual cluster provides an effective form of "cloud bursting" with the potential to significantly improve overall turnaround time, particularly when Stampede2 is experiencing long queue wait times. In addition, the virtual cluster can be used for test and debug without directly impacting Stampede2. We conclude with a discussion of how science gateways can leverage the TACC Jobs API web service to incorporate this cloud bursting technique transparently to the end user.Comment: 6 pages, 0 figures, PEARC '18: Practice and Experience in Advanced Research Computing, July 22--26, 2018, Pittsburgh, PA, US

arXiv.org e-Print Archive

Crossref

Cyber-Agricultural Systems for Crop Breeding and Sustainable Production

Author: Chowdhary Girish
Das Sajal K.
Fotouhi Fateme
Ganapathysubramanian Baskar
Kantor George
Kar Soumyashree
Krishnamurthy Adarsh
Merchant Nirav
Nagasubramanian Koushik
Sarkar Soumik
Singh Arti
Singh Asheesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2023
Field of study

The Cyber-Agricultural System (CAS) Represents an overarching Framework of Agriculture that Leverages Recent Advances in Ubiquitous Sensing, Artificial Intelligence, Smart Actuators, and Scalable Cyberinfrastructure (CI) in Both Breeding and Production Agriculture. We Discuss the Recent Progress and Perspective of the Three Fundamental Components of CAS – Sensing, Modeling, and Actuation – and the Emerging Concept of Agricultural Digital Twins (DTs). We Also Discuss How Scalable CI is Becoming a Key Enabler of Smart Agriculture. in This Review We Shed Light on the Significance of CAS in Revolutionizing Crop Breeding and Production by Enhancing Efficiency, Productivity, Sustainability, and Resilience to Changing Climate. Finally, We Identify Underexplored and Promising Future Directions for CAS Research and Development

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Jetstream: A self-provisoned, scalable science and engineering cloud environment

Author: Cockerill Timothy M.
Foster Ian
Gaffney Niall
Hancock David
Merchant Nirav
Skidmore Edwin
Stanzione Daniel
Stewart Craig A.
Taylor James
Tuecke Steven
Turner George
Vaughn Matthew
Publication venue
Publication date: 29/07/2015
Field of study

The paper describes the motivation behind Jetstream, its functions, hardware configuration, software environment, user interface, design, use cases, relationships with other projects such as Wrangler and iPlant, and challenges in implementation.Funded by the National Science Foundation Award #ACI - 144560

IUScholarWorks (University of Indiana)

From cheek swabs to consensus sequences : an A to Z protocol for high-throughput DNA sequencing of complete human mitochondrial genomes

Author: Adhikarla Syama
Adler Christina J.
Balanovska Elena
Balanovsky Oleg
Bertranpetit Jaume
Clarke Andrew C.
Comas David
Cooper Alan
Der Sarkissian Clio S.I.
Dulik Matthew C.
Gaieski Jill B.
GaneshPrasad Arun Kumar
Haak Wolfgang
Haber Marc
Hernanz Soria
Jin Li
Kaplan Matthew E.
Lacerda Daniela R.
Li Shilin
Martínez-Cruz Begoña
Matisoo-Smith Elizabeth A.
Merchant Nirav C.
Mitchell R. John
Owings Amanda C.
Parida Laxmi
Pitchappan Ramasamy
Platt Daniel E.
Prost Stefan
Quintana-Murci Lluis
Renfrew Colin
Royyuru Ajay K.
Santhakumari Arun Varatharajan
Santos Fabrício R.
Schurr Theodore G.
Soodyall Himla
Stanton Jo Ann L.
Swamikrishnan Pandikumar
Tyler-Smith Chris
Vieira Pedro Paulo
Vilar Miguel G.
Wells R. Spencer
White W. Timothy J.
Zalloua Pierre A.
Ziegle Janet S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Next-generation DNA sequencing (NGS) technologies have made huge impacts in many fields of biological research, but especially in evolutionary biology. One area where NGS has shown potential is for high-throughput sequencing of complete mtDNA genomes (of humans and other animals). Despite the increasing use of NGS technologies and a better appreciation of their importance in answering biological questions, there remain significant obstacles to the successful implementation of NGS-based projects, especially for new users. Results: Here we present an ‘A to Z’ protocol for obtaining complete human mitochondrial (mtDNA) genomes – from DNA extraction to consensus sequence. Although designed for use on humans, this protocol could also be used to sequence small, organellar genomes from other species, and also nuclear loci. This protocol includes DNA extraction, PCR amplification, fragmentation of PCR products, barcoding of fragments, sequencing using the 454 GS FLX platform, and a complete bioinformatics pipeline (primer removal, reference-based mapping, output of coverage plots and SNP calling). Conclusions: All steps in this protocol are designed to be straightforward to implement, especially for researchers who are undertaking next-generation sequencing for the first time. The molecular steps are scalable to large numbers (hundreds) of individuals and all steps post-DNA extraction can be carried out in 96-well plate format. Also, the protocol has been assembled so that individual ‘modules’ can be swapped out to suit available resources

Crossref

Repository@Nottingham

Adelaide Research & Scholarship

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

The University of Arizona

Warwick Research Archives Portal Repository

UPF Digital Repository

ScholarlyCommons@Penn

Neolithic Mitochondrial Haplogroup H Genomes and the Genetic Origins of Europeans

Author: Adhikarla Syama
Adler Christina J
Alt Kurt W
Balanovska Elena
Balanovska Oleg
Behar Doron M
Bertranpetit Jaume
Brandt Guido
Brotherton Paul
Clarke Andrew C
Comas David
Cooper Alan
Dresely Veit
Dulik Matthew C
Friederich Susanne
Gaieski Jill Bennett
Ganesh Prasad Arun Kumar
Ganslmeier Robert
Genographic Consortium
Haak Wolfgang
Ho Simon Y.W.
Hobbs Jennifer L
Hoobs Angela
Javed Asif
Jin Li
Kaplan Matthew E
Kenyon Rosalie
Korlach Jonas
Lacerda Daniela R
Li Shilin
Luong Khai
Martinez-Cruz Begoña
Matisoo-Smith Elizabeth
Meller Harald
Melé Marta
Merchant Nirav C
Mitchell R. John
Owings Amanda C
Parida Laxmi
Pitchappan Ramasamy
Platt Daniel E
Quintana-Murci Lluis
Renfrew Colin
Richards Stephen M
Royyuru Ajay K
Santos Fabrício R
Sarkissian Clio Der
Schurr Theodore G
Soodyall Himla
Soria-Hernanz David F
Soubrier Julien
Swamikrishnan Pandikumar
Templeton Jennifer E. L
Tyler-Smith Chris
Van der Hoek Mark B
van Oven Mannis
Varatharajan Santhakumari Arun
Vieira Pedro Paulo R
Vilar Miguel G
Wells R. Spencer
Ziegle Janet
Publication venue: ScholarlyCommons
Publication date: 23/04/2013
Field of study

Haplogroup H dominates present-day Western European mitochondrial DNA variability (\u3e40%), yet was less common (~19%) among Early Neolithic farmers (~5450 BC) and virtually absent in Mesolithic hunter-gatherers. Here we investigate this major component of the maternal population history of modern Europeans and sequence 39 complete haplogroup H mitochondrial genomes from ancient human remains. We then compare this ‘real-time’ genetic data with cultural changes taking place between the Early Neolithic (~5450 BC) and Bronze Age (~2200 BC) in Central Europe. Our results reveal that the current diversity and distribution of haplogroup H were largely established by the Mid Neolithic (~4000 BC), but with substantial genetic contributions from subsequent pan-European cultures such as the Bell Beakers expanding out of Iberia in the Late Neolithic (~2800 BC). Dated haplogroup H genomes allow us to reconstruct the recent evolutionary history of haplogroup H and reveal a mutation rate 45% higher than current estimates for human mitochondria

ScholarlyCommons@Penn

The iPlant Collaborative: Cyberinfrastructure for Plant Biology

The iPlant Collaborative (iPlant) is a United States National Science Foundation (NSF) funded project that aims to create an innovative, comprehensive, and foundational cyberinfrastructure in support of plant biology research (PSCIC, 2006). iPlant is developing cyberinfrastructure that uniquely enables scientists throughout the diverse fields that comprise plant biology to address Grand Challenges in new ways, to stimulate and facilitate cross-disciplinary research, to promote biology and computer science research interactions, and to train the next generation of scientists on the use of cyberinfrastructure in research and education. Meeting humanity's projected demands for agricultural and forest products and the expectation that natural ecosystems be managed sustainably will require synergies from the application of information technologies. The iPlant cyberinfrastructure design is based on an unprecedented period of research community input, and leverages developments in high-performance computing, data storage, and cyberinfrastructure for the physical sciences. iPlant is an open-source project with application programming interfaces that allow the community to extend the infrastructure to meet its needs. iPlant is sponsoring community-driven workshops addressing specific scientific questions via analysis tool integration and hypothesis testing. These workshops teach researchers how to add bioinformatics tools and/or datasets into the iPlant cyberinfrastructure enabling plant scientists to perform complex analyses on large datasets without the need to master the command-line or high-performance computational services

Carolina Digital Repository

PGen: large-scale genomic variations analysis workflow and browser in SoyKB

Author: A McKenna
B Valliyodan
Babu Valliyodan
DM Goodstein
Dong Xu
E Deelman
G Klambauer
H Li
Henry T. Nguyen
J Duitama
J Schmutz
J Towns
J Wang
Joao V. Maldonado dos Santos
Juexin Wang
JV Maldonado Dos Santos
Mats Rynge
Nirav Merchant
P Calyam
P Cingolani
Prasad P. Calyam
Q Song
SA Goff
Saad M. Khan
Shiyuan Chen
Shuai Zeng
T Joshi
T Langewisch
Trupti Joshi
Yang Liu
Yuanxun Zhang
Z Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/10/2016
Field of study

Background: With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed " PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. Results: We have developed both a Linux version in GitHub (https:// github. com/ pegasus-isi/ PGen-GenomicVariationsWorkflow) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), (http:// soykb. org/ Pegasus/ index. php). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser (http:// soykb. org/ NGS_ Resequence/ NGS_ index. php) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. Conclusion: PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.Missouri Soybean Merchandising Council [368]; United Soybean Board [1320-532-5615]This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

Crossref

Springer - Publisher Connector

PubMed Central

The University of Arizona

Bringing your tools to CyVerse Discovery Environment using Docker [version 3; referees: 3 approved]

Author: Eric Lyons
Kathleen Kennedy
Nirav Merchant
Paul Sarando
Upendra Kumar Devisetty
Publication venue: 'F1000 Research Ltd'
Publication date: 01/11/2016
Field of study

Docker has become a very popular container-based virtualization platform for software distribution that has revolutionized the way in which scientific software and software dependencies (software stacks) can be packaged, distributed, and deployed. Docker makes the complex and time-consuming installation procedures needed for scientific software a one-time process. Because it enables platform-independent installation, versioning of software environments, and easy redeployment and reproducibility, Docker is an ideal candidate for the deployment of identical software stacks on different compute environments such as XSEDE and Amazon AWS. Cyverse's Discovery Environment also uses Docker for integrating its powerful, community-recommended software tools into CyVerse's production environment for public use. This paper will help users bring their tools into CyVerse DE which will not only allows users to integrate their tools with relative ease compared to the earlier method of tool deployment in DE but also help users to share their apps with collaborators and also release them for public use

Directory of Open Access Journals