Search CORE

305 research outputs found

Exascale Deep Learning to Accelerate Cancer Research

Author: Abousamra Shahira
Chae Junghoon
Hou Le
Johnston J. Travis
Lim Seung-Hwan
Patton Robert M.
Potok Thomas E.
Rose Derek C.
Saltz Joel
Samaras Dimitris
Schuman Catherine D.
Young Steven R.
Publication venue
Publication date: 26/09/2019
Field of study

Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend for neural network architectures, especially those trained on ImageNet, has been to grow ever deeper and more complex. The result has been ever increasing accuracy on benchmark datasets with the cost of increased computational demands. In this paper we demonstrate that neural network architectures can be automatically generated, tailored for a specific application, with dual objectives: accuracy of prediction and speed of prediction. Using MENNDL--an HPC-enabled software stack for neural architecture search--we generate a neural network with comparable accuracy to state-of-the-art networks on a cancer pathology dataset that is also

16\times

faster at inference. The speedup in inference is necessary because of the volume and velocity of cancer pathology data; specifically, the previous state-of-the-art networks are too slow for individual researchers without access to HPC systems to keep pace with the rate of data generation. Our new model enables researchers with modest computational resources to analyze newly generated data faster than it is collected.Comment: Submitted to IEEE Big Dat

arXiv.org e-Print Archive

Crossref

Plasma Dynamics

Author: Bekefi George
Bers Abraham
Bromberg Leslie
Chambers Frank W.
Cogswell Kurt D.
Cook Andrew G.
Coppi Bruno
Deutsch Claude
Dupree Thomas H.
Ehst David A.
Fisch Nathaniel J.
Forbes Alan R.
Herring James S.
Hershcovitch Ady
John Putheveetil K.
Johnston George L.
Jones Joseph L.
Kaplan David L.
Karney Charles F. F.
Kenyon Peter T.
Klinkowstein Robert E.
Krapchev Vladimir B.
Kulp John L., Jr.
Leclert Gérard
Lidsky Lawrence M.
Macrakis Stavros M.
McCune James E.
McKinstry Mark L.
Misaki Yo
Novich Neil S.
Otten Gillis R.
Politzer Peter A.
Potok Robert E.
Reiman Allan H.
Scaturro Louis S.
Sen Abhijit
Smullin Louis D.
Tekula Miloslav S.
Tetreault David J.
Theilhaber Kim S.
Thomas Clarence E.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1977
Field of study

Contains research objectives and summary of research on nineteen research projects split into five sections.National Science Foundation (Grant ENG75-06242-A01)U.S. Energy Research and Development Administration (Contract E(11-1)-2766)U.S. Air Force - Office of Scientific Research (Grant AFOSR-77-3143)U.S. Energy Research and Development Administration (Contract EY-76-C2-02-3070.*000

DSpace@MIT

Recommended from our members

Analysis and Classification of Mammography Reports Using Maximum Variation Sampling

Author: Beckerman Barbara G.
Patton Robert M.
Potok Thomas E.
Publication venue: Oak Ridge National Laboratory
Publication date: 01/01/2011
Field of study

Currently, no automated means of detecting abnormal mammograms exist. While knowledge discovery capabilities through data mining and data analytics tools are widespread in many industries, the healthcare industry as a whole lags far behind. Providers are only just beginning to recognize the value of data mining as a tool to analyze patient care and clinical outcomes. The research conducted by the authors investigates the use of genetic algorithms for classification of unstructured mammography reports, which can be later correlated to the images for extraction and testing. In mammography, much effort has been expended to characterize findings in the radiology reports. Various computer-assisted technologies have been developed to assist radiologists in detecting cancer; however, the algorithms still lack high degrees of sensitivity and specificity, and must undergo machine learning against a training set with known pathologies in order to further refine the algorithms with higher validity of truth. In a large database of reports and corresponding images, automated tools are needed just to determine which data to include in the training set. Validation of these data is another issue. Radiologists disagree with each other over the characteristics and features of what constitutes a normal mammogram and the terminology to use in the associated radiology report. Abnormal reports follow the lexicon established by the American College of radiology Breast Imaging Reporting and Data System (Bi-RADS), but even within these reports, there is a high degree of text variability and interpretation of semantics. The focus has been on classifying abnormal or suspicious reports, but even this process needs further layers of clustering and gradation, so that individual lesions can be more effectively classified. The tools that are needed will not only help further identify problem areas but also support risk assessment and other knowledge discovery applications. The knowledge to be gained by extracting and integrating meaningful information from radiology reports will have a far-reaching benefit, in terms of the refinement of the classifications of various findings within the reports. This will support validation, training and optimization of these and other machine learning and computer-aided diagnosis algorithms to work both in this environment and with other medical and imaging modalities. In the near-term, the objective of this work is to accurately identify abnormal radiology reports amid a massive collection of reports. The challenge in achieving this objective lies in the use of natural language to describe the patient's condition. The premise of this work is that abnormal radiology reports consist of words and phrases that are statistically rare or unusual. If this is true, then it is expected that abnormal reports will be significantly dissimilar in comparison to normal radiology reports. To achieve this objective, our approach employs maximum variation sampling (MVS), which is implemented as an adaptive sampling approach. Maximum variation sampling seeks to identify a particular sample of data that will represent the diverse data points in a data set. Adaptive sampling continues to draw samples from the population based on previous samples until some criteria have been met. Previous results from using MVS indicated that an ideal sample could be found very quickly using this approach

UNT Digital Library