87 research outputs found
AdvFoolGen: Creating Persistent Troubles for Deep Classifiers
Researches have shown that deep neural networks are vulnerable to malicious
attacks, where adversarial images are created to trick a network into
misclassification even if the images may give rise to totally different labels
by human eyes. To make deep networks more robust to such attacks, many defense
mechanisms have been proposed in the literature, some of which are quite
effective for guarding against typical attacks. In this paper, we present a new
black-box attack termed AdvFoolGen, which can generate attacking images from
the same feature space as that of the natural images, so as to keep baffling
the network even though state-of-the-art defense mechanisms have been applied.
We systematically evaluate our model by comparing with well-established attack
algorithms. Through experiments, we demonstrate the effectiveness and
robustness of our attack in the face of state-of-the-art defense techniques and
unveil the potential reasons for its effectiveness through principled analysis.
As such, AdvFoolGen contributes to understanding the vulnerability of deep
networks from a new perspective and may, in turn, help in developing and
evaluating new defense mechanisms.Comment: 11 pages, 5 figure
Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report
Purpose: To introduce the concept of using large language models (LLMs) to
re-label structure names in accordance with the American Association of
Physicists in Medicine (AAPM) Task Group (TG)-263 standard, and to establish a
benchmark for future studies to reference.
Methods and Materials: The Generative Pre-trained Transformer (GPT)-4
application programming interface (API) was implemented as a Digital Imaging
and Communications in Medicine (DICOM) storage server, which upon receiving a
structure set DICOM file, prompts GPT-4 to re-label the structure names of both
target volumes and normal tissues according to the AAPM TG-263. Three disease
sites, prostate, head and neck, and thorax were selected for evaluation. For
each disease site category, 150 patients were randomly selected for manually
tuning the instructions prompt (in batches of 50) and 50 patients were randomly
selected for evaluation. Structure names that were considered were those that
were most likely to be relevant for studies utilizing structure contours for
many patients.
Results: The overall re-labeling accuracy of both target volumes and normal
tissues for prostate, head and neck, and thorax cases was 96.0%, 98.5%, and
96.9% respectively. Re-labeling of target volumes was less accurate on average
except for prostate - 100%, 93.1%, and 91.1% respectively.
Conclusions: Given the accuracy of GPT-4 in re-labeling structure names of
both target volumes and normal tissues as presented in this work, LLMs are
poised to be the preferred method for standardizing structure names in
radiation oncology, especially considering the rapid advancements in LLM
capabilities that are likely to continue.Comment: 20 pages, 5 figures, 1 tabl
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
We present the first study to investigate Large Language Models (LLMs) in
answering radiation oncology physics questions. Because popular exams like AP
Physics, LSAT, and GRE have large test-taker populations and ample test
preparation resources in circulation, they may not allow for accurately
assessing the true potential of LLMs. This paper proposes evaluating LLMs on a
highly-specialized topic, radiation oncology physics, which may be more
pertinent to scientific and medical communities in addition to being a valuable
benchmark of LLMs. We developed an exam consisting of 100 radiation oncology
physics questions based on our expertise at Mayo Clinic. Four LLMs, ChatGPT
(GPT-3.5), ChatGPT (GPT-4), Bard (LaMDA), and BLOOMZ, were evaluated against
medical physicists and non-experts. ChatGPT (GPT-4) outperformed all other LLMs
as well as medical physicists, on average. The performance of ChatGPT (GPT-4)
was further improved when prompted to explain first, then answer. ChatGPT
(GPT-3.5 and GPT-4) showed a high level of consistency in its answer choices
across a number of trials, whether correct or incorrect, a characteristic that
was not observed in the human test groups. In evaluating ChatGPTs (GPT-4)
deductive reasoning ability using a novel approach (substituting the correct
answer with "None of the above choices is the correct answer."), ChatGPT
(GPT-4) demonstrated surprising accuracy, suggesting the potential presence of
an emergent ability. Finally, although ChatGPT (GPT-4) performed well overall,
its intrinsic properties did not allow for further improvement when scoring
based on a majority vote across trials. In contrast, a team of medical
physicists were able to greatly outperform ChatGPT (GPT-4) using a majority
vote. This study suggests a great potential for LLMs to work alongside
radiation oncology experts as highly knowledgeable assistants
Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer
Purpose: In some proton therapy facilities, patient alignment relies on two
2D orthogonal kV images, taken at fixed, oblique angles, as no 3D on-the-bed
imaging is available. The visibility of the tumor in kV images is limited since
the patient's 3D anatomy is projected onto a 2D plane, especially when the
tumor is behind high-density structures such as bones. This can lead to large
patient setup errors. A solution is to reconstruct the 3D CT image from the kV
images obtained at the treatment isocenter in the treatment position.
Methods: An asymmetric autoencoder-like network built with vision-transformer
blocks was developed. The data was collected from 1 head and neck patient: 2
orthogonal kV images (1024x1024 voxels), 1 3D CT with padding (512x512x512)
acquired from the in-room CT-on-rails before kVs were taken and 2
digitally-reconstructed-radiograph (DRR) images (512x512) based on the CT. We
resampled kV images every 8 voxels and DRR and CT every 4 voxels, thus formed a
dataset consisting of 262,144 samples, in which the images have a dimension of
128 for each direction. In training, both kV and DRR images were utilized, and
the encoder was encouraged to learn the jointed feature map from both kV and
DRR images. In testing, only independent kV images were used. The full-size
synthetic CT (sCT) was achieved by concatenating the sCTs generated by the
model according to their spatial information. The image quality of the
synthetic CT (sCT) was evaluated using mean absolute error (MAE) and
per-voxel-absolute-CT-number-difference volume histogram (CDVH).
Results: The model achieved a speed of 2.1s and a MAE of <40HU. The CDVH
showed that <5% of the voxels had a per-voxel-absolute-CT-number-difference
larger than 185 HU.
Conclusion: A patient-specific vision-transformer-based network was developed
and shown to be accurate and efficient to reconstruct 3D CT images from kV
images.Comment: 9 figure
Artificial General Intelligence for Radiation Oncology
The emergence of artificial general intelligence (AGI) is transforming
radiation oncology. As prominent vanguards of AGI, large language models (LLMs)
such as GPT-4 and PaLM 2 can process extensive texts and large vision models
(LVMs) such as the Segment Anything Model (SAM) can process extensive imaging
data to enhance the efficiency and precision of radiation therapy. This paper
explores full-spectrum applications of AGI across radiation oncology including
initial consultation, simulation, treatment planning, treatment delivery,
treatment verification, and patient follow-up. The fusion of vision data with
LLMs also creates powerful multimodal models that elucidate nuanced clinical
patterns. Together, AGI promises to catalyze a shift towards data-driven,
personalized radiation therapy. However, these models should complement human
expertise and care. This paper provides an overview of how AGI can transform
radiation oncology to elevate the standard of patient care in radiation
oncology, with the key insight being AGI's ability to exploit multimodal
clinical data at scale
The succession of rhizosphere microbial community in the continuous cropping soil of tobacco
Introduction: Flue-cured tobacco is an important economic crop that is not tolerant of continuous cropping and can be influenced by planting soil conditions including rhizosphere microbial communities and soil physicochemical properties. The relationship between rhizosphere microbial communities and soil physicochemical properties under continuous cropping conditions is unclear.Methods: This study investigated the succession of rhizosphere microbial community in continuous tobacco cropping soil for 1, 3, 5, 8, 10, 15, and 30Â years. The physicochemical properties of the soil were measured, high-throughput sequencing was performed on the rhizosphere microbial community, and correlation analysis was conducted.Results: The results suggested that continuous cropping could significantly enrich soil available nitrogen, available phosphorus, available potassium, and organic matter. Meanwhile, the alpha diversity of the bacterial community was significantly reduced with continuous cropping, indicating significant changes in the structure of bacterial and fungal communities. Based on linear discriminant analysis effect size (LEfSe), 173 bacterial and 75 fungal genera were identified with significant differences. The bacterial genera, Sphingomonas, Streptomyces, and Microvirga, were significantly positively correlated with continuous cropping years. The fungal genera, Tausonia, Solicocozyma, Pseudomycohila, and Fusarium, also showed significant positive correlation with continuous cropping years. Meanwhile, the fungal genera, Olpidium, Cephaliophora, and Cercophora, presented an opposite correlation. However, there are differences in the correlation between these bacterial and fungal genera related to continuous cropping years and other different soil physicochemical properties.Discussion: In summary, this work could provide a reference for soil management and scientific fertilization of tobacco under continuous cropping conditions
Evolutionary Analysis of Structural Protein Gene VP1 of Foot-and-Mouth Disease Virus Serotype Asia 1
Foot-and-mouth disease virus (FMDV) serotype Asia 1 was mostly endemic in Asia and then was responsible for economically important viral disease of cloven-hoofed animals, but the study on its selection and evolutionary process is comparatively rare. In this study, we characterized 377 isolates from Asia collected up until 2012, including four vaccine strains. Maximum likelihood analysis suggested that the strains circulating in Asia were classified into 8 different groups (groups I–VIII) or were unclassified (viruses collected before 2000). On the basis of divergence time analyses, we infer that the TMRCA of Asia 1 virus existed approximately 86.29 years ago. The result suggested that the virus had a high mutation rate (5.745 × 10−3 substitutions/site/year) in comparison to the other serotypes of FMDV VP1 gene. Furthermore, the structural protein VP1 was under lower selection pressure and the positive selection occurred at many sites, and four codons (positions 141, 146, 151, and 169) were located in known critical antigenic residues. The remaining sites were not located in known functional regions and were moderately conserved, and the reason for supporting all sites under positive selection remains to be elucidated because the power of these analyses was largely unknown
- …