87 research outputs found

    AdvFoolGen: Creating Persistent Troubles for Deep Classifiers

    Full text link
    Researches have shown that deep neural networks are vulnerable to malicious attacks, where adversarial images are created to trick a network into misclassification even if the images may give rise to totally different labels by human eyes. To make deep networks more robust to such attacks, many defense mechanisms have been proposed in the literature, some of which are quite effective for guarding against typical attacks. In this paper, we present a new black-box attack termed AdvFoolGen, which can generate attacking images from the same feature space as that of the natural images, so as to keep baffling the network even though state-of-the-art defense mechanisms have been applied. We systematically evaluate our model by comparing with well-established attack algorithms. Through experiments, we demonstrate the effectiveness and robustness of our attack in the face of state-of-the-art defense techniques and unveil the potential reasons for its effectiveness through principled analysis. As such, AdvFoolGen contributes to understanding the vulnerability of deep networks from a new perspective and may, in turn, help in developing and evaluating new defense mechanisms.Comment: 11 pages, 5 figure

    Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report

    Full text link
    Purpose: To introduce the concept of using large language models (LLMs) to re-label structure names in accordance with the American Association of Physicists in Medicine (AAPM) Task Group (TG)-263 standard, and to establish a benchmark for future studies to reference. Methods and Materials: The Generative Pre-trained Transformer (GPT)-4 application programming interface (API) was implemented as a Digital Imaging and Communications in Medicine (DICOM) storage server, which upon receiving a structure set DICOM file, prompts GPT-4 to re-label the structure names of both target volumes and normal tissues according to the AAPM TG-263. Three disease sites, prostate, head and neck, and thorax were selected for evaluation. For each disease site category, 150 patients were randomly selected for manually tuning the instructions prompt (in batches of 50) and 50 patients were randomly selected for evaluation. Structure names that were considered were those that were most likely to be relevant for studies utilizing structure contours for many patients. Results: The overall re-labeling accuracy of both target volumes and normal tissues for prostate, head and neck, and thorax cases was 96.0%, 98.5%, and 96.9% respectively. Re-labeling of target volumes was less accurate on average except for prostate - 100%, 93.1%, and 91.1% respectively. Conclusions: Given the accuracy of GPT-4 in re-labeling structure names of both target volumes and normal tissues as presented in this work, LLMs are poised to be the preferred method for standardizing structure names in radiation oncology, especially considering the rapid advancements in LLM capabilities that are likely to continue.Comment: 20 pages, 5 figures, 1 tabl

    Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics

    Full text link
    We present the first study to investigate Large Language Models (LLMs) in answering radiation oncology physics questions. Because popular exams like AP Physics, LSAT, and GRE have large test-taker populations and ample test preparation resources in circulation, they may not allow for accurately assessing the true potential of LLMs. This paper proposes evaluating LLMs on a highly-specialized topic, radiation oncology physics, which may be more pertinent to scientific and medical communities in addition to being a valuable benchmark of LLMs. We developed an exam consisting of 100 radiation oncology physics questions based on our expertise at Mayo Clinic. Four LLMs, ChatGPT (GPT-3.5), ChatGPT (GPT-4), Bard (LaMDA), and BLOOMZ, were evaluated against medical physicists and non-experts. ChatGPT (GPT-4) outperformed all other LLMs as well as medical physicists, on average. The performance of ChatGPT (GPT-4) was further improved when prompted to explain first, then answer. ChatGPT (GPT-3.5 and GPT-4) showed a high level of consistency in its answer choices across a number of trials, whether correct or incorrect, a characteristic that was not observed in the human test groups. In evaluating ChatGPTs (GPT-4) deductive reasoning ability using a novel approach (substituting the correct answer with "None of the above choices is the correct answer."), ChatGPT (GPT-4) demonstrated surprising accuracy, suggesting the potential presence of an emergent ability. Finally, although ChatGPT (GPT-4) performed well overall, its intrinsic properties did not allow for further improvement when scoring based on a majority vote across trials. In contrast, a team of medical physicists were able to greatly outperform ChatGPT (GPT-4) using a majority vote. This study suggests a great potential for LLMs to work alongside radiation oncology experts as highly knowledgeable assistants

    Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer

    Full text link
    Purpose: In some proton therapy facilities, patient alignment relies on two 2D orthogonal kV images, taken at fixed, oblique angles, as no 3D on-the-bed imaging is available. The visibility of the tumor in kV images is limited since the patient's 3D anatomy is projected onto a 2D plane, especially when the tumor is behind high-density structures such as bones. This can lead to large patient setup errors. A solution is to reconstruct the 3D CT image from the kV images obtained at the treatment isocenter in the treatment position. Methods: An asymmetric autoencoder-like network built with vision-transformer blocks was developed. The data was collected from 1 head and neck patient: 2 orthogonal kV images (1024x1024 voxels), 1 3D CT with padding (512x512x512) acquired from the in-room CT-on-rails before kVs were taken and 2 digitally-reconstructed-radiograph (DRR) images (512x512) based on the CT. We resampled kV images every 8 voxels and DRR and CT every 4 voxels, thus formed a dataset consisting of 262,144 samples, in which the images have a dimension of 128 for each direction. In training, both kV and DRR images were utilized, and the encoder was encouraged to learn the jointed feature map from both kV and DRR images. In testing, only independent kV images were used. The full-size synthetic CT (sCT) was achieved by concatenating the sCTs generated by the model according to their spatial information. The image quality of the synthetic CT (sCT) was evaluated using mean absolute error (MAE) and per-voxel-absolute-CT-number-difference volume histogram (CDVH). Results: The model achieved a speed of 2.1s and a MAE of <40HU. The CDVH showed that <5% of the voxels had a per-voxel-absolute-CT-number-difference larger than 185 HU. Conclusion: A patient-specific vision-transformer-based network was developed and shown to be accurate and efficient to reconstruct 3D CT images from kV images.Comment: 9 figure

    Artificial General Intelligence for Radiation Oncology

    Full text link
    The emergence of artificial general intelligence (AGI) is transforming radiation oncology. As prominent vanguards of AGI, large language models (LLMs) such as GPT-4 and PaLM 2 can process extensive texts and large vision models (LVMs) such as the Segment Anything Model (SAM) can process extensive imaging data to enhance the efficiency and precision of radiation therapy. This paper explores full-spectrum applications of AGI across radiation oncology including initial consultation, simulation, treatment planning, treatment delivery, treatment verification, and patient follow-up. The fusion of vision data with LLMs also creates powerful multimodal models that elucidate nuanced clinical patterns. Together, AGI promises to catalyze a shift towards data-driven, personalized radiation therapy. However, these models should complement human expertise and care. This paper provides an overview of how AGI can transform radiation oncology to elevate the standard of patient care in radiation oncology, with the key insight being AGI's ability to exploit multimodal clinical data at scale

    The succession of rhizosphere microbial community in the continuous cropping soil of tobacco

    Get PDF
    Introduction: Flue-cured tobacco is an important economic crop that is not tolerant of continuous cropping and can be influenced by planting soil conditions including rhizosphere microbial communities and soil physicochemical properties. The relationship between rhizosphere microbial communities and soil physicochemical properties under continuous cropping conditions is unclear.Methods: This study investigated the succession of rhizosphere microbial community in continuous tobacco cropping soil for 1, 3, 5, 8, 10, 15, and 30 years. The physicochemical properties of the soil were measured, high-throughput sequencing was performed on the rhizosphere microbial community, and correlation analysis was conducted.Results: The results suggested that continuous cropping could significantly enrich soil available nitrogen, available phosphorus, available potassium, and organic matter. Meanwhile, the alpha diversity of the bacterial community was significantly reduced with continuous cropping, indicating significant changes in the structure of bacterial and fungal communities. Based on linear discriminant analysis effect size (LEfSe), 173 bacterial and 75 fungal genera were identified with significant differences. The bacterial genera, Sphingomonas, Streptomyces, and Microvirga, were significantly positively correlated with continuous cropping years. The fungal genera, Tausonia, Solicocozyma, Pseudomycohila, and Fusarium, also showed significant positive correlation with continuous cropping years. Meanwhile, the fungal genera, Olpidium, Cephaliophora, and Cercophora, presented an opposite correlation. However, there are differences in the correlation between these bacterial and fungal genera related to continuous cropping years and other different soil physicochemical properties.Discussion: In summary, this work could provide a reference for soil management and scientific fertilization of tobacco under continuous cropping conditions

    Evolutionary Analysis of Structural Protein Gene VP1 of Foot-and-Mouth Disease Virus Serotype Asia 1

    Get PDF
    Foot-and-mouth disease virus (FMDV) serotype Asia 1 was mostly endemic in Asia and then was responsible for economically important viral disease of cloven-hoofed animals, but the study on its selection and evolutionary process is comparatively rare. In this study, we characterized 377 isolates from Asia collected up until 2012, including four vaccine strains. Maximum likelihood analysis suggested that the strains circulating in Asia were classified into 8 different groups (groups I–VIII) or were unclassified (viruses collected before 2000). On the basis of divergence time analyses, we infer that the TMRCA of Asia 1 virus existed approximately 86.29 years ago. The result suggested that the virus had a high mutation rate (5.745 × 10−3 substitutions/site/year) in comparison to the other serotypes of FMDV VP1 gene. Furthermore, the structural protein VP1 was under lower selection pressure and the positive selection occurred at many sites, and four codons (positions 141, 146, 151, and 169) were located in known critical antigenic residues. The remaining sites were not located in known functional regions and were moderately conserved, and the reason for supporting all sites under positive selection remains to be elucidated because the power of these analyses was largely unknown
    • …
    corecore