179 research outputs found

    Hippocampal sparing in whole-brain radiotherapy for brain metastases: controversy, technology and the future

    Get PDF
    Whole-brain radiotherapy (WBRT) plays an irreplaceable role in the treatment of brain metastases (BMs), but cognitive decline after WBRT seriously affects patients’ quality of life. The development of cognitive dysfunction is closely related to hippocampal injury, but standardized criteria for predicting hippocampal injury and dose limits for hippocampal protection have not yet been developed. This review systematically reviews the clinical efficacy of hippocampal avoidance - WBRT (HA-WBRT), the controversy over dose limits, common methods and characteristics of hippocampal imaging and segmentation, differences in hippocampal protection by common radiotherapy (RT) techniques, and the application of artificial intelligence (AI) and radiomic techniques for hippocampal protection. In the future, the application of new techniques and methods can improve the consistency of hippocampal dose limit determination and the prediction of the occurrence of cognitive dysfunction in WBRT patients, avoiding the occurrence of cognitive dysfunction in patients and thus benefiting more patients with BMs

    Are All Losses Created Equal: A Neural Collapse Perspective

    Full text link
    While cross entropy (CE) is the most commonly used loss to train deep neural networks for classification tasks, many alternative losses have been developed to obtain better empirical performance. Among them, which one is the best to use is still a mystery, because there seem to be multiple factors affecting the answer, such as properties of the dataset, the choice of network architecture, and so on. This paper studies the choice of loss function by examining the last-layer features of deep networks, drawing inspiration from a recent line work showing that the global optimal solution of CE and mean-square-error (MSE) losses exhibits a Neural Collapse phenomenon. That is, for sufficiently large networks trained until convergence, (i) all features of the same class collapse to the corresponding class mean and (ii) the means associated with different classes are in a configuration where their pairwise distances are all equal and maximized. We extend such results and show through global solution and landscape analyses that a broad family of loss functions including commonly used label smoothing (LS) and focal loss (FL) exhibits Neural Collapse. Hence, all relevant losses(i.e., CE, LS, FL, MSE) produce equivalent features on training data. Based on the unconstrained feature model assumption, we provide either the global landscape analysis for LS loss or the local landscape analysis for FL loss and show that the (only!) global minimizers are neural collapse solutions, while all other critical points are strict saddles whose Hessian exhibit negative curvature directions either in the global scope for LS loss or in the local scope for FL loss near the optimal solution. The experiments further show that Neural Collapse features obtained from all relevant losses lead to largely identical performance on test data as well, provided that the network is sufficiently large and trained until convergence.Comment: 32 page, 10 figures, NeurIPS 202

    Dynamic response analysis of intake tower in hydroelectric power station with high surrounding rock

    Get PDF
    This paper presents results of numerical analysis for the seismic response of hydropower station intake tower in step-like ground based on artificial boundary theory. A L topography finite element model was established to verify the correctness of the proposed method of viscous elasticity boundary by consider inconsistent reflective surface. The method was applied to an intake tower, and the acceleration of bedrock was obtained by seismic inversion method, then the equivalent load of each node was calculated. Five different models were established as follow: massless foundation, consistent input viscous elasticity boundary, inconsistent input viscous elasticity boundary and whether set contact, then displacement and stress were compared, the results show that the proposed method with contact was minimal. The base plate of intake tower and the foundation were in close adhesion state in the whole process of earthquake, both sides and rear side of intake tower without through disengagement phenomena from rock, it can conclude that the intake tower in the overall stability state

    Weakly-supervised High-resolution Segmentation of Mammography Images for Breast Cancer Diagnosis

    Full text link
    In the last few years, deep learning classifiers have shown promising results in image-based medical diagnosis. However, interpreting the outputs of these models remains a challenge. In cancer diagnosis, interpretability can be achieved by localizing the region of the input image responsible for the output, i.e. the location of a lesion. Alternatively, segmentation or detection models can be trained with pixel-wise annotations indicating the locations of malignant lesions. Unfortunately, acquiring such labels is labor-intensive and requires medical expertise. To overcome this difficulty, weakly-supervised localization can be utilized. These methods allow neural network classifiers to output saliency maps highlighting the regions of the input most relevant to the classification task (e.g. malignant lesions in mammograms) using only image-level labels (e.g. whether the patient has cancer or not) during training. When applied to high-resolution images, existing methods produce low-resolution saliency maps. This is problematic in applications in which suspicious lesions are small in relation to the image size. In this work, we introduce a novel neural network architecture to perform weakly-supervised segmentation of high-resolution images. The proposed model selects regions of interest via coarse-level localization, and then performs fine-grained segmentation of those regions. We apply this model to breast cancer diagnosis with screening mammography, and validate it on a large clinically-realistic dataset. Measured by Dice similarity score, our approach outperforms existing methods by a large margin in terms of localization performance of benign and malignant lesions, relatively improving the performance by 39.6% and 20.0%, respectively. Code and the weights of some of the models are available at https://github.com/nyukat/GLAMComment: The last two authors contributed equally. Accepted to Medical Imaging with Deep Learning (MIDL) 202

    Cram\'er-Rao bound-informed training of neural networks for quantitative MRI

    Full text link
    Neural networks are increasingly used to estimate parameters in quantitative MRI, in particular in magnetic resonance fingerprinting. Their advantages over the gold standard non-linear least square fitting are their superior speed and their immunity to the non-convexity of many fitting problems. We find, however, that in heterogeneous parameter spaces, i.e. in spaces in which the variance of the estimated parameters varies considerably, good performance is hard to achieve and requires arduous tweaking of the loss function, hyper parameters, and the distribution of the training data in parameter space. Here, we address these issues with a theoretically well-founded loss function: the Cram\'er-Rao bound (CRB) provides a theoretical lower bound for the variance of an unbiased estimator and we propose to normalize the squared error with respective CRB. With this normalization, we balance the contributions of hard-to-estimate and not-so-hard-to-estimate parameters and areas in parameter space, and avoid a dominance of the former in the overall training loss. Further, the CRB-based loss function equals one for a maximally-efficient unbiased estimator, which we consider the ideal estimator. Hence, the proposed CRB-based loss function provides an absolute evaluation metric. We compare a network trained with the CRB-based loss with a network trained with the commonly used means squared error loss and demonstrate the advantages of the former in numerical, phantom, and in vivo experiments.Comment: Xiaoxia Zhang, Quentin Duchemin, and Kangning Liu contributed equally to this wor

    The stability evaluation of clay tunnels via the non-linear deterioration of physical and mechanical properties of surrounding rocks

    Get PDF
    Simple, fast, and reliable methods for the stability evaluation of tunnels can facilitate the construction and development of tunneling projects. The problems related to tunnel stability at this stage can be well analyzed via theoretical analysis method, model test method, or numerical analysis method. On the other hand, those methods are hard to be effectively analyzed these projects with higher importance, shorter decision and design period, and more urgent construction period. This paper proposed research works on the stability evaluation of clay tunnels. Firstly, a state function with the variables of stress and strain state is presented to predict the stress and strain states of surrounding rocks caused by tunnel excavation, which characterize the physical-mechanical state of surrounding rocks (also called stability state). Secondly, the non-linear deterioration of the physical and mechanical properties of surrounding rocks will be simulated, and the expressions and calculation methods of the tunnel stability reserve factor will be yielded. Finally, the results of the proposed method were compared with the strength reduction method and the limit equilibrium method with a clay tunnel example. The comparison between the three feature points of the arch crown, sidewall, and arch bottom showed that the stability reserve factor of the clay tunnel was smaller than those of the strength reduction method and the limit equilibrium method. The values of limit displacement obtained by the proposed method were closer to the field monitoring data than that of the strength reduction method. Therefore, this study could be better applied to the stability evaluation of clay tunnels

    Rapid quantitative magnetization transfer imaging: utilizing the hybrid state and the generalized Bloch model

    Full text link
    Purpose: To improve spatial resolution and scan time of quantitative magnetization transfer (qMT) imaging without constraints on model parameters. Theory and Methods: We combine two recently-proposed models in a Bloch-McConnell equation: the dynamics of the free spin pool is confined to the hybrid state and the dynamics of the semi-solid spin pool is described by the generalized Bloch model. We numerically optimize the flip angles and durations of a train of radio frequency pulses to enhance the encoding of three marked qMT parameters while accounting for an 8-parameter model. We sparsely sample each time frame along this spin dynamics with a 3D radial koosh-ball trajectory, reconstruct the data with sub-space modeling, and fit the qMT model with a neural network for computational efficiency. Results: We were able to extract qMT parameter maps of the whole brain with a nominal resolution of 1mm isotropic and high SNR from a 12.6 minute scan. In lesions of multiple sclerosis subjects, we observe a decreased size of the semi-solid spin pool and slower relaxation, consistent with previous reports. Conclusion: The encoding power of the hybrid state, combined with regularized image reconstruction, and the accuracy of the generalized Bloch model provide an excellent basis for highly efficient quantitative magnetization transfer imaging
    • …
    corecore