692 research outputs found

    Rethinking Gauss-Newton for learning over-parameterized models

    Full text link
    This work studies the global convergence and generalization properties of Gauss Newton's (GN) when optimizing one-hidden layer networks in the over-parameterized regime. We first establish a global convergence result for GN in the continuous-time limit exhibiting a faster convergence rate compared to GD due to improved conditioning. We then perform an empirical study on a synthetic regression task to investigate the implicit bias of GN's method. We find that, while GN is consistently faster than GD in finding a global optimum, the performance of the learned model on a test dataset is heavily influenced by both the learning rate and the variance of the randomly initialized network's weights. Specifically, we find that initializing with a smaller variance results in a better generalization, a behavior also observed for GD. However, in contrast to GD where larger learning rates lead to the best generalization, we find that GN achieves an improved generalization when using smaller learning rates, albeit at the cost of slower convergence. This study emphasizes the significance of the learning rate in balancing the optimization speed of GN with the generalization ability of the learned solution

    The relevance of posterior thalamo-cortical connectivity for visual short-term memory capacity

    Get PDF
    Visual short-term memory (vSTM) capacity represents the maximum number of visual items that can be perceived and stored into vSTM. One way to measure it is by using simple psycho-physical experiments together with the theory of visual attention (TVA) computational framework in which visual processing is conceived as a race between objects to be consciously perceived and stored into vSTM. The neural theory of visual attention (NTVA), which gives an interpretation of the TVA at both the cellular and systemic level, suggests that recurrent loops between posterior thalamus and visual cortices are relevant for vSTM capacity. Nevertheless, no clear evidence for the role of posterior thalamus and its connection to visual cortices in vSTM capacity has been found thus far. This thesis investigated the role of posterior thalamo-cortical connectivity in vSTM capacity in healthy young individuals as well as in two populations that have shown to exhibit both vSTM capacity impairments and posterior cortical and subcortical white matter damages: healthy aging and premature birth. We found that vSTM capacity in healthy young adults was significantly associated with the tracts connecting posterior thalamus to occipital cortices and their microstructure. However, this association was modified in elderly individuals and in young adults born prematurely, in which the recruitment of additional, cortico-cortical, tracts, takes place. Together, these findings bring the first structural evidence for the NTVA model with respect to the relevance of posterior thalamo-cortical tracts for vSTM capacity and show how alterations of these tracts affect vSTM capacity

    Self-Attention in Colors: Another Take on Encoding Graph Structure in Transformers

    Full text link
    We introduce a novel self-attention mechanism, which we call CSA (Chromatic Self-Attention), which extends the notion of attention scores to attention _filters_, independently modulating the feature channels. We showcase CSA in a fully-attentional graph Transformer CGT (Chromatic Graph Transformer) which integrates both graph structural information and edge features, completely bypassing the need for local message-passing components. Our method flexibly encodes graph structure through node-node interactions, by enriching the original edge features with a relative positional encoding scheme. We propose a new scheme based on random walks that encodes both structural and positional information, and show how to incorporate higher-order topological information, such as rings in molecular graphs. Our approach achieves state-of-the-art results on the ZINC benchmark dataset, while providing a flexible framework for encoding graph structure and incorporating higher-order topology

    The relevance of posterior thalamo-cortical connectivity for visual short-term memory capacity

    Get PDF
    Visual short-term memory (vSTM) capacity represents the maximum number of visual items that can be perceived and stored into vSTM. One way to measure it is by using simple psycho-physical experiments together with the theory of visual attention (TVA) computational framework in which visual processing is conceived as a race between objects to be consciously perceived and stored into vSTM. The neural theory of visual attention (NTVA), which gives an interpretation of the TVA at both the cellular and systemic level, suggests that recurrent loops between posterior thalamus and visual cortices are relevant for vSTM capacity. Nevertheless, no clear evidence for the role of posterior thalamus and its connection to visual cortices in vSTM capacity has been found thus far. This thesis investigated the role of posterior thalamo-cortical connectivity in vSTM capacity in healthy young individuals as well as in two populations that have shown to exhibit both vSTM capacity impairments and posterior cortical and subcortical white matter damages: healthy aging and premature birth. We found that vSTM capacity in healthy young adults was significantly associated with the tracts connecting posterior thalamus to occipital cortices and their microstructure. However, this association was modified in elderly individuals and in young adults born prematurely, in which the recruitment of additional, cortico-cortical, tracts, takes place. Together, these findings bring the first structural evidence for the NTVA model with respect to the relevance of posterior thalamo-cortical tracts for vSTM capacity and show how alterations of these tracts affect vSTM capacity

    An analysis of MRI derived cortical complexity in premature-born adults : regional patterns, risk factors, and potential significance

    Get PDF
    Premature birth bears an increased risk for aberrant brain development concerning its structure and function. Cortical complexity (CC) expresses the fractal dimension of the brain surface and changes during neurodevelopment. We hypothesized that CC is altered after premature birth and associated with long-term cognitive development. One-hundred-and-one very premature-born adults (gestational age <32 weeks and/or birth weight <1500 ​g) and 111 term-born adults were assessed by structural MRI and cognitive testing at 26 years of age. CC was measured based on MRI by vertex-wise estimation of fractal dimension. Cognitive performance was measured based on Griffiths-Mental-Development-Scale (at 20 months) and Wechsler-Adult-Intelligence-Scales (at 26 years). In premature-born adults, CC was decreased bilaterally in large lateral temporal and medial parietal clusters. Decreased CC was associated with lower gestational age and birth weight. Furthermore, decreased CC in the medial parietal cortices was linked with reduced full-scale IQ of premature-born adults and mediated the association between cognitive development at 20 months and IQ in adulthood. Results demonstrate that CC is reduced in very premature-born adults in temporoparietal cortices, mediating the impact of prematurity on impaired cognitive development. These data indicate functionally relevant long-term alterations in the brain’s basic geometry of cortical organization in prematurity

    Sequelae of premature birth in young adults

    Get PDF
    Background and Purpose Qualitative studies about the abnormalities appreciated on routine magnetic resonance imaging (MRI) sequences in prematurely born adults are lacking. This article aimed at filling this knowledge gap by (1) qualitatively describing routine imaging findings in prematurely born adults, (2) evaluating measures for routine image interpretation and (3) investigating the impact of perinatal variables related to premature birth. Methods In this study two board-certified radiologists assessed T1-weighted and FLAIR-weighted images of 100 prematurely born adults born very preterm (VP <32 weeks) and/or at very low birth weight (VLBW <1500 g) and 106 controls born at full term (FT) (mean age 26.8 ± 0.7 years). The number of white matter lesions (WML) was counted according to localization. Lateral ventricle volume (LVV) was evaluated subjectively and by measurements of Evans’ index (EI) and frontal-occipital-horn ratio (FOHR). Freesurfer-based volumetry served as reference standard. Miscellaneous incidental findings were noted as free text. Results The LVV was increased in 24.7% of VP/VLBW individuals and significantly larger than in FT controls. This was best identified by measurement of FOHR (AUC = 0.928). Ventricular enlargement was predicted by low gestational age (odds ratio: 0.71, 95% CI 0.51–0.98) and presence of neonatal intracranial hemorrhage (odds ratio: 0.26, 95% CI 0.07–0.92). The numbers of deep and periventricular WML were increased while subcortical WMLs were not. Conclusion Enlargement of the LVV and deep and periventricular WMLs are typical sequelae of premature birth that can be appreciated on routine brain MRI. To increase sensitivity of abnormal LVV detection, measurement of FOHR seems feasible in clinical practice

    Complex Grey Matter Structure Segmentation in Brains via Deep Learning: Example of the Claustrum

    Full text link
    Segmentationand parcellation of the brain has been widely performed on brain MRI using atlas-based methods. However, segmentation of the claustrum, a thin and sheet-like structure between insular cortex and putamen has not been amenable to automatized segmentation, thus limiting its investigation in larger imaging cohorts. Recently, deep-learning based approaches have been introduced for automated segmentation of brain structures, yielding great potential to overcome preexisting limitations. In the following, we present a multi-view deep-learning based approach to segment the claustrum in T1-weighted MRI scans. We trained and evaluated the proposed method on 181 manual bilateral claustrum annotations by an expert neuroradiologist serving as reference standard. Cross-validation experiments yielded median volumetric similarity, robust Hausdor? distance and Dice score of 93.3%, 1.41mm and 71.8% respectively which represents equal or superior segmentation performance compared to human intra-rater reliability. Leave-one-scanner-out evaluation showed good transfer-ability of the algorithm to images from unseen scanners, however at slightly inferior performance. Furthermore, we found that AI-based claustrum segmentation benefits from multi-view information and requires sample sizes of around 75 MRI scans in the training set. In conclusion, the developed algorithm has large potential in independent study cohorts and to facilitate MRI-based research of the human claustrum through automated segmentation. The software and models of our method are made publicly available.Comment: submitted to a journa

    Theory of visual attention's thalamic model for visual short-term memory capacity and top-down control: evidence from a thalamo-cortical structural connectivity analysis

    Get PDF
    In the theory of visual attention (TVA), it is suggested that objects in a visual scene compete for representation in a visual short-term memory (vSTM) store. The race towards the store is assumed to be biased by top-down controlled weighting of the objects according to their task relevance. Only objects that reach the store before its capacity limitation is reached are represented consciously in a given instant. TVA-based computational modeling of participants' performance in whole- and partial-report tasks permits independent parameters of individual efficiency of top-down control α and vSTM storage capacity K to be extracted. The neural interpretation of the TVA proposes recurrent loops between the posterior thalamus and posterior visual cortices to be relevant for generating attentional weights for competing objects and for maintaining selected objects in vSTM. Accordingly, we tested whether structural connectivity between posterior thalamus and occipital cortices (PT-OC) is associated with estimates of top-down control and vSTM capacity. We applied whole- and partial-report tasks and probabilistic tractography in a sample of 37 healthy adults. We found vSTM capacity K to be associated with left PT-OC structural connectivity and a trend-wise relation between top-down control α and right PT-OC structural connectivity. These findings support the assumption of the relevance of thalamic structures and their connections to visual cortex for top-down control and vSTM capacity

    Automated claustrum segmentation in human brain MRI using deep learning

    Get PDF
    In the last two decades, neuroscience has produced intriguing evidence for a central role of the claustrum in mammalian forebrain structure and function. However, relatively few in vivo studies of the claustrum exist in humans. A reason for this may be the delicate and sheet-like structure of the claustrum lying between the insular cortex and the putamen, which makes it not amenable to conventional segmentation methods. Recently, Deep Learning (DL) based approaches have been successfully introduced for automated segmentation of complex, subcortical brain structures. In the following, we present a multi-view DL-based approach to segment the claustrum in T1-weighted MRI scans. We trained and evaluated the proposed method in 181 individuals, using bilateral manual claustrum annotations by an expert neuroradiologist as reference standard. Cross-validation experiments yielded median volumetric similarity, robust Hausdorff distance, and Dice score of 93.3%, 1.41 mm, and 71.8%, respectively, representing equal or superior segmentation performance compared to human intra-rater reliability. The leave-one-scanner-out evaluation showed good transferability of the algorithm to images from unseen scanners at slightly inferior performance. Furthermore, we found that DL-based claustrum segmentation benefits from multi-view information and requires a sample size of around 75 MRI scans in the training set. We conclude that the developed algorithm allows for robust automated claustrum segmentation and thus yields considerable potential for facilitating MRI-based research of the human claustrum. The software and models of our method are made publicly available
    • …
    corecore