23 research outputs found

    Digital Language Death

    Get PDF
    Of the approximately 7,000 languages spoken today, some 2,500 are generally considered endangered. Here we argue that this consensus figure vastly underestimates the danger of digital language death, in that less than 5% of all languages can still ascend to the digital realm. We present evidence of a massive die-off caused by the digital divide

    New Computational Tools for Sample Purification and Early-Stage Data Processing in High-Resolution Cryo-Electron Microscopy

    Get PDF
    Large macromolecular protein complexes play an important role in the quest of un- derstanding life at a molecular level. For this endeavor, the highly-resolved 3D structure of a large complex and its changes, which are called conformational dy- namics, are of high interest due to their close relation to biological function. A pop- ular method to gain access to this information is cryo-electron microscopy. Here, biological samples are vitrified in their native buffer conditions and then im- aged in a transmission electron microscope. The large amount of recorded noisy images, called micrographs, are then subjected to a series of sophisticated, compu- tationally expensive algorithmic steps that include particle extraction, averaging of similar particle images, angular assignment and 3D reconstruction. While a growing amount of 3D structures determined by cryo EM reach resolutions that allow atomic model building, a large share of structures never reach that level of possible biolog- ical detail. In this thesis, improvements to two of the problems that can limit the resolution of structures are presented: Sample quality and elimination of suboptimal micro- graphs. First, a new software for simulation of density gradient centrifugation ex- periments, which is a popular method for sample purification, is presented that can aid users in finding the right conditions to optimize their purification proto- cols through new algorithmic approaches. This can lead to purer and more stable samples that improve the recorded images. Then, a new software for live processing and machine learning based classification of TEM images whose quality is judged live by a user is introduced. This software, called the Micrograph Quality Checker, is part of the COW, which is a novel single particle cryo EM image processing framework that covers the whole data analysis workflow. Elimination of unwanted data and collection of metadata in parallel to image acquisition can reduce the data overhead, improve computational speed and can also lead to higher resolution in the obtained structures due to the self-referential image processing
    corecore