306 research outputs found

    Leveraging Automated Mixed-Low-Precision Quantization for Tiny Edge Microcontrollers

    Get PDF
    The severe on-chip memory limitations are currently preventing the deployment of the most accurate Deep Neural Network (DNN) models on tiny MicroController Units (MCUs), even if leveraging an effective 8-bit quantization scheme. To tackle this issue, in this paper we present an automated mixed-precision quantization flow based on the HAQ framework but tailored for the memory and computational characteristics of MCU devices. Specifically, a Reinforcement Learning agent searches for the best uniform quantization levels, among 2, 4, 8 bits, of individual weight and activation tensors, under the tight constraints on RAM and FLASH embedded memory sizes. We conduct an experimental analysis on MobileNetV1, MobileNetV2 and MNasNet models for Imagenet classification. Concerning the quantization policy search, the RL agent selects quantization policies that maximize the memory utilization. Given an MCU-class memory bound of 2 MB for weight-only quantization, the compressed models produced by the mixed-precision engine result as accurate as the state-of-the-art solutions quantized with a non-uniform function, which is not tailored for CPUs featuring integer-only arithmetic. This denotes the viability of uniform quantization, required for MCU deployments, for deep weights compression. When also limiting the activation memory budget to 512 kB, the best MobileNetV1 model scores up to 68.4% on Imagenet thanks to the found quantization policy, resulting to be 4% more accurate than the other 8-bit networks fitting the same memory constraints

    Large-scale annotation of proteins with labelling methods

    Get PDF
    We revise a major important problem in bioinformatics: how to annotate protein sequences in the genomic era and all the solutions that have been described by implementing tools based on labelling methods. In this paper we mainly focus on our own work and the theoretical methods that are popular in the field of biosequence analysis in modern molecular biology. We will also review a recent application from our group that largely improves on the topology prediction of disulfide bonds in proteins from Eukaryotic organisms

    SUS-BAR: a database of pig proteins with statistically validated structural and functional annotation

    Get PDF
    Given the relevance of the pig proteome in different studies, including human complex maladies, a statistical validation of the annotation is required for a better understanding of the role of specific genes and proteins in the complex networks underlying biological processes in the animal. Presently, approximately 80% of the pig proteome is still poorly annotated, and the existence of protein sequences is routinely inferred automatically by sequence alignment towards preexisting sequences. In this article, we introduce SUS-BAR, a database that derives information mainly from UniProt Knowledgebase and that includes 26 206 pig protein sequences. In SUS-BAR, 16 675 of the pig protein sequences are endowed with statistically validated functional and structural annotation. Our statistical validation is determined by adopting a cluster-centric annotation procedure that allows transfer of different types of annotation, including structure and function. Each sequence in the database can be associated with a set of statistically validated Gene Ontologies (GOs) of the three main sub-ontologies (Molecular Function, Biological Process and Cellular Component), with Pfam functional domains, and when possible, with a cluster Hidden Markov Model that allows modelling the 3D structure of the protein. A database search allows some statistics demonstrating the enrichment in both GO and Pfam annotations of the pig proteins as compared with UniProt Knowledgebase annotation. Searching in SUS-BAR allows retrieval of the pig protein annotation for further analysis. The search is also possible on the basis of specific GO terms and this allows retrieval of all the pig sequences participating into a given biological process, after annotation with our system. Alternatively, the search is possible on the basis of structural information, allowing retrieval of all the pig sequences with the same structural characteristics

    Reconstruction of protein structures from a vectorial representation

    Full text link
    We show that the contact map of the native structure of globular proteins can be reconstructed starting from the sole knowledge of the contact map's principal eigenvector, and present an exact algorithm for this purpose. Our algorithm yields a unique contact map for all 221 globular structures of PDBselect25 of length N120N \le 120. We also show that the reconstructed contact maps allow in turn for the accurate reconstruction of the three-dimensional structure. These results indicate that the reduced vectorial representation provided by the principal eigenvector of the contact map is equivalent to the protein structure itself. This representation is expected to provide a useful tool in bioinformatics algorithms for protein structure comparison and alignment, as well as a promising intermediate step towards protein structure prediction.Comment: 4 pages, 1 figur

    DDGun: An untrained method for the prediction of protein stability changes upon single and multiple point variations

    Get PDF
    Background: Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (\u3b4\u3b4G) between wild type and variant proteins, using sequence and structure information. Most of the available methods however do not exhibit the anti-symmetric prediction property, which guarantees that the predicted \u3b4\u3b4G value for a variation is the exact opposite of that predicted for the reverse variation, i.e., \u3b4\u3b4G(A \u2192 B) = -\u3b4\u3b4G(B \u2192 A), where A and B are amino acids. Results: Here we introduce simple anti-symmetric features, based on evolutionary information, which are combined to define an untrained method, DDGun (DDG untrained). DDGun is a simple approach based on evolutionary information that predicts the \u3b4\u3b4G for single and multiple variations from sequence and structure information (DDGun3D). Our method achieves remarkable performance without any training on the experimental datasets, reaching Pearson correlation coefficients between predicted and measured \u3b4\u3b4G values of ~ 0.5 and ~ 0.4 for single and multiple site variations, respectively. Surprisingly, DDGun performances are comparable with those of state of the art methods. DDGun also naturally predicts multiple site variations, thereby defining a benchmark method for both single site and multiple site predictors. DDGun is anti-symmetric by construction predicting the value of the \u3b4\u3b4G of a reciprocal variation as almost equal (depending on the sequence profile) to -\u3b4\u3b4G of the direct variation. This is a valuable property that is missing in the majority of the methods. Conclusions: Evolutionary information alone combined in an untrained method can achieve remarkably high performances in the prediction of \u3b4\u3b4G upon protein mutation. Non-trained approaches like DDGun represent a valid benchmark both for scoring the predictive power of the individual features and for assessing the learning capability of supervised methods

    Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine

    Get PDF
    Protein stability predictions are becoming essential in medicine to develop novel immunotherapeutic agents and for drug discovery. Despite the large number of computational approaches for predicting the protein stability upon mutation, there are still critical unsolved problems: 1) the limited number of thermodynamic measurements for proteins provided by current databases; 2) the large intrinsic variability of \u394\u394G values due to different experimental conditions; 3) biases in the development of predictive methods caused by ignoring the anti-symmetry of \u394\u394G values between mutant and native protein forms; 4) over-optimistic prediction performance, due to sequence similarity between proteins used in training and test datasets. Here, we review these issues, highlighting new challenges required to improve current tools and to achieve more reliable predictions. In addition, we provide a perspective of how these methods will be beneficial for designing novel precision medicine approaches for several genetic disorders caused by mutations, such as cancer and neurodegenerative diseases

    Stereotactic Radiosurgery for Postoperative Spine Malignancy: A Systematic Review and International Stereotactic Radiosurgery Society Practice Guidelines.

    Get PDF
    To determine safety and efficacy of postoperative spine stereotactic body radiation therapy (SBRT) in the published literature, and to present practice recommendations on behalf of the International Stereotactic Radiosurgery Society. A systematic review of the literature was performed, specific to postoperative spine SBRT, using PubMed and Embase databases. A meta-analysis for 1-year local control (LC), overall survival (OS), and vertebral compression fracture probability was conducted. The literature search revealed 251 potentially relevant articles after duplicates were removed. Of these 56 were reviewed in-depth for eligibility and 12 met all the inclusion criteria for analysis. 7 studies were retrospective, 2 prospective observational and 3 were prospective phase 1 and 2 clinical trials. Outcomes for a total of 461 patients and 499 spinal segments were reported. Ten studies used a magnetic resonance imaging (MRI) scan fused to computed tomography (CT) simulation for treatment planning, and 2 investigations reported on all patients receiving a CT-myelogram at the time of planning. Meta-analysis for 1 year LC and OS was 88.9% and 57%, respectively. The crude reported vertebral compression fracture rate was 5.6%. One case of myelopathy was described in a patient with a previously irradiated spinal segment. One patient developed an esophageal fistula requiring surgical repair. Postoperative spine SBRT delivers a high 1-year LC with acceptably low toxicity. Patients who may benefit from this include those with oligometastatic disease, radioresistant histology, paraspinal masses, or those with a history of prior irradiation to the affected spinal segment. The International Stereotactic Radiosurgery Society recommends a minimum interval of 8 to 14 days after invasive surgery before simulation for SBRT, with initiation of radiation therapy within 4 weeks of surgery. An MRI fused to the planning CT, or the use of a CT-myelogram, are necessary for target and organ-at-risk delineation. A planning organ-at-risk volume (PRV) of 1.5 to 2 mm for the spinal cord is advised

    New decoding algorithms for Hidden Markov Models using distance measures on labellings

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Existing hidden Markov model decoding algorithms do not focus on approximately identifying the sequence feature boundaries.</p> <p>Results</p> <p>We give a set of algorithms to compute the conditional probability of all labellings "near" a reference labelling <it>λ </it>for a sequence <it>y </it>for a variety of definitions of "near". In addition, we give optimization algorithms to find the best labelling for a sequence in the robust sense of having all of its feature boundaries nearly correct. Natural problems in this domain are <it>NP</it>-hard to optimize. For membrane proteins, our algorithms find the approximate topology of such proteins with comparable success to existing programs, while being substantially more accurate in estimating the positions of transmembrane helix boundaries.</p> <p>Conclusion</p> <p>More robust HMM decoding may allow for better analysis of sequence features, in reasonable runtimes.</p
    corecore