58 research outputs found
AugNet: Dynamic Test-Time Augmentation via Differentiable Functions
Distribution shifts, which often occur in the real world, degrade the
accuracy of deep learning systems, and thus improving robustness is essential
for practical applications. To improve robustness, we study an image
enhancement method that generates recognition-friendly images without
retraining the recognition model. We propose a novel image enhancement method,
AugNet, which is based on differentiable data augmentation techniques and
generates a blended image from many augmented images to improve the recognition
accuracy under distribution shifts. In addition to standard data augmentations,
AugNet can also incorporate deep neural network-based image transformation,
which further improves the robustness. Because AugNet is composed of
differentiable functions, AugNet can be directly trained with the
classification loss of the recognition model. AugNet is evaluated on widely
used image recognition datasets using various classification models, including
Vision Transformer and MLP-Mixer. AugNet improves the robustness with almost no
reduction in classification accuracy for clean images, which is a better result
than the existing methods. Furthermore, we show that interpretation of
distribution shifts using AugNet and retraining based on that interpretation
can greatly improve robustness
A radioprotective agonist for p53 transactivation
Inhibiting p53-dependent apoptosis by inhibitors of p53 is an effective strategy for preventing radiation-induced damage in hematopoietic lineages, while p53 and p21 also play radioprotective roles in the gastrointestinal epithelium. We previously identified some zinc(II) chelators, including 8-quinolinol derivatives that suppress apoptosis in attempts to discover compounds that target the zinc-binding site in p53. We found that 5-chloro-8-quinolinol (5CHQ) has a unique p53-modulating activity that shifts its transactivation from proapoptotic to protective responses including enhancing p21 induction and suppressing PUMA induction. This p53-modulating activity also influenced p53 and p53-target gene expression in unirradiated cells without inducing DNA damage. The specificity of 5CHQ for p53 and p21 was demonstrated by silencing the expression of each protein. These effects seems to be attributable to the sequence-specific alteration of p53 DNA-binding, as evaluated by chromatin immunoprecipitation and electrophoretic mobility shift assays. In addition, 5-chloro-8-methoxyquinoline itself had no antiapoptotic activity, indicating that the hydroxyl group at the 8-position is required for its antiapoptotic activity. We applied this remarkable agonistic activity to protecting the hematopoietic and gastrointestinal system in mouse irradiation models. The dose-reduction factors of 5CHQ in total-body and abdominally irradiated mice were about 1.2 and 1.3, respectively. 5CHQ effectively protected mouse epithelial stem cells from a lethal dose of abdominal irradiation. Furthermore, the specificity of 5CHQ for p53 in reducing the lethality induced by abdominal irradiation was revealed in Trp53-KO mice. These results indicate that the pharmacological upregulation of radioprotective p53-target genes is an effective strategy for addressing the gastrointestinal syndrome
The whole blood transcriptional regulation landscape in 465 COVID-19 infected samples from Japan COVID-19 Task Force
「コロナ制圧タスクフォース」COVID-19患者由来の血液細胞における遺伝子発現の網羅的解析 --重症度に応じた遺伝子発現の変化には、ヒトゲノム配列の個人差が影響する--. 京都大学プレスリリース. 2022-08-23.Coronavirus disease 2019 (COVID-19) is a recently-emerged infectious disease that has caused millions of deaths, where comprehensive understanding of disease mechanisms is still unestablished. In particular, studies of gene expression dynamics and regulation landscape in COVID-19 infected individuals are limited. Here, we report on a thorough analysis of whole blood RNA-seq data from 465 genotyped samples from the Japan COVID-19 Task Force, including 359 severe and 106 non-severe COVID-19 cases. We discover 1169 putative causal expression quantitative trait loci (eQTLs) including 34 possible colocalizations with biobank fine-mapping results of hematopoietic traits in a Japanese population, 1549 putative causal splice QTLs (sQTLs; e.g. two independent sQTLs at TOR1AIP1), as well as biologically interpretable trans-eQTL examples (e.g., REST and STING1), all fine-mapped at single variant resolution. We perform differential gene expression analysis to elucidate 198 genes with increased expression in severe COVID-19 cases and enriched for innate immune-related functions. Finally, we evaluate the limited but non-zero effect of COVID-19 phenotype on eQTL discovery, and highlight the presence of COVID-19 severity-interaction eQTLs (ieQTLs; e.g., CLEC4C and MYBL2). Our study provides a comprehensive catalog of whole blood regulatory variants in Japanese, as well as a reference for transcriptional landscapes in response to COVID-19 infection
DOCK2 is involved in the host genetics and biology of severe COVID-19
「コロナ制圧タスクフォース」COVID-19疾患感受性遺伝子DOCK2の重症化機序を解明 --アジア最大のバイオレポジトリーでCOVID-19の治療標的を発見--. 京都大学プレスリリース. 2022-08-10.Identifying the host genetic factors underlying severe COVID-19 is an emerging challenge. Here we conducted a genome-wide association study (GWAS) involving 2, 393 cases of COVID-19 in a cohort of Japanese individuals collected during the initial waves of the pandemic, with 3, 289 unaffected controls. We identified a variant on chromosome 5 at 5q35 (rs60200309-A), close to the dedicator of cytokinesis 2 gene (DOCK2), which was associated with severe COVID-19 in patients less than 65 years of age. This risk allele was prevalent in East Asian individuals but rare in Europeans, highlighting the value of genome-wide association studies in non-European populations. RNA-sequencing analysis of 473 bulk peripheral blood samples identified decreased expression of DOCK2 associated with the risk allele in these younger patients. DOCK2 expression was suppressed in patients with severe cases of COVID-19. Single-cell RNA-sequencing analysis (n = 61 individuals) identified cell-type-specific downregulation of DOCK2 and a COVID-19-specific decreasing effect of the risk allele on DOCK2 expression in non-classical monocytes. Immunohistochemistry of lung specimens from patients with severe COVID-19 pneumonia showed suppressed DOCK2 expression. Moreover, inhibition of DOCK2 function with CPYPP increased the severity of pneumonia in a Syrian hamster model of SARS-CoV-2 infection, characterized by weight loss, lung oedema, enhanced viral loads, impaired macrophage recruitment and dysregulated type I interferon responses. We conclude that DOCK2 has an important role in the host immune response to SARS-CoV-2 infection and the development of severe COVID-19, and could be further explored as a potential biomarker and/or therapeutic target
Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network
Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism
The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.
X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X's gene content, gene expression, and evolution
Test-time Similarity Modification for Person Re-identification toward Temporal Distribution Shift
Person re-identification (re-id), which aims to retrieve images of the same
person in a given image from a database, is one of the most practical image
recognition applications. In the real world, however, the environments that the
images are taken from change over time. This causes a distribution shift
between training and testing and degrades the performance of re-id. To maintain
re-id performance, models should continue adapting to the test environment's
temporal changes. Test-time adaptation (TTA), which aims to adapt models to the
test environment with only unlabeled test data, is a promising way to handle
this problem because TTA can adapt models instantly in the test environment.
However, the previous TTA methods are designed for classification and cannot be
directly applied to re-id. This is because the set of people's identities in
the dataset differs between training and testing in re-id, whereas the set of
classes is fixed in the current TTA methods designed for classification. To
improve re-id performance in changing test environments, we propose TEst-time
similarity Modification for Person re-identification (TEMP), a novel TTA method
for re-id. TEMP is the first fully TTA method for re-id, which does not require
any modification to pre-training. Inspired by TTA methods that refine the
prediction uncertainty in classification, we aim to refine the uncertainty in
re-id. However, the uncertainty cannot be computed in the same way as
classification in re-id since it is an open-set task, which does not share
person labels between training and testing. Hence, we propose re-id entropy, an
alternative uncertainty measure for re-id computed based on the similarity
between the feature vectors. Experiments show that the re-id entropy can
measure the uncertainty on re-id and TEMP improves the performance of re-id in
online settings where the distribution changes over time.Comment: Accepted to IJCNN202
Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss
Image coding for machines (ICM) aims to compress images for machine analysis
using recognition models rather than human vision. Hence, in ICM, it is
important for the encoder to recognize and compress the information necessary
for the machine recognition task. There are two main approaches in learned ICM;
optimization of the compression model based on task loss, and Region of
Interest (ROI) based bit allocation. These approaches provide the encoder with
the recognition capability. However, optimization with task loss becomes
difficult when the recognition model is deep, and ROI-based methods often
involve extra overhead during evaluation. In this study, we propose a novel
training method for learned ICM models that applies auxiliary loss to the
encoder to improve its recognition capability and rate-distortion performance.
Our method achieves Bjontegaard Delta rate improvements of 27.7% and 20.3% in
object detection and semantic segmentation tasks, compared to the conventional
training method.Comment: This version has been removed by arXiv administrators as the
submitter did not have the right to agree to the license at the time of
submissio
Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching
Deep neural networks have achieved remarkable success in a variety of
computer vision applications. However, there is a problem of degrading accuracy
when the data distribution shifts between training and testing. As a solution
of this problem, Test-time Adaptation~(TTA) has been well studied because of
its practicality. Although TTA methods increase accuracy under distribution
shift by updating the model at test time, using high-uncertainty predictions is
known to degrade accuracy. Since the input image is the root of the
distribution shift, we incorporate a new perspective on enhancing the input
image into TTA methods to reduce the prediction's uncertainty. We hypothesize
that enhancing the input image reduces prediction's uncertainty and increase
the accuracy of TTA methods. On the basis of our hypothesis, we propose a novel
method: Test-time Enhancer and Classifier Adaptation~(TECA). In TECA, the
classification model is combined with the image enhancement model that
transforms input images into recognition-friendly ones, and these models are
updated by existing TTA methods. Furthermore, we found that the prediction from
the enhanced image does not always have lower uncertainty than the prediction
from the original image. Thus, we propose logit switching, which compares the
uncertainty measure of these predictions and outputs the lower one. In our
experiments, we evaluate TECA with various TTA methods and show that TECA
reduces prediction's uncertainty and increases accuracy of TTA methods despite
having no hyperparameters and little parameter overhead.Comment: Accepted to IJCNN202
- …