85 research outputs found

    A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data

    Full text link
    Unsupervised domain adaptation (UDA) assumes that source and target domain data are freely available and usually trained together to reduce the domain gap. However, considering the data privacy and the inefficiency of data transmission, it is impractical in real scenarios. Hence, it draws our eyes to optimize the network in the target domain without accessing labeled source data. To explore this direction in object detection, for the first time, we propose a source data-free domain adaptive object detection (SFOD) framework via modeling it into a problem of learning with noisy labels. Generally, a straightforward method is to leverage the pre-trained network from the source domain to generate the pseudo labels for target domain optimization. However, it is difficult to evaluate the quality of pseudo labels since no labels are available in target domain. In this paper, self-entropy descent (SED) is a metric proposed to search an appropriate confidence threshold for reliable pseudo label generation without using any handcrafted labels. Nonetheless, completely clean labels are still unattainable. After a thorough experimental analysis, false negatives are found to dominate in the generated noisy labels. Undoubtedly, false negatives mining is helpful for performance improvement, and we ease it to false negatives simulation through data augmentation like Mosaic. Extensive experiments conducted in four representative adaptation tasks have demonstrated that the proposed framework can easily achieve state-of-the-art performance. From another view, it also reminds the UDA community that the labeled source data are not fully exploited in the existing methods.Comment: accepted by AAAI202

    Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models

    Full text link
    We do not pursue a novel method in this paper, but aim to study if a modern text-to-image diffusion model can tailor any task-adaptive image classifier across domains and categories. Existing domain adaptive image classification works exploit both source and target data for domain alignment so as to transfer the knowledge learned from the labeled source data to the unlabeled target data. However, as the development of the text-to-image diffusion model, we wonder if the high-fidelity synthetic data from the text-to-image generator can serve as a surrogate of the source data in real world. In this way, we do not need to collect and annotate the source data for each domain adaptation task in a one-for-one manner. Instead, we utilize only one off-the-shelf text-to-image model to synthesize images with category labels derived from the corresponding text prompts, and then leverage the surrogate data as a bridge to transfer the knowledge embedded in the task-agnostic text-to-image generator to the task-oriented image classifier via domain adaptation. Such a one-for-all adaptation paradigm allows us to adapt anything in the world using only one text-to-image generator as well as the corresponding unlabeled target data. Extensive experiments validate the feasibility of the proposed idea, which even surpasses the state-of-the-art domain adaptation works using the source data collected and annotated in real world.Comment: 11 pages, 6 figure

    Attention Diversification for Domain Generalization

    Full text link
    Convolutional neural networks (CNNs) have demonstrated gratifying results at learning discriminative features. However, when applied to unseen domains, state-of-the-art models are usually prone to errors due to domain shift. After investigating this issue from the perspective of shortcut learning, we find the devils lie in the fact that models trained on different domains merely bias to different domain-specific features yet overlook diverse task-related features. Under this guidance, a novel Attention Diversification framework is proposed, in which Intra-Model and Inter-Model Attention Diversification Regularization are collaborated to reassign appropriate attention to diverse task-related features. Briefly, Intra-Model Attention Diversification Regularization is equipped on the high-level feature maps to achieve in-channel discrimination and cross-channel diversification via forcing different channels to pay their most salient attention to different spatial locations. Besides, Inter-Model Attention Diversification Regularization is proposed to further provide task-related attention diversification and domain-related attention suppression, which is a paradigm of "simulate, divide and assemble": simulate domain shift via exploiting multiple domain-specific models, divide attention maps into task-related and domain-related groups, and assemble them within each group respectively to execute regularization. Extensive experiments and analyses are conducted on various benchmarks to demonstrate that our method achieves state-of-the-art performance over other competing methods. Code is available at https://github.com/hikvision-research/DomainGeneralization.Comment: ECCV 2022. Code available at https://github.com/hikvision-research/DomainGeneralizatio

    A Prospective Study of the Surgical Outcome of Simple Uvulopalatopharyngoplasty (UPPP), UPPP Combined With Genioglossus Advancement or Tongue Base Advancement for Obstructive Sleep Apnea Hypopnea Syndrome Patients With Multilevel Obstruction

    Get PDF
    ObjectivesTo investigate the surgical outcomes of different uvulopalatopharyngoplasty (UPPP).Methods All subjects underwent overnight polysomnography and were evaluated using the Epworth sleepiness scale (ESS), the Quebec sleep questionnaire and the snoring scale at the baseline and 3 and 12 months following operation. The primary endpoint was the overall effective rate representing the sum of the surgical success rate and effective rate.ResultsThe overall effective rate at 12 months post surgery was 55.6% for simple UPPP, 95.8% for UPPP+GA, and 92.3% for UPPP+TBA. The surgical success rate at 3 and 12 months postoperation for UPPP+GA or UPPP+TBA was significantly higher than simple UPPP (P<0.05). Marked improvement was observed in all patients in the snoring scale score and the ESS score 3 and 12 months following surgery compared to the baseline (P<0.05 in all).ConclusionUPPP, UPPP+GA, and UPPP+TBA are all effective in improving the surgical outcome of obstructive sleep apnea hypopnea syndrome (OSAHS) patients with multilevel obstruction. UPPP+TBA appears to be the most effective in treating OSAHS patients

    Laryngeal Reinnervation Using Ansa Cervicalis for Thyroid Surgery-Related Unilateral Vocal Fold Paralysis: A Long-Term Outcome Analysis of 237 Cases

    Get PDF
    To evaluate the long-term efficacy of delayed laryngeal reinnervation using the main branch of the ansa cervicalis in treatment of unilateral vocal fold paralysis (UVFP) caused by thyroid surgery.UVFP remains a serious complication of thyroid surgery. Up to now, a completely satisfactory surgical treatment of UVFP has been elusive.From Jan. 1996 to Jan. 2008, a total of 237 UVFP patients who underwent ansa cervicalis main branch-to-recurrent laryngeal nerve (RLN) anastomosis were enrolled as UVFP group; another 237 age- and gender-matched normal subjects served as control group. Videostroboscopy, vocal function assessment (acoustic analysis, perceptual evaluation and maximum phonation time), and electromyography were performed preoperatively and postoperatively. The mean follow-up period was 5.2±2.7 years, ranging from 2 to 12 years.>0.05, respectively). Postoperative laryngeal electromyography confirmed successful reinnervation of laryngeal muscle.Delayed laryngeal reinnervation with the main branch of ansa cervicalis is a feasible and effective approach for treatment of thyroid surgery-related UVFP; it can restore the physiological laryngeal phonatory function to the normal or a nearly normal voice quality

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Generation of Uniform Hazard Spectrum Based on the Stochastic Method of Simulating Ground Motion and Its Use in Nuclear Power Plants

    No full text
    To obtain an accurate uniform hazard spectrum (UHS), this paper proposes combining a stochastic simulation with probabilistic seismic hazard analysis. The stochastic method fully accounts for the effect of the source mechanism, path, and site effect. Historical ground motions in the site specific to the nuclear power plant (NPP) are simulated, and a UHS with an equal exceeding probability is proposed. To compare the seismic performance of the NPP under different ground motions generated by the existing site spectrum (SL-2), the UHS generated by the safety evaluation report, and the US RG1.60 spectrum, respectively, a three-dimensional finite element model is established, and dynamic analysis is performed. Results show that the structural responses to different spectra varied; the UHS response was slightly larger than that of RG1.60. This finding is relatively more reasonable than prior research results. The UHS generated using the stochastic simulation method can provide a reference for the seismic design of NPPs
    corecore