85 research outputs found
A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data
Unsupervised domain adaptation (UDA) assumes that source and target domain
data are freely available and usually trained together to reduce the domain
gap. However, considering the data privacy and the inefficiency of data
transmission, it is impractical in real scenarios. Hence, it draws our eyes to
optimize the network in the target domain without accessing labeled source
data. To explore this direction in object detection, for the first time, we
propose a source data-free domain adaptive object detection (SFOD) framework
via modeling it into a problem of learning with noisy labels. Generally, a
straightforward method is to leverage the pre-trained network from the source
domain to generate the pseudo labels for target domain optimization. However,
it is difficult to evaluate the quality of pseudo labels since no labels are
available in target domain. In this paper, self-entropy descent (SED) is a
metric proposed to search an appropriate confidence threshold for reliable
pseudo label generation without using any handcrafted labels. Nonetheless,
completely clean labels are still unattainable. After a thorough experimental
analysis, false negatives are found to dominate in the generated noisy labels.
Undoubtedly, false negatives mining is helpful for performance improvement, and
we ease it to false negatives simulation through data augmentation like Mosaic.
Extensive experiments conducted in four representative adaptation tasks have
demonstrated that the proposed framework can easily achieve state-of-the-art
performance. From another view, it also reminds the UDA community that the
labeled source data are not fully exploited in the existing methods.Comment: accepted by AAAI202
Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models
We do not pursue a novel method in this paper, but aim to study if a modern
text-to-image diffusion model can tailor any task-adaptive image classifier
across domains and categories. Existing domain adaptive image classification
works exploit both source and target data for domain alignment so as to
transfer the knowledge learned from the labeled source data to the unlabeled
target data. However, as the development of the text-to-image diffusion model,
we wonder if the high-fidelity synthetic data from the text-to-image generator
can serve as a surrogate of the source data in real world. In this way, we do
not need to collect and annotate the source data for each domain adaptation
task in a one-for-one manner. Instead, we utilize only one off-the-shelf
text-to-image model to synthesize images with category labels derived from the
corresponding text prompts, and then leverage the surrogate data as a bridge to
transfer the knowledge embedded in the task-agnostic text-to-image generator to
the task-oriented image classifier via domain adaptation. Such a one-for-all
adaptation paradigm allows us to adapt anything in the world using only one
text-to-image generator as well as the corresponding unlabeled target data.
Extensive experiments validate the feasibility of the proposed idea, which even
surpasses the state-of-the-art domain adaptation works using the source data
collected and annotated in real world.Comment: 11 pages, 6 figure
Attention Diversification for Domain Generalization
Convolutional neural networks (CNNs) have demonstrated gratifying results at
learning discriminative features. However, when applied to unseen domains,
state-of-the-art models are usually prone to errors due to domain shift. After
investigating this issue from the perspective of shortcut learning, we find the
devils lie in the fact that models trained on different domains merely bias to
different domain-specific features yet overlook diverse task-related features.
Under this guidance, a novel Attention Diversification framework is proposed,
in which Intra-Model and Inter-Model Attention Diversification Regularization
are collaborated to reassign appropriate attention to diverse task-related
features. Briefly, Intra-Model Attention Diversification Regularization is
equipped on the high-level feature maps to achieve in-channel discrimination
and cross-channel diversification via forcing different channels to pay their
most salient attention to different spatial locations. Besides, Inter-Model
Attention Diversification Regularization is proposed to further provide
task-related attention diversification and domain-related attention
suppression, which is a paradigm of "simulate, divide and assemble": simulate
domain shift via exploiting multiple domain-specific models, divide attention
maps into task-related and domain-related groups, and assemble them within each
group respectively to execute regularization. Extensive experiments and
analyses are conducted on various benchmarks to demonstrate that our method
achieves state-of-the-art performance over other competing methods. Code is
available at https://github.com/hikvision-research/DomainGeneralization.Comment: ECCV 2022. Code available at
https://github.com/hikvision-research/DomainGeneralizatio
A Prospective Study of the Surgical Outcome of Simple Uvulopalatopharyngoplasty (UPPP), UPPP Combined With Genioglossus Advancement or Tongue Base Advancement for Obstructive Sleep Apnea Hypopnea Syndrome Patients With Multilevel Obstruction
ObjectivesTo investigate the surgical outcomes of different uvulopalatopharyngoplasty (UPPP).Methods All subjects underwent overnight polysomnography and were evaluated using the Epworth sleepiness scale (ESS), the Quebec sleep questionnaire and the snoring scale at the baseline and 3 and 12 months following operation. The primary endpoint was the overall effective rate representing the sum of the surgical success rate and effective rate.ResultsThe overall effective rate at 12 months post surgery was 55.6% for simple UPPP, 95.8% for UPPP+GA, and 92.3% for UPPP+TBA. The surgical success rate at 3 and 12 months postoperation for UPPP+GA or UPPP+TBA was significantly higher than simple UPPP (P<0.05). Marked improvement was observed in all patients in the snoring scale score and the ESS score 3 and 12 months following surgery compared to the baseline (P<0.05 in all).ConclusionUPPP, UPPP+GA, and UPPP+TBA are all effective in improving the surgical outcome of obstructive sleep apnea hypopnea syndrome (OSAHS) patients with multilevel obstruction. UPPP+TBA appears to be the most effective in treating OSAHS patients
Laryngeal Reinnervation Using Ansa Cervicalis for Thyroid Surgery-Related Unilateral Vocal Fold Paralysis: A Long-Term Outcome Analysis of 237 Cases
To evaluate the long-term efficacy of delayed laryngeal reinnervation using the main branch of the ansa cervicalis in treatment of unilateral vocal fold paralysis (UVFP) caused by thyroid surgery.UVFP remains a serious complication of thyroid surgery. Up to now, a completely satisfactory surgical treatment of UVFP has been elusive.From Jan. 1996 to Jan. 2008, a total of 237 UVFP patients who underwent ansa cervicalis main branch-to-recurrent laryngeal nerve (RLN) anastomosis were enrolled as UVFP group; another 237 age- and gender-matched normal subjects served as control group. Videostroboscopy, vocal function assessment (acoustic analysis, perceptual evaluation and maximum phonation time), and electromyography were performed preoperatively and postoperatively. The mean follow-up period was 5.2±2.7 years, ranging from 2 to 12 years.>0.05, respectively). Postoperative laryngeal electromyography confirmed successful reinnervation of laryngeal muscle.Delayed laryngeal reinnervation with the main branch of ansa cervicalis is a feasible and effective approach for treatment of thyroid surgery-related UVFP; it can restore the physiological laryngeal phonatory function to the normal or a nearly normal voice quality
Large expert-curated database for benchmarking document similarity detection in biomedical literature search
Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe
Generation of Uniform Hazard Spectrum Based on the Stochastic Method of Simulating Ground Motion and Its Use in Nuclear Power Plants
To obtain an accurate uniform hazard spectrum (UHS), this paper proposes combining a stochastic simulation with probabilistic seismic hazard analysis. The stochastic method fully accounts for the effect of the source mechanism, path, and site effect. Historical ground motions in the site specific to the nuclear power plant (NPP) are simulated, and a UHS with an equal exceeding probability is proposed. To compare the seismic performance of the NPP under different ground motions generated by the existing site spectrum (SL-2), the UHS generated by the safety evaluation report, and the US RG1.60 spectrum, respectively, a three-dimensional finite element model is established, and dynamic analysis is performed. Results show that the structural responses to different spectra varied; the UHS response was slightly larger than that of RG1.60. This finding is relatively more reasonable than prior research results. The UHS generated using the stochastic simulation method can provide a reference for the seismic design of NPPs
- …