28 research outputs found
Collaborative Sampling in Generative Adversarial Networks
The standard practice in Generative Adversarial Networks (GANs) discards the
discriminator during sampling. However, this sampling method loses valuable
information learned by the discriminator regarding the data distribution. In
this work, we propose a collaborative sampling scheme between the generator and
the discriminator for improved data generation. Guided by the discriminator,
our approach refines the generated samples through gradient-based updates at a
particular layer of the generator, shifting the generator distribution closer
to the real data distribution. Additionally, we present a practical
discriminator shaping method that can smoothen the loss landscape provided by
the discriminator for effective sample refinement. Through extensive
experiments on synthetic and image datasets, we demonstrate that our proposed
method can improve generated samples both quantitatively and qualitatively,
offering a new degree of freedom in GAN sampling.Comment: Accepted to AAAI 202
Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecasting
Deep motion forecasting models have achieved great success when trained on a
massive amount of data. Yet, they often perform poorly when training data is
limited. To address this challenge, we propose a transfer learning approach for
efficiently adapting pre-trained forecasting models to new domains, such as
unseen agent types and scene contexts. Unlike the conventional fine-tuning
approach that updates the whole encoder, our main idea is to reduce the amount
of tunable parameters that can precisely account for the target domain-specific
motion style. To this end, we introduce two components that exploit our prior
knowledge of motion style shifts: (i) a low-rank motion style adapter that
projects and adjusts the style features at a low-dimensional bottleneck; and
(ii) a modular adapter strategy that disentangles the features of scene context
and motion history to facilitate a fine-grained choice of adaptation layers.
Through extensive experimentation, we show that our proposed adapter design,
coined MoSA, outperforms prior methods on several forecasting benchmarks.Comment: CoRL 202
Collaborative Sampling in Generative Adversarial Networks
The standard practice in Generative Adversarial Networks (GANs) discards the discriminator during sampling. However, this sampling method loses valuable information learned by the discriminator regarding the data distribution. In this work, we propose a collaborative sampling scheme between the generator and the discriminator for improved data generation. Guided by the discriminator, our approach refines the generated samples through gradient-based updates at a particular layer of the generator, shifting the generator distribution closer to the real data distribution. Additionally, we present a practical discriminator shaping method that can smoothen the loss landscape provided by the discriminator for effective sample refinement. Through extensive experiments on synthetic and image datasets, we demonstrate that our proposed method can improve generated samples both quantitatively and qualitatively, offering a new degree of freedom in GAN sampling
Effect of Drain Duration and Output on Perioperative Outcomes and Readmissions after Lumbar Spine Surgery
Study design: Single-center retrospective cohort.
Purpose: To compare surgical outcomes of patients based on lumbar drain variables relating to output and duration.
Overview of literature: The use of drains following lumbar spine surgery, specifically with respect to hospital readmission, postoperative hematoma, postoperative anemia, and surgical site infections, has been controversial.
Methods: Patients aged ≥18 years who underwent lumbar fusion with a postoperative drain between 2017 and 2020 were included and grouped based on hospital readmission status, last 8-hour drain output (\u3c40 mL cutoff), or drain duration (2 days cutoff). Total output of all drains, total output of the primary drain, drain duration in days, drain output per day, last 8-hour output, penultimate 8-hour output, and last 8-hour delta (last 8-hour output subtracted by penultimate 8-hour output) were collected. Continuous and categorical data were compared between groups. Multivariate logistic regression analysis and receiver operating characteristic (ROC) analysis were performed to determine whether drain variables can predict hospital readmission, postoperative blood transfusions, and postoperative anemia. Alpha was 0.05.
Results: Our cohort consisted of 1,166 patients with 111 (9.5%) hospital readmissions. Results of regression analysis did not identify any of the drain variables as independent predictors of hospital readmission, postoperative blood transfusion, or postoperative anemia. ROC analysis demonstrated the drain variables to be poor predictors of hospital readmission, with the highest area under curve of 0.524 (drain duration), corresponding to a sensitivity of 61.3% and specificity of 49.9%.
Conclusions: Drain output or duration did not affect readmission rates following lumbar spine surgery
Reducing the environmental impact of surgery on a global scale: systematic review and co-prioritization with healthcare workers in 132 countries
Abstract
Background
Healthcare cannot achieve net-zero carbon without addressing operating theatres. The aim of this study was to prioritize feasible interventions to reduce the environmental impact of operating theatres.
Methods
This study adopted a four-phase Delphi consensus co-prioritization methodology. In phase 1, a systematic review of published interventions and global consultation of perioperative healthcare professionals were used to longlist interventions. In phase 2, iterative thematic analysis consolidated comparable interventions into a shortlist. In phase 3, the shortlist was co-prioritized based on patient and clinician views on acceptability, feasibility, and safety. In phase 4, ranked lists of interventions were presented by their relevance to high-income countries and low–middle-income countries.
Results
In phase 1, 43 interventions were identified, which had low uptake in practice according to 3042 professionals globally. In phase 2, a shortlist of 15 intervention domains was generated. In phase 3, interventions were deemed acceptable for more than 90 per cent of patients except for reducing general anaesthesia (84 per cent) and re-sterilization of ‘single-use’ consumables (86 per cent). In phase 4, the top three shortlisted interventions for high-income countries were: introducing recycling; reducing use of anaesthetic gases; and appropriate clinical waste processing. In phase 4, the top three shortlisted interventions for low–middle-income countries were: introducing reusable surgical devices; reducing use of consumables; and reducing the use of general anaesthesia.
Conclusion
This is a step toward environmentally sustainable operating environments with actionable interventions applicable to both high– and low–middle–income countries
Deep Learning Methods for Socially-Aware Human Trajectory Forecasting
The ability to forecast human motion, called ``human trajectory forecasting", is a critical requirement for mobility applications such as autonomous driving and robot navigation. Humans plan their path taking into account what might happen in the future. Similarly, the decision-making algorithm of autonomous systems should predict how their environment will evolve in the future. This thesis focuses on developing deep learning methods for forecasting human motion.
In the first part of this thesis, we tackle the fundamental challenges of social interaction modelling and multimodality. Social interactions dictate how the motion of a human is affected by others. Current deep learning methods often struggle to model these interactions between trajectory sequences. To promote interaction-awareness in forecasting models, we develop a training paradigm that explicitly focuses on samples that undergo interactions and incorporates model uncertainty. Furthermore, we build a taxonomy of existing interaction encoders and propose an optimal design that is robust to the real-world noise. In addition to modelling interactions, a good trajectory forecasting model must account for the multimodal nature of the prediction, i.e., the possibility of having multiple plausible futures given the past observations. To tackle multimodality, we present a socially-aware generative adversarial network that leverages recent advances in sequence modelling, and has the ability to model the temporal evolution of social interactions. Furthermore, we develop a collaborative sampling technique that refines the bad generated predictions at test time.
In the second part of this thesis, we focus on two challenges specific to the real-world deployment of forecasting models: interpretability and adaptability. While neural networks have the capacity to learn complex interactions, it is difficult to understand the reason behind their predictions. Thus, we develop a framework that combines the interpretability of the classical models with the predictive power of neural networks. With regards to adaptability, existing deep forecasting models suffer from inferior performance when they encounter novel scenarios. We develop a strategy to adapt a pre-trained forecasting model to a target domain using limited samples. In particular, we introduce motion style adapters that identify and adjust the target domain-specific features. Throughout this thesis, experiments on synthetic and real-world forecasting datasets validate the effectiveness of our proposed methods.VIT
Safety-Compliant Generative Adversarial Networks for Human Trajectory Forecasting
Human trajectory forecasting in crowds presents the challenges of modelling social interactions and outputting collision-free multimodal distribution. Following the success of Social Generative Adversarial Networks (SGAN), recent works propose various GAN-based designs to better model human motion in crowds. Despite superior performance in reducing distance-based metrics, current networks fail to output socially acceptable trajectories, as evidenced by high collisions in model predictions. To counter this, we introduce SGANv2: an improved safety-compliant SGAN architecture equipped with spatio-temporal interaction modelling and a transformer-based discriminator. The spatio-temporal modelling ability helps to learn the human social interactions better while the transformer-based discriminator design improves temporal sequence modelling. Additionally, SGANv2 utilizes the learned discriminator even at test-time via a collaborative sampling strategy that not only refines the colliding trajectories but also prevents mode collapse, a common phenomenon in GAN training. Through extensive experimentation on multiple real-world and synthetic datasets, we demonstrate the efficacy of SGANv2 to provide socially-compliant multimodal trajectories.VIT