179 research outputs found
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
This work addresses the problem of discovering, in an unsupervised manner, interpretable paths in the latent space of pretrained GANs, so as to provide an intuitive and easy way of controlling the underlying generative factors. In doing so, it addresses some of the limitations of the state-of-the-art works, namely, a) that they discover directions that are independent of the latent code, i.e., paths that are linear, and b) that their evaluation relies either on visual inspection or on laborious human labeling. More specifically, we propose to learn non-linear warpings on the latent space, each one parametrized by a set of RBF-based latent space warping functions, and where each warping gives rise to a family of non-linear paths via the gradient of the function. Building on the work of Voynov and Babenko that discovers linear paths, we optimize the trainable parameters of the set of RBFs, so as that images that are generated by codes along different paths, are easily distinguishable by a discriminator network. This leads to easily distinguishable image transformations, such as pose and facial expressions in facial images. We show that linear paths can be derived as a special case of our method, and show experimentally that non-linear paths in the latent space lead to steeper, more disentangled and interpretable changes in the image space than in state-of-the art methods, both qualitatively and quantitatively. We make the code and the pretrained models publicly available at: https://github.com/chi0tzp/WarpedGANSpace
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
This work addresses the problem of discovering, in an unsupervised manner, interpretable paths in the latent space of pretrained GANs, so as to provide an intuitive and easy way of controlling the underlying generative factors. In doing so, it addresses some of the limitations of the state-of-the-art works, namely, a) that they discover directions that are independent of the latent code, i.e., paths that are linear, and b) that their evaluation relies either on visual inspection or on laborious human labeling. More specifically, we propose to learn non-linear warpings on the latent space, each one parametrized by a set of RBF-based latent space warping functions, and where each warping gives rise to a family of non-linear paths via the gradient of the function. Building on the work of [34], that discovers linear paths, we optimize the trainable parameters of the set of RBFs, so as that images that are generated by codes along different paths, are easily distinguishable by a discriminator network. This leads to easily distinguishable image transformations, such as pose and facial expressions in facial images. We show that linear paths can be derived as a special case of our method, and show experimentally that non-linear paths in the latent space lead to steeper, more disentangled and interpretable changes in the image space than in state-of-the art methods, both qualitatively and quantitatively. We make the code and the pretrained models publicly available at: https://github.com/chi0tzp/WarpedGANSpace
HyperReenact: one-shot reenactment via jointly learning to refine and retarget faces
In this paper, we present our method for neural face
reenactment, called HyperReenact, that aims to generate
realistic talking head images of a source identity, driven
by a target facial pose. Existing state-of-the-art face reenactment methods train controllable generative models that
learn to synthesize realistic facial images, yet producing
reenacted faces that are prone to significant visual artifacts,
especially under the challenging condition of extreme head
pose changes, or requiring expensive few-shot fine-tuning
to better preserve the source identity characteristics. We
propose to address these limitations by leveraging the photorealistic generation ability and the disentangled properties of a pretrained StyleGAN2 generator, by first inverting
the real images into its latent space and then using a hypernetwork to perform: (i) refinement of the source identity characteristics and (ii) facial pose re-targeting, eliminating this way the dependence on external editing methods that typically produce artifacts. Our method operates under the one-shot setting (i.e., using a single source
frame) and allows for cross-subject reenactment, without
requiring any subject-specific fine-tuning. We compare
our method both quantitatively and qualitatively against
several state-of-the-art techniques on the standard benchmarks of VoxCeleb1 and VoxCeleb2, demonstrating the superiority of our approach in producing artifact-free images, exhibiting remarkable robustness even under extreme
head pose changes. We make the code and the pretrained
models publicly available at: https://github.com/
StelaBou/HyperReenact
A new fault-ride-through strategy for MTDC networks incorporating wind farms and modular multi-level converters
This paper presents a DC voltage control strategy for enhancing the fault-ride-through (FRT) capability of wind farms comprising of fully rated converter permanent magnet synchronous generators (FRC-PMSGs) connected to multi-terminal high voltage direct current (MT-HVDC) grids through modular multi-level converters (MMCs). The proposed FRT strategy is implemented on a master controller located in the offshore AC substation of each wind farm. The underlying issue addressed via the scheme relates to overvoltages in the HVDC links when the power transfer is disrupted due to faults occurring in the AC onshore grid. The corresponding Matlab/Simulinkr model has been validated using transient simulation, while the practical feasibility of the controller is demonstrated utilising Opal-RT© real-time hardware platform
Design of a Hybrid AC/DC Microgrid Using HOMER Pro: Case Study on an Islanded Residential Application
This paper is concerned with the design of an autonomous hybrid alternating current/direct current (AC/DC) microgrid for a community system, located on an island without the possibility of grid connection. It is comprised of photovoltaic (PV) arrays and a diesel generator, AC loads, and battery energy storage devices for ensuring uninterruptible power supply during prolonged periods of low sunshine. A multi-objective, non-derivative optimisation is considered in this residential application; the primary objective is the system cost minimisation, while it is also required that no load shedding is allowed. Additionally, the CO2 emissions are calculated to demonstrate the environmental benefit the proposed system offers. The commercial software, HOMER Pro, is utilised to identify the least-cost design among hundreds of options and simultaneously satisfy the secondary objective. A sensitivity analysis is also performed to evaluate design robustness against the uncertainty pertaining to fuel prices and PV generation. Finally, an assessment of the capabilities of the utilised optimisation platform is conducted, and a theoretical discussion sheds some light on the proposal for an enhanced design tool addressing the identified issues
Finding Semantically Related Videos in Closed Collections
Modern newsroom tools offer advanced functionality for automatic and semi-automatic content collection from the web and social media sources to accompany news stories. However, the content collected in this way often tends to be unstructured and may include irrelevant items. An important step in the verification process is to organize this content, both with respect to what it shows, and with respect to its origin. This chapter presents our efforts in this direction, which resulted in two components. One aims to detect semantic concepts in video shots, to help annotation and organization of content collections. We implement a system based on deep learning, featuring a number of advances and adaptations of existing algorithms to increase performance for the task. The other component aims to detect logos in videos in order to identify their provenance. We present our progress from a keypoint-based detection system to a system based on deep learning
Advanced fault location in MTDC networks utilising optically-multiplexed current measurements and machine learning approach
This paper presents a method for accurate fault localisation of DC-side faults in Voltage Source Converter (VSC) based Multi-Terminal Direct Current (MTDC) networks utilising optically-multiplexed DC current measurements sampled at 5 kHz, off-line continuous wavelet transform and machine learning approach. The technical feasibility of optically-based DC current measurements is evaluated through laboratory experiments using commercially available equipment. Simulation-based analysis through Matlab/Simulink® has been adopted to test the proposed fault location algorithm under different fault types and locations along a DC grid. Results revealed that the proposed fault location scheme can accurately calculate the location of a fault and successfully identify its type. The scheme has been also found to be effective for highly resistive fault with resistances of up to 500 Ω. Further sensitivity analysis revealed that the proposed scheme is relatively robust to additive noise and synchronisation errors
Recommended from our members
DISYRE: Diffusion-Inspired SYnthetic REstoration for Unsupervised Anomaly Detection
Unsupervised Anomaly Detection (UAD) techniques aim to identify and localize anomalies without relying on annotations, only leveraging a model trained on a dataset known to be free of anomalies. Diffusion models learn to modify inputs x to increase the probability of it belonging to a desired distribution, i.e., they model the score function ∇x log p(x). Such a score function is potentially relevant for UAD, since ∇x log p(x) is itself a pixel-wise anomaly score. However, diffusion models are trained to invert a corruption process based on Gaussian noise and the learned score function is unlikely to generalize to medical anomalies. This work addresses the problem of how to learn a score function relevant for UAD and proposes DISYRE: Diffusion-Inspired SYnthetic REstoration. We retain the diffusion-like pipeline but replace the Gaussian noise corruption with a gradual, synthetic anomaly corruption so the learned score function generalizes to medical, naturally occurring anomalies. We evaluate DISYRE on three common Brain MRI UAD benchmarks and substantially outperform other methods in two out of the three tasks
Small-molecule inhibition of METTL3 as a strategy against myeloid leukaemia.
N6-methyladenosine (m6A) is an abundant internal RNA modification1,2 that is catalysed predominantly by the METTL3-METTL14 methyltransferase complex3,4. The m6A methyltransferase METTL3 has been linked to the initiation and maintenance of acute myeloid leukaemia (AML), but the potential of therapeutic applications targeting this enzyme remains unknown5-7. Here we present the identification and characterization of STM2457, a highly potent and selective first-in-class catalytic inhibitor of METTL3, and a crystal structure of STM2457 in complex with METTL3-METTL14. Treatment of tumours with STM2457 leads to reduced AML growth and an increase in differentiation and apoptosis. These cellular effects are accompanied by selective reduction of m6A levels on known leukaemogenic mRNAs and a decrease in their expression consistent with a translational defect. We demonstrate that pharmacological inhibition of METTL3 in vivo leads to impaired engraftment and prolonged survival in various mouse models of AML, specifically targeting key stem cell subpopulations of AML. Collectively, these results reveal the inhibition of METTL3 as a potential therapeutic strategy against AML, and provide proof of concept that the targeting of RNA-modifying enzymes represents a promising avenue for anticancer therapy
Small-molecule inhibition of METTL3 as a strategy against myeloid leukaemia.
N6-methyladenosine (m6A) is an abundant internal RNA modification1,2 that is catalysed predominantly by the METTL3-METTL14 methyltransferase complex3,4. The m6A methyltransferase METTL3 has been linked to the initiation and maintenance of acute myeloid leukaemia (AML), but the potential of therapeutic applications targeting this enzyme remains unknown5-7. Here we present the identification and characterization of STM2457, a highly potent and selective first-in-class catalytic inhibitor of METTL3, and a crystal structure of STM2457 in complex with METTL3-METTL14. Treatment of tumours with STM2457 leads to reduced AML growth and an increase in differentiation and apoptosis. These cellular effects are accompanied by selective reduction of m6A levels on known leukaemogenic mRNAs and a decrease in their expression consistent with a translational defect. We demonstrate that pharmacological inhibition of METTL3 in vivo leads to impaired engraftment and prolonged survival in various mouse models of AML, specifically targeting key stem cell subpopulations of AML. Collectively, these results reveal the inhibition of METTL3 as a potential therapeutic strategy against AML, and provide proof of concept that the targeting of RNA-modifying enzymes represents a promising avenue for anticancer therapy
- …