75 research outputs found
ORB5: a global electromagnetic gyrokinetic code using the PIC approach in toroidal geometry
This paper presents the current state of the global gyrokinetic code ORB5 as
an update of the previous reference [Jolliet et al., Comp. Phys. Commun. 177
409 (2007)]. The ORB5 code solves the electromagnetic Vlasov-Maxwell system of
equations using a PIC scheme and also includes collisions and strong flows. The
code assumes multiple gyrokinetic ion species at all wavelengths for the
polarization density and drift-kinetic electrons. Variants of the physical
model can be selected for electrons such as assuming an adiabatic response or a
``hybrid'' model in which passing electrons are assumed adiabatic and trapped
electrons are drift-kinetic. A Fourier filter as well as various control
variates and noise reduction techniques enable simulations with good
signal-to-noise ratios at a limited numerical cost. They are completed with
different momentum and zonal flow-conserving heat sources allowing for
temperature-gradient and flux-driven simulations. The code, which runs on both
CPUs and GPUs, is well benchmarked against other similar codes and analytical
predictions, and shows good scalability up to thousands of nodes
GPAW: open Python package for electronic-structure calculations
We review the GPAW open-source Python package for electronic structure
calculations. GPAW is based on the projector-augmented wave method and can
solve the self-consistent density functional theory (DFT) equations using three
different wave-function representations, namely real-space grids, plane waves,
and numerical atomic orbitals. The three representations are complementary and
mutually independent and can be connected by transformations via the real-space
grid. This multi-basis feature renders GPAW highly versatile and unique among
similar codes. By virtue of its modular structure, the GPAW code constitutes an
ideal platform for implementation of new features and methodologies. Moreover,
it is well integrated with the Atomic Simulation Environment (ASE) providing a
flexible and dynamic user interface. In addition to ground-state DFT
calculations, GPAW supports many-body GW band structures, optical excitations
from the Bethe-Salpeter Equation (BSE), variational calculations of excited
states in molecules and solids via direct optimization, and real-time
propagation of the Kohn-Sham equations within time-dependent DFT. A range of
more advanced methods to describe magnetic excitations and non-collinear
magnetism in solids are also now available. In addition, GPAW can calculate
non-linear optical tensors of solids, charged crystal point defects, and much
more. Recently, support of GPU acceleration has been achieved with minor
modifications of the GPAW code thanks to the CuPy library. We end the review
with an outlook describing some future plans for GPAW
Balanced Local Data Assimilation with a Blended Numerical Model for Geophysical Flows
Local sequential Bayesian data assimilation introduces physical imbalances that pose a challenge for geophysical flows with implications for robust numerical weather prediction. The presence of fast-mode imbalances of the order of the relevant slower dynamics deteriorates solution quality. To negate this effect, dynamics-driven methods that suppress imbalances arising from data assimilation are introduced in this thesis. Specifically, a blended numerical model for the rotating compressible fluid flow equations under gravity is employed and equipped with access to soundproof and hydrostatic dynamics. The blended numerical model is formally extended to support seamless transition between the shallow water equations and lake equations. Through careful numerical and asymptotic analysis, one-step blending strategies that enable seamless switching between model regimes within a simulation run are developed. Upon assimilation of data, the model configuration is switched for one time step to the limit regime. After that, the model configuration is switched back to the compressible or shallow water regime for the duration of the assimilation window. This switching between the model regimes is repeated for each subsequent assimilation window to eliminate the imbalances arising from the assimilation of data. Idealised experiments involving the travelling vortex, buoyancy-driven rising thermals, and inertia gravity waves demonstrate that the blending strategies successfully eliminate unphysical imbalances, yielding up to two orders-of-magnitude improvements in the error scores. This novel dynamics-conforming method of achieving balanced data assimilation can be extended to eliminate other forms of imbalances, and it has the potential to reduce data assimilation-generated spurious signals in numerical weather prediction simulations.Die lokale sequentielle Bayes'sche Datenassimilation verursacht unphysikalische Imbalancen. Dies stellt eine Herausforderung für die Modellierung geophysikalischer Strömungen dar und hat Auswirkungen auf robuste numerische Wettervorhersagen. Das Auftreten sich schnell ausbreitender Imbalancen von der Größenordnung der betreffenden langsameren Dynamik vermindert die Lösungsqualität. Diese Arbeit führt dynamikgesteuerte Methoden ein, die Imbalancen aus der Datenassimilation unterdrücken und so dem oben beschriebenen Effekt entgegenwirken. Konkret wird für die kompressiblen Strömungsgleichungen mit Schwer- und Corioliskraft ein numerisches Modell in gemischter Form verwendet, welches auf schalldichte und hydrostatische Dynamik zurückgreifen kann. Dieses Modell wird dann formal erweitert, um einen nahtlosen Übergang zwischen den Flachwassergleichungen und deren inkompressiblem Analogon, den sogenannten "lake-equations'', zu ermöglichen. Durch sorgfältige numerische und asymptotische Analysen werden ferner einstufige Blending-Strategien entwickelt, die einen nahtlosen Wechsel zwischen den Modellregimen während des Simulationslaufs ermöglichen. Nach der Assimilation der Daten wird die Modellkonfiguration für einen Zeitschritt in das Grenzregime überführt. Anschließend wird die Modellkonfiguration für die Dauer des Assimilationsfensters wieder auf das Flachwasser- oder das kompressible Modell eingestellt. Dieser Wechsel zwischen den Modellregimen wird für jedes weitere Assimilationsfenster wiederholt, um die durch die Assimilation der Daten entstehenden Imbalancen zu beseitigen. Idealisierte Experimente mit dem wandernden Wirbel, der auftriebsgetriebenen aufsteigenden atmosphärischen Blase und den Trägheitsschwerewellen zeigen, dass die Blending-Strategien unphysikalische Imbalancen erfolgreich beseitigen und Abweichungen um bis zu zwei Größenordnungen verkleinern. Diese neuartige dynamikkonforme Methode zur Erlangung einer balancierten Datenassimilation kann erweitert werden, um andere Formen von Imbalancen zu eliminieren. Sie hat das Potenzial, durch Datenassimilation erzeugte Störsignale in numerischen Wettervorhersagesimulationen zu reduzieren
Advances in Computer Science and Engineering
The book Advances in Computer Science and Engineering constitutes the revised selection of 23 chapters written by scientists and researchers from all over the world. The chapters cover topics in the scientific fields of Applied Computing Techniques, Innovations in Mechanical Engineering, Electrical Engineering and Applications and Advances in Applied Modeling
Monte Carlo simulations of two-dimensional electron gasses in gallium nitride high electron mobility transistors via general-purpose computing on graphics processing units
The work in this thesis covers two main topics: successfully porting an Ensemble Monte Carlo (EMC) focused on bulk III-V semiconductors on to the graphics processing unit (GPU) and investigating carrier transport in a two-dimensional electron gas (2DEG) created at an Aluminium Gallium Nitride (AlGaN) and Gallium Nit ride (GaN) heterojunction, specifically the effect of introducing non-equilibrium phonons.The programming language used to be able to run on the GPU, NVIDIA CUDA, is introduced. The concept of highly parallel programming is explored, along with the challenges this poses to an EMC simulating semiconductor materials and devices. The changes made to the bulk EMC algorithm are explained, including architectural, memory strategies and execution optimisations. The performance increase related to each change is given, and it is found that the GPU algorithm has a run time that is approximately 30% of the original EMC algorithm. This is the first example of an EMC simulating electron transport in semiconductors on a GPU.A two-dimensional EMC is created to simulate the behaviour of electrons confined in the 2DEG created at an AlGaN/GaN heterojunction. Results are presented for the electron velocity, momentum and energy relaxation times and mobility, which are compared to experimental results from AlGaN/GaN High Electron Mobility Transistors (HEMTs), and agreement is good. No velocity overshoot is observed, in agreement with experiments.Finally, non-equilibrium phonons are introduced to the 2DEG simulation to study their effect on the electron transport. Non-equilibrium phonons are found to reduce the electron velocity due to diffusive heating. However, due to the confinement of electrons, the phonon distribution is only increased in a small volume of reciprocal space and the effects are shown to be weaker than in bulk. The consideration of electron confinement and a non-equilibrium phonon population has not been seen in the current literature
Efficient, end-to-end and self-supervised methods for speech processing and generation
Deep learning has affected the speech processing and generation fields in many directions. First, end-to-end architectures allow the direct injection and synthesis of waveform samples. Secondly, the exploration of efficient solutions allow to implement these systems in computationally restricted environments, like smartphones. Finally, the latest trends exploit audio-visual data with least supervision. In this thesis these three directions are explored.
Firstly, we propose the use of recent pseudo-recurrent structures, like self-attention models and quasi-recurrent networks, to build acoustic models for text-to-speech. The proposed system, QLAD, turns out to synthesize faster on CPU and GPU than its recurrent counterpart whilst preserving the good synthesis quality level, which is competitive with state of the art vocoder-based models.
Then, a generative adversarial network is proposed for speech enhancement, named SEGAN. This model works as a speech-to-speech conversion system in time-domain, where a single inference operation is needed for all samples to operate through a fully convolutional structure. This implies an increment in modeling efficiency with respect to other existing models, which are auto-regressive and also work in time-domain. SEGAN achieves prominent results in noise supression and preservation of speech naturalness and intelligibility when compared to the other classic and deep regression based systems. We also show that SEGAN is efficient in transferring its operations to new languages and noises. A SEGAN trained for English performs similarly to this language on Catalan and Korean with only 24 seconds of adaptation data. Finally, we unveil the generative capacity of the model to recover signals from several distortions. We hence propose the concept of generalized speech enhancement. First, the model proofs to be effective to recover voiced speech from whispered one. Then the model is scaled up to solve other distortions that require a recomposition of damaged parts of the signal, like extending the bandwidth or recovering lost temporal sections, among others. The model improves by including additional acoustic losses in a multi-task setup to impose a relevant perceptual weighting on the generated result. Moreover, a two-step training schedule is also proposed to stabilize the adversarial training after the addition of such losses, and both components boost SEGAN's performance across distortions.Finally, we propose a problem-agnostic speech encoder, named PASE, together with the framework to train it. PASE is a fully convolutional network that yields compact representations from speech waveforms. These representations contain abstract information like the speaker identity, the prosodic features or the spoken contents. A self-supervised framework is also proposed to train this encoder, which suposes a new step towards unsupervised learning for speech processing. Once the encoder is trained, it can be exported to solve different tasks that require speech as input. We first explore the performance of PASE codes to solve speaker recognition, emotion recognition and speech recognition. PASE works competitively well compared to well-designed classic features in these tasks, specially after some supervised adaptation. Finally, PASE also provides good descriptors of identity for multi-speaker modeling in text-to-speech, which is advantageous to model novel identities without retraining the model.L'aprenentatge profund ha afectat els camps de processament i generació de la parla en và ries direccions. Primer, les arquitectures fi-a-fi permeten la injecció i sÃntesi de mostres temporals directament. D'altra banda, amb l'exploració de solucions eficients permet l'aplicació d'aquests sistemes en entorns de computació restringida, com els telèfons intel·ligents. Finalment, les darreres tendències exploren les dades d'à udio i veu per derivar-ne representacions amb la mÃnima supervisió. En aquesta tesi precisament s'exploren aquestes tres direccions. Primer de tot, es proposa l'ús d'estructures pseudo-recurrents recents, com els models d’auto atenció i les xarxes quasi-recurrents, per a construir models acústics text-a-veu. AixÃ, el sistema QLAD proposat en aquest treball sintetitza més rà pid en CPU i GPU que el seu homòleg recurrent, preservant el mateix nivell de qualitat de sÃntesi, competitiu amb l'estat de l'art en models basats en vocoder. A continuació es proposa un model de xarxa adversà ria generativa per a millora de veu, anomenat SEGAN. Aquest model fa conversions de veu-a-veu en temps amb una sola operació d'inferència sobre una estructura purament convolucional. Això implica un increment en l'eficiència respecte altres models existents auto regressius i que també treballen en el domini temporal. La SEGAN aconsegueix resultats prominents d'extracció de soroll i preservació de la naturalitat i la intel·ligibilitat de la veu comparat amb altres sistemes clà ssics i models regressius basats en xarxes neuronals profundes en espectre. També es demostra que la SEGAN és eficient transferint les seves operacions a nous llenguatges i sorolls. AixÃ, un model SEGAN entrenat en Anglès aconsegueix un rendiment comparable a aquesta llengua quan el transferim al català o al coreà amb només 24 segons de dades d'adaptació. Finalment, explorem l'ús de tota la capacitat generativa del model i l’apliquem a recuperació de senyals de veu malmeses per và ries distorsions severes. Això ho anomenem millora de la parla generalitzada. Primer, el model demostra ser efectiu per a la tasca de recuperació de senyal sonoritzat a partir de senyal xiuxiuejat. Posteriorment, el model escala a poder resoldre altres distorsions que requereixen una reconstrucció de parts del senyal que s’han malmès, com extensió d’ample de banda i recuperació de seccions temporals perdudes, entre d’altres. En aquesta última aplicació del model, el fet d’incloure funcions de pèrdua acústicament rellevants incrementa la naturalitat del resultat final, en una estructura multi-tasca que prediu caracterÃstiques acústiques a la sortida de la xarxa discriminadora de la nostra GAN. També es proposa fer un entrenament en dues etapes del sistema SEGAN, el qual mostra un increment significatiu de l’equilibri en la sinèrgia adversà ria i la qualitat generada finalment després d’afegir les funcions acústiques. Finalment, proposem un codificador de veu agnòstic al problema, anomenat PASE, juntament amb el conjunt d’eines per entrenar-lo. El PASE és un sistema purament convolucional que crea representacions compactes de trames de veu. Aquestes representacions contenen informació abstracta com identitat del parlant, les caracterÃstiques prosòdiques i els continguts lingüÃstics. També es proposa un entorn auto-supervisat multi-tasca per tal d’entrenar aquest sistema, el qual suposa un avenç en el terreny de l’aprenentatge no supervisat en l’à mbit del processament de la parla. Una vegada el codificador esta entrenat, es pot exportar per a solventar diferents tasques que requereixin tenir senyals de veu a l’entrada. Primer explorem el rendiment d’aquest codificador per a solventar tasques de reconeixement del parlant, de l’emoció i de la parla, mostrant-se efectiu especialment si s’ajusta la representació de manera supervisada amb un conjunt de dades d’adaptació.Postprint (published version
Efficient, end-to-end and self-supervised methods for speech processing and generation
Deep learning has affected the speech processing and generation fields in many directions. First, end-to-end architectures allow the direct injection and synthesis of waveform samples. Secondly, the exploration of efficient solutions allow to implement these systems in computationally restricted environments, like smartphones. Finally, the latest trends exploit audio-visual data with least supervision. In this thesis these three directions are explored.
Firstly, we propose the use of recent pseudo-recurrent structures, like self-attention models and quasi-recurrent networks, to build acoustic models for text-to-speech. The proposed system, QLAD, turns out to synthesize faster on CPU and GPU than its recurrent counterpart whilst preserving the good synthesis quality level, which is competitive with state of the art vocoder-based models.
Then, a generative adversarial network is proposed for speech enhancement, named SEGAN. This model works as a speech-to-speech conversion system in time-domain, where a single inference operation is needed for all samples to operate through a fully convolutional structure. This implies an increment in modeling efficiency with respect to other existing models, which are auto-regressive and also work in time-domain. SEGAN achieves prominent results in noise supression and preservation of speech naturalness and intelligibility when compared to the other classic and deep regression based systems. We also show that SEGAN is efficient in transferring its operations to new languages and noises. A SEGAN trained for English performs similarly to this language on Catalan and Korean with only 24 seconds of adaptation data. Finally, we unveil the generative capacity of the model to recover signals from several distortions. We hence propose the concept of generalized speech enhancement. First, the model proofs to be effective to recover voiced speech from whispered one. Then the model is scaled up to solve other distortions that require a recomposition of damaged parts of the signal, like extending the bandwidth or recovering lost temporal sections, among others. The model improves by including additional acoustic losses in a multi-task setup to impose a relevant perceptual weighting on the generated result. Moreover, a two-step training schedule is also proposed to stabilize the adversarial training after the addition of such losses, and both components boost SEGAN's performance across distortions.Finally, we propose a problem-agnostic speech encoder, named PASE, together with the framework to train it. PASE is a fully convolutional network that yields compact representations from speech waveforms. These representations contain abstract information like the speaker identity, the prosodic features or the spoken contents. A self-supervised framework is also proposed to train this encoder, which suposes a new step towards unsupervised learning for speech processing. Once the encoder is trained, it can be exported to solve different tasks that require speech as input. We first explore the performance of PASE codes to solve speaker recognition, emotion recognition and speech recognition. PASE works competitively well compared to well-designed classic features in these tasks, specially after some supervised adaptation. Finally, PASE also provides good descriptors of identity for multi-speaker modeling in text-to-speech, which is advantageous to model novel identities without retraining the model.L'aprenentatge profund ha afectat els camps de processament i generació de la parla en và ries direccions. Primer, les arquitectures fi-a-fi permeten la injecció i sÃntesi de mostres temporals directament. D'altra banda, amb l'exploració de solucions eficients permet l'aplicació d'aquests sistemes en entorns de computació restringida, com els telèfons intel·ligents. Finalment, les darreres tendències exploren les dades d'à udio i veu per derivar-ne representacions amb la mÃnima supervisió. En aquesta tesi precisament s'exploren aquestes tres direccions. Primer de tot, es proposa l'ús d'estructures pseudo-recurrents recents, com els models d’auto atenció i les xarxes quasi-recurrents, per a construir models acústics text-a-veu. AixÃ, el sistema QLAD proposat en aquest treball sintetitza més rà pid en CPU i GPU que el seu homòleg recurrent, preservant el mateix nivell de qualitat de sÃntesi, competitiu amb l'estat de l'art en models basats en vocoder. A continuació es proposa un model de xarxa adversà ria generativa per a millora de veu, anomenat SEGAN. Aquest model fa conversions de veu-a-veu en temps amb una sola operació d'inferència sobre una estructura purament convolucional. Això implica un increment en l'eficiència respecte altres models existents auto regressius i que també treballen en el domini temporal. La SEGAN aconsegueix resultats prominents d'extracció de soroll i preservació de la naturalitat i la intel·ligibilitat de la veu comparat amb altres sistemes clà ssics i models regressius basats en xarxes neuronals profundes en espectre. També es demostra que la SEGAN és eficient transferint les seves operacions a nous llenguatges i sorolls. AixÃ, un model SEGAN entrenat en Anglès aconsegueix un rendiment comparable a aquesta llengua quan el transferim al català o al coreà amb només 24 segons de dades d'adaptació. Finalment, explorem l'ús de tota la capacitat generativa del model i l’apliquem a recuperació de senyals de veu malmeses per và ries distorsions severes. Això ho anomenem millora de la parla generalitzada. Primer, el model demostra ser efectiu per a la tasca de recuperació de senyal sonoritzat a partir de senyal xiuxiuejat. Posteriorment, el model escala a poder resoldre altres distorsions que requereixen una reconstrucció de parts del senyal que s’han malmès, com extensió d’ample de banda i recuperació de seccions temporals perdudes, entre d’altres. En aquesta última aplicació del model, el fet d’incloure funcions de pèrdua acústicament rellevants incrementa la naturalitat del resultat final, en una estructura multi-tasca que prediu caracterÃstiques acústiques a la sortida de la xarxa discriminadora de la nostra GAN. També es proposa fer un entrenament en dues etapes del sistema SEGAN, el qual mostra un increment significatiu de l’equilibri en la sinèrgia adversà ria i la qualitat generada finalment després d’afegir les funcions acústiques. Finalment, proposem un codificador de veu agnòstic al problema, anomenat PASE, juntament amb el conjunt d’eines per entrenar-lo. El PASE és un sistema purament convolucional que crea representacions compactes de trames de veu. Aquestes representacions contenen informació abstracta com identitat del parlant, les caracterÃstiques prosòdiques i els continguts lingüÃstics. També es proposa un entorn auto-supervisat multi-tasca per tal d’entrenar aquest sistema, el qual suposa un avenç en el terreny de l’aprenentatge no supervisat en l’à mbit del processament de la parla. Una vegada el codificador esta entrenat, es pot exportar per a solventar diferents tasques que requereixin tenir senyals de veu a l’entrada. Primer explorem el rendiment d’aquest codificador per a solventar tasques de reconeixement del parlant, de l’emoció i de la parla, mostrant-se efectiu especialment si s’ajusta la representació de manera supervisada amb un conjunt de dades d’adaptació
On coupling resolved and unresolved physical processes in finite element discretisations of geophysical fluids
At the heart of modern numerical weather forecasting and climate modelling lie simulations of two geophysical fluids: the atmosphere and the ocean. These endeavours rely on numerically solving the equations that describe these fluids. A key challenge is that the fluids contain motions spanning a range of scales. As the small-scale processes (unresolved by the numerical model) affect the resolved motions, they need to be described in the model, which is known as parametrisation. One major class of methods for numerically solving such partial differential equations is the finite element method. This thesis focuses on the coupling of such parametrised processes to the resolved flow within finite element discretisations. Four sets of research are presented, falling under two main categories.
The first is the development of a compatible finite element discretisation for use in numerical weather prediction models, so as to avoid the bottleneck in computational scalability associated with the convergence at the poles of latitude-longitude grids. We present a transport scheme for use with the lowest-order function spaces in such a compatible finite element method, which is motivated by the coupling of the resolved and unresolved processes within the model. This then facilitates the use of the lower-order spaces within Gusto, a toolkit for studying such compatible finite element discretisations. Then, we present a compatible finite element discretisation of the moist compressible Euler equations, parametrising the unresolved moist processes. This is a major step in the development of Gusto, extending it to describe its first unresolved processes.
The second category with which this thesis is concerned is the stochastic variational framework presented by Holm [Variational principles for stochastic fluid dynamics, P. Roy. Soc. A-Math. Phy. 471 (2176), (2015)]. In this framework, the effect of the unresolved processes and their uncertainty is expressed through a stochastic component to the advecting velocity. This framework ensures the circulation theorem is preserved by the stochastic equations. We consider the application of this formulation to two simple geophysical fluid models. First, we discuss the statistical properties of an enstrophy-preserving finite element discretisation of the stochastic quasi-geostrophic equation. We find that the choice of discretisation and the properties that it preserves affects the statistics of the solution. The final research presented is a finite element discretisation of the stochastic Camassa-Holm equation, which is used to numerically investigate the formation of ‘peakons’ within this set-up, finding that they do still always form despite the noise’s presence.Open Acces
- …