Search CORE

161 research outputs found

Conformation Prediction of Synthetic Ion Channels:From structure prediction to structure classification

Author: Aguilar Sanjuan Broncio
Publication venue
Publication date: 28/11/2019
Field of study

Explore Bristol Research

Reverse-time inference of biological dynamics

Author: Lenner Nicolas
Publication venue
Publication date: 13/11/2019
Field of study

Georg-August-University Göttingen

Investigation of exciton properties in organic materials via many-body perturbation theory

Author: Wehner J.
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2019
Field of study

MPG.PuRe

Investigation of exciton properties in organic materials via many-body perturbation theory

Author: Wehner Jens
Publication venue: Technische Universiteit Eindhoven
Publication date: 07/05/2019
Field of study

Modeling energy transport in an organic solar cel

Pure OAI Repository

Development and Application of Machine Learning Methods to Selected Problems of Theoretical Solid State Physics

Author: Hoock Benedikt Andreas
Publication venue: Humboldt-Universität zu Berlin
Publication date: 16/08/2022
Field of study

In den letzten Jahren hat sich maschinelles Lernen als hilfreiches Werkzeug zur Vorhersage von simulierten Materialeigenschaften erwiesen. Somit können aufwendige Berechnungen mittels Dichtefunktionaltheorie umgangen werden und bereits bekannte Materialien besser verstanden oder sogar neuartige entdeckt werden. Eine zentrale Rolle spielt dabei der Deskriptor, ein möglichst interpretierbarer Satz von Materialkenngrößen. Diese Arbeit präsentiert einen Ansatz zur Auffindung von Deskriptoren für periodische Multikomponentensysteme, deren Eigenschaften durch die genaue atomare Anordnung mitbeinflusst wird. Primäre Features von Einzel-, Paar- und Tetraederclustern werden über die Superzelle gemittelt und weiter algebraisch kombiniert. Aus den so erzeugten Kandidaten wird mittels Dimensionalitätsreduktion ein geeigneter Deskriptor identifiziert. Zudem stellt diese Arbeit Strategien vor bei der Modellfindung Kreuzvalidierung einzusetzen, sodass stabilere und idealerweise besser generalisierbare Deskriptoren gefunden werden. Es werden außerdem mehrere Fehlermaße untersucht, die die Qualität der Deskriptoren bezüglich Genauigkeit, Komplexität der Formeln und Berücksichtung der atomaren Anordnung charakterisieren. Die allgemeine Methodik wurde in einer teilweise parallelisierten Python-Software implementiert. Als konkrete Problemstellungen werden Modelle für die Gitterkonstante und die Mischenergie von ternären Gruppe-IV Zinkblende-Legierungen "gelernt", mit einer Genauigkeit von 0.02 Å bzw. 0.02 eV. Datenbeschaffung, -analyse, und -bereinigung werden im Hinblick auf die Zielgrößen als auch auf die primären Features erläutert, sodass umfassende Analysen und die Parametrisierung der Methodik an diesem Testdatensatz durchgeführt werden können. Als weitere Anwendung werden Gitterkonstante und Bandlücken von binären Oktett-Verbindungen vorhergesagt. Die präsentierten Deskriptoren werden mit den Fehlermaßen evaluiert und ihre physikalische Relevanz wird abschließend disktutiert.In the last years, machine learning methods have proven as a useful tool for the prediction of simulated material properties. They may replace effortful calculations based on density functional theory, provide a better understanding of known materials or even help to discover new materials. Here, an essential role is played by the descriptor, a desirably interpretable set of material parameters. This PhD thesis presents an approach to find descriptors for periodic multi-component systems where also the exact atomic configuration influences the physical characteristics. We process primary features of one-atom, two-atom and tetrahedron clusters by an averaging scheme and combine them further by simple algebraic operations. Compressed sensing is used to identify an appropriate descriptor out from all candidate features. Furthermore, we develop elaborate cross-validation based model selection strategies that may lead to more robust and ideally better generalizing descriptors. Additionally, we study several error measures which estimate the quality of the descriptors with respect to accuracy, complexity of their formulas and the capturing of configuration effects. These generally formulated methods were implemented in a partially parallelized Python program. Actual learning tasks were studied on the problem of finding models for the lattice constant and the energy of mixing of group-IV ternary compounds in zincblende structure where an accuracy of 0.02 Å and 0.02 eV is reached, respectively. We explain the practical preparation steps of data acquisition, analysis and cleaning for the target properties and the primary features, and continue with extensive analyses and the parametrization of the developed methodology on this test case. As an additional application we predict lattice constants and band gaps of octet binary compounds. The presented descriptors are assessed quantitatively by the error measures and, finally, their physical meaning is discussed

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

Discovering robust dependencies from data

Author: Mandros Panagiotis
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2020
Field of study

Science revolves around forming hypotheses, designing experiments, collecting data, and tests. It was not until recently, with the advent of modern hardware and data analytics, that science shifted towards a big-data-driven paradigm that led to an unprecedented success across various fields. What is perhaps the most astounding feature of this new era, is that interesting hypotheses can now be automatically discovered from observational data. This dissertation investigates knowledge discovery procedures that do exactly this. In particular, we seek algorithms that discover the most informative models able to compactly “describe” aspects of the phenomena under investigation, in both supervised and unsupervised settings. We consider interpretable models in the form of subsets of the original variable set. We want the models to capture all possible interactions, e.g., linear, non-linear, between all types of variables, e.g., discrete, continuous, and lastly, we want their quality to be meaningfully assessed. For this, we employ information-theoretic measures, and particularly, the fraction of information for the supervised setting, and the normalized total correlation for the unsupervised. The former measures the uncertainty reduction of the target variable conditioned on a model, and the latter measures the information overlap of the variables included in a model. Without access to the true underlying data generating process, we estimate the aforementioned measures from observational data. This process is prone to statistical errors, and in our case, the errors manifest as biases towards larger models. This can lead to situations where the results are utterly random, hindering therefore further analysis. We correct this behavior with notions from statistical learning theory. In particular, we propose regularized estimators that are unbiased under the hypothesis of independence, leading to robust estimation from limited data samples and arbitrary dimensionalities. Moreover, we do this for models consisting of both discrete and continuous variables. Lastly, to discover the top scoring models, we derive effective optimization algorithms for exact, approximate, and heuristic search. These algorithms are powered by admissible, tight, and efficient-to-compute bounding functions for our proposed estimators that can be used to greatly prune the search space. Overall, the products of this dissertation can successfully assist data analysts with data exploration, discovering powerful description models, or concluding that no satisfactory models exist, implying therefore new experiments and data are required for the phenomena under investigation. This statement is supported by Materials Science researchers who corroborated our discoveries.In der Wissenschaft geht es um Hypothesenbildung, Entwerfen von Experimenten, Sammeln von Daten und Tests. Jüngst hat sich die Wissenschaft, durch das Aufkommen moderner Hardware und Datenanalyse, zu einem Big-Data-basierten Paradigma hin entwickelt, das zu einem beispiellosen Erfolg in verschiedenen Bereichen geführt hat. Ein erstaunliches Merkmal dieser neuen ra ist, dass interessante Hypothesen jetzt automatisch aus Beobachtungsdaten entdeckt werden k nnen. In dieser Dissertation werden Verfahren zur Wissensentdeckung untersucht, die genau dies tun. Insbesondere suchen wir nach Algorithmen, die Modelle identifizieren, die in der Lage sind, Aspekte der untersuchten Ph nomene sowohl in beaufsichtigten als auch in unbeaufsichtigten Szenarien kompakt zu “beschreiben”. Hierzu betrachten wir interpretierbare Modelle in Form von Untermengen der ursprünglichen Variablenmenge. Ziel ist es, dass diese Modelle alle m glichen Interaktionen erfassen (z.B. linear, nicht-lineare), zwischen allen Arten von Variablen unterscheiden (z.B. diskrete, kontinuierliche) und dass schlussendlich ihre Qualit t sinnvoll bewertet wird. Dazu setzen wir informationstheoretische Ma e ein, insbesondere den Informationsanteil für das überwachte und die normalisierte Gesamtkorrelation für das unüberwachte Szenario. Ersteres misst die Unsicherheitsreduktion der Zielvariablen, die durch ein Modell bedingt ist, und letztere misst die Informationsüberlappung der enthaltenen Variablen. Ohne Kontrolle des Datengenerierungsprozesses werden die oben genannten Ma e aus Beobachtungsdaten gesch tzt. Dies ist anf llig für statistische Fehler, die zu Verzerrungen in gr eren Modellen führen. So entstehen Situationen, wobei die Ergebnisse v llig zuf llig sind und somit weitere Analysen st ren. Wir korrigieren dieses Verhalten mit Methoden aus der statistischen Lerntheorie. Insbesondere schlagen wir regularisierte Sch tzer vor, die unter der Hypothese der Unabh ngigkeit nicht verzerrt sind und somit zu einer robusten Sch tzung aus begrenzten Datenstichproben und willkürlichen-Dimensionalit ten führen. Darüber hinaus wenden wir dies für Modelle an, die sowohl aus diskreten als auch aus kontinuierlichen Variablen bestehen. Um die besten Modelle zu entdecken, leiten wir effektive Optimierungsalgorithmen mit verschiedenen Garantien ab. Diese Algorithmen basieren auf speziellen Begrenzungsfunktionen der vorgeschlagenen Sch tzer und erlauben es den Suchraum stark einzuschr nken. Insgesamt sind die Produkte dieser Arbeit sehr effektiv für die Wissensentdeckung. Letztere Aussage wurde von Materialwissenschaftlern best tigt

Universaar

Acronym

MPG.PuRe

Spéciation guidée par l'environnement‎ : interactions sur des périodes évolutionnaires de communautés de plantes artificielles

Author: Godin-Dubois Kevin
Publication venue
Publication date: 15/07/2020
Field of study

Depuis des décades, les chercheurs en Vie Artificielle on créé une pléthore de créatures en utilisant de multiples schémas d’encodage, capacités motrices et aptitudes cognitives. Un motif récurrent, cependant, est que la focalisation est centrée sur les individus à évoluer, ne laissant que peu de place aux variations environnementales. Dans ce travail, nous argumentons que des contraintes abiotiques plus complexes pourraient diriger un processus évolutionnaire vers des régions de l’espace génétique plus robustes and diverses. Nous avons conçu un modèle morphologique complexe, basé sur les graphes orientés de K. Sims, qui repose sur le moteur physique Bullet pour la précision et utilise des contraintes à 6 Degrés de Liberté pour connecter les paires d’organes. Nous avons ainsi évolué un panel de plantes à l’aspect naturel qui devaient survivre malgré des niveaux de ressources variables induits par une source de lumière mobile et des motifs de pluies saisonnières. En plus de cette expérience, nous avons aussi obtenu une meilleure croissance verticale en ajoutant une contrainte biotique artificielle sous la forme de brins d’herbe statiques. La complexité de ce modèle, cependant, ne permettait pas la mise a l’échelle d’une évolution de populations et a donc été réduit dans l’expérience suivante, notamment en supprimant le moteur physique. Cela nous a amené à l’exploration de la co-évolution de populations composées d’une unique espèce et ayant la capacité de se reproduire de manière autonome grâce à notre Bail-Out Crossover (Croisement avec Désistement). Bien que les populations résultantes n’ont pas démontré un grand intérêt pour cette aptitude, elles ont néanmoins fourni d’importantes informations sur les mécanismes d’auto-reproduction. Ceux-ci ont été mis en action dans un second modèle inspiré des travaux de Bornhofen. Grâce à sa légèreté, cela nous a permis de traiter non seulement de plus grandes populations (de l’ordre de milliers d’individus) mais aussi de plus longues périodes évolutionnaires (100 années, approximativement 5000 générations). Notre première expérience avec ce modèle s’est concentrée sur la possibilité de reproduire des cas d’école de spéciation (allopatrique, parapatrique, péripatrique) sur cette plate-forme. Grâce à APOGet, une nouvelle procédure de regroupement pour l’extraction en parallèle d’espèces à partir d’un arbre généalogique, nous avons pu affirmer que le système était effectivement capable de spéciation spontanée. Cela nous a conduit à une dernière expérience dans laquelle l’environnement était contrôlé par de la Programmation Génétique Cartésienne (CGP), permettant ainsi une évolution automatique d’une population et des contraintes abiotiques auxquelles elle était confrontée. Par une variation du traditionnel algorithme 1 + λ nous avons obtenu 10 populations finales qui ont survécu à de brutales et imprévisibles variations environnementales. En les comparant à un groupe contrôle c pour lequel les contraintes ont été maintenues faibles et constantes, le groupe évolué e a montré des performances mitigées: dans les deux types de tests, une moitié de e surpassait c qui, à son tour, surpassait la moitié restante de e. Nous avons aussi trouvé une très forte corrélation entre les chutes catastrophiques de population et la performance des évolutions correspondantes. Il en résulte que l’évolution de population dans des environnements hostiles et dynamiques n’est pas une panacée bien que ces expériences en démontrent le potentiel et souligne le besoin d’études ultérieures plus approfondies.Artificial Life researchers have, for decades, created a plethora of creatures using numerous encoding schemes, motile capabilities and cognitive capacities. One recurring pattern, however, is that focus is solely put on the evolved individuals, with very limited environmental variations. In this work, we argue that more complex abiotic constraints could drive an evolutionary process towards more robust and diverse regions of the genetic space. We started with a complex morphogenetic model, based on K. Sims’ directed graphs, which relied on the Bullet physics engine for accuracy and used 6Degrees of Freedom constraints to connect pairs of organs. We evolved a panel of natural-looking plants which had to cope with varying resource levels thanks to a mobile light source and seasonal rain patterns. In addition to this experiment, we also obtained improved vertical growth by adding an artificialbiotic constraint in the form of static grass blades. However, the computational cost of this model precluded scaling to a population-level evolution and was reduced in the successive experiment, notably by removing the physical engine. This led to the exploration of co-evolution on single-species populations which, thanks to our Bail-Out Crossover (BOC) algorithm, were able to self-reproduce. The resulting populations provided valuable insight into the mechanisms of self-sustainability. These were put to action in an even more straightforward morphogenetic model inspired by the work of Bornhofen. Due to its light weightness, this allowed for both larger populations (up to thousands of individuals) and longer evolutionary periods (100 years, roughly 5K generations). Our first experiment on this model tested whether text-book cases of speciation could be reproduced in our framework. Such positive results were observed thanks to the species monitoring capacities of APOGeT, a novel clustering procedure we designed for online extraction of species from a genealogic tree. This drove us to a final experiment in which the environment was controlled through Cartesian Genetic Programming thus allowing the automated evolution of both the population and abiotic constraints it is subjected to. Through a variation of the traditional1 + λ algorithm, we obtained 10 populations (evolved group e) which had endured in harsh and unpredictable environments. These were confronted to a control group c, in which the constraints were kept mild and constant, on two types of colonization evaluation. Results showed that the evolved group was heterogeneous with half of e consistently outperforming members of c and the other half exhibiting worse performances than the baseline. We also found a very strong positive correlation between catastrophic drops in population level during evolution with the robustness of their final representatives. From this work, two conclusions can be drawn. First, though the need to fight on both the abiotic and biotic fronts can lead to worse performances, more robust individuals can be found in reasonable time-frames. Second, the automated co-evolution of populations and their environments is essential in exploring counter-intuitive, yet fundamental, dynamics both in biological and artificial life

Toulouse Capitole Publications

Winds observed in the Northern European seas with wind lidars, meteorological masts and satellite

Author: Astrup Poul
Badger Merete
Bingöl Ferhat
Courtney Michael
Driesenaar M.L.
Hagemann S.
Hasager Charlotte Bay
Karagali Ioanna
Mikkelsen Torben
Mouche A.
Oldroyd A.
Peña Alfredo
Stein D.
Stoffelsen A.
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology

The nature of wind turbine fatigue loads in wind farms

Author: Aagaard Madsen Helge
Larsen Gunner Chr.
Larsen Torben J.
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology

Wind-Farm Parametrisations in Mesoscale Models

Author: Badger Jake
Hahmann Andrea N.
Hansen Kurt Schaldemose
Volker Patrick
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology