Dataset for "Reformulating Reactivity Design for Data-Efficient Machine Learning"

Abstract

This dataset contains the Gaussian 16 output files for the dataset of aza-Michael addition reactions used in the publication "Fast Identification of Reactions with Desired Barriers by Reformulating Machine Learning Activation Energies". The structures of the methylamine nucleophile, the 1000 Michael acceptor electrophiles and their 1000 transition states were all optimised at the wB97X-D/def2-TZVP level of theory with the IEFPCM(water) implicit solvent model. Before optimisation all Michael acceptors and transition states were conformationally searched using the MMFF force field in Schrödinger's MacroModel software and the lowest energy conformer was selected for DFT calculation. This dataset also contains the Gaussian 16 output files for the SVWN/def2-SVP single-point energy calculations on the dihydrogen activation catalyst and transition state structures.1000 Michael acceptor structures and their transition states for their reactions with methylamine were generated according the the scheme shown in the image "michael_structures.png" using the “R-Group Creator” and “Custom R-Group Enumeration” tools from Schrödinger's Maestro. The resulting Michael acceptors and transition states were conformationally searched using Schrödinger's MacroModel with the MMFF force field and the lowest energy electrophile and transition state conformers were selected for DFT optimisation. Gaussian 16 was used to perform geometry optimisation of the selected conformers as well as the methylamine nucleophile at the wB97X-D/def-TZVP level of theory with the IEFPCM(water) solvent model. Gaussian 16 was also used to perform single-point energy calculations on the Michael acceptor and transition state structures using the PM6 semi-empirical method with the IEFPCM(water) solvent model. Gaussian 16 was used to perform single-point energy calculations at the SVWN/def2-SVP level of theory on all of the transition state and catalyst structures available from the "Vaska's space" dataset (https://doi.org/10.5683/SP2/CJS7QA).“R-Group Creator” and “Custom R-Group Enumeration” tools from Schrödinger Maestro v12.5. “Conformational Search” tool from Schrödinger MacroModel v12.9. Gaussian 16, Revision A.03 and Revision C.01.The "electrophiles.zip" file contains the Gaussian output files for the optimised Michael acceptor structures. The "transitionstates.zip" file contains the Gaussian output files for the optimised aza-Michael addition transition state structures. The "methylamine.out" file is the Gaussian output file for the optimised methylamine nucleophile structure. The "electrophiles_pm6.zip" file contains the Gaussian output files for the PM6 single-point energies for the Michael acceptors. The "transitionstates_pm6.zip" file contains the Gaussian output files for the PM6 single-point energies for aza-Michael addtion transition states. The "methylamine_pm6.out" file is the Gaussian output file for the PM6-optimised methylamine nucleophile structure. The "catalysts_lda.zip" file contains the Gaussian output files for the single-point LDA iridium catalyst energies. The "dihydrogen_lda.zip" file contains the Gaussian output files for the single-point LDA dihydrogen activation transition state energies. The "h2.out" file is the Gaussian output file for the LDA-optimised dihydrogen molecule

    Similar works