28 research outputs found

    Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

    Full text link
    Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by \cite{arjovsky2017towards}, who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz continuity of the discriminator. In this paper, we propose a novel approach to enforcing the Lipschitz continuity in the training procedure of WGANs. Our approach seamlessly connects WGAN with one of the recent semi-supervised learning methods. As a result, it gives rise to not only better photo-realistic samples than the previous methods but also state-of-the-art semi-supervised learning results. In particular, our approach gives rise to the inception score of more than 5.0 with only 1,000 CIFAR-10 images and is the first that exceeds the accuracy of 90% on the CIFAR-10 dataset using only 4,000 labeled images, to the best of our knowledge.Comment: Accepted as a conference paper in International Conference on Learning Representation(ICLR). Xiang Wei and Boqing Gong contributed equally in this wor

    Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

    Full text link
    Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by \cite{arjovsky2017towards}, who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz continuity of the discriminator. In this paper, we propose a novel approach to enforcing the Lipschitz continuity in the training procedure of WGANs. Our approach seamlessly connects WGAN with one of the recent semi-supervised learning methods. As a result, it gives rise to not only better photo-realistic samples than the previous methods but also state-of-the-art semi-supervised learning results. In particular, our approach gives rise to the inception score of more than 5.0 with only 1,000 CIFAR-10 images and is the first that exceeds the accuracy of 90% on the CIFAR-10 dataset using only 4,000 labeled images, to the best of our knowledge.Comment: Accepted as a conference paper in International Conference on Learning Representation(ICLR). Xiang Wei and Boqing Gong contributed equally in this wor

    Twisted van der Waals Quantum Materials: Fundamentals, Tunability and Applications

    Full text link
    Twisted vdW quantum materials have emerged as a rapidly developing field of 2D semiconductors. These materials establish a new central research area and provide a promising platform for studying quantum phenomena and investigating the engineering of novel optoelectronic properties such as single-photon emission, non-linear optical response, magnon physics, and topological superconductivity. These captivating electronic and optical properties result from, and can be tailored by, the interlayer coupling using moir\'e patterns formed by vertically stacking atomic layers with controlled angle misorientation or lattice mismatch. Their outstanding properties and the high degree of tunability position them as compelling building blocks for both compact quantum-enabled devices and classical optoelectronics. This article offers a comprehensive review of recent advancements in the understanding and manipulation of twisted van der Waals structures and presents a survey of the state-of-the-art research on moir\'e superlattices, encompassing interdisciplinary interests. It delves into fundamental theories, synthesis and fabrication, and visualization techniques, and the wide range of novel physical phenomena exhibited by these structures, with a focus on their potential for practical device integration in applications ranging from quantum information to biosensors, and including classical optoelectronics such as modulators, light emitting diodes (LEDs), lasers, and photodetectors. It highlights the unique ability of moir\'e superlattices to connect multiple disciplines, covering chemistry, electronics, optics, photonics, magnetism, topological and quantum physics. This comprehensive review provides a valuable resource for researchers interested in moir\'e superlattices, shedding light on their fundamental characteristics and their potential for transformative applications in various fields.Comment: 179 pages, 42 figures, Chemical Review

    Extraordinary second harmonic generation modulated by divergent strain field in pressurized monolayer domes

    Get PDF
    The most prominent form of nonlinear optical (NLO) frequency conversion is second harmonic generation (SHG), where incident light interacts with a nonlinear medium producing photons at double the input frequency, which has vast applications in material and biomedical science. Emerging two-dimensional nonlinear optical materials led by transition metal dichalcogenides (TMDs) have fascinating optical and mechanical properties and are highly anticipated to overcome the technical limitations imposed by traditional bulky NLO materials. However, the atomic scale interaction length and low conversion efficiency in TMD materials prevent their further implementation in NLO applications. While some uniaxial strain-engineering studies intensively investigated the anisotropic SHG response in TMDs, they did not realize giant SHG enhancement by exploiting the opto-mechanical characteristics. Herein, we employ proton (H+) irradiation to successfully fabricate large pressurized monolayer TMD domes (d ≥ 10 μm) and conduct a comprehensive investigation and characterization of their SHG performance enhancement. We show that the intensity of SHG is effectively enhanced by around two orders of magnitude at room temperature. Such giant enhancement arises from the distinct separation distance induced by capped pressurized gas and the hemi-spherical morphology, enabling constructive optical interference. Moreover, the unique divergent strain field in TMD domes promotes the first experimental study on the anisotropic nonlinear optical behavior based on biaxial strain conditions in terms of varying strain orientation and relative weights. Our work demonstrates a promising system with enhanced NLO performance and well-preserved biocompatibility, paving a way toward the future nano-scaled quantum optics design and biomedical applications

    VideoGLUE: Video General Understanding Evaluation of Foundation Models

    Full text link
    We evaluate existing foundation models video understanding capabilities using a carefully designed experiment protocol consisting of three hallmark tasks (action recognition, temporal localization, and spatiotemporal localization), eight datasets well received by the community, and four adaptation methods tailoring a foundation model (FM) for a downstream task. Moreover, we propose a scalar VideoGLUE score (VGS) to measure an FMs efficacy and efficiency when adapting to general video understanding tasks. Our main findings are as follows. First, task-specialized models significantly outperform the six FMs studied in this work, in sharp contrast to what FMs have achieved in natural language and image understanding. Second,video-native FMs, whose pretraining data contains the video modality, are generally better than image-native FMs in classifying motion-rich videos, localizing actions in time, and understanding a video of more than one action. Third, the video-native FMs can perform well on video tasks under light adaptations to downstream tasks(e.g., freezing the FM backbones), while image-native FMs win in full end-to-end finetuning. The first two observations reveal the need and tremendous opportunities to conduct research on video-focused FMs, and the last confirms that both tasks and adaptation methods matter when it comes to the evaluation of FMs

    Efficient and Layer-Dependent Exciton Pumping across Atomically Thin Organic–Inorganic Type-I Heterostructures

    Get PDF
    The fundamental light–matter interactions in monolayer transition metal dichalcogenides might be significantly engineered by hybridization with their organic counterparts, enabling intriguing optoelectronic applications. Here, atomically thin organic–inorganic (O–I) heterostructures, comprising monolayer MoSe2 and mono‐/few‐layer single‐crystal pentacene samples, are fabricated. These heterostructures show type‐I band alignments, allowing efficient and layer‐dependent exciton pumping across the O–I interfaces. The interfacial exciton pumping has much higher efficiency (>86 times) than the photoexcitation process in MoSe2, although the pentacene layer has much lower optical absorption than MoSe2. This highly enhanced pumping efficiency is attributed to the high quantum yield in pentacene and the ultrafast energy transfer between the O–I interface. Furthermore, those organic counterparts significantly modulate the bindings of charged excitons in monolayer MoSe2 via their precise dielectric environment engineering. The results open new avenues for exploring fundamental phenomena and novel optoelectronic applications using atomically thin O–I heterostructures.The authors also acknowledge financial support from ANU Ph.D. student scholarship, China Scholarship Council, ANU Major Equipment Committee fund (No. 14MEC34), and Australian Research Council (ARC) Discovery Early Career Researcher Award (DECRA) (DE140100805). ARC Centre of Excellence in Future Low-Energy Electronics Technologies (FLEET), ANU node; this work is also supported by NSFC 61734003, 61521001 and National Key Basic Research Program of China 2015CB921600

    Psychometric Evaluation of the Affective Reactivity Index Among Children and Adolescents in China: A Multi-Method Assessment Approach

    No full text
    The Affective Reactivity Index (ARI) is one of the most studied scales for assessing youth irritability, but little is known about its measurement performance in community populations. This study applied item response theory (IRT), network analysis, and classical test theory (CTT) to examine the psychometric properties of the ARI in a sample of n = 395 community-based children (M-age = 13.44, SD = 2.51) and n = 403 parents. In this sample, the ARI demonstrated good reliability, as well as convergent and concurrent validity. The one-factor structure was supported by both confirmatory factor analysis (CFA) and network analysis. IRT analysis revealed that the ARI effectively distinguished between various levels of irritability within the community population. Network analysis identified "Loses temper easily,""Gets angry frequently," and "Often loses temper" are central aspects of irritability. The findings support the ARI as a brief, reliable, and valid instrument to assess irritability in community children and adolescents

    Improving The Improved Training Of Wasserstein Gans: A Consistency Term And Its Dual Effect

    No full text
    Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by Arjovsky & Bottou (2017), who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz continuity of the discriminator. In this paper, we propose a novel approach to enforcing the Lipschitz continuity in the training procedure of WGANs. Our approach seamlessly connects WGAN with one of the recent semi-supervised learning methods. As a result, it gives rise to not only better photo-realistic samples than the previous methods but also state-of-the-art semi-supervised learning results. In particular, our approach gives rise to the inception score of more than 5.0 with only 1,000 CIFAR-10 images and is the first that exceeds the accuracy of 90% on the CIFAR-10 dataset using only 4,000 labeled images, to the best of our knowledge
    corecore