39 research outputs found
Deep Learning for Free-Hand Sketch: A Survey
Free-hand sketches are highly illustrative, and have been widely used by
humans to depict objects or stories from ancient times to the present. The
recent prevalence of touchscreen devices has made sketch creation a much easier
task than ever and consequently made sketch-oriented applications increasingly
popular. The progress of deep learning has immensely benefited free-hand sketch
research and applications. This paper presents a comprehensive survey of the
deep learning techniques oriented at free-hand sketch data, and the
applications that they enable. The main contents of this survey include: (i) A
discussion of the intrinsic traits and unique challenges of free-hand sketch,
to highlight the essential differences between sketch data and other data
modalities, e.g., natural photos. (ii) A review of the developments of
free-hand sketch research in the deep learning era, by surveying existing
datasets, research topics, and the state-of-the-art methods through a detailed
taxonomy and experimental evaluation. (iii) Promotion of future work via a
discussion of bottlenecks, open problems, and potential research directions for
the community.Comment: This paper is accepted by IEEE TPAM
SUPPLEMENTARY INFORMATION Dislocation nucleation facilitated by atomic segregation DOI: 10.1038/NMAT5034
This is a set of supplementary data and information supporting the Journal Publication 'Dislocation nucleation facilitated by atomic segregation', DOI: 10.1038/NMAT5034, and available at Journal article in Nature Materials
More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation
Some standardized environments have been designed for partially observable
multi-agent cooperation, but we find most current environments are synchronous,
whereas real-world agents often have their own action spaces leading to
asynchrony. Furthermore, fixed agents number limits the scalability of action
space, whereas in reality agents number can change resulting in a flexible
action space. In addition, current environments are balanced, which is not
always the case in the real world where there may be an ability gap between
different parties leading to asymmetry. Finally, current environments tend to
have less stochasticity with simple state transitions, whereas real-world
environments can be highly stochastic and result in extremely risky. To address
this gap, we propose WarGame Challenge (WGC) inspired by the Wargame. WGC is a
lightweight, flexible, and easy-to-use environment with a clear framework that
can be easily configured by users. Along with the benchmark, we provide MARL
baseline algorithms such as QMIX and a toolkit to help algorithms complete
performance tests on WGC. Finally, we present baseline experiment results,
which demonstrate the challenges of WGC. We think WGC enrichs the partially
observable multi-agent cooperation domain and introduces more challenges that
better reflect the real-world characteristics. Code is release in
http://turingai.ia.ac.cn/data\_center/show/10
Adaptive Prior-Dependent Correction Enhanced Reinforcement Learning for Natural Language Generation
Natural language generation (NLG) is an important task with various applications like neural machine translation (NMT) and image captioning. Since deep-learning-based methods have issues of exposure bias and loss inconsistency, reinforcement learning (RL) is widely adopted in NLG tasks recently. But most RL-based methods ignore the deviation ignorance issue, which means the model fails to understand the extent of token-level deviation well. It leads to semantic incorrectness and hampers the agent to perform well. To address the issue, we propose a technique called adaptive prior-dependent correction (APDC) to enhance RL. It leverages the distribution generated by computing the distances between the ground truth and all other words to correct the agent's stochastic policy. Additionally, some techniques on RL are explored to coordinate RL with APDC, which requires a reward estimation at every time step. We find that the RL-based NLG tasks are a special case in RL, where the state transition is deterministic and the afterstate value equals the Q-value at every time step. To utilize such prior knowledge, we estimate the advantage function with the difference of the Q-values which can be estimated by Monte Carlo rollouts. Experiments show that, on three tasks of NLG (NMT, image captioning, abstractive text summarization), our method consistently outperforms the state-of-the-art RL-based approaches on different frequently-used metrics
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Model-based reinforcement learning (RL) is more sample efficient than model-free RL by using imaginary trajectories generated by the learned dynamics model. When the model is inaccurate or biased, imaginary trajectories may be deleterious for training the action-value and policy functions. To alleviate such problem, this paper proposes to adaptively reweight the imaginary transitions, so as to reduce the negative effects of poorly generated trajectories. More specifically, we evaluate the effect of an imaginary transition by calculating the change of the loss computed on the real samples when we use the transition to train the action-value and policy functions. Based on this evaluation criterion, we construct the idea of reweighting each imaginary transition by a well-designed meta-gradient algorithm. Extensive experimental results demonstrate that our method outperforms state-of-the-art model-based and model-free RL algorithms on multiple tasks. Visualization of our changing weights further validates the necessity of utilizing reweight scheme
The Influence of Top Managers on Environmental Information Disclosure: The Moderating Effect of Company’s Environmental Performance
Abundant extant literature emphasizes the impact of board members attributes’ influence on environmental information disclosure. Considering the voluntary nature of environmental information disclosure, executives have strong managerial discretion when they make such decisions, so this article focuses on top managers’ influence on environmental information disclosure. We hypothesize that top managers’ educational background and age will affect companies’ environmental decision. The hypotheses are verified with the data from Chinese listed manufacturing companies. As the results show, a Master of Business Administration (MBA) educational background and average age of top managers positively affect environmental information disclosure, while the impact of legal educational background is negative. The company’s environmental performance plays a U-shaped moderating effect on the relationship between MBA educational background and environmental information disclosure