Improved behavior equivalence algorithm of multi-agent interactive dynamic influence diagrams

Abstract

结合前瞻搜索思想提出了一种判断模型近似行为等价的方法,首先通过比较候选模型的部分解(即策略树)判断模型近似行为等价,然后自上而下对近似行为等价模型进行快速聚类和修剪,利用代表模型将交互式动态影响图扩展成为平铺动态影响图,最后求解平铺动态影响图.算法减少了候选模型的存储空间和运行时间,提高了算法的效率.最后通过多AgEnT老虎问题及音乐会问题的实验验证了该方法的有效性.The look-ahead search method was used to give a new method for determining approximate behavior equivalence.The method first determined whether the models were approximately behavior equivalent by comparing part of the solution(i.e.policy tree),then quickly clustered top-down and pruned the models that were approximately behavior equivalent.Next,the method used representative model to expand the interactive dynamic influence diagrams into flat dynamic influence diagrams.Finally,the flat dynamic influence diagrams were solved.The method reduces the storage space and the running time,thus improves the efficiency of the algorithm.The effectiveness of the proposed method was verified through experiments on multi-agent tiger and multi-agent concert problems.国家自然科学基金资助项目(61375070

    Similar works