The paper describes implementation of multi-agent
reinforcement learning tool based on temporal differences. The
possibilities of combining learning methods with statistical and
expert methods of forecasting for subsequent integration into the
forecasting subsystem for use in long-term intelligent decision
support system of real-time were considered. The work is supported by RFBR and BRFBR