Multiple asynchronous agent training