This work studies how an AI-controlled dog-fighting agent with tunable decision-
making parameters can learn to optimize performance against an intelligent adversary,as measured by a stochastic objective function evaluated on simulated combat engage-