Previously, a transition reward with [] would match the self loop transition added for fixing deadlocks in the explicit engine. Fixes #29 and adds test case.