You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
24 lines
349 B
24 lines
349 B
// MDP illustrating non-memoryless optimal rewards (see Figure 6)
|
|
|
|
mdp
|
|
|
|
module M
|
|
|
|
// s=i for state s_i
|
|
s : [0..2];
|
|
|
|
[b0] s=0 -> (s'=0);
|
|
[a0] s=0 -> (s'=1);
|
|
[a1] s=1 -> (s'=2);
|
|
[a2] s=2 -> (s'=2);
|
|
|
|
endmodule
|
|
|
|
rewards
|
|
// state rewards
|
|
s=0 : 1;
|
|
s=1 : 2;
|
|
// transition rewards
|
|
[b0] s=0 : 1;
|
|
[a2] s=2 : 3;
|
|
endrewards
|