Friday, October 14, 2005

Final QL MDP Results


Here are the final results for the QL MDP. I have also plotted on the same graph the results from the Exact MDP for comparison. As you can see, the QL MDP gives up some performance, and tends to result in a slightly higher total cost than the Exact MDP.

One peculiar behavior that was observed was the that the QL MDP always exhibited compute-move-compute-move-compute-move policies. I wonder if moving helps it eliminate some of the aliasing of the states? Is it intelligently deciding to move to a less ambiguous state?

1 Comments:

Blogger ProV1 said...

Lack of knowledge leads to misperception of Indonesia
One reason for this article appearing here is my participation as a guest on INSIGHT, a talk show on SBS TV hosted by Jenny Brockie.
Hey, you have a great blog here! I'm definitely going to bookmark you!

I have a health insurance quote site/blog. It pretty much covers health insurance quote information.

Come and check it out if you get time :-)

-----------------------------------------------------

1:03 PM  

Post a Comment

<< Home