What is your observation abstraction? And what algorithm exactly? Tabular Q-Learning? Some function approximation? Do you have any document with details of your implementation?
I'm starting to work with RL but the results aren't that good until now!
@nickabourisk Hey there I know this is about a year old now but if you still have it and are still following this video can you forward those details on to me aswell.
I'm looking at something similar for my honours project, the original plan was to make an agent that can play mario using reinforcement learning until my professor burst my bubble by telling me it was already done (damn it) so now I'm looking at working on and improving past work in the field.
@RyanfaeScotland no problem. Let me know how it turns out! I've edited the video description with a link to my project partner's website (it contains more details). Good luck!
constant jumping, slow forward moveing at any place even at the places where there are obvius ways to avoid enemys whiout even getting on same path as enemys, or skip them purely just by running, then falling above or even past enemys..
The game and environment (stage/levels) are all set up already. We just get a set of observations about the world and get to choose actions for Mario (move right, run, and jump). Because the observations are so huge, we have to abstract it to something workable.
What is your observation abstraction? And what algorithm exactly? Tabular Q-Learning? Some function approximation? Do you have any document with details of your implementation?
I'm starting to work with RL but the results aren't that good until now!
Thanks in advance!
rcparts 2 years ago
Hi rcparts, I sent you a PM about a document with details.
nickabourisk 2 years ago
@nickabourisk Hey there I know this is about a year old now but if you still have it and are still following this video can you forward those details on to me aswell.
I'm looking at something similar for my honours project, the original plan was to make an agent that can play mario using reinforcement learning until my professor burst my bubble by telling me it was already done (damn it) so now I'm looking at working on and improving past work in the field.
RyanfaeScotland 1 year ago
@RyanfaeScotland no problem. Let me know how it turns out! I've edited the video description with a link to my project partner's website (it contains more details). Good luck!
nickabourisk 1 year ago
Thanks, not anywhere near as impressive as your's. We placed 2nd in the RL competition (my partner made it better by adding options and other things).
I believe that this required on the order of several hundred to a thousand iterations.
Good luck with your competition as well! Excited to see the results.
nickabourisk 2 years ago
This is a really nice effort, well done! How many iterationts were required before you arrived at that state?
Good luck with the competition!
robinba2342364 2 years ago
constant jumping, slow forward moveing at any place even at the places where there are obvius ways to avoid enemys whiout even getting on same path as enemys, or skip them purely just by running, then falling above or even past enemys..
FusionNinjin 2 years ago
The game and environment (stage/levels) are all set up already. We just get a set of observations about the world and get to choose actions for Mario (move right, run, and jump). Because the observations are so huge, we have to abstract it to something workable.
nickabourisk 2 years ago
Whoaaa thats awsum.
Did you make the stage layout, or are they made already, and you just put your Mario through it?
billyjoeboucher 2 years ago