Added: 2 years ago
From: nickabourisk
Views: 6,674
Sort by time | Sort by thread (beta)

Link to this comment:

Share to:
see all

All Comments (6)

Sign In or Sign Up now to post a comment!
  • What is your observation abstraction? And what algorithm exactly? Tabular Q-Learning? Some function approximation? Do you have any document with details of your implementation?

    I'm starting to work with RL but the results aren't that good until now!

    Thanks in advance!

  • Hi rcparts, I sent you a PM about a document with details.

  • @nickabourisk Hey there I know this is about a year old now but if you still have it and are still following this video can you forward those details on to me aswell.

    I'm looking at something similar for my honours project, the original plan was to make an agent that can play mario using reinforcement learning until my professor burst my bubble by telling me it was already done (damn it) so now I'm looking at working on and improving past work in the field.

  • @RyanfaeScotland no problem. Let me know how it turns out! I've edited the video description with a link to my project partner's website (it contains more details). Good luck!

  • Thanks, not anywhere near as impressive as your's. We placed 2nd in the RL competition (my partner made it better by adding options and other things).

    I believe that this required on the order of several hundred to a thousand iterations.

    Good luck with your competition as well! Excited to see the results.

  • This is a really nice effort, well done! How many iterationts were required before you arrived at that state?

    Good luck with the competition!

  • constant jumping, slow forward moveing at any place even at the places where there are obvius ways to avoid enemys whiout even getting on same path as enemys, or skip them purely just by running, then falling above or even past enemys..

  • The game and environment (stage/levels) are all set up already. We just get a set of observations about the world and get to choose actions for Mario (move right, run, and jump). Because the observations are so huge, we have to abstract it to something workable.

  • Whoaaa thats awsum.

    Did you make the stage layout, or are they made already, and you just put your Mario through it?

Loading...
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more