Is it a good idea to apply reinforcement learning to dots and boxes?


I am currently in college, and trying to learn reinforcement learning by myself. My primary goal is building an agent that play games such as dots and boxes.

I have sufficient highschool maths knowledge and I started studying Sutton for RL. Along with that I also studied probability from bertsekas.I also did a course on neural networks from Coursera, though very basic.

Currently while reading Sutton I have many questions, so please tell me if this a good approach to study RL. My main aim is to create a dots and boxes agent and also learn fundamental maths involved with RL on the way.


Posted 2020-01-06T14:05:37.677

Reputation: 111

Question was closed 2020-01-10T20:16:33.303

I taught myself basics of RL mainly by reading Sutton & Barto, so it is possible. Whether or not is a good approach for you, we cannot really answer, too much depends on what you already know, how you learn best etc. Once that is removed (as mainly opinion/too difficult), it is hard to find an answerable question here. Perhaps instead you could ask one - please just one, you can always ask more later - of your "many questions" which I assume are going to be more practical such as how you would set up state, actions and rewards for your game. – Neil Slater – 2020-01-06T17:27:07.427

I would rather say, Approaches like Hopfield Networks are good for those kinds of game. – None – 2020-01-06T18:09:39.593

Welcome to SE:AI! This post hasn't received closevotes, so I'm leaving open for now (but be advised it may get closed at some point, not being the type of query ideally suited for this stack.) Noticed you posted on chat, so thanks for heeding that advice. – DukeZhou – 2020-01-06T21:58:03.600

I think this question includes multiple questions, with OP asking whether he or she should apply RL to dots and boxes, but also asking is Sutton a good approach of studying AI, and both of these are opinion based.. @DukeZhou – Clement Hui – 2020-01-07T00:13:32.517

From RL point of view it's the same as connect-4 and gomoku. Both have open sourced implementation based on AlphaZero on Github:

– mirror2image – 2020-01-07T06:20:12.600

@DukeZhou ok please close the question I was not aware! – jeea – 2020-01-07T16:04:26.583

No answers