It should be clear that the reward matrix for player 2 is: It turns out that this same approach can be used for several other games. Matching pennies: In this game the two players hold a penny each.
3. `move_x`: Simulates gravity in a column. 4. `move_y`: Reorganizes the matrix by shifting columns leftward when a column becomes empty. 5. `play`: Executes a single move, updating the matrix and ...