Problem
Implement a strategy that records for each transition (s1, a, s2) an abstract transition (g(s1), g(a), g(s2)), where g is a grouping function; use Get Hash Code for g. If in a given state, there are enabled actions that transition to states in new groups, select one of those actions randomly, otherwise choose any action randomly.