Question:
Consider the power of two policies for a single product model. Describe how you could generate a power of three policy (a policy where each Ti = 3kTB for some integer k > 0). What is the effectiveness (in terms of worst-case performance) of the best power of three policy?