Abstract: This article presents a novel scheme, namely, an intermittent learning scheme based on Skinner’s operant conditioning techniques that approximates the optimal policy while decreasing the ...