Abstract: This article presents a novel scheme, namely, an intermittent learning scheme based on Skinner’s operant conditioning techniques that approximates the optimal policy while decreasing the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results