Punish/Reward: Learning with a critic in adaptive threshold systems (1973)

by B Widrow, N K Gupta, S Maitra
Venue:IEEE Trans. Systems, Man, and Cybernetics