Temporal-Difference Learning Using Distributed Error Signals