Learning from delayed reward und punishment in a spiking neural network model of basal ganglia with opposing D1/D2 plasticity

Jenia Jitsev; Nobi Abraham; Abigail Morrison; Marc Tittgemeyer

Conference Proceedings

Learning from delayed reward und punishment in a spiking neural network model of basal ganglia with opposing D1/D2 plasticity

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7552 LNCS(PART 1) 459-466

DOI: 10.1007/978-3-642-33269-2_58

0Citations

8Readers

Get full text

Abstract

Extending previous work, we introduce a spiking actor-critic network model of learning from reward and punishment in the basal ganglia. In the model, the striatum is taken to be segregated into populations of medium spiny neurons (MSNs) that carry either D1 or D2 dopamine receptor type. This segregation allows explicit representation of both positive and negative expected outcome within the respective population. In line with recent experiments, we further assume that D1 and D2 MSN populations have opposing dopamine-modulated bidirectional synaptic plasticity. Experiments were conducted in a grid world, where a moving agent had to reach a remote rewarded goal state. The network learned not only to approach the rewarded goal, but also to consequently avoid punishments as opposed to the previous model. The spiking network model explains functional role of D1/D2 MSN segregation within striatum, specifically the reversed direction of dopamine-dependent plasticity found at synapses converging on different MSNs. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Jitsev, J., Abraham, N., Morrison, A., & Tittgemeyer, M. (2012). Learning from delayed reward und punishment in a spiking neural network model of basal ganglia with opposing D1/D2 plasticity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7552 LNCS, pp. 459–466). https://doi.org/10.1007/978-3-642-33269-2_58

Learning from delayed reward und punishment in a spiking neural network model of basal ganglia with opposing D1/D2 plasticity

Abstract

Author supplied keywords

Cite

Register to see more suggestions