Search
Advanced Search
Average Rating (0 User Ratings)
    • Currently 0/5 Stars.
    See all categories
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
    Rate This Article

Open Access

Research Article

Nitric Oxide Regulates Input Specificity of Long-Term Depression and Context Dependence of Cerebellar Learning

Hideaki Ogasawara1,2*, Tomokazu Doi1,3¤, Kenji Doya1,4, Mitsuo Kawato1

1 ATR Computational Neuroscience Laboratories, Seika, Kyoto, Japan, 2 National Institute of Information and Communications Technology, Seika, Kyoto, Japan, 3 Graduate School of Information Science, Nara Institute of Science and Technology, Takayama, Ikoma, Nara, Japan, 4 Initial Research Project, Okinawa Institute of Science and Technology, Uruma, Okinawa, Japan

Abstract

Recent studies have shown that multiple internal models are acquired in the cerebellum and that these can be switched under a given context of behavior. It has been proposed that long-term depression (LTD) of parallel fiber (PF)–Purkinje cell (PC) synapses forms the cellular basis of cerebellar learning, and that the presynaptically synthesized messenger nitric oxide (NO) is a crucial “gatekeeper” for LTD. Because NO diffuses freely to neighboring synapses, this volume learning is not input-specific and brings into question the biological significance of LTD as the basic mechanism for efficient supervised learning. To better characterize the role of NO in cerebellar learning, we simulated the sequence of electrophysiological and biochemical events in PF–PC LTD by combining established simulation models of the electrophysiology, calcium dynamics, and signaling pathways of the PC. The results demonstrate that the local NO concentration is critical for induction of LTD and for its input specificity. Pre- and postsynaptic coincident firing is not sufficient for a PF–PC synapse to undergo LTD, and LTD is induced only when a sufficient amount of NO is provided by activation of the surrounding PFs. On the other hand, above-adequate levels of activity in nearby PFs cause accumulation of NO, which also allows LTD in neighboring synapses that were not directly stimulated, ruining input specificity. These findings lead us to propose the hypothesis that NO represents the relevance of a given context and enables context-dependent selection of internal models to be updated. We also predict sparse PF activity in vivo because, otherwise, input specificity would be lost.

Author Summary

The cerebellum is essential for coordinated movements. The skills for executing such movements are acquired in modules of the cerebellum, and the appropriate modules in which to store the skill for a certain movement are selected according to the environment, or the context, where the movement is made. We are interested in the molecular mechanisms that enable context-dependent cerebellar learning. In search of the key molecules, we combined established simulation models of Purkinje cells, the only output neurons in the cerebellar cortex, and constructed a new model. Using computer simulation, we found that nitric oxide is likely to have a pivotal role in context-dependent learning. Our simulation also provides insights into how sparse sensory information is coded in the cerebellar cortex. These findings have led us to propose the experimentally testable hypothesis that the relevance of a given context to learning modules is represented by the concentration of nitric oxide.

Introduction

Internal models—that is, neural mechanisms for motor planning and control—mimic the input/output characteristics of the motor apparatus or their inverses [1]. Recent studies have shown that multiple internal models are acquired in the cerebellum [2], and that these can be switched under a given context of behavior [3,4], so that, for example, we can walk on stairs or on escalators without losing balance. Contextual information, which consists of various modalities of afferent signals from the entire body and efferent signals from the cerebrum, is transmitted to the cerebellar cortex via mossy fibers (MFs) and parallel fibers (PFs) [57]. Contexts may also be processed in the upstream cerebral regions, such as the superior parietal lobe, the occipital lobe, and the middle temporal lobe [4]. The combination of contexts and tasks is thought to enable the cerebellum to acquire and switch between multiple internal models in a context-dependent manner [1,6,7]. Indeed, adaptation to multiple motor tasks is possible when they are presented together with adequate sensory or cognitive cues [811]. On the other hand, without these contexts, the motor memory of a previously learned task would be erased by the experience of an opposing task [12]. However, the biological mechanisms of context-dependent learning have yet to be explored.

Long-term depression (LTD) of PF–Purkinje cell (PC) synapses is widely thought to form the cellular basis of cerebellar learning [13], with some controversies regarding its computational roles and the participation of other types of plasticity [1417]. The presynaptically synthesized messenger nitric oxide (NO) is a “gatekeeper” to plasticity at these synapses; LTD and long-term potentiation are induced only in its presence [13,1821]. Because NO diffuses freely to neighboring synapses and affects them [22,23], this volume learning, unlike classical associative learning, is not input-specific and raises questions about the biological significance of LTD as the basic mechanism for efficient supervised learning.

To better characterize the role of NO in cerebellar learning, we constructed a new comprehensive model based on established simulation models of the electrophysiology, calcium dynamics, and signaling pathways of the PC [2426]. It is basically a passive electrical cable representing a PC dendrite (Figure 1A) whose dendritic spines contain ion channels, calcium buffers, calcium pumps (Figure 1B), and a biochemical reaction network (Figure 1C). For our purposes, it was necessary to simulate the whole sequence of events in PF–PC LTD, from stimulation of PF–PC synapses coupled with depolarization, diffusion of NO, and an increase of calcium concentration in the spine ([Ca2+]spine), to activation of the intracellular signaling cascade and phosphorylation of alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate receptors (AMPARs), using a rather complicated model, because cerebellar LTD is a very nonlinear phenomenon [25,26].

thumbnail

Figure 1. Overview of the Model

(A) The electrophysiological structure of the model. Spine necks connect spine heads (arrowhead) to an unbranched dendrite. The right-hand end of the dendrite corresponds to the soma, which is voltage-clamped. Vertical lines indicate compartmentalization of the dendrite.

(B) Block diagram of signaling pathways for calcium mobilization in each spine. Thick arrows and thin arrows indicate mobilization of calcium and activation of targets, respectively.

(C) Block diagram of AMPAR phosphorylation. Raf is a kinase that phosphorylates and activates MEK. Gq protein is a heterotrimeric guanine nucleotide binding protein that activates PLC. AA, arachidonic acid; DAG, diacylglycerol; PIP2, phosphatidylinositol bisphosphate; PKG, cGMP-dependent protein kinase; PLC, phospholipase C; PP2A, protein phosphatase 2A; sGC, soluble guanylyl cyclase.

doi:10.1371/journal.pcbi.0020179.g001

Our simulation revealed that a PF–PC synapse undergoes LTD only when more than several other PFs in the vicinity are activated concomitantly. This finding has led us to propose a novel hypothesis of cerebellar learning in which NO enables context-dependent acquisition and updating of internal models; in addition, we suggest an animal experiment to critically test this hypothesis. We also predict that the number of PFs responsible for a certain movement in a certain context needs to be strictly limited, because, otherwise, input specificity would be lost.

Results

Outline of the Model

We combined several established models of the PC [2426] and constructed a new comprehensive model to simulate the sequence of electrophysiological and biochemical events in cerebellar LTD. The electrophysiological part of the model (Figure 1A) is based on a realistic PC model proposed by De Schutter et al. (De Schutter's model) [24] but is greatly simplified. It consists of a 90-μm dendritic cable that is as long electrically as the average PC dendrite [27] and is composed of 30 compartments. The peripheral ten compartments (Figure 1A, left side) are 1.3 μm in diameter and 2 μm in length; the middle ten are 2.6 μm in diameter and 3 μm in length; and the proximal ten (Figure 1A, right side) are 3.9 μm in diameter and 4 μm in length. A total of 1,350 single-compartment spines are connected to the dendrite (15 spines per micrometer of the dendrite length [28,29]) by a spine neck. The electrical potential of each spine and dendritic compartment is expressed in cable equations [30]. Despite considerable alterations to De Schutter's model, our model retains important characteristics of the PC, as expected from the robustness of the original model to any changes made in its parameter values [31]. For example, the input resistance measured at the tip of the dendrite is 129 MΩ, which is on the same order as the values estimated from real PCs (132–511 MΩ) [27].

Stimuli to PF–PC synapses are represented as AMPAR currents into the spines (Figure 1B), metabotropic glutamate receptor (mGluR) activation (Figure 1C), and diffusion of presynaptically synthesized NO. We assumed a constant release probability and did not model presynaptic plasticity. The kinetics of NO and calcium are introduced in later subsections. Spillover of glutamate was not considered, because granular layer stimulation and molecular layer stimulation that involve approximately 60 PF–PC synapses each have been reported to produce excitatory postsynaptic currents with almost equal decay time constants [32], suggesting that glutamate spillover has only minimal effects, if any, on the extent of stimulation in our study. Inhibitory interneurons were not simulated.

The biochemical reaction network for LTD (Figure 1C) was derived from a model proposed by Kuroda et al. (Kuroda's model) [26]. According to their study, AMPAR phosphorylation, the key step in expression of cerebellar LTD [13,26], is regulated in the initial phase by Ca2+- and diacylglycerol-mediated linear pathways, and is maintained in the intermediate phase by a positive feedback loop pathway that is mediated by mitogen-activated protein kinase (MAPK), cytosolic phospholipase A2 (cPLA2), and protein kinase C (PKC). Indeed, the essential role of this feedback loop in cerebellar LTD has been virtually proved by a series of in vitro pharmacological experiments demonstrating the following: PKC activation results in MAPK activation; MAPK activation results in PKC activation; cPLA2 is activated by MAPK and activates PKC; and prolonged activation of PKC is necessary for induction of LTD [33]. Similar to Kuroda et al., we measured the concentration of phosphorylated AMPARs ([P-AMPAR]) as the output of the simulation.

Production, Diffusion, and Decay of Nitric Oxide

Firing of PFs activates NO synthase (NOS) in their presynaptic terminals [13,18,19,21]. The NO produced by NOS then freely diffuses into the intracellular and extracellular spaces and decays, affecting postsynaptic signaling pathways en route [13,20,21,23]. We modeled the decay and diffusion of NO synthesized at a PF bouton (Figure 2A) and calculated the NO concentration ([NO]) at various distances from the bouton and at various times. Figure 2B is a plot of the time course of NO synthesized in a bouton at various distances from the observation point, and Figure 2C shows its spatial distribution at various time points. [NO] rapidly peaked after activation of NOS, then gradually fell (Figure 2B). The distribution of NO was restricted to within 10 μm or 20 μm of its source (Figure 2C). In reality, however, when a PF fires, NO is not synthesized at a single bouton but at multiple neighboring boutons, which occur every 5.2 μm along the PF [18,19,21,34] (Figure 2D). These boutons also contribute to [NO] at the observation point on the PC dendritic plane, according to their distance away from the observation point; that is, μm (i = 0, ±1, ±2,...), where R is the distance between the PF and the observation point, and i is the number assigned to a given bouton, with i = 0 for the one on the dendritic plane. Figure 2E and 2F are plots of the time course of PF-derived NO measured at various distances, and of its spatial distribution at various time points, respectively. NO derived from a PF persisted slightly longer than that derived from a bouton. The concentrations of NO derived from a PF at R = 1 μm, 5 μm, and 10 μm returned to 36.8% of their peak values in 64 ms, 72 ms, and 79 ms, respectively (Figure 2E), whereas the concentrations of NO derived from a single bouton at the same distances returned to 36.8% of their peak values in 59 ms, 67 ms, and 75 ms, respectively (Figure 2B). In addition, NO derived from a PF was spatially less restricted than that derived from a single bouton. At 25 ms, 50 ms, and 100 ms after activation of NOS, the ratios of the [NO] derived from a PF at R = 10 μm to that at R = 5 μm were 0.33, 0.35, and 0.35, respectively (Figure 2F), whereas the ratios of the [NO] derived from a bouton at R = 10 μm to that at R = 5 μm were 0.23, 0.24, and 0.24, respectively (Figure 2C). The spatial distribution of NO synthesized in a PF (Figure 2F) quantitatively agrees with the results of a previous slice experiment [35].

thumbnail

Figure 2. Simulated Concentration of NO Derived from a Single Bouton or from Multiple Boutons on a PF

(A–C) NO derived from a single bouton.

(A) Activated NOS in a bouton (open circle) produces NO, which diffuses three dimensionally and decays. Only some of the synthesized NO reaches the observation point (closed circle). r, distance between the observation point and the NO-producing bouton.

(B) [NO] is plotted against time. The blue solid line, red dashed line, and black dotted line indicate [NO] measured at 1 μm, 5 μm, and 10 μm, respectively, from the bouton.

(C) [NO] is plotted against distance. The blue solid line, red dashed line, and black dotted line indicate the concentration at 25 ms, 50 ms, and 100 ms, respectively, from stimulation of the bouton.

(D–F) NO derived from boutons along a PF.

(D) When a PF fires, NO is released from its boutons (open circles) located at 5.2-μm intervals along the PF and diffuses [23,34]. Thus, [NO] measured at the observation point (closed circle) on the dendritic plane at distance R from the PF is the sum of [NO] from each bouton at distance μm (i = 0, ±1, ±2,...). Of course, NO diffuses from each bouton in all directions, but only diffusion towards the observation point is indicated (arrows) for clarity.

(E) [NO] is plotted against time. The blue solid line, red dashed line, and black dotted line represent [NO] at 1 μm, 5 μm, and 10 μm, respectively, from the PF.

(F) [NO] is plotted against distance from the PF. The blue solid line, red dashed line, and black dotted line indicate the concentration at 25 ms, 50 ms, and 100 ms, respectively, after stimulation of the PF. [NO] at distances smaller than 0.5 μm are not shown in (C) and (F), because the delta function in Equation 1 (Materials and Methods) made the NO concentration near the site of synthesis dependent on the discretization size of r in its numerical solution.

doi:10.1371/journal.pcbi.0020179.g002

Calcium Kinetics

The calcium kinetics of our model (Figure 1B) are based on De Schutter's model [24] and on a calcium dynamics model of a PF–PC synaptic spine proposed by Doi et al. (Doi's model) [25]. Briefly, calcium ions enter the cytosol of the spine through voltage-gated calcium channels (VGCCs) in the plasma membrane and through inositol 1,4,5-triphosphate (IP3) receptors (IP3Rs) in the endoplasmic reticulum (ER) [13,25,36], and are sequestered by calcium buffers, taken in by the ER, or pumped out by calcium pumps. Depolarization of the dendrite, whether by AMPAR currents or by propagation from other parts of the dendrite, opens VGCCs. The mGluR–IP3R pathway detects the coincidence of PF inputs and depolarization, and activates a large release of calcium from the ER [25]. For simplicity, our model postulates that during the 2 s after each stimulus, a fixed number (5.3 × 104) of calcium ions flow into the stimulated spines (Figure S1), instead of actually simulating the mGluR–IP3R pathway [25].

To understand the calcium kinetics in our model, we stimulated PF–PC synapses conjunctively with somatic depolarization (100 ms in duration) and monitored [Ca2+]spine (Figure 3). [Ca2+] reached approximately 9 μM in 200 ms in the spines that were directly stimulated (Figure 3A); [Ca2+] increased by a much smaller extent in the neighboring spines that were not directly stimulated (Figure 3B). These findings are quantitatively consistent with the results of previous experiments [36,37]. Although we had expected that stimuli to neighboring synapses would locally depolarize the dendrite, opening VGCCs in the spine and allowing more calcium to enter, the size and waveform of [Ca2+]spine, either in stimulated synapses or in synapses not directly stimulated, were practically identical and independent of the number of stimulated synapses within the range tested (one to 30).

thumbnail

Figure 3. Time Course of [Ca2+]spine

[Ca2+]spine was monitored during and after 100 ms of somatic depolarization (−70 mV to 0 mV, bold bars) in the presence (A) and absence (B) of a conjunctive stimulus to the synapse at time = 0 ms.

doi:10.1371/journal.pcbi.0020179.g003

Time Course of AMPAR Phosphorylation

We then observed the time course of AMPAR phosphorylation after synaptic stimulation paired with somatic depolarization. A synapse 9 μm proximal from the tip of the dendrite (the synapse of interest) was repetitively stimulated at 1 Hz for 5 min together with zero to 29 neighboring synapses (0–20 μm from the tip of the dendrite; Table S1), and the soma was depolarized from −70 mV to 0 mV for 100 ms simultaneously with each synaptic stimulus. NO synthesized not only in the stimulated synapses but also in the other boutons along the same PFs (Figure 2D–2F) was considered. [P-AMPAR] was monitored in the synapse of interest (the synapse 9 μm from the tip) as a measure of LTD induction. Figure 4A is a 3-D graph of [P-AMPAR] plotted against the number of stimulated synapses (N) and time, and Figure 4B shows sections from Figure 4A when N = 2, 11, 20, and 29. When two PF–PC synapses were stimulated, only a small proportion of the AMPARs were phosphorylated, and these were soon dephosphorylated (Figure 4B). This result is in accordance with a slice experiment in which stimulation of a small number of PFs, coupled with depolarization, failed to induce LTD [38]. Application of stimuli to a larger number of synapses led to stronger and more prolonged phosphorylation of AMPARs (11 or more synapses; Figure 4B). At any strength of stimulus, [P-AMPAR] returned to baseline within an hour (Figure 4A and 4B). In reality, however, LTD, which also involves receptor internalization, gene expression, and protein synthesis, is more persistent than the AMPAR phosphorylation modeled here [13,39,40].

thumbnail

Figure 4. Concentration of Phosphorylated AMPARs at Various Time Points and at Various Numbers of Stimulated Synapses

(A) [P-AMPAR] is three-dimensionally plotted against time and N.

(B) [P-AMPAR] plotted against time at N = 2 (blue solid line), 11 (red dashed line), 20 (black dotted line), and 29 (green dash-dotted line).

(C) [P-AMPAR] plotted against N at time = 5 min (blue solid line), 19 min (red dashed line), and 30 min (black dotted line), and at the 40-min average (green dash-dotted line).

(D) [P-AMPAR] measured at 19 min plotted against N in a stimulated synapse (the synapse of interest, red dashed line) or a neighboring unstimulated synapse (black solid line). [P-AMPAR] of the stimulated synapse is superimposed on a sigmoid curve (light blue solid line) at an SF of 9.9 and N50% of 9.7 (see Results). The thin dotted line indicates [P-AMPAR]50%,19min.

doi:10.1371/journal.pcbi.0020179.g004

The All-or-None Principle in the Intermediate Phase

Kuroda et al. [26] have pointed out that AMPAR phosphorylation in the intermediate phase, which corresponds to the success of induction of LTD, follows the all-or-none principle. Figure 4C is a graph of simulated [P-AMPAR] plotted against N at 5 min, 19 min, and 30 min, and the 40-min average of [P-AMPAR]. In our model, AMPAR phosphorylation initially occurred in a graded fashion according to N (Figure 4A, ~5 min; Figure 4C, 5 min), but persisted in an all-or-none fashion in the intermediate phase (Figure 4A, 10–30 min; Figure 4C, 19 min and 30 min). To evaluate the nonlinearity of AMPAR phosphorylation in the intermediate phase, the [P-AMPAR] values at various time points versus N were fitted to a sigmoid curve, [P − AMPAR] = [P − AMPAR] N = 0 + ([P − AMPAR] N = ∞ − [P − AMPAR] N = 0), where [P-AMPAR] N = 0, [P-AMPAR] N = ∞, and N50% are [P-AMPAR] at N = 0, [P-AMPAR] estimated at N = ∞, and N at which [P-AMPAR]50% of AMPARs arephosphorylated, respectively. SF stands for the sloping factor, which indicates the nonlinearity of the response. Goodness of fit was quantified by the ratio of the confidence interval of the SF to the SF. [P-AMPAR] best fitted a sigmoid curve at 19 min with a SF of 9.9 (95% confidence interval, 8.7–11.1), indicating strong nonlinearity of AMPAR phosphorylation in the intermediate phase (Figure 4C; see also Figure 4D for [P-AMPAR] at 19 min with the sigmoid curve superimposed). The curve fitting estimated a [P-AMPAR]N=∞ of 0.19 μM and an N50% of 9.7. The [P-AMPAR] at other time points, and the 40-min average, fitted only poorly to sigmoid curves. We therefore concluded that the [P-AMPAR] at 19 min best reflects successful induction of LTD.

thumbnail

Figure 5. Importance of the Time-Averaged Concentration of NO

(A) Time course of AMPAR phosphorylation induced by continuous exposure to NO (red dotted line) or by NO pulses (blue solid line) equivalent in time-averaged concentrations (0.23 nM, 0.69 nM, and 1.1 nM from bottom to top).

(B) [P-AMPAR] at 19 min is plotted against the frequencies of NO pulses whose 1-s–averaged concentration was 0.23 nM, 0.69 nM, or 1.1 nM (from bottom to top).

doi:10.1371/journal.pcbi.0020179.g005

Synaptic Specificity

LTD has been reported to spread to neighboring PF–PC synapses in vitro when dozens of PFs are stimulated in synchrony with somatic depolarization [38,41,42]. In other words, under such conditions, a submicromolar increase in [Ca2+]spine is sufficient for induction of LTD. However, stimulation of a smaller number of PFs, coupled with depolarization, fails to induce LTD, even in stimulated synapses [38]. Furthermore, when the stimulation parameters are varied, stimulated synapses and their neighboring synapses often undergo modification in the same direction—that is, either long-term potentiation or LTD [38]. These findings conflict with the proposed role of cerebellar LTD in supervised learning [13], in which input specificity is taken for granted. We therefore compared AMPAR phosphorylation in a stimulated spine and an adjacent spine to determine whether there is any opportunity for synaptic specificity. Figure 4D plots [P-AMPAR] at 19 min after stimulation in a stimulated synapse and a neighboring synapse 9 μm from the tip of the dendrite against the number of stimulated synapses, N. The slope of [P-AMPAR] in the neighboring synapse was less steep than that in the stimulated synapse, probably because the MAPK-mediated positive feedback loop, which is responsible for the bistability of AMPAR phosphorylation [26], is not fully activated by submicromolar [Ca2+]spine. At large Ns, however, the slope eventually reached values comparable with those in the stimulated synapse. The slope for the stimulated synapse crossed [P-AMPAR]50% at 19 min ([P-AMPAR]50%,19min) when N = 10, whereas that of the neighboring synapse did not cross this value until N reached 19. We therefore concluded that synapse-specific LTD is theoretically possible within a narrow range of N (ten to 18), although experiments have failed to clearly demonstrate input specificity of LTD [38]. Just by adjusting parameters so that LTD would begin to spread through the neighborhood at an N of about 20 (Materials and Methods), our simulation succeeded in demonstrating the failure of LTD at smaller Ns (Figure 4B), which has also been shown experimentally [38].

The Time-Averaged Concentration of NO, Not Its Actual Waveforms, Is Critical for AMPAR Phosphorylation

Although only a fraction of PFs form electrically connected synapses with a PC [43], other PFs can also affect these synapses by producing NO. PFs convey diverse modalities of signals, and the set of PFs responsible for a certain motion in a certain context does not always fire in exact synchrony. The more out of synchrony they fire, the more blunt the NO waveforms become. Taking the slow time constant of protein phosphatase 2A inactivation [26] (Figure 1C) into account, we hypothesized that [P-AMPAR] would depend not on NO waveforms, but rather on its time-averaged concentration. To test this hypothesis, we compared receptor phosphorylation induced by pulses of NO and that induced by continuous exposure to NO. A spine was stimulated for 5 min, either repetitively at 1 Hz with realistic waveforms of NO (identical in shape to the solid line in Figure 2E and variable in amplitude), or constantly with the equivalent time-averaged concentration of NO. Calcium surges (identical in shape and amplitude to the curve in Figure 3A) were applied concomitantly at 1 Hz. Figure 5A shows the time course of AMPAR phosphorylation induced by constant NO and that induced by equivalent pulses. Their time courses almost coincide, suggesting that constant NO can substitute for equivalent pulses in AMPAR phosphorylation. We then tested NO waves at various frequencies to further confirm the independency of AMPAR phosphorylation from the waveforms of NO. It should be noted that what we examined here is not the frequency of stimuli to PFs, upon which presynaptic NO synthesis is dependent [18,35]. A spine was stimulated for 5 min with NO pluses at a frequency of 0.25 Hz, 0.5 Hz, 1 Hz, 2 Hz, or 4 Hz, and with calcium surges at 1 Hz. The NO pulses were rescaled so that their time-averaged concentrations would be the same. Figure 5B shows [P-AMPAR] at 19 min, the time point when the all-or-none principle of AMPAR phosphorylation is most obvious (Figure 4C), plotted against the frequencies of NO pulses. This figure indicates that the effect of NO on AMPAR phosphorylation is independent of its frequency within the range shown. Because of these findings, we were able to define the concentration of constant NO that would result in [P-AMPAR]50%,19min of phosphorylated AMPARs at 19 min (median effective concentration, or EC50). The EC50 of NO for a stimulated spine and that for a neighboring spine are 0.41 nM and 0.69 nM, respectively. Introduction of the EC50 of NO enabled us to predict PF activity in vivo in a very simple scheme, as described below.

Distribution of PFs Assigned to a Certain Movement in a Certain Context

A fraction of PFs fire to achieve a certain movement in a certain context [6,7], but their distribution is totally unknown. At one extreme, they might all be concentrated in a small area of a PC dendrite; at the other extreme, they might be evenly distributed in the molecular layer, with only one of them forming an electrically active synapse with the PC (if none of them formed an electrically active synapse, there would be no room for synapse-specific modification linked to that movement). We were able to examine the relationship between PF distribution and AMPAR phosphorylation in a simple scheme in which [NO] is constant throughout stimulation, because AMPAR phosphorylation depends on the time-averaged concentration of NO, and not on its actual waveforms (Figure 5). Synaptic stimulation was represented as NO synthesis, AMPAR current, and mGluR activation, but we were also able to regard neighboring PFs merely as additional sources of NO, because stimuli to them had hardly any effect on [Ca2+]spine in the synapse of interest, and because the downstream signaling cascade of mGluR activation was synapse-specific, at least in our model. In other words, it was unnecessary to distinguish the minority of PFs that formed electrical connections with the PC [43] from the other PFs that were not connected to that PC [43]; the only thing that mattered was the number of firing PFs in the neighborhood of the synapse of interest. To make this point obvious, in this section we use “n” instead of “N” to indicate the number of activated PFs.

Suppose that the set of PFs responsible for a certain movement in a given context projects onto the PC dendritic plane according to a Gaussian distribution (Figure 6A), with the synapse of interest at its center. The 1-s–averaged concentration of NO will be given as , where n and <f> are the total number of PFs that fire to generate the movement in that context, and their average firing rate, respectively. The product of n and <f> represents the activity of these PFs, and σ is the standard deviation of their spatial distribution. [NO(R)] is the 1-s–averaged concentration of NO that is synthesized in a single PF and measured at a distance of R μm (Figure 2D–2F). The function inside the integral sign, , represents the probability-weighed time-averaged concentration of NO derived from a PF responsible for the movement, because denotes the probability density of such a PF occurring at distance R μm. This function, plotted against R in Figure 6B, clearly indicates that regardless of the size of σ, NO cannot be provided by PFs more than several dozens of micrometers away because of the restricted NO distribution (Figure 2F). Figure 6C plots σ against the n<f> at which [NO] reaches the EC50 (that is to say, the [NO] to phosphorylate [P-AMPAR]50%,19min of AMPARs at 19 min) either for a stimulated synapse or for an adjacent unstimulated synapse. The interpretation is as follows: below the lower curve, no substantial LTD occurs in any synapse. Above the upper curve, LTD occurs not only in stimulated synapses, but also in neighboring synapses that are not directly stimulated, and synapse specificity is lost. Between the two curves, LTD is synapse-specific.

thumbnail

Figure 6. Prediction of PF Activity In Vivo

The spatial distribution of the PFs responsible for a certain movement in a certain context was assumed to follow a Gaussian distribution with a standard deviation of σ.

(A) The occurrence of the PFs responsible for a particular motion in a particular context was assumed to follow a Gaussian distribution, . σ is either 10 μm (solid line), 30 μm (dashed line), or 90 μm (dotted line).

(B) The probability-weighed time-averaged NO concentration, , is plotted against R. σ is either 10 μm (solid line), 30 μm (dashed line), or 90 μm (dotted line).

(C) Against σ, this panel plots the n<f> at which [P-AMPAR]50%,19min of AMPARs in a stimulated synapse (solid line) or in a neighboring unstimulated synapse (dashed line) was phosphorylated at 19 min.

(D) R90%, which satisfies , is plotted against σ.

(E) n90%<f> (see Results for explanation) is plotted against σ. Solid line, stimulated synapse; dashed line, neighboring unstimulated synapse.

(F) The ratio of n90%<f> to the total number of the PFs projecting within R90% μm of the dendritic plate is plotted against σ. Solid line, stimulated synapse; dashed line, neighboring unstimulated synapse.

doi:10.1371/journal.pcbi.0020179.g006

In reality, however, NO does not travel very far, and only PFs in the vicinity contribute to the local concentration of NO, regardless of the size of σ (Figures 2F and 6B). Thus, it must be much more practical to estimate the number and distribution of PFs that actually contribute to [NO] at the synapse of interest. Suppose that among the n PFs responsible for a given movement, n90% within R90% μm produce 90% of the EC50 of NO. R90% and n90% appear to represent the actual distribution and number of responsible PFs. Figure 6D plots σ against R90%, which was obtained by numerically solving . R90% reached a plateau at approximately 7 μm, suggesting that regardless of the size of σ, the state of a PF–PC synapse can be influenced only by PFs in the immediate vicinity. In Figure 6E, σ is plotted against n90%<f> for a stimulated synapse and a neighboring synapse not directly stimulated. In contrast to n<f>, as σ rose, n90%<f> plateaued at approximately 8 for the stimulated synapse and at 14 for the neighboring synapse. This implies that for any movement in any given context, the number of firing PFs in the vicinity needs to be strictly limited; otherwise, input specificity would be lost. Furthermore, regardless of the size of σ, AMPAR phosphorylation in a stimulated synapse in the intermediate phase was not achieved by the firing of a single PF, but by the firing of multiple PFs (Figure 6E). Thus, it can be concluded that plasticity of a synapse is regulated by the surrounding PF activity.

The ratio of n90%<f> to the total number of the PFs projecting within R90% μm of the dendritic plate (Figure 6F) can be used as an index of the sparseness of predicted PF activity, and it will enable an easy comparison with our prediction with future experiments. The ratio is obtained from the following reckoning: the size of the PC dendritic plate is 250 μm × 300 μm, and 400,000 PFs traverse the dendritic arborization of a PC in cats [5]. Thus, the density of PFs projecting on the PC dendritic plate is estimated as 5.3/μm2, and the total number of PFs traversing a circle (of radius R90% μm) on the dendritic plate is estimated as . Figure 6F shows the ratio for the stimulated synapse, and that for the neighboring synapse, plotted against σ. Except at very small σ (≤2.3 μm), the ratios were less than 0.1 and approached 0.017 (at the stimulated synapse) and 0.0099 (at the neighboring synapse) as σ rose, suggesting that if LTD is input-specific in vivo, only several percent of PFs in the vicinity fire to encode a particular motion in any given context.

Discussion

We simulated the sequence of events in cerebellar LTD from receptor activation to AMPAR phosphorylation in search of the cellular substrates of context-dependent motor learning [1,9,10,12]. The cellular events modeled here are not, of course, the whole picture of cerebellar LTD. For example, our model does not consider δ2 receptors [13], endocannabinoid signaling [44], dependence of NO release on the frequency of PF activity [35], mGluR-linked slow excitatory postsynaptic potentials [45], desensitization of soluble guanylyl cyclase [46], cyclic guanosine monophosphate (cGMP)–mediated activation of phosphodiesterase [47], gene expression, or protein synthesis [39,40]. However, this rather simple model was able to demonstrate the key features of PF–PC LTD (Figure 4) and shed light on some important characteristics of cerebellar LTD and learning (Figures 5 and 6).

The simulation results revealed that synapse-specific modification is theoretically possible despite the difficulty in demonstrating it experimentally [38]. Furthermore, [P-AMPAR] in the intermediate phase was found to be dependent on NO derived from surrounding PFs. During low PF activity in the vicinity, stimulated PF–PC synapses were incapable of undergoing prolonged AMPAR phosphorylation because of an insufficient NO concentration; during excessive PF activity, even neighboring synapses underwent AMPAR phosphorylation, and synapse specificity was lost. Only at a moderate level of PF activity in the vicinity did AMPAR phosphorylation occur in a synapse-specific manner. We predict that any movement in any context is encoded by a small number of PFs, because otherwise synapse specificity would be lost.

Although LTD can be induced by a large increase in calcium alone [48], in vivo studies have shown the essential role of NO in cerebellar learning and adaptation [13,4951], suggesting that LTD in such a condition is unphysiological. Similarly, LTD induced by strong activation of PFs alone [42] can be regarded as another unphysiological extreme, because the predicted sparse activity of PFs in vivo (Figure 6) will never be sufficient for supralinear increase in calcium levels if it is not paired with CF firing. Between these two extremes lies our simulation setting, where LTD is induced by the increase of NO and calcium levels due to conjunctive activation of PFs and the CF, which corresponds better to the physiology of behaving animals.

Synaptic Specificity

Our simulation shows that synapse-specific AMPAR phosphorylation is theoretically possible (Figure 4). Some slice experiments have demonstrated that, in most cases, standard procedures to induce homosynaptic PF–PC LTD also induce heterosynaptic LTD in neighboring synapses, bringing into question the input specificity of cerebellar LTD [38,41,42]. There are, however, strong reasons to expect input specificity in vivo, if not in vitro. Cerebellar LTD needs to be input-specific if it is the cellular substrate of supervised learning, as suggested experimentally and theoretically [13]. In addition, synapse-specific modification is more advantageous by orders of magnitude than the nonspecific form in terms of metabolic costs and memory capacity [13,41]. Moreover, neurons have acquired through evolution the apparatus to localize modification in individual spines, such as the intraspinal calcium store [13] and the calcium-compartmentalizing spine neck [52]. Thus, our demonstration of room for synapse-specific AMPAR phosphorylation is not at all trivial. The range of the number of stimulated synapses for input-specific LTD was found to be narrow (Figures 4D and S3), which accounts for the difficulty in demonstrating this form of LTD experimentally [38].

In contrast with the previous report that multiple PFs need to be stimulated in order to induce LTD [38], Casado et al. [19] demonstrated LTD at the synapse between a single granule cell (GC) and a PC. However, as is suggested in their paper [19], not only the PF–PC synapse, but also multiple synapses at the ascending part of the GC axon, could be involved. Further in vitro and in vivo studies are necessary to confirm the possibility that stimuli to a single synapse is sufficient for induction of LTD.

NO Represents the Relevance of Contexts

The IP3R, which is sensitive to both calcium and IP3, appears to be sufficient for the coincidence detection mechanism triggered by conjunctive stimulation of the PF and the climbing fiber (CF) [25,36]. So, what can NO add in terms of computation? Considering that only a fraction of PFs traversing a PC dendritic plate make electrical connections with the PC [43], this diffusing messenger seems to have a specific role in cerebellar learning. According to the results of our simulation, input-specific LTD necessitates that NO be provided by a modest number of surrounding PFs. This is a very important finding, because it might underlie context-dependent learning in the cerebellum [1,3,4,9,10], as we propose here. In our hypothesis, NO represents the relevance of a given context and enables context-dependent selection of internal models to be updated. Imagine that a PF–PC synapse responsible for a certain movement in a certain context is surrounded by thousands of PFs coding vast modalities of information, such as joint angles, audiovisual cues, verbal instructions, etc. The synapse will be modified only when it and a few of its surrounding PFs fire together and produce a sufficient concentration of NO (Figure 7A, left). However, if a small fraction of them happen to fire in an inappropriate situation, the synapse will remain unmodified, because [NO] does not reach the level necessary for prolonged AMPAR phosphorylation (Figure 7A, right). As a result, the internal models for a movement are acquired or updated only when they are responsible in the given context. In other contexts, where these internal models are not responsible, they are left unmodified. This corresponds to the system-level findings that the cerebellum can form modules of internal models according to the context [3,4], while adaptation to multiple tasks is interfered with in the absence of adequate contexts [8,12]. In a later subsection, we will propose an animal experiment to critically test our hypothesis (Figure 7B).

thumbnail

Figure 7. A Novel Hypothesis of the Cellular Mechanism for Context-Dependent Cerebellar Learning and an Experiment to Test It

(A) Schematically shown are a PF–PC synapse and its surrounding PFs in a parasagittal section of the cerebellar cortex. Firing PFs are indicated by red solid circles, and those at rest are indicated by gray dotted circles (note that their beams are perpendicular to the sagittal plane.) Some neighboring PFs might fire together with the PF making the synapse (arrow) and produce NO (orange semitransparent circles); others might not. Left: when a few PFs in the neighborhood fire together, that will result in a sufficient concentration of NO for the PF–PC synapse to be modified. Right: when only a small number of PFs fire together, [NO] will not increase to the level necessary for the synapse to be modified.

(B) The experimental protocol.

doi:10.1371/journal.pcbi.0020179.g007

According to the Marr–Albus model [53,54], divergence of MF inputs to an enormous number of GCs enables pattern separation, so that after learning, similar sensory inputs are associated with different motor outputs in different contexts. However, a recent in vivo study revealed that a burst of activity in an MF is not filtered at the MF–GC synapse, but rather directly transmitted to the downstream GC [55], implying that input patterns are not likely to be separated there. Our hypothesis might provide a resolution by suggesting that pattern separation, or assignment of adequate internal models for a movement to be learned in a given context, takes place in the molecular layer instead, according to the [NO] determined by the surrounding PF activity in that context.

Because bursts of PF activity are more efficient than single pulses in induction of LTD in cerebellar slices [52,56], one may argue the possibility that repetitive activation of a single PF is sufficient for induction of LTD. However, it is very difficult to experimentally test this, because even the weakest stimuli might activate several PFs [36]. Bursts of PF activation might be too strong and less appropriate than single stimuli as an analog for physiological activation of PFs in vivo for the following reason: single activation of PFs in slices elicits repetitive firing through the antidromic activation of GCs [57], just like physiological stimuli in vivo typically evoke triplets of action potentials in GCs [55], presumably firing their PFs as many times. On the other hand, bursts of activation applied in slice experiments usually consist of five pulses, which are much greater stimuli than triplets in vivo, considering the multiplication of pulses in vitro [57].

A recent study implicated the presynaptic terminals of interneurons rather than PFs as the origin of NO [58]. If so, our simulation model and hypothesis would still apply, because the activity of interneurons reflects that of connected PFs, and their connection with multiple PFs might serve to integrate PF activity in the neighborhood. Namiki et al. [35] estimated [NO] after stimulation of PFs to be at micromolar levels, whereas Hall and Garthwaite estimated the peak concentration of NO produced by a single bouton to be 15 pM and the tissue concentration of NO to be 50 nM at most following full activation of NOS [59]. This severe discrepancy might reflect a lack of knowledge and understanding of NO kinetics in the brain. The peak concentrations of NO in our simulation lie between Hall's and Namiki's estimates, at a nanomolar level (Figure 2). Because we estimated a production rate of NO (kf[NOS]total) at which NO would mediate the spread of LTD at a realistic size of PF stimulation, our model as a whole yields quantitatively compatible outcomes with experimental findings in terms of AMPAR phosphorylation and LTD.

Sparse PF Activity Codes a Movement in a Context

According to the results of our simulation, synapse specificity requires that every movement in every context be coded by a small number of PFs (Figure 6). This is in agreement with Marr and Albus' postulation that only a small fraction of PFs is active at any given time [53,54], and might at least partly explain the extreme difficulty in demonstrating in vivo PF activity induced by natural stimuli, in spite of recent technical developments in the bioimaging of and electrophysiology of the cerebellar cortex [55,60]. Sparse coding is also suggested in the hippocampus and cerebral cortices [6163] and might be a general principle of information processing in the central nervous system.

The low PF activity estimated in this study does not contradict the high frequency of simple spikes (SSs) at approximately 50–100 Hz [5], because PCs intrinsically generate spontaneous spiking activity even in the