(C) PLOS One
This story was originally published by PLOS One and is unaltered.
. . . . . . . . . .



A unitary mechanism underlies adaptation to both local and global environmental statistics in time perception [1]

['Tianhe Wang', 'School Of Psychological', 'Cognitive Sciences', 'Peking University', 'Beijing', 'Department Of Psychology', 'Helen Wills Neuroscience Institute', 'University Of California Berkeley', 'Berkeley', 'California']

Date: 2023-05

Our duration estimation flexibly adapts to the statistical properties of the temporal context. Humans and non-human species exhibit a perceptual bias towards the mean of durations previously observed as well as serial dependence, a perceptual bias towards the duration of recently processed events. Here we asked whether those two phenomena arise from a unitary mechanism or reflect the operation of two distinct systems that adapt separately to the global and local statistics of the environment. We employed a set of duration reproduction tasks in which the target duration was sampled from distributions with different variances and means. The central tendency and serial dependence biases were jointly modulated by the range and the variance of the prior, and these effects were well-captured by a unitary mechanism model in which temporal expectancies are updated after each trial based on perceptual observations. Alternative models that assume separate mechanisms for global and local contextual effects failed to capture the empirical results.

Our perceptual system can actively adapt to the statistical properties of the environment in multiple time scales. For example, the perceived duration of an event is biased by the mean duration of events observed in a relatively long period and also by the durations of recently processed events. Here we ask whether these two effects reflect the operation of separate mechanisms or a unitary mechanism. We develop a series of computational models of independent and unitary mechanisms, and use experimental manipulations that generate predictions which allow us to evaluate the models. We show that serial dependence, the signature of short-term adaptation, is modulated by the long-term context. The results are consistent with the predictions of a unitary mechanism model which assumes the global prior is updated in a trial-by-trial manner. The alternative models that assume separate mechanisms fail to capture the empirical results. These results provide a comprehensive picture of how the timing system jointly adapts to short- and long-term environmental statistics.

To arbitrate between these hypotheses, we used a temporal reproduction task [ 6 , 28 – 30 ] in which participants reproduce an interval specified by a visual stimulus. Across conditions, we manipulated the global distribution by sampling the target durations from different temporal distributions. If serial dependence and central tendency arise from a shared mechanism, the magnitude of the serial dependence effect would be impacted by the global temporal distribution. Alternatively, if serial dependence and central tendency arise from distinct mechanisms, the magnitude of the serial dependence effect would remain invariant across a wide range of global temporal distributions. We formalized these hypotheses and compared the predictions of these computational models with the empirical results. By combining our behavioral experiments and model-based analyses, we sought to unravel the computational mechanisms underlying the influence of context on temporal perception.

Although the central tendency and serial dependence effects describe two ways in which context can influence behavior, it remains unclear whether they reflect the operation of a unitary process or two separate, adaptive processes. From a unitary view, the perceptual system continuously updates the global prior based on new observations, and the trial-by-trial updating of the global prior could influence the subsequent perception, giving rise to serial dependence [ 10 , 11 , 27 ]. Alternatively, there may be two adaptive systems that operate on different timescales in response to environmental statistics, building up global and local priors that give rise to central tendency and serial dependence, respectively.

Central tendency and serial dependence effects have been observed across a wide range of perceptual tasks [ 17 – 22 ], indicating that they reflect general principles of how the perception system adapts to the statistics of the environment. Both phenomena can be explained under a Bayesian framework [ 23 ]. On the one hand, the observers appear to construct a relatively stable global prior that reflects the distribution of the stimulus set [ 6 , 24 , 25 ]. Following Bayesian integration, the current perception is biased toward the mean of the global prior, the central tendency effect. On the other hand, the observer also appears to build a temporal expectancy based on the most recent stimulus, inducing a bias in judging the duration of the current stimulus towards recently experienced stimuli, the serial dependence effect [ 26 ].

It has also been shown that temporal perception can adapt on a rapid timescale. Participants’ perception of the current duration is attracted toward the duration of the previous stimulus [ 10 – 12 ]. That is, the duration is perceived to be longer after a relatively long duration stimulus compared to a relatively short duration stimulus [ 13 – 16 ]. This phenomenon, known as serial dependence, suggests that the perceptual system is also sensitive to the local statistics of the environment [ 17 , 18 ].

The internal representation of temporal information is essential for a wide range of cognitive functions, from anticipating future events to controlling movements [ 1 – 3 ]. To improve the precision of temporal perception, the timing system flexibly adapts to the statistical properties of the current context [ 4 , 5 ]. For example, when presented with a set of durations in a perception task, participants have a strong tendency to overestimate relatively short durations and underestimate relatively long durations. Thus, the perceived duration is biased toward the mean of the set [ 4 , 6 – 9 ], a phenomenon known as the “central tendency effect.” This effect indicates that temporal perception is sensitive to the global temporal context.

Taken together, the results from Experiments 3 and 4 are consistent with the predictions of the MGU model, and at odds with the predictions of the DP model. Manipulations of the distribution properties of the test set indicate that a unitary process gives rise to both the central tendency and serial dependence effects.

Consistent with the prediction of the MGU, the serial dependence effect was modulated by the variance of the test set. For the trial n-1 data, the best-fitted DoG in the bimodal condition (ΔAIC n = -78.6 ± 5.6; ΔAIC l = -18.3 ± 3.6) yielded the highest and broadest serial dependence function of the three conditions ( Fig 7E ); the short-900 condition (short-900: ΔAIC n = -48.3 ± 7.8) showed the lowest and narrowest function ( Fig 7D ). Statistically, the half-amplitudes of the functions increased as the variance of the prior increased (short-900: 8.7 ± 0.6 ms; medium: 11.8 ± 0.9 ms; bimodal: 18.7 ± 0.8ms, Zs>4.2, ps<0.001, Fig 7F ). In the analysis of the individual functions, the serial dependence effect was found in both trial n-1 and trial n-2 in the bimodal condition, similar to what was found in the medium condition. In the short-900 condition, serial dependence effect was only significant from trial n-1 ( S6 Fig ). In sum, the results provide additional evidence that serial dependence is enhanced in higher variance test sets.

(a) Illustration of the distributions of target durations in experiment 4. The short-900 condition (purple) has the same mean as the medium condition (turquoise). The bimodal (orange) has the same range as the medium condition but has larger variability. (b-c) Predicted half-amplitudes of the DoGs for the different conditions by the MGU and the DP models. (d-e) Serial dependence effect for trial n-1 of the short-900 condition (d) and bimodal condition (e). Note the scale for the x-axis is different in d and e (and thus, the function for the medium condition looks different). Each filled dot represents a participant. The open circles indicate the average across participants. The purple, orange, and turquoise lines represent the best-fitted DoG for the short-900, the bimodal-medium, and the medium conditions, respectively. Each filled dot represents one participant. Error bars represent standard error. (f) Half-amplitude of the best-fitted DoG for the trial n-1 data in Experiments 1, 2, and 4. Each filled dot represents an estimate from jackknife resampling. Error bars represent standard error.

The MGU and DP models also make differential predictions when the variance of the target distribution is manipulated ( Fig 7A ). In Experiment 4, we set the mean of the target set distribution to 900 ms (the medium condition of Experiment 3). In one condition, we created a set with low variance (720-1080ms, short-900) compared to that used in experiment 3, where the variance was larger (540-1260ms). In the second condition, we used a bimodal distribution to increase variance. The GMU predicts that the serial dependence effect will be enhanced as the variance of the target distribution is increased ( Fig 7B ). The logic here is similar to that described above in terms of the range of the distribution: Because a low variance test set, by definition, is more concentrated, the effect of trial-by-trial updating of the prior will be smaller relative to when the variance is high. In contrast, the DP predicts that the variance of the test set will have little effect on serial dependence ( Fig 7C ).

We note that in the original version of the DP model, the local prior is decided by the posterior of the 1-back trial. Here we implemented a more general version of the DP model, the Dual-prior short-term memory (DPSM) model, in which the updating of the local prior after each trial incorporates information from previous trials with diminishing weight. The DPSM model generates a serial dependence from the 2-back trial ( S5 Fig ). However, the model predicts a near-linear serial dependence function, a prediction at odds with the data. Moreover, for the 1-back serial dependence function, the DPSM model predicts the direction of the serial dependence will flip when the current and previous stimuli are distinct from each other; this prediction is also not consistent with the data. In sum, adding a more graded working memory component to the dual prior model cannot account for the results of our experiments.

The key question in Experiment 3 centers on the trial n-2 data: Will the increase in the magnitude of the serial dependence effect from trial n-1 be accompanied by a stronger serial dependence effect from non-adjacent trials? In the analysis of the individual functions, there was a significant positive instantaneous slope for the trial n-2 data in both the medium and long conditions ( Fig 6B ). Trial n-3 also showed a tendency for a positive bias, although this did not reach significance. Moreover, fitting the DoG at the group level showed a significant positive half-amplitude for the serial dependence function from trial n-2 ( Fig 6C ). Thus, the results are consistent with the prediction of the MGU model and fail to support the DP model ( Fig 6A ). Moreover, the MGU is consistent with the central tendency effect in all conditions (short, medium, long), further supporting the idea that the process that produces the serial dependence effect uses local information to update the prior ( S1F Fig ).

Given the predictions of the two models, we extended the range of the target durations in experiment 3. We applied two test sets, one ranging from 520-1260ms (Medium range) and the other from 560-1860ms (Large range, Fig 5A ). We compared the results with those obtained in Experiment 1, in which the target durations ranged from 520-880ms (Short range). Extending the range of target duration successfully enhanced the serial dependence effect from trial n-1. The best-fitted DoG of the medium condition (ΔAIC n = -27.5 ± 4.2; ΔAIC l = -10.8 ± 2.2) and long condition (ΔAIC n = -20.7 ± 4.4; ΔAIC l = -10.0 ± 2.1) had a higher peak and was broader than that for the short condition ( Fig 5H ). Correspondingly, the half-amplitude increased as the distribution became wider ( Fig 5I ; medium: 11.8 ± 0.9 ms; long: 22.6 ± 2.0 ms; Zs > 4.9, ps < 0.001). Similarly, the half-widths also increased as the distribution became wider (Zs > 10.5, ps < 0.001), indicating an influence from more distant productions on the previous trial when the test set range increased.

(a) Simulation of the half-amplitude with the MGU model (left) and the DP model(right). The upper two panels depict simulations for the long-range condition, and the lower two panels depict simulations for the medium-range condition. Simulations of the MGU model can produce robust serial dependence effects from trial n-2, whereas the DP model fails to predict this effect. (b) Instantaneous slope of the DoG-fitting curve for trials n-1, n-2, n-3, and n+1 (control condition) in the medium and long conditions. The p-values are with respect to the difference from zero. (c) Half-amplitude of the best-fitted DoG for the n-1 and n-2 trials in the medium (left) and long (right) conditions.

(a) Distributions of target durations in different contexts. (b-c) Simulations of the MGU model predict that the half-amplitude of the serial dependence effect will increase as the range of target duration is increased. (d) In the MGU model, changing the range of the target durations will impact the width of the prior and, therefore, the serial dependence effect. (e) In the PD model, since the scalar property in time perception such that the ratio between the SD and mean is a constant, the likelihood will become flatter as the target duration increases. (f-g) Simulations of the DP model also predict that the half-amplitude of serial dependence increases substantially as the range of target duration is increased. (h) Serial dependence effects for trial n-1 in the medium (left) and long condition (right). Filled dots represent individual participants. Blank circles represent the average of all participants. The turquoise and blue lines represent the best-fitted DoG in the medium and long conditions, respectively. The red and green dash lines represent the best-fitted DoG in Experiments 1 and 2, respectively. (i) Half-amplitude of the best-fitted DoG for serial dependence effect from trial n-1 in Experiments 1, 2, and 3 (medium and long conditions). Each filled dot represents an estimate from jackknife resampling. Error bars represent standard error.

To address this discrepancy, we sought to enhance the serial dependence effect and examine whether it will be manifest beyond trial n-1. The MGU and DP models both predict that extending the range of the durations in the test set will enhance the trial n-1 serial dependence effect ( Fig 5 ). In the DP, sensory noise scales with duration, a form of Weber’s law. Because of this, the likelihood becomes relatively flat when the range is increased ( Fig 5E ). This will result in a greater influence of the previous production and, thus, a strengthening of the serial dependence effect ( Fig 5F–5G ). Sensory noise also influences the serial dependence effect in a similar way in the MGU. In addition, the MGU model postulates that the observer builds a concentrated prior when the range is limited ( Fig 5D right). As such, the prior is resistant to updating, yielding a weak serial dependence effect. When the range is expanded, the prior becomes more distributed ( Fig 5D left), resulting in a more pronounced local change after each update, and thus, a stronger serial dependence effect ( Fig 5B and 5C ). Importantly, as noted above, the MGU and DP models generate very different predictions regarding concerning a trial n-2 serial dependence effect ( Fig 6A ): The MGU predicts that the serial dependence effect should be observed from trial n-2, with a half-amplitude slightly attenuated relative to trial n-1, whereas the DP predicts almost no serial dependence effect from trial n-2.

A key difference between the MGU and DP models is that the latter assumes the local prior is mainly determined by the duration of trial n-1; as such, a serial dependence effect from the n-2 trial will be very weak. In contrast, the MGU predicts a robust serial dependence effect from trial n-2, given that all (recent) observations are integrated into the prior. In experiment 1, the group level analysis indicated a positive serial dependence effect from the n-2 trial. However, this effect was not observed in the individual analyses of the instantaneous slope values.

In sum, our computational analyses point to two candidate models that can account for the observed central tendency and serial dependence effects. These two models diverge in that the MGU model postulates that central tendency and serial dependence effects arise from a unitary mechanism (non-linear updates to a global prior). In contrast, the DP model postulates that these two effects arise from separate mechanisms (a fixed global prior paired with updates to a local prior). In the following sections, we sought to arbitrate between the MGU and the DP models.

In contrast to the unitary mechanism models described above, an alternative way to produce both serial dependence and central tendency effect is to consider a hybrid model in which the two biases arise from two distinct processes (Dual Priors model, DP). Specifically, the DP model assumes that the current stimulus (likelihood) is integrated with a static global prior ( Fig 3A ), generating central tendency. The posterior is then integrated with a local prior ( Fig 3F ), inducing non-linear serial dependence. As such, the Dual Priors model can predict both the central tendency and non-linear serial dependence effects observed in the data ( Fig 4A–4C ). Note that these two effects are generated by two mechanisms that adapt separately to the global and local temporal context.

To get a non-linear serial dependence function, the prior can be updated in a non-Bayesian manner. We assume that the brain creates the prior distribution by summing multiple Gaussian distributions with different weights (Mixed Gaussian Updating model, MGU). When perceiving a new duration, the prior is updated by adding a Gaussian centered at the reproduced duration to the old prior, followed by normalization ( Fig 3C ). This kind of computation has previously been suggested to account for how Purkinje cells in the cerebellum adapt to the prior for representing temporal information [ 30 ]. Since the global prior is updated locally, the attraction effect from trial n-1 decreases when the duration of the current stimulus is far from the duration of the n-1 reproduction ( Fig 4D ). Thus, this model predicts a non-linear serial dependence function and serial dependence in accordance with the empirical results in experiments 1–2 ( Fig 4A–4C ).

We then considered models capable of simultaneously generating serial dependence and central tendency effects. First, we considered a unitary mechanism model in which the global prior is updated across trials (Bayes-Updating model, BU). Given that serial dependence is driven by temporal reproduction (Exp 2), we assume that, following Bayes rule, the prior is updated by integrating a Gaussian centered at the reproduced duration. Since the priors and posteriors are both Gaussians, the Bayesian integration can be simplified into a linear weighted sum of the means (i.e., a Kalman filter, Fig 3D ). This model is mathematically similar to what has been previously described as an internal reference model [ 10 , 11 , 26 , 34 , 35 ]. While this model predicts a central tendency effect and an attractive serial dependence effect, the predicted serial dependence function is near-linear ( Fig 4B ), inconsistent with the empirical results of Exps 1–2.

We next considered a unitary mechanism Bayesian Interference model that assumes that the observer integrates the likelihood and a local prior based solely on the local context (Local-Only model, Fig 3E and 3F ). The weights of the local prior and likelihood are determined by the difference between distributions associated with the current sample and recently experienced samples. Specifically, the observer will rely less on the local prior when it is far away from the likelihood and more on the local prior when it is close to the likelihood. The Local-Only model can produce a non-linear serial dependence effect ( Fig 4B ). However, this model cannot simultaneously capture the observed serial dependence and central tendency effects (Figs 4A–4C and S4 ). Parameter values that generate the observed serial dependence effect produce a smaller than observed central tendency effect. Parameter values that generate the observed central tendency effect produce an unreasonably large serial dependence effect. Thus, a local prior by itself is not sufficient to explain adaptative behavior in the current experiments, indicating that the central tendency effect is not a by-product of serial dependence.

Predicted central tendency (a) and serial dependence (b) effects for the five models. (c) Predicted relationship between central tendency and serial dependence for the models. Each line is the prediction of one model, and each dot is the prediction with a specific learning rate parameter. The half-amplitude increases with the learning rate. The blue “+” represents the empirical data. The length of the bars indicates SE. (d) Illustration of how the MGU model generates a non-linear serial dependence. For illustration purposes, we depict the n-1 prior as a broad, uniform distribution ranging from 500-1900ms, the range used for the long condition in Exp 3. When the current and previous durations are close (left), the previous duration can influence the shape of the prior in the range around the current duration and make the current posterior shift more relative to the likelihood. In this region, the bias of the posterior increases with the distance between two successive stimuli. However, when the current and previous duration are distinct (Right), the previous duration cannot influence the shape of the prior around the likelihood, and thus, the attraction effect decreases.

(a) In all of the models, the target duration (green) is represented as a normal distribution (likelihood) centered at the target duration with perceptual noise. The global prior (blue) is based on the distribution of the target stimuli. The posterior (yellow) is obtained by multiplying the global prior by the likelihood. (b) The mean of the posterior is computed as a single-value estimate. The reproduced duration is sampled from a normal distribution centered on the estimate with motor noise. (c-d) In the BU and MGU models, the observer updates the prior based on the reproduced duration after the motor response. (c) For the MGU model, a scaled normal distribution centered at the reproduced duration is added to the old prior, and the new prior is normalized. (d) For the BU model, the prior is updated with a Kalman filter. (e) For the Local-Only model, the likelihood is integrated with the local prior in a non-linear manner, with the weight on the local prior increasing when the likelihood and local prior are similar and decreasing when they are dissimilar. In the DP model, the observer is assumed to hold two priors, one global and one local. After the posterior is computed based on the global prior and likelihood, the posterior is integrated with the local prior to generate a second posterior (posterior’). (f) In the second step of the DP and Local-Only models, the new local prior is fully determined by the motor response of the current trial. As with the other three models, the reproduced duration is drawn from a normal distribution centered at the mean of the posterior with motor noise.

To explain the central tendency effect, we begin with a Bayesian-least-square model [ 6 ] that assumes the observer’s global temporal expectancy is static (i.e., global prior), with bias constrained only by the global distribution of temporal stimuli (Global-Only model, Fig 3A and 3B ). The observer’s temporal estimation on a given trial is the Bayesian integration of the global prior with the actual target duration (corresponding to the likelihood in Bayesian theory). As such, the estimated duration of each trial is biased towards the global prior, giving rise to the central tendency effect ( Fig 4A ). However, given that the Global-Only model uses a fixed prior, the observer’s estimate will not be influenced by the local context (e.g., recently experienced stimuli). That is, the basic Bayesian model cannot account for the serial dependence effect ( Fig 4B ).

Having established robust signatures of the central tendency and serial dependence effects, we now ask whether these two effects are generated by a unitary mechanism or reflect separate mechanisms that are shaped by global and local statistics, respectively. To test this, we formalized our intuitions into a series of computational models.

In trials immediately following Go trials, we replicated the serial dependence effect observed in experiment 1 (instantaneous slope: 0.114 ± 0.042; Wilcoxon test: z = 2.03, p = 0.021; Fig 2E ) with a half-amplitude of 7.4 ± 0.9ms ( Fig 2F ) and half-width of 111.0 ± 15.6ms ( S3A Fig ). In contrast, we did not find a serial dependence effect from the previous stimulus following a No-Go trial (instantaneous slope: 0.006 ± 0.049; Wilcoxon test: z = 0.01, p = 0.97; Figs 2E and S3B ). This dissociation supports the idea that serial dependence in timing largely arises from factors associated with temporal reproduction. We recognize that this may reflect how reproduction is influenced by attention, decision making, memory, or the response itself. Consistent with this notion, when we examined trial triplets composed of Go-NoGo-Go trials, we found a serial dependence effect on the second Go trial towards the first Go trial (rather than the intervening NoGo trial; instantaneous slope, 0.080 ± 0.034, Wilcoxon test: z = 2.25, p = 0.012; Fig 2E–2F ; S3C Fig ).

In Experiment 2, we asked whether the serial dependence effect originates from temporal perception or temporal reproduction. To test this, we included No-Go trials in which participants were instructed not to respond. Temporal reproductions following Go trials could reflect biases arising from processes associated only with perception, only with motor production, or both. In contrast, temporal reproductions following No-Go trials should only be influenced by a perceptual bias. Importantly, the Go/No-Go signal was only presented after the target duration, ensuring that participants encoded the target duration on both Go and No-Go trials ( Fig 1B ).

This non-linear function is well captured by a derivative of Gaussian (DoG) curve [ 17 , 32 ]. To quantify how previous stimuli influence the current perception in experiment 1, we fitted a DoG to the function of the difference between the target duration in trial n and trial 1-back, 2-back, or 3-back, respectively. We used the instantaneous slope at zero to measure the sign of serial dependence (see Fig 2B ). Positive instantaneous slopes indicate that the current perception is attracted towards the previous stimulus duration; negative slopes indicate that the current perception is repelled from the previous stimulus duration. We found a positive slope when calculating serial dependence based on trial n-1 (0.092 ± 0.094, t(11) = 3.39, p = 0.006; Fig 2D ), but not when the calculation was based on trial n-2 (0.009 ± 0.079, t(11) = 0.39, p = 0.70) or n-3 (0.028 ± 0.095, t(11) = 1.00, p = 0.34). To better estimate the magnitude of the serial dependence effect in response to the previous stimuli, we fit a DoG at the group level. The DoG provided a good fit ( Fig 2B ), outperforming the null-model (ΔAIC n = -46.3 ± 7.7) and linear-model (ΔAIC l = -18.6 ± 4.6) for the trial n-1 function. The half-amplitude of this function is 7.6 ± 0.6ms, and the half-width is 95.8 ± 10.1ms ( Fig 2C ). When analyzed at the group level, we also get a significant serial dependence effect from trial n-2 (2.4 ± 0.8 ms, z = 3.00, p = 0.001), but not from trial n-3 ( Fig 2C ), indicating there might be a weak attractive effect that is not evident in the instantaneous slopes calculated at the individual level. We return to this issue below.

We found that the deviation is biased towards the previous stimulus ( Fig 2B ): When the target duration on trial n-1 was longer than the stimulus on trial n, the reproduced duration tended to be longer than average, and vice-versa. This indicates that the reproduction on the current trial is attracted towards the stimulus duration (or reproduced duration) of the previous trial. Notably, the shape of serial dependence is non-linear: The attraction effect peaks when the current stimulus differs from the previous stimulus by approximately 100ms and then falls off when the difference grows larger.

Experiment 1: (a) Reproduced durations of a representative participant. Grey dots represent the reproduced duration of each trial. The blue circle represents the participant’s mean reproduced duration for each target duration. (b) Serial dependence is evident in the non-linear relationship between the deviation index, the current reproduced duration minus the mean reproduced duration, as a function of the temporal difference between the previous target duration and the current target duration (positive values indicate the previous target was longer). Filled dots denote individual participants. Empty circles denote the mean of all 12 participants. The red solid line represents the best-fitted Derivative of Gaussian (DoG) model at the group level. (c) Half-amplitude of the best-fitted DoG in Experiment 1. Error bars represent an estimate of the standard error obtained from jackknife resampling. (d) Instantaneous slope obtained from the fitted DoG curve for trials n-1, n-2, n-3, and n+1 trials. Experiment 2: (e) Instantaneous slope obtained from the fitted DoG curve with respect to n-1 Go, n-1 no-Go trials, n-2 Go trial (when n-1 is no-Go trials), and n+1 (control condition). (f) Half-amplitude of the best-fitted DoG for trials n-1 and n-2.

The reproduced durations exhibited robust regression towards the mean ( Fig 2A ), replicating the central tendency effect seen in previous studies [ 10 , 29 , 31 ]. Quantitatively, the slope of the reproduced duration to the target duration was significantly smaller than one (0.65 ± 0.20; t(11) = -6.08; p<0.001, S1A Fig ). To examine serial dependence, we first calculated a “deviation” index, the difference between the reproduction of a target duration on a given trial and the individual’s mean reproduction to that target duration across all trials (see Fig 2A ). The serial dependence effect is calculated by the change in the deviation index as a function of the difference between the previous and the current target durations. This method minimizes potential artifacts in the serial dependence function that may be induced by regression to the mean and reproduction biases [ 32 , 33 ].

(a) The stimulus sequence used in Experiment 1 (“ready-set-go” task). Two 100-ms stimuli flashed in sequence, signifying first a "Ready" signal and then a "Set" signal; the target duration was the interval between the onset times of the "Ready" and "Set" signals. Participants were instructed to press the space bar to reproduce the temporal interval after the “Set” signal. Performance feedback was conveyed for 50 ms via the color of the fixation cross (green = correct; red = incorrect). (b) The stimulus sequence used in Experiments 2–4. We used a duration reproduction task including both go and no-Go trials. A ripple-shaped stimulus was presented for a fixed duration denoting the temporal interval. After a 300 ms interval, the fixation point became either a "+" sign or “x” sign, signaling either a “go” or “no-go” trial, respectively. In Go trials, participants reproduced the temporal interval by holding down a key. When the key was released, the fixation point turned to a grey circle signaling the end of the trial. In the no-Go trial, participants were asked to withhold any movement and fixate until the "x" switched to a grey circle after 700ms.

We begin this project by examining serial dependence and central tendency simultaneously in a widely employed temporal reproduction task, the “ready-set-go” task [ 6 ]. Participants observed a pair of visual events that defined a target interval (“Ready” and “Set”) and then made a single button press (“Go”), attempting to produce an interval between the Set and Go signals that reproduced the target interval ( Fig 1A ). The target durations were randomly sampled from a uniform distribution that ranged from 500 to 900ms.

Discussion

The internal representation of duration is context-dependent. Previous work has identified two sources of contextual bias, a central tendency bias in which the temporal representation is attracted towards the mean of a global prior, and a serial dependence effect in which recently experienced durations serve as attractors on the current representation of a stimulus duration [6, 19, 24, 27, 31]. In the current study, we asked whether the central tendency and serial dependence effects arise from a common mechanism or separate mechanisms.

We performed a series of temporal reproduction tasks in which we manipulated the distribution of the target duration. We observed central tendency and serial dependence effects in all of the experiments. Although many studies have shown a central tendency in duration perception[6, 24], the properties of a short-term contextual effect, serial dependency, is less clear. The designs and analyses used in previous work have precluded the analysis of the whole serial dependence function[11, 27]. In the current study, we consistently observed a non-linear serial dependence effect in temporal reproduction. By using a Go/No-Go task[32], we found that the effect was dependent on movement reproduction: There was an attraction effect from the previous target duration after a Go trial, but no observable effect after a No-Go trial.

To determine if long- and short-term contextual effects arise from a single or distinct Bayesian processes, we compared five computational models. We first examined two unitary models, one with a static global prior established during the initial phase of the experiment[6], and one with a local prior dictated by recent experience. Each unitary model could not account for both the observed central tendency and serial dependence effects. To generate both effects, we considered three more complex models. In one of these, we assumed that there were two separate mechanisms, one based on a global prior and one based on a local prior. For the other two models, a unitary mechanism included a process by which the global prior was continuously updated by recent experience. We rejected the unitary model in which the prior is updated in a Bayesian optimal manner since it generated a near-linear serial dependence function. In contrast, simulations of the MGU unitary and DP models yield central tendency and non-linear serial dependence effects.

To further explore the viability of the MGU and DP models, we considered predictions of the two models when we manipulated either the range or variance of the test stimuli. While both models predict that the amplitude of the serial dependence effect from the preceding trial will increase with the range of the test stimuli, only the MGU model predicts that this manipulation will also enhance the serial dependence effect from earlier trials (e.g., n-2). Furthermore, the MGU model predicts that the serial dependence effect will be enhanced when the variance of the test set is increased, even when the mean duration is fixed, whereas the DP model predicts that the variance of the prior should have negligible influence on serial dependence. The results showed that, when extending the range of the test set, the serial dependence effect was enhanced and evident in terms of the context established from the n-2 as well as the n-1 stimulus. Moreover, even we included a short-term memory component to the DP model which allows the local prior to be decided by multiple recent trials, it still fail to generate a DoG-shaped serial dependence function for the 2-back trial as what has been observed in the data. Similarly, the serial dependence effect was enhanced when increasing the variance of the test set. Together, these results provide strong support for the MGU model, indicating that the global bias and serial dependence effects arise from a unitary mechanism in which a single global prior is continuously updated after each trial by the reproduced stimulus duration.

We have presented evidence that a single prior which is dynamically updated across trials is sufficient to explain both the serial dependence and central tendency effects, as well as how they are jointly modulated by the distribution of the test set. Simple dual-prior models with a fixed global prior accompanied by a Gaussian-shaped or single-value prior determined by the most recent experience (1-back trial or short-term memory) cannot account for the behavioral results. Nonetheless, we cannot rule out that the pattern of results reported across the experiments can be captured by alternative multiple process models. One such possibility is a model that incorporates multiple processes, each adhering to the MGU model, but exhibiting distinct learning rates. However, on the grounds of parsimony, the single process MGU model provides a comprehensive account of the data, setting a point of comparison for future research.

The nonlinearity of the serial dependence function sheds light on how the prior is updated based on recent experience. Updating could follow a simple Bayesian optimal integration rule with the prior represented as a Gaussian. While this model generates a serial dependence effect, the shape of the function is approximately linear (BU, see Fig 4B). To capture a non-linear serial dependence function, we assumed that the prior is updated by increasing the weight of the Gaussian centered at the recently reproduced duration. One benefit of representing the prior with a Gaussian mixture model is that it provides a way to represent the effects of any test set, regardless of its distributional shape. Indeed, previous studies have also shown that a mixture of Gaussians provides a good fit of the internal prior in duration perception experiments [30, 36]. Here we show that an updating rule based on this assumption can account for the non-linear serial dependence function as well as how the function will shift when the variance of the test set is manipulated.

As noted above, the serial dependence effect was contingent on the participant having produced a response; it was markedly attenuated after No-Go trials. In theory, it is possible that this local effect is reflective of some sort of motor memory, where the current motor reproduction is directly biased by the previous motor reproduction, with the perceived duration unbiased. However, we think it is more likely that the serial dependence effect is mediated by perception: A prior of temporal expectation is constructed from the reproduced duration, and this prior biases the perception of subsequent target stimuli, which in turn, will be reflected in the next reproduced duration. In support of this idea, several studies have shown that making a movement of a variable duration influences performance on a subsequent duration comparison task in which no motor reproduction was made [12, 37, 38].

We note that serial dependence effects have been observed in previous studies that did not involve motor responses, including judgments of duration [13–16] or visual orientation or position [32, 33, 39].Thus, we do not claim that the absence of a serial dependence effect in the No-Go condition of Exp 2 should be taken to mean that movement is a necessary prerequisite. As with all null results, caution is warranted. It is possible that the design used in Exp 2 may have been insensitive to capture a perceptual contribution to the serial dependence effect. Alternatively, the absence of a perceptual may be specific to the duration reproduction task used in our experiments. Indeed, there is evidence from duration comparison tasks without reproduction of an attractive sequential effect [11, 40]. Follow-up experiments are needed to understand the factors that determine the weight given to perception and production in forming the prior.

We recognize that, in evaluating the five models, we assumed that a global prior was established during the initial phase of the experiment. Obviously, establishing a global prior requires some integration of the local context as the participant becomes familiar with the stimulus set. We set our initial “training” phase based on previous studies, which have shown that participants are able to generate a relatively accurate global prior of the temporal context after about 100 trials [21]. One open question is how the prior updating rate changes as a function of training. In our dynamic models, we assumed that the rate remained constant over the course of the experiment. However, it is possible that the rate of updating weakens as the prior becomes more established. Correspondingly, the serial dependence effect might become weaker over time. However, this temporal expectation system might be a rigid system that recalibrates the environment with an invariant learning rate. This characterization conforms with a conceptualization of the operation of the cerebellum in sensorimotor adaptation, and the same principles might apply to duration representation, another function associated with the cerebellum [41, 42]. The data set in the current experiments are insufficient to examine the dynamics of updating, and we see this as an important issue to be addressed in future studies.

In summary, the current study provides new insights into how context influences our sense of duration. The perception of the duration of a stimulus is sensitive to both the global distribution of the stimulus set as well as recent experience. Importantly, by examining the central tendency and serial dependence effects in a joint manner, we observed that these two forms of bias are best explained by a unitary mechanism in which a global prior is updated in an iterative manner based on each observation. Given that central tendency and serial dependence effects are ubiquitous in perception tasks [6, 18, 19, 31, 39, 43], the common Bayesian framework developed here may provide a general account of how perceptual systems adapt to environmental statistics.

[END]
---
[1] Url: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011116

Published and (C) by PLOS One
Content appears here under this condition or license: Creative Commons - Attribution BY 4.0.

via Magical.Fish Gopher News Feeds:
gopher://magical.fish/1/feeds/news/plosone/