Rescorla-Wagner

Fitting simulated data using a grid search

So - we have a model. What does it mean to 'fit' the model to data?

We need to determine the values of the parameters, α and β, that make the model behave as similarly to the real participant as possible.

As a 'sanity check', we will first fit the model to the simulated data,i.e. estimate the parameter values from the observed choices, to see how well we can 'recover' the parameters we put in.

For the simulated data, we know the values of the parameters α and β

In the simulated data, α and β are parameters we set ourselves
In real subjects' data, α and β are unknown

Grid search

There are various ways you could determine the best-fitting values of α and β.

We are going to use a grid search, which means:

We run the model with a whole range of values of α and β

For each pair α,β we work out how well the model predicts the data

This is defined as the likelihood of the parameters given the observed data

We can plot the outcome as a colourful grid!

OPTIONAL: MECHANICS of the GRID SEARCH

Note - this section is conceptually related to the material covered in the first section of the Bayes' tutorial

The aim of the grid search is to find the values of the parameters that maximise the probability of the observed data, given the model that we are fitting.

Let's make that a little bit less abstract. Suppose we have 3 trials. On every trial, the subjects can pick orange or blue.

That means we have the following potential choice sequences:

[BBB]; [BBO]; [BOB]; [BOO]; [OBB];[OBO]; [OOB];[OOO]

For each trial there are 2 options, so there are 2³ = 8 possible sequences of three outcomes.

This means that for chance behaviour each combination has a probability of occurring of 1/8.

This is the probability for a 'chance' model.

What we want of course is that our model that predicts the sequence of choices in our data at above chance level.

Using the update and observation equations, we can predict the probability that the subject chooses Orange (or Blue) given the Value of Orange and Blue, and the parameters alpha and beta, on each trial.

For a sequence of choices, we get a sequence of single-trial probabilities for choosing Orange (or Blue), which are multiplied together to give a probability for the whole sequence:

This quantity p(data|model) is also known as the likelihood of the data.

Now we need to determine the parameter values that maximise the likelihood of the data.

We will use a very simple approach, called a grid search. The advantage of a grid search is that it's conceptually easy and easy to implement, but it's computationally expensive, particularly if you have many parameters. Luckily for us, we only have 2.

To do a grid search of the parameter space, we define a range of parameter values for each parameter, and compute for values within this range the probability of the observed responses. Then we check which set of values explains the data best, and declare that the winner.

So first we need to decide what range of parameter values we want to use. Can you think of something sensible, based on the playing around with the simulations that you've done above?

?

α, the learning rate

lower bound of [0], which means that there is always some learning

upper bound of [1], which means that the maximum it can learn is the value of the reward itself.

β, the softmax temperature

lower bound of [0], so that the bias is always in favour of the option with the higher value

upper bound of [15], which is fairly liberal and would make for quite deterministic choices (you can simulate this!).

Note about using the grid search approach

Running a grid search
So we will leave these complexities for now and set up the grid search:
go into the Matlab file RLtutorial_main.m, and set:

subjects = 1; simulate = true; fitData = true; plotIndividual = true; if simulate simpars.alpha = .25; simpars.beta = 4; end if fitData bounds = [0 1; % alpha 0 15]; % beta nBin = [20 25]; end

... save, and run!

This gives you a plot of the likelihood for all of the bins in a 2D plot.