Measuring Visual Attention

This project is involved with estimating visual attention and its derivatives to better understand consumer behavior.


For a static scene the only type of visual processes taking place in observer can be described as a sequence of fixations and saccades. These can be computed from an array of gazepoints collected by the eye-tracking hardware. (Salvucci and Goldberg) To detect fixation points we used a robust and accurate algorithm based on dispersion threshold identification - I-DT. (see Widdel, Wedel and Pieters). Our I-DT fixation detection method described above yeilds mean fixation durations of ~307 ms which is consistent with the previous research (Hooge and Erkelens). Typical estimates of attentional shift time in visual search experiments are on the order of 50 ms (Wolfe, 1998).

In our visual attention study we use eye-tracking results as an objective measure of visual attention. These are further used as a training set in a supervised learning process that improved the precision of our predictive models.

Our predictive model uses low level saliency indicators (color, texture) with a combination of hi-level saliency driving factors for targeted search tasks. These features are contracted based on the shape and appearnce of the target search object or a group of objects. Combining top-down and bottom-up saliency can predict the location of up to 70-80 percent of the fixations correctly.

Related work:

My Visual Attention Bibliography (in separate window)