« But what does it all mean? Understanding eye-tracking results (Part 4) | Main | Common email layouts: B2C electronics template »

September 10, 2007

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d834202a2053ef00e54ed8a3268833

Listed below are links to weblogs that reference But what does it all mean? Understanding eye-tracking results (Part 5):

Comments

Very good initiative to educate people on what information can be obtained through eye tracking - and what cannot be obtained.

I only have one objection: A heat map actually can tell you how long people have been looking at a part of the page - if it is generated using the length of the fixations instead of the number. A heat map should always be accompanied by a legend describing how it was generated, otherwise you can't really draw any conclusions based on it.

Thanks so much for the feedback.

To clarify, a temperature map can be created based on any criteria, and it's imperative that the analyst have a concrete understanding of what the exact criteria is. However, in a "classic" heatmap (for lack of a better word), the aggregation is performed using fixation location and not time. So when most people refer to a "heatmap" within the niche field of eye-tracking in usability and market research, they are usually referring to a result which has no time dimension.

In the past, we created heatmaps based on fixation duration, but didn't find this to be the best method for analyzing time-based data (I'd be very interested in understanding how you've used this kind of graph). We commonly look at average fixation durations and the number of return saccades to page elements with mean and standard deviation calculations. A large number of studies have investigated fixation duration as a measure of interest and ease of comprehension. However, because a heatmap doesn't have an associated error bar, we usually visualize such data in a different way (in a bar or line graph for example). This analysis allows us to test for statistical significance, and avoids confounds which might be present depending on exactly how the time-based heatmap is produced.

I don't really see why fixation duration and count should be analyzed in different ways. You say that fixation duration is best analyzed using for example graphs, but this is also true for fixation count. In my opinion, heat maps should be used as a starting point for analysis and as a way to visualize and motivate conclusions to clients rather than being the foundation of the entire analysis.

Why would the time element require error estimations that the count does not? Doesn't the number of fixations need to be parsed through statistical analysis just like their duration? It is still the same fixations we are talking about, just adding an attribute. Isn't it more relevant to also look at the length of the fixations rather than just giving all of them the same weight in the heat map?

I forgot, I have actually never done any sharp analysis of eye tracking data but I work at Tobii and feel that I have a good understanding of the metrics but want to learn more. All opinions I express are my own, not Tobii's.

Thanks for the ongoing dialog. (I hoped there would be good discussions on analysis when writing these posts.) There are a few different points you've made that I'd like to address, so here we go:

> I don't really see why fixation duration and count should be analyzed in different ways.

I may be misunderstanding (and I apologize if I am), but I think you may be confusing a heatmap with a direct representation of "fixation count". In a "classic" heatmap, there isn't really a direct relationship between how many times an item is fixated by each individual and the temperature read. In this kind of a heatmap, individual user fixations to the same locations are counted only once. In other words, if a user fixates the exact location 1,5, or even 20 times, a standard heatmap calculation will only register that the user fixated the spot once. Then when averaging for the group is done, your result is a plot of where users looked, and not how long and or how often. This is done to avoid the possibility that 1 user could bias the entire heatmap by continually fixating the same page location.

> You say that fixation duration is best analyzed using [...] graphs, but this is also true for fixation count.

Yes, it's true for "number of fixations". (I only used duration as an example in my previous response). For both of these measurements, the most important properties are mean and standard deviation. This analysis allows comparison of different page elements, texts, and the ability to run statistical analysis. Without some measure of variability, it's difficult to draw useful conclusions from this data -- qualitative or quantitative. If you have mean and variability measurements, the plot you choose is really only important to the extent that it helps you clearly interpret and share the results. I suppose that there's a way to add error bars to a heatmap... but I'm sure you can imagine that a surface plot with +/- error bars would be exceptionally difficult to interpret accurately, and even harder to explain.

> In my opinion, heat maps should be used as a starting point for analysis and as a way to visualize and motivate conclusions to clients rather than being the foundation of the entire analysis.

I completely agree, and that's been one of the main points of this series. Analysts should always look beyond the heatmap.

> [....] It is still the same fixations we are talking about, just adding an attribute.

I'm not entirely sure I understand this part of your question. Please elaborate if my comments above haven't addressed this.

> Isn't it more relevant to also look at the length of the fixations rather than just giving all of them the same weight in the heat map?

This really depends on the research question you're asking. Examining fixation length and location are both valid and useful ways of analyzing eye-tracking data. As for heatmap representations... heatmaps which include a time component risk heavy biases if one person decides to stare at the center of the page for a minute or two (as an extreme example), or simply has a harder time understanding or reading something (as a less extreme example). The effect of this kind of outlier is limited in a calculation which gives all users equal weight. There is also a big question about how one would represent peripheral vision in a time-weighted heatmap. Using other analysis tools avoids these problems and will probably get you a much faster answer.

>"In a "classic" heatmap, there isn't really a direct relationship between >how many times an item is fixated by each individual and the temperature >read. In this kind of a heatmap, individual user fixations to the same >locations are counted only once. In other words, if a user fixates the >exact location 1,5, or even 20 times, a standard heatmap calculation will >only register that the user fixated the spot once. Then when averaging for >the group is done, your result is a plot of where users looked, and not how >long and or how often"

Are you referring to the 'count' option in Tobii Studio here?
If so I'm a little confused, as with an average heatmap from 6 participants I still get red heat areas when the number of counts is set to 11 or 12 - If each users fixation was counted only once there would be no red areas for a count higher than 6?

Back to the original discussion...if you set the heatmap style to absolute or relative duration (again, in Tobii Studio), I beleive this does introduce an element of time, because by adjustign the time which the heatmap represents you can change the heat areas dramatically. So by altering the time variable you alter the whole heatmap, therefore the heatmap is dependant on time.

Alex, Tobii Studio does not have what Teresa refers to as a "classic" heat map. The "Count" heat map in Studio is calculated based on the total number of fixations, while both duration settings are based on the duration (relative being normalized and absolute not).

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.