Why is interpolation adding too much heat

Question

I am interpolating geo data using qgis. The data is in the form of lat,lon,count. The path of the collected data is shown as a mostly transparent line. The resulting heatmap it produces is shown. The map at label 1 is green, but at label 2 has started to shift to yellow (the colors go from white->green->yellow->orange->red). I dont understand why, it should be all green up there as either no data was collected, or at best it was low measurements. A similar color shift is seen at label 3 and 4. Most of the data collected at the west edge is "low" measurements as seen in the green wedge at label 4. But at label 3, it again is shifting towards yellow. I dont understand why.

The interpolation was created with the following settings:

Interpolation attribute:value, type:points
Method:IDW coefficient:2.0
cols:300, rows:300
(resulting cell size x:0.07934, y:0.01678)

Colors:

0:white
13:Green
25:Yellow
37:Orange
50:Red

I have placed the data file here:datafile

Peter enter image description here

score 2 · Answer 1 · edited Apr 13 '17 at 12:33

I don't know exactly how the QGIS interpolation plugin works. Someone else may be able to give more specifics on the tool itself, but with a little testing and observation I think I know what is causing the result you're seeing. Based on the screenshot of the dialog box I see at the documentation, there's no user input parameter for a search radius with the interpolation tool.

I downloaded your data and tried running it through the ArcGIS IDW tool to see if I got similar results, and I did. I note that I have some additional/different input options available - specifically I can use Power while you have Coefficient, and I have a the ability to put in a search radius. More on Power vs Coeffcient can be found at this other GIS.SE question: How to pick the coefficient in Inverse Distance Weighted method? And I believe this is partially related to what you're seeing.

Then there's the search radius. My default setting is a variable radius with a point count and an option for a maximum distance. If I don't specify one, that means for every output raster cell it's going to go find the closest n (point count) points to get the cell value. My default was 12. To help see what was going on, I symbolized the points on a graduated color scale by value.

overall map

Zooming in to the area of your 1/2 example, you can see there are some higher values on the data points at those edge curves. If QGIS interpolation is working the same way, some of those cells in the upper right are pulling their values from the n nearest points, which are going to be the points on those curves, causing their values to be higher. The close you get to the curve, however, the more zero value points available to interpolate from, which drags the cell value down. In this case having no data points out there is actually increasing the values, because there are no zeros to counteract the higher values that are in range.

upper right

A similar thing occurs are your 3/4 example. Symbolizing your points by value range classification might help you see what's going on. It's a good idea try a couple of different values for things like search radius or coefficient when interpolating and examine the resulting surface. In your case (according to the question linked above), you might try a smaller coefficient, but I don't know that will completely eliminate the effect and it's a bit of a cartographic balancing act.

Also, I note you've used the tag heatmap. There is a heatmap plugin for QGIS (which allows for a search radius and a weight by field), but it's not the same as interpolation. The heatmap plugin looks at a density of points, not interpolating the values of points. You may already understand the distinction, but for completeness and future viewers, I wanted to make sure and point this out. See this related question: How to build effective heat-maps?

Why is interpolation adding too much heat

1 Answers1