Wednesday, August 2, 2017

Clean, but shorter, Dexcom G4 (505) Freestyle Libre comparison

Since my previous post has triggered a few private reactions. Here’s another comparison on a fairly standard situation, with clean data: clocks are in perfect synchronisation, there are climbs (pre-game carb loading) and falls, including a severe low (delayed hypo).

On the left, the data as downloaded. On the right, the data shifted for the best correlation (which basically means that the Dexcom data is rolled back in time to erase the delay). That post-mortem analysis is both realistic and a bit unfair to the Dexcom. Realistic because the Libre raw data matches historical data quite well. A bit unfair because the Libre only provides delayed and adjusted historical data. Adjusted relative to what? The spot checks. As I have shown many times on this blog, spot checks are typically even faster than the Dexcom in practice, with the drawback that they are really inaccurate at times, especially on the high side.


In this case, the best correlation is found with a shift of 5-6 minutes (Libre ahead of the Dexcom by 5-6 minutes). This is fairly typical of what we see with the Libre vs the 505, when everything works well for both sensors. That’s the tricky part in practice of course: adhesion issues, desynchronisation between insertions (ie comparing a fresh Dexcom to a Libre in its second week) all play a role.

Broadly speaking, the sensors see the same thing. The 505 data is a bit more bumpy: that is a consequence of the adaptive 505 algorithm and, of course, of the smoothing introduced by the Libre historical data.

One important point: as you can see in the left Bland Altman plot, two well working sensors can show very significant differences based on timing and rate of change.

Regardless of the absolute magnitude of the differences, a consistent behavior emerges: the Libre overshoots highs compared to the Dexcom and undershoots lows to a lesser (absolute) extent. This type of behavior could be the consequence of the calibration slope of the BGM used to calibrate the Dexcom, but we have observed the same behaviors with different BGMs (Menarini Glucomen LX, Roche Accucheck Mobile, Abbott’s Libre BGM). If you are interested in that behavior, the 2014 and 2015 posts on this blog provide additional insight.

The third screen is a log/log plot privately suggested by L. and is basically a Bland Altman on steroids that amplifies the visualization of the differences in behavior in a way that is less dependent on absolute differences. (I am sure I will be corrected if I didn’t get that right).

Beautifying the data

Now, let’s look at the old Clarke plot of the Dexcom vs the Libre. (yes, I know, Clarke plots are out of fashion, but I have had the function for ages, so why not…

First the un-shifted data plot.


Quite decent match, you would not have killed yourself by relying on either device.

Now, the delay corrected data plot.

Isn’t that something? We have gained almost 8% in the A zone.

Now, this doesn’t mean anything in absolute terms. For all we know, the Dexcom could have been right and the Libre could have been overshooting. Only one thing is certain: the delay.

But this tells us something else: it is extremely easy to tweek test results to your liking. Something as simple as asking patients to tests 2 hours after a meal vs asking them to test 1.5 hours after a meal, something seemingly as innocuous as using standard meals or standard sport sessions can have a drastic impact on the numbers. In a market where T1D fanboys love to argue about the 1% MARD advantage of their sensor (while at the same time losing 10% MARD or more through home made hacks), a couple of percent of differences can mean a huge amount of good publicity…

No comments:

Post a Comment