Documentation
Ebbing and collaborators (10) have criticised the Norwegian Directorate of Health for its recommendation to apply eSnurra nationwide. The Directorate’s recommendations are based on comprehensive scientific documentation, assessments by the Norwegian Knowledge Centre for the Health Services and expertise in academic communities in Norway and abroad, including with regard to the statistical methodology.
The Norwegian Knowledge Centre for the Health Services pointed out that one possible validation strategy to assess different term prediction models consists in comparing the characteristics of the methods in question through measurements within one and the same population. This was done in a PhD thesis in 2012 (2), and the work has been presented in three international publications (7) – (9). The PhD thesis undertook validation studies of three different term prediction models: Snurra (phased out in 2007), Terminhjulet and eSnurra. The methods were validated, and the consequences of their differences were shown with data from a total of 73 409 pregnancies from three different Norwegian regions.
The old Snurra model and Terminhjulet both showed a varying systematic bias in the prediction of EDD over the range of inclusion – pregnancy weeks 14 – 23. The bias amounted to a maximum of five days, equivalent in all three populations. When using eSnurra, the EDD corresponded to the actual median date of delivery, also equivalent in all three populations and without any systematic bias. The methods were validated on their precision in predicting fetal age, and eSnurra scored highest (7).
The commentary by Ebbing and collaborators in the Journal of the Norwegian Medical Association (10) contains a number of fallacies: The developers of eSnurra are criticised for having used the population from which the method was developed in the evaluation of how well the EDD is predicted, and for using the same population to evaluate Terminhjulet. This is a fallacy. It is correct to evaluate the results in the same population from which the model was developed, as long as it is population-based.
It is also pointed out that a systematic review article gave a positive assessment of the quality of the studies of age determination and growth on which Terminhjulet is based (11). However, the presentation by Ebbing and collaborators is incorrect. The systematic review article is not a study of pregnancy dating models, but of fetal growth curves for use in later pregnancy.
According to Ebbing and collaborators, it must be expected that the EDD that was predicted with eSnurra would influence the health services to converge on this prediction. This is a fallacy. The pregnancies were dated from the old Snurra method, and the women were delivered long before the evaluation of eSnurra was initiated. They also refer to a critical comment on the statistical population-based approach that underlies eSnurra (12), but fail to mention the refutation of this criticism (13). This is quite remarkable.
It is wrong to claim that the due date is of no clinical importance. The fact that few births occur exactly on the EDD is not crucial. What is important is that the model has no systematic bias that causes the ultrasound-based EDD of the entire population to be shifted, as it is with Terminhjulet.