Common statistical errors in logarithmic data of viral load
Background: The mathematical operation of logarithm data follows the rules of logarithm as the numbers relate to specific meanings. The numbers prior to the decimal are the mantissa or the transformed power of 10 and the numbers after the decimal are the significant numbers of a given numeric data....
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Chaing Mai University
2018-01-01
|
Series: | Journal of Associated Medical Sciences |
Subjects: | |
Online Access: | https://www.tci-thaijo.org/index.php/bulletinAMS/article/view/91320 |
Summary: | Background: The mathematical operation of logarithm data follows the rules of logarithm as the numbers relate to specific meanings. The numbers prior to the decimal are the mantissa or the transformed power of 10 and the numbers after the decimal are the significant numbers of a given numeric data. Such operations result in errors in statistical calculations and lead to erroneous conclusions and eventual miss-interpretation of data.
Objectives: The principal aims of this study were to demonstrate and discuss several commonly made mistakes in calculation of logarithm data of viral load assays, and the subsequent errors in calculation of precision that affects the estimation of measurement uncertainty of the test.
Materials and methods: The study reviewed scholarly articles in 2017, using’ viral load’ as a keyword. Several sets of log data from inter-laboratory comparison of human immunodeficiency virus (HIV) load and multi-center evaluation of cytomegalovirus (CMV) load assays were used for this study to demonstrate the errors in calculation when the statistical calculations were performed using logarithm data.
Results: It was observed that presentation of average viral load data as arithmetic mean and calculation of other statistical measures directly from log viral load data was quite common, in spite of that fact that such average was log geometric mean of the viral load. Standard deviations (SD) calculated directly from log viral loads gave a new-undefined values that were irrelevant to the deviation of viral load from its mean. Such errors in SD calculation lead to extraordinarily low coefficient of variance and very low measurement uncertainty.
Conclusion: The SD, calculated from the log, and those SD calculated from the viral load per milliliter are different. Mathematically, such statistics should be calculated from the number of viruses per milliliter and could then be converted to a log scale for downstream use. Traditionally, the SD were calculated from the log which was very low. This study recommends that for all statistical measures, the absolute value is important, should use the viral load per milliliter in the calculations and then convert it into a log scale for correct usage. |
---|---|
ISSN: | 2539-6056 2539-6056 |