Anscombe Quartet

    Statistics is such a tool ... Very scary in inept hands. The skilled ones are worse than that, capable of tearing the brain to pieces.

    There are sequences A , B , C and D , about which the following is known:
     ABCD
    Average x9.009.009.009.00
    Variance x10.0010.0010.0010.00
    Average y7.507.507.507.50
    Variance y3.753.753.753.75
    Correlation between x and y0.820.820.820.82
    Direct linear regressiony  = 3 + 0.5  xy  = 3 + 0.5  xy  = 3 + 0.5  xy  = 3 + 0.5  x
    That is, all the indicated values ​​for them coincide. At least until the second decimal place. And now we look with our eyes:
    Anscombe's quartet


    Such is the oil painting. You can download .XLS with data for self-study.

    The British statistician F.J. came up with this thing. Anscombe, and it’s called the Anscombe Quartet. Everyone has heard the saying about the average temperature in the hospital, and now you have a good illustration for it.

    About the Anscombe quartet on the English Wikipedia.

    UPD: the porting of this article to the Russian Wikipedia has begun , and they correctly notice that the author should be called Francis Enscomb .

    Also popular now: