Simpson's paradox: a trend appears in different groups of data but disappears or reverses when these groups are combined e. a is bigger than b in years but on average b is bigger . Usually due to different sample sizes.

This is my first attempt at an elementary statistics post, which I hope is suitable for Less Wrong. I am going to present a discussion of a statistical phenomen