posted on 2011-10-28, 08:45authored byBrendan Halpin
Simple modelling of categorical data is not as simple as it seems. Standard formulas taught to generations of undergraduates are shown to be sub-optimal, simple models are widely misunderstood, and high levels of controversy surround the suitability and interpretation of relatively standard models such as logistic regression. In this research note I discuss a number of these issues, using simple simulations in Stata and R to illuminate them.