My example tends to be lottery tickets. You minimize MAD by saying they are all worth zero (which is pretty much my opinion). But then you don't get the value of the lottery by summing up all the ticket values. You do/should get get with mean/expectation based estimates.
More of my writing on this: http://www.win-vector.com/blog/2014/01/use-standard-deviatio... . Though I am also a fan of quantile regression (it just solves different problems).
This Mean Absolute Deviation is a very useful error estimator. Unfortunately it, unlike the standard deviation, requires two passes over a dataset to compute.
My 6th grader took home a practice standardized test today. Last question on the test required a MAD computation.
He didn't know what MAD was and I had to look it up myself.
I think understanding the idea behind MAD is great and potentially useful and interesting.
However, I'm not sure what 6th graders are going to be able to grind through two iterations over a data set to get the correct answer.
1. The SD does have an interpretation: it is the (rescaled) Euclidean distance between your observed data set and a data set in which all points are replaced by the sample mean. Not terribly useful, but arguably not useless.
2. Are there models for which MD is in fact a sufficient statistic?