The best benchmark is always your application. All benchmarks are flawed, use your judgement and determine how flawed a benchmark is; Any flaws are relative to your application similarity to what the benchmark tests. An imperfect tool is not a useless tool, so long as you are smart about how you use it.
This is probably relevant too: http://benchmarksgame.alioth.debian.org/dont-jump-to-conclus...