I don't know how sophisticates the streaming/prefetch/access pattern prediction the 2002 cpus did was.
I'm speculating, but if that's not modeled, cachegrind may pessimize some less simple predictable patterns and report a lot of expected misses when the cpu would have been able to prefetch it