undefined | Better HN

0 pointsmattnewport8y ago0 comments

I think you can make a good case that the failure of the Itanium (and other similar attempts to un-hide some of this stuff like the IBM/Sony Cell used in the PlayStation 3) was precisely because they tried to shift optimization work from the CPU to the compiler / programmer.

0 comments

greglindahl8y ago

Funny, the compiler people I've worked with complain that Itanium tried to do too much in hardware, like the hw support for loop unrolling, which made superpipelining optimizations in existing compilers much more complicated.

mattnewportOP8y ago

Loop unrolling is one of those optimizations that actually highlights the need for dynamic CPU optimizations like out of order execution and speculative execution. It's very difficult to statically make a good decision about the optimal amount of loop unrolling to do, especially if you want to generate code that will continue to perform well on future CPUs using the same ISA. Even when targeting a specific CPU model it's difficult however since you don't know statically how many iterations of the loop you're expecting, what's currently in cache, what other code might be running immediately before or after the loop, what's running at the same time on other threads, etc.

greglindahl8y ago

Itanium's hardware did not make any of these things easier.

j / k navigate · click thread line to collapse

0 comments

greglindahl8y ago

mattnewportOP8y ago

greglindahl8y ago

Itanium's hardware did not make any of these things easier.

j / k navigate · click thread line to collapse