The software decoder is not fast enough if you need to use the CPUs in parallel. So removing the hardware acceleration is a net loss of compute power.