8 and 16 bit are dying pretty rapidly. You can make some pretty tiny 32 bit CPUs (e.g.
https://github.com/YosysHQ/picorv32 is a RV32 with only a couple thousand transistors). On a budget optimized process node (e.g. 28nm), the core is absolutely tiny and all of the cost comes from the (s)ram.