thx phndrenad2. funny i just searched "chisel gpu" and found two:
https://github.com/jbush001/ChiselGPU
https://github.com/Chlorophytus/broccolihalf-float we'd like to do by using a dynamic SIMD-aware 64-bit ALU that has auto-partitioning. we do however already have an actual FP16 implementation
https://git.libre-soc.org/?p=ieee754fpu.git;a=tree;f=src/iee...
or more to the point, one that is compile-time configureable with one parameter (bit-width), so the same HDL does FP16, FP32 and FP64. i'd like to make that dynmaically-SIMD-configureable but it'll take some base work in nmigen to do without massive code-explosions.