> Sohu is built for one thing and one thing only: transformers
Thanks for clarifying this. Could you clarify whether your chip supports the transformer architecture in general, or only specific models for e.g. Llama 70B? In case of the latter, would your ASIC have to be reprogrammed for each model?
Transformers in general. There’s no reprogramming of the ASIC needed, just applying a different sequence of layers, and that’s exactly what our software stack is meant to support.