The first convolutional neural network, the Neocognitron, was AFAIK implemented on a PDP-11 as well:
https://www.semanticscholar.org/paper/Neocognitron%3A-A-neur...No backpropagation back then, this only appeared around 1986 with Rumelhart, probably on VAX machines by that time.
The 11/34 was hardly a powerhouse (roughly a turbo XT) but it was sturdy, could handle sustained load and its FPU made the whole difference.