undefined | Better HN

0 pointsnoobermin3y ago0 comments

To be fair, likely none of the readers here have designed a single CPU either :)

0 comments

ruslan3y ago

I toyed with my own implementation of one-stage RV32I for FPGAs, does this count ? ;)

kragen3y ago

I think that's a lot more than Rob has done. What did you learn?

ruslan3y ago

Well, at least I learnt that pipelining is for good, it speed-ups execution quite a lot. :)

snvzz3y ago

>one-stage

as in not pipelined?

ruslan3y ago

Yes, there was no pipeline. :)

formerly_proven3y ago

Some variation of designing, building or extending a CPU or ISA is standard in comp-sci curricula.

kragen3y ago

It's a standard thing to do in EE curricula; you normally do it in a one-semester class, and there are literally thousands of open-source synthesizable CPU cores on GitHub now. Some one-semester classes go so far as to design ASICs and, if they pass DRCs, get them fabbed through something like MOSIS or CMP.

To take three examples to show that designing a CPU is less work than writing a novel:

- Chuck Thacker's "A Tiny Computer", fairly similar to the Nova, is a page and a half of synthesizable Verilog; it runs at 66 MHz in 200 (6-input) LUTs of a Virtex-5: https://www.cl.cam.ac.uk/~swm11/examples/bluespec/Tiny3/Thac...

- James Bowman's J1A is more like Chuck Moore's MuP21 and is about three pages of synthesizable Verilog: https://github.com/jamesbowman/swapforth/blob/master/j1a/ver... and https://github.com/jamesbowman/swapforth/blob/master/j1a/ver.... You can build it with Claire Wolf's iCEStorm (yosys, etc.) and run it on any but Lattice's tiniest FPGAs; it takes up 1162 4-input LUTs.

- Ultraembedded's uriscv is about 11 pages of Verilog and implements the full RV32IMZicsr instruction set, including interrupt handling (but not virtual memory or supervisor mode): https://github.com/rolandbernard/kleine-riscv/tree/master/sr...

In all three cases, this doesn't include testbenches and other verification work, but as I understand it, that's usually only two or three times as much work as the logic design itself.

Maybe we should have a NaCpuDeMo, National CPU Design Month, like NaNoWriMo.

I haven't quite done it myself. Last time I played https://nandgame.com/ it took me a couple of hours to play through the hardware design levels. But that's not really "design" in the sense of defining the instruction set (which is, like Thacker's design, kind of Nova-like), thinking through state machine design, and trying different pipeline depths; you're mostly just doing the kind of logic minimization exercises you'd normally delegate to yosys.

In https://github.com/kragen/calculusvaporis I designed a CPU instruction set, wrote a simulator for it, wrote and tested some simple programs, designed a CPU at the RTL level, and sketched out gate-level logic designs to get an estimate of how big it would be. But I haven't simulated the RTL to verify it, written it down in an HDL, or breadboarded the circuit, so I'm reluctant to say that this qualifies as "designing a single CPU" either. (Since it's not 01982 anymore maybe you should also include a simple compiler backend before you say a new ISA is really designed?)

But I also wouldn't say I'm "well versed in the topic". I can say things about what makes CPUs fast or slow, but I don't know them from my own experience; I'm mostly just repeating things I've heard from people I judge as credible on CPU design. But what is that credibility judgment based on? How would I know if I was just believing a smooth charlatan who doesn't really know any more than I do? And I think Rob is in the same situation as I am, just worse, because he has even less experience.

sponaugle3y ago

Indeed, I suspect most computer engineering students have done some level of CPU design in their coursework. I rather enjoy the design process, and have done many different designs over the years in an attempt to learn about different optimization and design decisions. I typically do something on paper first, then in some simulation, and sometimes into HDL an on an FPGA, or in some cases discrete logic.

I recently did a very simple 16 instruction/16 register RISC-like design (no microcode) built using just 74xx series logic which was successful at over 10MHz, and I then took that design and implemented it in CPLDs to see how it would compress. It really is an enjoyable process and a nice change from the daily software engineering tasks.

Napkin CPU design should be table topic at your next dinner!

kragen3y ago

That's great! Presumably you mean 74HC or 74HCT or at least 74ALS, not really 74? How much trouble did you get with noise as you pushed it to 10 MHz with SSI chips? How many CPLDs did you end up using?

Thank you for comprehensively rebutting "To be fair, likely none of the readers here have designed a single CPU either :)"

1 more reply

j / k navigate · click thread line to collapse

0 comments

ruslan3y ago

I toyed with my own implementation of one-stage RV32I for FPGAs, does this count ? ;)

kragen3y ago

I think that's a lot more than Rob has done. What did you learn?

ruslan3y ago

Well, at least I learnt that pipelining is for good, it speed-ups execution quite a lot. :)

snvzz3y ago

>one-stage

as in not pipelined?

ruslan3y ago

Yes, there was no pipeline. :)

formerly_proven3y ago

Some variation of designing, building or extending a CPU or ISA is standard in comp-sci curricula.

kragen3y ago

To take three examples to show that designing a CPU is less work than writing a novel:

In all three cases, this doesn't include testbenches and other verification work, but as I understand it, that's usually only two or three times as much work as the logic design itself.

Maybe we should have a NaCpuDeMo, National CPU Design Month, like NaNoWriMo.

sponaugle3y ago

Napkin CPU design should be table topic at your next dinner!

kragen3y ago

Thank you for comprehensively rebutting "To be fair, likely none of the readers here have designed a single CPU either :)"

1 more reply

j / k navigate · click thread line to collapse