Yes, but keep in mind that I'm purposfully comparing development RTL on the open-source side with finished silicon product on the proprietary side.
Many companies have production ready/fully verified IP that are much faster than XiangShan, but they haven't taped out and productized it yet.
E.g. Tenstorrent Ascalon IP is available and at 21 SPECint2006/GHz. They want to release a lower clocked devboard on TSMC 12nm in 2026 Q2.
XiangShan is currently at 15 SPECint2006/GHz, but are targeting 22 SPECint2006/GHz in the next iteration, that is currently in the works.
Well have to see who gets there first.
The inherent gap between RTL design and final product may be large enough that open-source RTL can keep up in terms of IPC, if XiangShan continues their momentum. But I think open-source trailing 1-3 years is a more realistic long term outcome.
That's actually the most NDA thing, since companies don't want to be held to a release date. However the fact they have fully functional engineering samples running in their offices now is very encouraging.
If I want to do systems research that needs a simple and synthesizable RISC-V design, what are some good options? BOOM seems to be pretty complicated and I don't really need the out-of-order execution.
I also want to be able to run it on a cheap FPGA, something like Artix A7.
This is what the OpenHW Foundaiton is for. Providing well verified cores for people who just need a core. They are based on the original Pulp Cores from the ETH Zürich and Unviersity of Bolongia. See:
Claire Wolf's PicoRV32 seems to get a lot of "simple and synthesizable RISC-V design to run on a cheap FPGA" use, although that's more typically for much cheaper FPGA's than an Artix A7, like an iCE40. SeRV is even cheaper and simpler but much slower. Graeme Smecher’s RVC-first "Minimax" is almost as simple as SeRV but not nearly as slow.
I haven't tried any of them, though, so take this with a Himalayan salt lamp.
I've been getting tremendous use out of PicoRV32- it works, it's tiny, and for many use cases ("management plane") you just don't need much speed. I work around its slowness by providing things like relatively large communication buffers in the FPGA. I use it in "execute in place" mode from external SPI-flash (the FPGA's config flash), but with an instruction cache. It can do floating point via emulation which is handy for printf.
I've been meaning to update my toolbox to at least a pipelined processor of some sort (to up the IPC rate to at least 1), but so far just had no strong need. For applications that really need CPU power, I use SoC FPGAs like Zynq.
XiangShan is currently the fastest open-source CPU implementation: https://github.com/OpenXiangShan/XiangShan
uarch slides: https://tutorial.xiangshan.cc/hpca25/slides/20250302-HPCA25-...
uarch slides with WIP features: https://tutorial.xiangshan.cc/micro25/slides/Microarchitectu...
Many companies have production ready/fully verified IP that are much faster than XiangShan, but they haven't taped out and productized it yet.
E.g. Tenstorrent Ascalon IP is available and at 21 SPECint2006/GHz. They want to release a lower clocked devboard on TSMC 12nm in 2026 Q2.
XiangShan is currently at 15 SPECint2006/GHz, but are targeting 22 SPECint2006/GHz in the next iteration, that is currently in the works. Well have to see who gets there first.
The inherent gap between RTL design and final product may be large enough that open-source RTL can keep up in terms of IPC, if XiangShan continues their momentum. But I think open-source trailing 1-3 years is a more realistic long term outcome.
Actually available for pre-order, take a look at the UltraRISC DP-1000 (unrelated to the chip above).
I also want to be able to run it on a cheap FPGA, something like Artix A7.
If you want something very tiny and completely understandable, and don't mind that it's 32 bit, then PicoRV32 (https://github.com/YosysHQ/picorv32).
https://openhwfoundation.org/
Or because its part of OpenTitan, Ibex sees a lot of development: https://github.com/lowRISC/ibex
I haven't tried any of them, though, so take this with a Himalayan salt lamp.
I've been meaning to update my toolbox to at least a pipelined processor of some sort (to up the IPC rate to at least 1), but so far just had no strong need. For applications that really need CPU power, I use SoC FPGAs like Zynq.
this one is well tested in real life and works well (it is the RV core in RP2350)
https://github.com/riscv-boom/riscv-boom
But this project seems to have stagnated with one significant commit in the last year.