✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions #264

stnolting · 2022-01-29T04:49:49Z

With this PR the NEORV32 now provides an option to add custom RISC-V instructions. 🚀

This PR adds a Custom Functions Unit (CFU) wrapped in the Zxcfu ISA extension, which is a NEORV32-specific custom ISA extension. The extension's name follows the RISC-V naming scheme:

Z = this is a sub-extension
x = the second letter behind the Z defines the "parent-extension" where this sub-extension belongs to: in this case it belongs to the X "custom extensions" extension (platform-specific extension that is not defined by the RISC-V spec.)
cfu = name of the extension (Custom Functions Unit)

The CFU is implemented as a new hardware module (rtl/core/neorv32_cpu_cp_cfu.vhd) that is integrated right into the CPU's ALU. Thus, the CFU has direct access to the core's register file, which provides minimal data transfer latency. A special OPCODE, which has been officially reserved for custom extensions by the RISC-V spec, is used to build custom instructions. The custom instructions supported by the CFU use the R2-type format that provides two source registers, one destinations register and a 10-bit immediate (split into two bit-fields:

The funct7 and funct3 bit-fields can be used to pass immediates to the CFU for certain computations (for example offsets, addresses, shift-amounts, ...) or they can be used to select the actual custom instruction to be executed (allowing up to 1024 different instructions).

Software can utilize the custom instruction by using the provides intrinsics (defined in sw/lib/include/neorv32_cpu_cfu.h. These pre-defined functions implicitly set the funct3 bit field. Each intrinsic can be treated as "normal C function" (see #263). A simple demo program using the default CFU hardware is available in sw/example/demo_cfu/main.c.

// custom instruction prototypes
neorv32_cfu_cmd0(funct7, rs1, rs2); // funct3 = 000
neorv32_cfu_cmd1(funct7, rs1, rs2); // funct3 = 001
neorv32_cfu_cmd2(funct7, rs1, rs2); // funct3 = 010
neorv32_cfu_cmd3(funct7, rs1, rs2); // funct3 = 011
neorv32_cfu_cmd4(funct7, rs1, rs2); // funct3 = 100
neorv32_cfu_cmd5(funct7, rs1, rs2); // funct3 = 101
neorv32_cfu_cmd6(funct7, rs1, rs2); // funct3 = 110
neorv32_cfu_cmd7(funct7, rs1, rs2); // funct3 = 111

This new feature was highly inspired by @google's CFU-Playground - thanks again to @umarcor for showing me that framework. With some logic plumbing it should be possible to install the CFUs from the CFU-Playground into the NEORV32.

📚 Documentation

The documentation of the CFU module is available in the online processor data sheet section "Custom Functions Unit (CFU)". A comparison of different processor extension options is available in user guide section "Adding Custom Hardware Modules".

CFU vs. CFS

There are two processor-internal options for custom hardware now: the Custom Functions Subsystem (CFS) and the Custom Functions Unit (CFU).

Custom Functions Subsystem (CFS): The CFS is a memory-mapped peripheral that is accessed using load/store instructions. It is intended for complex accelerators that - once triggered - perform some "long" processing in a CPU-independent manner (like a complete AES encryption). The CFS also provides the option to implement custom interfaces as it has direct access to special top entity signals.
Custom Functions Unit (CFU): The CFU is located right inside the CPU's pipeline. It is intended for custom instructions that implement certain functionality, which is not supported by the official (and supported) RISC-V ISA extensions. These instructions should be rather simple data transformations (like bit-reversal, summing elements in a vector, elementary AES operations, ...) rather than implementing a complete algorithm (even if this is also supported) since the CFU instructions are absolutely CPU-dependent and will stall the core until completed.

aslo fixed Zmmul ISA extension generic

-> for custom RISC-V instructions

includes custom instructions macros/intrinsics

don't get confused here ;)

stnolting · 2022-01-29T14:06:53Z

I have added a summarized comparison of the four most obvious (IMHO) options for adding custom hardware modules to the processor (user guide, section "Adding Custom Hardware Modules"). These options are:

attach custom hardware via the external memory interface (WISHBONE)
attach custom hardware via the stream link interface (SLINK)
implement custom hardware via the custom functions subsystem (CFS)
implement custom hardware via the custom functions unit (CFU)

@umarcor

We recently had a short discussion about this topic. Could have a look at the comparison table (-> https://github.com/stnolting/neorv32/blob/zxcfu_isa_extension/docs/userguide/adding_custom_hw_modules.adoc#16-comparative-summary)? Maybe you have some ideas for additional (or better) comparison "metrics". 😉

add Zxcfu ISA extension for custom RISC-V instructions

umarcor · 2022-02-01T03:28:20Z

@stnolting you are so fast! I commented it mostly for gathering some knowledge and you implemented all of it in 1-2 days! That's impressive! Thank you so much!

/cc @tcal-x @mithro @kgugala might be interested in knowing that NEORV32 supports CFU and might be combined with the content from google/CFU-Playground.

docs/datasheet/cpu.adoc

docs/datasheet/cpu_cfu.adoc

tcal-x · 2022-02-03T23:10:20Z

Hi! That's great news! Sorry I forgot to follow up on this. It would be super awesome to have an alternative CPU that would connect to CFUs similar to VexRiscv, even if an adapter is needed (I haven't yet checked out the CFU interface on NEORV32 to see how similar it is). Connecting VexRiscv to CFU is actually done in LiteX: https://github.com/enjoy-digital/litex/blob/master/litex/soc/cores/cpu/vexriscv/core.py#L275-L328 .

Does NEORV32 currently plug into LiteX?

stnolting · 2022-02-04T05:59:11Z

Does NEORV32 currently plug into LiteX?

Not yet, but that is already on the to-do stack (#115) 😉

haven't yet checked out the CFU interface on NEORV32 to see how similar it is

According to this CFU-Playground template the interface seems to be quite similar. However, I need to find some real specification for that and test some existing CFU setups (see #269).

umarcor · 2022-02-05T17:05:19Z

FTR, there is https://github.com/umarcor/neorv32-setups/commits/umarcor/edaa, which is a work-in-progress prj.py to declare NEORV32 sources through pyEDAA.ProjectModel. Having it defined in Python should make it easier to reuse in Litex. However, I'm not sure about the steps required to support a new CPU in Litex (most of the uses I see are packaging SoC with already supported cores/modules). I've seen Migen and SpinalHDL designs begin available in the ecosystem, but not VHDL. Is there any Litex example using VHDL or do we need to convert NEORV32 to Verilog (#266)?

On the other hand, it might be interesting to add a .core file to stnolting/neorv32-setups. Can Litex read FuseSoC's .core files or does it need some specific declarative format?

/cc @enjoy-digital @olofk

enjoy-digital · 2022-02-05T20:57:06Z

Hi @umarcor,

we currently have one VHDL CPU integrated in LiteX: Microwatt. It can used and integrated as VHDL or pre-converted to Verilog through GHDL/Yosys.

To add a CPU, you first need to create the LiteX wrapper around it. You can eventually use these PRs as reference: CV32E40P or FemtoRV, with still local sources in a first time. Once working, we could package NEORV32 in a pythondata-xxyy package as we are doing for other CPUs.

@stnolting's NEORV32 looks awesome and I would be really happy to help for the integration in LiteX. This would also probably be a good stress test for the GHDL/Yosys plugin :) (Proprietary tools will accept VHDL, but conversion to Verilog is useful to simulate with litex_sim (through Verilator) or to implement it with open-source tools on hardware).

stnolting · 2022-02-10T07:04:32Z

@enjoy-digital

Thank you very much! I will take a closer look at the Microwatt integration and try to find out how things work 😉

enjoy-digital · 2022-02-13T12:41:46Z

@stnolting: In fact, since NeoRV32 seems a lot more documented than LiteX and that I'm also a VHDL developer, it would probably be more efficient that I at least do the skeleton for the integration to initiate the work and allow us to work together on this. It seems different issues are related to this or derived aspects (Verilog generation, CFU, etc...) so it could allow you to go further on these aspects and doing the NeoRV32 integration could also be a good occasion for me to write a CPU integration tutorial for LiteX :) I'll have a look at integrating neorv32_cpu.vhd as a LiteX CPU in the next days and will share progress here.

umarcor · 2022-02-13T12:51:42Z

@enjoy-digital, please, don't do it alone. I mean, @stnolting is the author of NEORV32 as a whole, and I wrote most of the Makefiles in neorv32-setups. See the diagram in hdl.github.io/constraints/Usage. So, please, ask as soon as you don't understand anything about the structure.

I suggest you take a look at processor_templates and system_integration subdirs in this repo. Rather than integrating neorv32_cpu, you might want to start with one of those. With regard to the board_tops in neorv32-setups, maybe you don't want to use them in LiteX at all (because that's the core functionality of LiteX and you do have litex-boards already), however, they can serve for inspiration.

enjoy-digital · 2022-02-13T17:51:52Z

@enjoy-digital, please, don't do it alone. I mean, @stnolting is the author of NEORV32 as a whole, and I wrote most of the Makefiles in neorv32-setups. See the diagram in hdl.github.io/constraints/Usage. So, please, ask as soon as you don't understand anything about the structure.

Sure, that's what I mean with the integration skeleton. It's easier for me to put things in place for LiteX, when it will be done and if there are issues, I'll be able to share a simulation environment we could use to continue the bringup.

I suggest you take a look at processor_templates and system_integration subdirs in this repo. Rather than integrating neorv32_cpu, you might want to start with one of those. With regard to the board_tops in neorv32-setups, maybe you don't want to use them in LiteX at all (because that's the core functionality of LiteX and you do have litex-boards already), however, they can serve for inspiration.

In fact the NeoRV32 CPU seems to be the equivalent of other CPUs integrated in LiteX so it's easier to start with this. If LiteX can also be useful to allow running the NeoRV32 Processor on different hardware or to provide some peripherals not present in NeoRV32 Processor, I'll also be happy to provide help/directions.

enjoy-digital · 2022-02-14T16:42:05Z

@umarcor: This is working and I suggest moving the LiteX specific discussion to #115 instead of this closed PR :)

umarcor · 2022-02-14T16:49:28Z

@enjoy-digital ack and agree. Thanks for you awesomely fast and effective response!

stnolting · 2022-02-15T07:56:48Z

I think I would recommend to use the processor top entity (rtl/core/neorv32_top.vhd) for integration (or maybe some of the template processor wrappers). The bus interfaces of the stand-alone CPU are somehow proprietary even though they were inspired by Wishbone.

You can disable all processor-internal modules (even the memories) via generics. By this configuration you can get a processor setup providing just a Wishbone-compatible bus interface.

This is working and I suggest moving the LiteX specific discussion to #115 instead of this closed PR :)

I agree 😉

stnolting added 21 commits January 28, 2022 20:40

[docs/figures] added 'Zxcfu' extension

9bcb115

[rtl/system_integration] added Zxcfu ISA extensions

3aba96f

aslo fixed Zmmul ISA extension generic

[rtl/core] TOP: added generic to enabale Zxcfu ISA extension

28ba540

[sim] added Zxcfu ISA ext.

bd73053

[sw/lib] added "custom OPCODES" to intrinsic library

e7c5ce3

[sw/lib] CFU: comment typo fix

ccd158e

[rtl/core] CPU control: added custom OPCODE logic

cd50dbf

[rtl/core] SYSINFO: added Zxcfu flag

41ea603

[rtl/core] ALU: added CFU to CP4 slot

5f5f4d6

[rtl/core] CPU: added Zxcfu ISA extension

e383953

[rtl/core] PACKAGE: added Zxcfu ext. and CFU module

27be939

✨ [rtl/core] added CFU hardware module

75607ae

-> for custom RISC-V instructions

[sw/lib] add CFU drivers and SYSCONFIG bit definition

6c228c4

includes custom instructions macros/intrinsics

[sw/lib] RTE: added Zxcfu ISA extension

9ae94d3

[docs/datasheet] SYSINFO: added Zxcfu flag

e205391

[docs/datasheet] SOC: added Zxcfu generic

790c9c1

[docs/datasheet] OVERVIEW: added CFU HW file

8124196

[docs/datasheet] CFS: added note/link to CFU

b7d515d

don't get confused here ;)

[docs/datasheet] CPU: added Zxcfu ISA extension

2db059b

[README] add links to Zxcfu extension and CFU module

62ba20e

[docs/datasheet] added CFU section (still empty)

ec025d5

stnolting added enhancement New feature or request HW hardware-related SW software-related labels Jan 29, 2022

stnolting self-assigned this Jan 29, 2022

stnolting changed the title ~~✨[Zxcfu ISA extensions] add option to implement custom RISC-V instructions~~ ✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions Jan 29, 2022

stnolting added 4 commits January 29, 2022 05:55

minor edits

17219d2

✨ [sw/example] added CFU example program

c79f5c3

[docs/figures] added R2-type CFU instruction diagram

ad515d6

[sw/lib/include] fixed typo in intrinsics library

dbf8a1f

stnolting added 6 commits January 29, 2022 08:46

[docs/datasheet] minor edits -> "extensibility"

1d97ffa

[docs] added CPU block diagram

b7d5f35

minor edits

eda0278

[docs/datasheet] added CFU to SW file list

ea4d57c

📚 [docs/datasheet] added new section "CFU"

67449f4

[docs/userguide] reworked section "Adding Custom Hardware Modules"

0e3a715

stnolting added 3 commits January 30, 2022 06:15

[docs/userguide] custom hardware extension options: minor edits

1470de8

[CHANGELOG] added v1.6.7.1

63ecf74

add Zxcfu ISA extension for custom RISC-V instructions

typo fixes in comments

4382a29

stnolting marked this pull request as ready for review January 31, 2022 04:32

stnolting merged commit 3ac6303 into master Jan 31, 2022

stnolting deleted the zxcfu_isa_extension branch January 31, 2022 07:10

umarcor reviewed Feb 1, 2022

View reviewed changes

docs/datasheet/cpu.adoc Show resolved Hide resolved

umarcor reviewed Feb 1, 2022

View reviewed changes

docs/datasheet/cpu_cfu.adoc Show resolved Hide resolved

stnolting mentioned this pull request Feb 3, 2022

[CFU] does anyone know of actual CFU-Playground applications? #269

Closed

stnolting mentioned this pull request Feb 10, 2022

[idea] add option to convert processor setup to Verilog #266

Closed

enjoy-digital mentioned this pull request Feb 14, 2022

Add neorv32 to LiteX as soft-cpu option #115

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions #264

✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions #264

stnolting commented Jan 29, 2022 •

edited

Loading

stnolting commented Jan 29, 2022

umarcor commented Feb 1, 2022 •

edited

Loading

tcal-x commented Feb 3, 2022

stnolting commented Feb 4, 2022

umarcor commented Feb 5, 2022 •

edited

Loading

enjoy-digital commented Feb 5, 2022

stnolting commented Feb 10, 2022

enjoy-digital commented Feb 13, 2022

umarcor commented Feb 13, 2022

enjoy-digital commented Feb 13, 2022

enjoy-digital commented Feb 14, 2022

umarcor commented Feb 14, 2022

stnolting commented Feb 15, 2022

✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions #264

✨[Zxcfu ISA ext.] add option to implement custom RISC-V instructions #264

Conversation

stnolting commented Jan 29, 2022 • edited Loading

📚 Documentation

CFU vs. CFS

stnolting commented Jan 29, 2022

umarcor commented Feb 1, 2022 • edited Loading

tcal-x commented Feb 3, 2022

stnolting commented Feb 4, 2022

umarcor commented Feb 5, 2022 • edited Loading

enjoy-digital commented Feb 5, 2022

stnolting commented Feb 10, 2022

enjoy-digital commented Feb 13, 2022

umarcor commented Feb 13, 2022

enjoy-digital commented Feb 13, 2022

enjoy-digital commented Feb 14, 2022

umarcor commented Feb 14, 2022

stnolting commented Feb 15, 2022

stnolting commented Jan 29, 2022 •

edited

Loading

umarcor commented Feb 1, 2022 •

edited

Loading

umarcor commented Feb 5, 2022 •

edited

Loading