Register VM #3798

HalidOdat · 2024-04-09T00:10:26Z

This PR transitions the current stack based VM to a register based VM.

There are a many changes in this, but I will try to highlight the most important ones.

The register allocation machanism in the bytecompiler has already been implemented in Implement register allocation #3942
I moved the register storage at runtime out of the Vm struct into a dedicated Registers struct that is passed to the vm execution functions together with the Context. The reason for this is mostly to be able to get references to register values without borrowing the Context. In some opcodes, the Clone rate of JsObjects significantly increased with the move to registers, because I had to clone all register values before operating on them, because the operation required a &mut Context.
Most logic in the bytecompiler is (IMO) way easier to understand now. I could eliminate 13 opcodes, and most importantly Dup, Swap, RotateLeft and RotateRight are gone. Working with registers turns out to be way more intuitive than working on a stack.
There is a lot of boilerplate in the opcode implementation now, since many more opcodes now read the register locations and need implementations for u8, u16 and u32 sizes. I think we should probably look into reworking the opcode generation macros, to possibly add both emit and read_operands functions. This should also make the code a lot safer, since right now we have to manually implement all operand reading and writing.
There is still some usage of the stack left. In particular all Call / New opcodes use the stack for all arguments and the return values. Also the return value and returned resume kind for yielding opcodes (Await, etc.) still use the stack. Currently I'm not sure how / if we want to change this. There are some significant challanges here, especially with how our function calls currently work.
I tried to unify most opcode descriptions with their relevant input / output values to make those more clear. I also adjusted the instruction_operands print function for all opcodes. A significant amout of lines changed is just that.
Performance had initially regressed notably until I moved the registers to their dedicated struct. As far as I can tell, this was mainly from cloning JsObjects and the associated gc. Currently the object clone rate is still higher than with a the stack vm. I measured this on the EarleyBoyer benchmark and last time I checked, the clone rate is ~1,35B vs ~1,42B. In my local benchmark runs the performance regression is between ~1% and ~3%.

github-actions · 2024-04-09T00:25:02Z

Test262 conformance changes

Test result	main count	PR count	difference
Total	50,254	50,254	0
Passed	45,007	45,012	+5
Ignored	1,728	1,728	0
Failed	3,519	3,514	-5
Panics	0	0	0
Conformance	89.56%	89.57%	+0.01%

Fixed tests (5):

test/language/expressions/super/prop-expr-getsuperbase-before-topropertykey-putvalue-increment.js (previously Failed)
test/language/expressions/super/prop-expr-getsuperbase-before-topropertykey-putvalue-compound-assign.js (previously Failed)
test/staging/sm/class/superPropIncDecElem.js (previously Failed)
test/staging/sm/class/superPropFor.js (previously Failed)
test/staging/sm/class/superPropDerivedCalls.js (previously Failed)

HalidOdat · 2024-07-09T17:28:15Z

The failing test is due to #3907 , in any case I'll work on always returning i32 for increment.

raskad · 2024-10-10T20:05:10Z

@HalidOdat Do you have the time to work on this at the moment? If not, I can see if I can make some progress on the PR.

HalidOdat · 2024-10-11T13:04:49Z

I'm quite busy at the moment, but feel free to work on it! 😊

raskad · 2024-12-26T21:27:06Z

@jedel1043 @HalidOdat Could you give this a look? I was looking into making some some changes to some opcodes, but didn't want to rebase this all the time, so it would be nice if we could merge it soon.

- Add todo for storing fp in CallFrame - Start on moving to register VM - Add more binary ops - Add helpter transition methods - Register based logical ops - Register InPrivate - Register GetFunction - Some class opcodes - Register post and pre increment and decrement

jedel1043

Hmm, doing a first pass on this and architecturally I'm wondering why we need to pass the registers as if they were "external state" from the VM.

I would think of registers as being "global memory addresses" that you can use while running in a VM, but this treats them as "function memory addresses", with the disadvantage of having to reserve a new set of registers each time you make a call, even if you don't use all of them.

Can we treat them as "proper" registers, pushing and popping from the stack when we need to avoid clobbering, or are there some limitations with this approach that I'm not seeing right now?

raskad · 2025-01-11T23:46:07Z

Can we treat them as "proper" registers, pushing and popping from the stack when we need to avoid clobbering, or are there some limitations with this approach that I'm not seeing right now?

As far as I can see that approach would be a lot more complex, both from the runtime and probably especially in the bytecompiler. With the current approach, we have the benefit of having a pretty simple mapping of ByteCompiler registers and the runtime register storage.

Also, I have not tested this, but I would guess that having a fixed number of registers that spill over to the stack would cause more copys than this approach when copying from and to the stack. Or at least the performance with this approach is probably more consistent, since we have the max number of used registers and can allocate the required registers in one allocation (if alloc is needed).

[...] with the disadvantage of having to reserve a new set of registers each time you make a call, even if you don't use all of them.

Since the register storage is essentially a sliding window, note that after we hit the maxium call depth for a program, there will be no allocations.
But the question of avoiding register reservation if we do not use them, because of branching, etc. in the code is definitley interesting. As far as I can see, you are right, solving this would require a more dynamic register storage at runtime. But I'm not sure if that is worth the additional complexity.

[...] I'm wondering why we need to pass the registers as if they were "external state" from the VM.

Do you mean why the extra Registers type exists? I added a section in the PR description. Basically I did this just to avoid having to clone register values when passing them to functions that require &JsValue but also &mut Context.

HalidOdat

Sorry for the delay! Amazing work @raskad! :)

I agree with @jedel1043 that registers shouldn't ideally be treated as external state (like V8 does it all in one stack). However, @raskad's approach simplifies the runtime. It’s a big step forward, and we can refactor it in future PRs. Besides that it looks great to me!

Can't approve "my" PR, so just leaving leaving this PR comment 😄

HalidOdat added execution Issues or PRs related to code execution Internal Category for changelog labels Apr 9, 2024

HalidOdat force-pushed the refactor/register-vm branch from 7351c87 to 039bd2b Compare June 27, 2024 20:05

HalidOdat force-pushed the refactor/register-vm branch 2 times, most recently from f98cd87 to 1e0bf01 Compare July 5, 2024 15:26

raskad mentioned this pull request Jul 8, 2024

Fix HomeObject for private class methods #3897

Merged

raskad mentioned this pull request Aug 3, 2024

Optimization fast local variables #3194

Closed

HalidOdat force-pushed the refactor/register-vm branch 2 times, most recently from 5fd2167 to 50e8339 Compare August 3, 2024 11:07

HalidOdat mentioned this pull request Aug 3, 2024

Implement register allocation #3942

Merged

raskad force-pushed the refactor/register-vm branch from 50e8339 to 2bab5ac Compare December 10, 2024 00:22

raskad added this to the next-release milestone Dec 11, 2024

raskad marked this pull request as ready for review December 11, 2024 19:07

raskad requested a review from a team December 11, 2024 19:07

raskad force-pushed the refactor/register-vm branch 2 times, most recently from 0cecbb6 to aeb2e2f Compare December 20, 2024 04:24

HalidOdat and others added 5 commits January 11, 2025 01:07

WIP Register VM

f477b1e

- Add todo for storing fp in CallFrame - Start on moving to register VM - Add more binary ops - Add helpter transition methods - Register based logical ops - Register InPrivate - Register GetFunction - Some class opcodes - Register post and pre increment and decrement

Migrate to register vm

e0edab9

fix doc comment

1ca2d25

fix default register value error

cc1a4a5

fix rebase errors

e83eb8c

raskad force-pushed the refactor/register-vm branch from aeb2e2f to e83eb8c Compare January 11, 2025 01:11

jedel1043 reviewed Jan 11, 2025

View reviewed changes

HalidOdat commented Jan 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Register VM #3798

Register VM #3798

HalidOdat commented Apr 9, 2024 •

edited by raskad

Loading

github-actions bot commented Apr 9, 2024 •

edited

Loading

HalidOdat commented Jul 9, 2024

raskad commented Oct 10, 2024

HalidOdat commented Oct 11, 2024

raskad commented Dec 26, 2024

jedel1043 left a comment

raskad commented Jan 11, 2025

HalidOdat left a comment •

edited

Loading

Register VM #3798

Are you sure you want to change the base?

Register VM #3798

Conversation

HalidOdat commented Apr 9, 2024 • edited by raskad Loading

github-actions bot commented Apr 9, 2024 • edited Loading

Test262 conformance changes

HalidOdat commented Jul 9, 2024

raskad commented Oct 10, 2024

HalidOdat commented Oct 11, 2024

raskad commented Dec 26, 2024

jedel1043 left a comment

Choose a reason for hiding this comment

raskad commented Jan 11, 2025

HalidOdat left a comment • edited Loading

Choose a reason for hiding this comment

HalidOdat commented Apr 9, 2024 •

edited by raskad

Loading

github-actions bot commented Apr 9, 2024 •

edited

Loading

HalidOdat left a comment •

edited

Loading