-
Notifications
You must be signed in to change notification settings - Fork 757
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improving Precompute performance #6931
Comments
The lock shouldn't be related, no. That lock is only taken when creating a new HeapType (edit: or Type). The inlining hypothesis seems likely. Perhaps you could look for places that handle the |
I experimented with moving all the That does not help, though. Runtime is the same, and the profile shows So this still seems like a surprising amount of overhead. @tlively I wonder if we can at least make |
I once tried to change the representation of reference types to be pointers to HeapType infos with their nullability encoded in their low bits, but didn't quite get it working. It would be good to try again. This would also remove the need to take the lock when creating a |
…*() methods (#6936) This just moves code around. As a result, isRef() vanishes entirely from the profiling traces in #6931, since now the core isRef/Tuple/etc. methods are all inlineable. This also required some reordering of wasm-type.h, namely to move HeapType up front. No changes to that class otherwise. TypeInfo is now in the header. getTypeInfo is now a static method on Type. This has the downside of moving internal details into the header, and it may increase compile time a little. The upside is making the --precompute benchmark from #6931 significantly faster, 33%, and it will also help the many Type::isNonNullable() etc. calls we have scattered around the codebase in other passes too.
This avoids creating a large Literals (SmallVector of Literal) and then copying it. All the places that construct GCData do not need the Literals afterwards. This gives a 7% speedup on the --precompute benchmark from #6931
This makes Precompute about 5% faster on a WasmGC binary. Inspired by #6931.
I checked the profile here again today and
|
This makes Precompute about 5% faster on a WasmGC binary. Inspired by #6931.
Running on the binary from Google Sheets,
(1 core to avoid noise from multithreading; validation disabled to focus on the optimization)
The top item from
perf report
isReading the code, I can't see an obvious reason for that slowness.
@tlively You mentioned some lock you might look into with
wasm::Type
- is this related? I don't seem to see a lock taken here though.My only other guess is that this may just be called many times, and by not being in a header, it isn't getting inlined.
The text was updated successfully, but these errors were encountered: