Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow dynamic types in configurables #6760

Open
wants to merge 14 commits into
base: master
Choose a base branch
from

Conversation

xunilrj
Copy link
Contributor

@xunilrj xunilrj commented Nov 26, 2024

Description

This PR implements "dynamic types" for configurables. This means those types which the encoded bytes buffer does not have a know size at compilation time. For example: Vec, Dictionaries, string slices etc...

The major problem these pose is that the ABI json has the offset to their encoded bytes; and if their buffer size changes, that would invalidate the offset of configurables coming after them inside the json.

To solve this issue, these types will have their buffer read indirectly. This means that, at the predefined offset will be another offset, this last one from the start of the data section that will point to their encoded bytes, which can be shared if needed as it is read-only.

If the buffer is not reused, it is expected that it will live at the end of the binary, where it will not invalidate any of the ABI json offsets.

Example:

Each config at the IR has a new field for some "flags". Currently we only have one flag, which is if the decode is direct (0) or indirect(1). Direct means that the encoded bytes are inside the configurable section. Indirect means that the compiler puts an offset inside the configurable section, but the encoded bytes will be append at the end of the binary.

 // IR: Final
     script {
//                                                              \/ ---------- This is new
         ANOTHER_U8 = config u8, abi_decode_in_place_27, 0x03, 0x00, !1
         ...
         VEC_U8 = config { { u64, u64 }, u64 }, abi_decode_in_place_53, 0x0000000000000000, 0x01, !18

The generated asm now have a new ".offset of", which will be materialized into the offset from the start of .data section (NOT the configurable section).

.data:
     ...
     data_Configurable_STRING_SLICE (9) .offset_of data_Dynamic_STRING_SLICE
     ...
     data_Configurable_VEC_U8 (17) .offset_of data_Dynamic_VEC_U8
     ...
     data_Dynamic_STRING_SLICE (0) .bytes[19] 00 00 00 00 00 00 00 0b 48 65 6c 6c 6f 2c 20 53 77 61 79  ........Hello, Sway
     data_Dynamic_VEC_U8 (1) .bytes[8] 00 00 00 00 00 00 00 00  ........

and the generated bytecode will be:

.data_section:
        ...
         0x00002178 .offset of 0x000021e1, as word 465, as hex be bytes ([00, 00, 00, 00, 00, 00, 01, D1])
        ...
         0x000021d8 .offset of 0x000021f4, as word 484, as hex be bytes ([00, 00, 00, 00, 00, 00, 01, E4])
         ...
         0x000021e1 .bytes as hex ([00, 00, 00, 00, 00, 00, 00, 0B, 48, 65, 6C, 6C, 6F, 2C, 20, 53, 77, 61, 79]), len i19, as ascii "........Hello, Sway"
         0x000021f4 .bytes as hex ([00, 00, 00, 00, 00, 00, 00, 00]), len i8, as ascii "........"

The .data section offset can be read directly from the binary as it lives at the start of the binary.

;; ASM: Final program
;; Program kind: Script
.program:
     move $$tmp $pc
     jmpf $zero i4
     DATA_SECTION_OFFSET[0..32]
     DATA_SECTION_OFFSET[32..64]
     CONFIGURABLES_OFFSET[0..32]
     CONFIGURABLES_OFFSET[32..64]

more precisely at:

;; --- START OF TARGET BYTECODE ---

         0x00000000 MOVE R60 $pc                                    ;; [26, 240, 48, 0]
         0x00000004 JMPF $zero 0x4                                  ;; [116, 0, 0, 4]
         0x00000008                                                 ;; [0, 0, 0, 0, 0, 0, 32, 16]   <- DATA SECTION OFFSET
         0x00000010                                                 ;; [0, 0, 0, 0, 0, 0, 0, 0]
         0x00000018 LW R63 R60 0x1                                  ;; [93, 255, 192, 1]
         0x0000001c ADD R63 R63 R60                                 ;; [16, 255, 255, 0]

Checklist

  • I have linked to any relevant issues.
  • I have commented my code, particularly in hard-to-understand areas.
  • I have updated the documentation where relevant (API docs, the reference, and the Sway book).
  • I have added tests that prove my fix is effective or that my feature works.
  • I have added (or requested a maintainer to add) the necessary Breaking* or New Feature labels where relevant.
  • I have done my best to ensure that my PR adheres to the Fuel Labs Code Review Standards.
  • I have requested a review from the relevant team or maintainers.

@xunilrj xunilrj force-pushed the xunilrj/dynamic-types-configurables branch from 0383319 to e4a24e9 Compare November 26, 2024 22:15
Copy link

codspeed-hq bot commented Nov 26, 2024

CodSpeed Performance Report

Merging #6760 will not alter performance

Comparing xunilrj/dynamic-types-configurables (5ee6a00) with master (1192b3f)

Summary

✅ 22 untouched benchmarks

@xunilrj xunilrj force-pushed the xunilrj/dynamic-types-configurables branch 2 times, most recently from 869de4b to 6f3f3be Compare December 2, 2024 11:38
@xunilrj xunilrj self-assigned this Dec 4, 2024
@xunilrj xunilrj force-pushed the xunilrj/dynamic-types-configurables branch from f6ae306 to 4a625ca Compare December 10, 2024 18:05
@xunilrj xunilrj marked this pull request as ready for review December 13, 2024 10:16
@xunilrj xunilrj requested review from a team as code owners December 13, 2024 10:16

assert_eq!(generated_loader_abi, used_loader_abi);
dbg!(&generated_loader_abi_path, &used_loader_abi_path);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
dbg!(&generated_loader_abi_path, &used_loader_abi_path);


[patch.crates-io]
fuel-abi-types = { git = "https://github.com/FuelLabs/fuel-abi-types", branch = "xunilrj/indirect-configurables" }
fuels = { git = "https://github.com/FuelLabs/fuels-rs", branch = "hal3e/dynamic-configurables" }
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the plan to wait for the fuels branches to be merged this PR goes out?

xunilrj added a commit to FuelLabs/fuel-abi-types that referenced this pull request Jan 14, 2025
This PR is part of FuelLabs/sway#6760 and it
allows the ABI json to inform if a configurable encoded bytes will be
load directly (at the predefined offset) or indirectly, there will
another offset at the predefined offset.

---------

Co-authored-by: hal3e <[email protected]>
Copy link
Contributor

@jjcnn jjcnn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only a few minor issues, but I'd like to hear your comments on them before I approve.

Comment on lines +537 to +540
255, 255, 255, 255, 255, 255, 255, 255, // NonConfigurable
1, 0, 0, 0, 0, 0, 0, 0, // U8
0, 0, 0, 0, 0, 0, 0, 24, // VEC_U8_OFFSET
0, 1, 2, 3, 4, 5 // VEC_U8_BYTES
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we have a test for non word-aligned sections as well, or is that not necessary?

metadata:comma_metadata_idx()? {
assert!(flags.len() == 1); // TODO improve error message
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you're not planning to fix this TODO in this PR, could you create an issue for it, and link to the issue in the comment? Otherwise we'll lose track of TODOs in the code.

VEC_U8: Vec<u8> = Vec::new(),
STRING_SLICE: str = "Hello, Sway",

// The test runner does not have a way to calculate the dynamic part of the data section
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to add that to the test runner as a separate issue?

@@ -43,9 +43,31 @@ pub enum ConfigContent {
encoded_bytes: Vec<u8>,
decode_fn: Cell<Function>,
opt_metadata: Option<MetadataIndex>,
indirect: bool,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: Using the name indirect creates a double negative - if indirect = false then you're saying "This buffer is not read indirectly", when it would be simpler to say "This buffer is read directly".

I would prefer if it was named direct_read or something similar, with true meaning "This buffer is read directly" and false meaning "This buffer is read indirectly".

Or maybe even scrap the direct/indirect terminology altogether and call it read_from_offset instead?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants