FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook run #1267

noopurintel · 2025-01-13T12:27:16Z

Added GitHub workflow "Federated Runtime 301 MNIST Watermarking" to run the notebook with same name.
Enabled it to run for PR pipelines - with 3 rounds. # 5 rounds will take more time leading to longer duration of CI pipeline run, thus used 3.
Corrected Watermaking -> Watermarking at all relevant places.
Added pytest file wf_federated_runtime_tests.py with a test test_federated_runtime_301_watermarking
Multiple changes across end_to_end helper files.

Successful run as part of PR pipeline itself - https://github.com/securefederatedai/openfl/actions/runs/12802027154?pr=1267

Successful run with display of output - https://github.com/noopurintel/openfl/actions/runs/12791676959

Successful display of error, if any - https://github.com/noopurintel/openfl/actions/runs/12784011165/job/35636177976 (induced explicitly for the testing purpose)

… run Signed-off-by: noopur <[email protected]>

Signed-off-by: noopur <[email protected]>

scngupta-dsp

Overall, this is a promising starting point for integrating FederatedRuntime test cases into the CI test pipeline, as it effectively demonstrates multiple functionalities of the Workflow Interface.

During an internal discussion with @noopurintel, @payalcha, and @ishant162, a concern was raised regarding the challenges in automating FederatedRuntime test cases. Specifically, many parameters required for executing the Workflow Interface are currently hardcoded in the Jupyter notebook, limiting flexibility and scalability.

Proposed Solution
To address this issue partially, I developed a Python script that extracts the execution logic from the Jupyter notebook. This script provides a way to run the FederatedRuntime notebook outside of the Jupyter environment, making it easier to automate and integrate into CI pipelines.

Below is the experimental Python script:

#######################################################################
# Test script (experimental basis) to execute FederatedRuntime 
# notebook from outside of Jupyter notebook 
#######################################################################

# Instantiate FederatedRuntime
from openfl.experimental.workflow.runtime import FederatedRuntime

director_info = {
    'director_node_fqdn':'localhost',
    'director_port':50050,
}

collaborator_names = ["Portland", "Seattle"]

federated_runtime = FederatedRuntime(
    collaborators=collaborator_names,
    director=director_info, 
    notebook_path='/home/scngupta/openfl_latest/openfl/openfl-tutorials/experimental/workflow/FederatedRuntime/101_MNIST/workspace/MNIST_FederatedRuntime_reduced.ipynb'
)

# Check to make sure that envoys are connected 
federated_runtime.get_envoys()

# Define a dummy flow to enable runtime execution 
from openfl.experimental.workflow.interface import FLSpec
from openfl.experimental.workflow.placement import aggregator, collaborator

class AutomationFlow(FLSpec):
    """
    Flow to automate the FederatedRuntime execution
    NOTE: This flow will not execute on the Federation  
    """

    def __init__(self, **kwargs):
        super().__init__(**kwargs)


# Run the flow (in reality the flow defined in jupyter notebook will run)
flflow = AutomationFlow()
flflow.runtime = federated_runtime
flflow.run()

Modified jupyter notebook that works with above can be referenced at
MNIST_FederatedRuntime.txt

Next Steps
This script is provided as an experimental enhancement and may not address certain corner cases. I would recommend to try this approach and if it is successful, we can refine and formally adopt it in subsequent PRs. WDYT ?

.github/workflows/federated_runtime.yml

...rimental/workflow/FederatedRuntime/301_MNIST_Watermarking/workspace/MNIST_Watermarking.ipynb

Signed-off-by: noopur <[email protected]>

openfl/experimental/workflow/runtime/federated_runtime.py

scngupta-dsp

Thanks Noopur. Looks good !

Signed-off-by: noopur <[email protected]>

teoparvanov

LGTM, thanks @noopurintel ! It's great to have an E2E test with the FederatedRuntime.

MasterSkepticista

@noopurintel Please hold off on doubling the timeout.

I am testing a couple of alternatives to reduce calls on downloading MNIST. Average CI time is 6 mins, and 15 mins is a safeguard on anomalies. Setting it to 30 mins causes CI to be blocked for long on other PRs.

We recently had a situation where CI compute was blocked for >4 hours solely for this reason. With current setting, failed actions can deallocate resources for "working" PRs faster.

noopurintel · 2025-01-16T10:13:42Z

@MasterSkepticista - increased timeout is anyways a temporary fix to unblock the PRs and testing process. Once the download issue/slowness is resolved, will revert the timeout to its previous value (i.e. 15m).

noopurintel added 3 commits January 13, 2025 12:26

FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook…

fbf358a

… run Signed-off-by: noopur <[email protected]>

Display output on screen

d1a9744

Signed-off-by: noopur <[email protected]>

5 Rounds

e7512e0

Signed-off-by: noopur <[email protected]>

noopurintel marked this pull request as ready for review January 13, 2025 12:54

rahulga1 approved these changes Jan 13, 2025

View reviewed changes

noopurintel added 3 commits January 13, 2025 13:02

Removed extra ) bracket

cb4dcbe

Signed-off-by: noopur <[email protected]>

Timeout of 30 min due to 5 rounds

903ca3e

Signed-off-by: noopur <[email protected]>

End the loop after all rounds

d1f42e8

Signed-off-by: noopur <[email protected]>

ishant162 approved these changes Jan 14, 2025

View reviewed changes

scngupta-dsp suggested changes Jan 14, 2025

View reviewed changes

noopurintel marked this pull request as draft January 15, 2025 06:16

noopurintel added 10 commits January 15, 2025 06:16

Review comments incor

024220c

Signed-off-by: noopur <[email protected]>

Merge branch 'develop' into develop

49bfe5d

Retry the envoy fetch

2865d67

Signed-off-by: noopur <[email protected]>

Added invalid code just to verify negative scenario

4293de0

Signed-off-by: noopur <[email protected]>

Added invalid code just to verify negative scenario

60d4ff4

Signed-off-by: noopur <[email protected]>

20m job timeout

c1d8106

Signed-off-by: noopur <[email protected]>

Revert invalid code and stdout notebook run

6b64ff4

Signed-off-by: noopur <[email protected]>

Use markdown with stdout

9edbb2f

Signed-off-by: noopur <[email protected]>

Induced error for testing

eda05cf

Signed-off-by: noopur <[email protected]>

Reverted error, added 10s sleep in github fetch logic

e9a2b37

Signed-off-by: noopur <[email protected]>

noopurintel marked this pull request as ready for review January 15, 2025 08:20

noopurintel added 2 commits January 15, 2025 08:23

Code format check

cb14071

Signed-off-by: noopur <[email protected]>

pytest for the notebook

5a164da

Signed-off-by: noopur <[email protected]>

noopurintel marked this pull request as draft January 15, 2025 15:13

noopurintel added 2 commits January 15, 2025 15:32

Pip install ipython ipykernel

487d6e5

Signed-off-by: noopur <[email protected]>

Minor changes

dbe1d7b

Signed-off-by: noopur <[email protected]>

noopurintel marked this pull request as ready for review January 15, 2025 15:59

noopurintel added 2 commits January 15, 2025 16:06

Test summary step corrected for wf_functional_e2e workflow

edd6c75

Signed-off-by: noopur <[email protected]>

3 rounds instead of 5

3e3a52b

Signed-off-by: noopur <[email protected]>

noopurintel added 3 commits January 15, 2025 22:31

Merge branch 'develop' into develop

0eeb3e9

Job name change

299628f

Signed-off-by: noopur <[email protected]>

Merge branch 'develop' into develop

ebe0a7b

ishant162 reviewed Jan 16, 2025

View reviewed changes

openfl/experimental/workflow/runtime/federated_runtime.py Outdated Show resolved Hide resolved

ishant162 reviewed Jan 16, 2025

View reviewed changes

openfl/experimental/workflow/runtime/federated_runtime.py Outdated Show resolved Hide resolved

scngupta-dsp approved these changes Jan 16, 2025

View reviewed changes

noopurintel added 4 commits January 16, 2025 07:09

Review comments incorp

69b1d1a

Signed-off-by: noopur <[email protected]>

Increased timeout to 30m for CI pipeline jobs

55eaa1c

Signed-off-by: noopur <[email protected]>

Increased timeout to 30m for CI pipeline jobs

c4529c8

Signed-off-by: noopur <[email protected]>

Merge branch 'develop' into develop

ff862bf

teoparvanov approved these changes Jan 16, 2025

View reviewed changes

teoparvanov merged commit b8e2c70 into securefederatedai:develop Jan 16, 2025
21 checks passed

MasterSkepticista reviewed Jan 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook run #1267

FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook run #1267

noopurintel commented Jan 13, 2025 •

edited

Loading

scngupta-dsp left a comment

scngupta-dsp left a comment

teoparvanov left a comment

MasterSkepticista left a comment •

edited

Loading

noopurintel commented Jan 16, 2025

FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook run #1267

FederatedRuntime Workflow for CI Pipeline - 301 Watermarking notebook run #1267

Conversation

noopurintel commented Jan 13, 2025 • edited Loading

scngupta-dsp left a comment

Choose a reason for hiding this comment

scngupta-dsp left a comment

Choose a reason for hiding this comment

teoparvanov left a comment

Choose a reason for hiding this comment

MasterSkepticista left a comment • edited Loading

Choose a reason for hiding this comment

noopurintel commented Jan 16, 2025

noopurintel commented Jan 13, 2025 •

edited

Loading

MasterSkepticista left a comment •

edited

Loading