Enable run cross-validation without training workflow and examples #2035

yhwen · 2023-09-26T20:34:07Z

…xample for demonstration.

Fixes # .

Description

To enable re-run cross-site validation without the training workflow. Also include an cifar10_fedavg example to demonstrate how to configure and run cross-site validation indepentelty.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Quick tests passed locally by running ./runtest.sh.
In-line docstrings updated.
Documentation updated.

…xample for demonstration.

...ples/advanced/cross-validation-without-training/cifar10_fedavg/config/config_fed_server.json

...anced/cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_learner.py

...cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_model_learner.py

...ed/cross-validation-without-training/cifar10_fedavg/custom/pt/utils/cifar10_data_splitter.py

nvflare/app_opt/pt/file_model_locator.py

chesterxgchen

the example seems duplicate the CIFAR10 examples with a lot details that is not needed.

Can we simplify this

yhwen · 2023-09-27T01:20:37Z

the example seems duplicate the CIFAR10 examples with a lot details that is not needed.

Can we simplify this

I will explain in the meeting what's the point to make the cross-site validation re-run without training workflow work. And how is this example built. (exact the same as the cifar10_fedavg)

…el_file_name and best_global_model_file_name, setting to the abosolute model paths.

examples/advanced/cross-validation-without-training/models/server/best_FL_global_model.pt

examples/advanced/cross-validation-without-training/models/server/FL_global_model.pt

examples/advanced/cross-validation-without-training/models/client/local_model.pt

examples/advanced/cross-validation-without-training/models/client/best_local_model.pt

examples/advanced/cross-validation-without-training/README.md

...anced/cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_learner.py

...ss-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_scaffold_learner.py

...cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_model_learner.py

yanchengnv

I unresolved some issues raised before.

nvflare/app_opt/pt/file_list_model_locator.py

nvflare/app_common/np/np_model_locator.py

nvflare/app_opt/pt/file_list_model_locator.py

yhwen · 2023-11-29T16:44:44Z

I unresolved some issues raised before.

yanchengnv

Some additional changes needed in np_model_locator. See comments.

nvflare/app_common/np/np_model_locator.py

yanchengnv

LGTM

YuanTingHsieh

Overall LGTM.
My only concern is those binary files.

Could it be possible we have some scripts to generate this?

For example:

mock_train.py

import numpy as np

result = np.array(
  [1,2,3]
)

np.save(result)

This way it might be clearer for the users.
We can do this in another PR as well.

examples/hello-world/hello-numpy-cross-val/README.md

YuanTingHsieh

We can address the data preparation in next PR.

…only examples.

yhwen · 2023-12-06T16:33:18Z

I unresolved some issues raised before.

We can address the data preparation in next PR.

Changed to use a script to generate the pre-trained models. Removed the binary model files.

examples/hello-world/hello-numpy-cross-val/README.md

chesterxgchen

LGTM with minor readme wording changes

yhwen · 2023-12-06T18:40:49Z

/build

YuanTingHsieh · 2023-12-07T00:05:43Z

/build

Enable re-run cross-validation without training workflow, added the e…

3e6967e

…xample for demonstration.

yhwen requested review from chesterxgchen, IsaacYangSLA, YuanTingHsieh and yanchengnv September 26, 2023 20:34

codestyle fix.

70a88e4

chesterxgchen reviewed Sep 26, 2023

View reviewed changes

...ples/advanced/cross-validation-without-training/cifar10_fedavg/config/config_fed_server.json Outdated Show resolved Hide resolved

chesterxgchen reviewed Sep 26, 2023

View reviewed changes

...anced/cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_learner.py Outdated Show resolved Hide resolved

chesterxgchen reviewed Sep 26, 2023

View reviewed changes

...cross-validation-without-training/cifar10_fedavg/custom/pt/learners/cifar10_model_learner.py Outdated Show resolved Hide resolved

chesterxgchen reviewed Sep 26, 2023

View reviewed changes

...ed/cross-validation-without-training/cifar10_fedavg/custom/pt/utils/cifar10_data_splitter.py Outdated Show resolved Hide resolved

chesterxgchen reviewed Sep 26, 2023

View reviewed changes

nvflare/app_opt/pt/file_model_locator.py Outdated Show resolved Hide resolved

chesterxgchen requested changes Sep 26, 2023

View reviewed changes

yhwen and others added 5 commits September 27, 2023 09:33

Added README.md to explain how the example been built.

6f4eb66

updated Readme.md

cdc3c34

Merge branch 'main' into rerun_cross_validation

c4e1226

re-engineer the re-run cross-validation, making use of the global_mod…

cc19063

…el_file_name and best_global_model_file_name, setting to the abosolute model paths.

updated the README.

a2417e9