Add FedMLB baseline #2340

alessiomora · 2023-09-11T16:19:06Z

Issue

Implementation of FedMLB for the SoR inititative.

Description

Implementation of FedMLB for the SoR inititative.

Related issues/PRs

Issue #2048

…ading state. Implemented a custom server with MyServer in server.py. Changed main.py accordingly. Updated README.me.

jafermarq

with previous review.

baselines/fedmlb/pyproject.toml

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

jafermarq · 2023-09-28T21:10:12Z

baselines/fedmlb/README.md

+python -m fedmlb.dataset_preparation dataset_config.alpha_dirichlet=0.6 total_clients=500
+```
+Note that, to reproduce those settings, we leverage the `.txt` files
+contained in the `client_data` folder in this project. Such files store


Any chance we could remove the client_data files from the directory ? (they are ~7MB in total). Is there an obvious way of constructing those files via a not-too-complex script? -- we can naturally request people to git-clone them from the original repo you mention below, but that might not be always reliable.

Those files could be constructed via a script by using a Dirichlet distribution just as the original paper. The files you mention, in fact, contain the IDs of the images assigned to that client, leveraging a Dirichlet distribution with a specific concentration parameters to select a certain number of images for a certain label (and then randomly selecting that number of images in the pool of images with that label). Obviously, if you re-run such a script, you could not be able to reproduce that specific per-client dataset compositions unless you know the seed used to set the pseudo-random generation of numbers (and probably running in the same machine).

For this reason, for reproducibility puproses, I decided to exactly compose the clients' dataset as they were crafted in the original paper.

So, in principle, I can produce a script that generates the composition of datasets (basically the .txt files) that follows a Dirichlet distribution of labels among clients with a certain concentration parameter (but datasets would be different from the original code), or I can find a better way of storing the data contained in the files under client_data.

In the original code, you can find those generation scripts here.
For now, I've deleted some unused .txt files from the folder.

Ok. I'll think about this and discuss with the others in the team. Now the files are 4MB (down from 7MB) so that's nice to see. Th
ere are other baselines that also have some not-so-small files as part of their proposed PR, so I'll update this thread once i figure out what's the best way to deal with these. Maybe keeping them is fine. Let's see...

jafermarq

Hi @alessiomora ,

Just a small comment for the pyproject.toml. I also enabled the tests but a small formatting issue was flagged.

baselines/fedmlb/pyproject.toml

jafermarq

Looks great!

alessiomora and others added 24 commits August 30, 2023 12:13

First FedMLB working with partial results.

d0debbf

Flwr 1.5

bb8f908

Update main.py

0e121ee

Changed requirement to tf==2.12.0

20b3e3d

Implmented stop and restart functionality for the training. Saving/lo…

8f2c2d7

…ading state. Implemented a custom server with MyServer in server.py. Changed main.py accordingly. Updated README.me.

An exemplary bash script to break a simulation in parts.

6c698a2

Added configurations to manage start and stop from config.

a9e32ba

Added utility to load and save a dictionary to file with pickle.

13debb1

Added results for table 1b.

cb3f47e

Added formulation

c3c1261

Added formulation corrected

380de5c

Added formulation corrected

0058413

Added formulation corrected

06b77e7

Added formulation corrected

49e9884

Added formulation corrected

adc394e

Added formulation corrected

fec5cf4

Added description of special configs.

00e6dfb

Added charts

67b7c1b

Added charts README.md

da60365

Added charts README.md

c22a9e5

Added charts README.md

5fe7550

Added charts png

e4bf5f3

Added charts png

113239c

Updated charts png

a7e2c38

jafermarq added the summer-of-reproducibility About a baseline for Summer of Reproducibility label Sep 11, 2023

alessiomora added 5 commits September 12, 2023 09:27

Added tiny-imagenet charts

bfd072d

Added tiny-imagenet charts

1765913

Add a simple bash script to divide the simulations in batch of rounds.

7e29f25

Added tensorboard dev reference for results.

0e9b0b5

Added tensorboard dev reference for results.

fc6be17

jafermarq reviewed Sep 26, 2023

View reviewed changes

baselines/fedmlb/pyproject.toml Outdated Show resolved Hide resolved

alessiomora and others added 10 commits September 27, 2023 11:56

Updated python version in README.md

4285099

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Updated dataset_preparation.py

83f475a

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Updated README.md

a0d91e2

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Update README.md

3823d73

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Updated README.md

495e06f

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Updated python version range in pyproject.toml

d03a5e8

Co-authored-by: Javier <jafermarq@users.noreply.github.com>

Removed warning in README for TinyImagenet.

d73ac66

Updated README.md

1ef0fa9

Updated tables format in README.md

7c5ed3a

Updated img command in README.md

2509794

jafermarq reviewed Sep 28, 2023

View reviewed changes

Removed unused files.

013ac39

jafermarq reviewed Oct 3, 2023

View reviewed changes

baselines/fedmlb/pyproject.toml Outdated Show resolved Hide resolved

alessiomora and others added 9 commits October 3, 2023 10:29

Added description and authors.

ac09c7f

Fixed formatting.

c8df402

Slightly fixing formatting

2803642

Fixed default config

2058c36

Merge remote-tracking branch 'origin/main' into fedmlb_gpu

91189e5

Merge branch 'main' into fedmlb_gpu

7e02c24

Merge branch 'main' into fedmlb_gpu

c1c0654

Merge branch 'main' into fedmlb_gpu

f3f5b5e

Merge branch 'main' into fedmlb_gpu

ab5c780

jafermarq previously approved these changes Oct 10, 2023

View reviewed changes

jafermarq changed the title ~~FedMLB~~ Add FedMLB baseline Oct 10, 2023

Changed reference to results from tensorboard.dev to google drive

957cf40

alessiomora dismissed jafermarq’s stale review via 957cf40 October 10, 2023 13:22

jafermarq approved these changes Oct 10, 2023

View reviewed changes

Merge branch 'main' into fedmlb_gpu

f9bc37f

jafermarq merged commit 4f9ce5c into adap:main Oct 10, 2023
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FedMLB baseline #2340

Add FedMLB baseline #2340

alessiomora commented Sep 11, 2023 •

edited

Loading

jafermarq left a comment

jafermarq Sep 28, 2023

alessiomora Sep 29, 2023 •

edited

Loading

jafermarq Sep 29, 2023

jafermarq left a comment

jafermarq left a comment

Add FedMLB baseline #2340

Add FedMLB baseline #2340

Conversation

alessiomora commented Sep 11, 2023 • edited Loading

Issue

Description

Related issues/PRs

jafermarq left a comment

Choose a reason for hiding this comment

jafermarq Sep 28, 2023

Choose a reason for hiding this comment

alessiomora Sep 29, 2023 • edited Loading

Choose a reason for hiding this comment

jafermarq Sep 29, 2023

Choose a reason for hiding this comment

jafermarq left a comment

Choose a reason for hiding this comment

jafermarq left a comment

Choose a reason for hiding this comment

alessiomora commented Sep 11, 2023 •

edited

Loading

alessiomora Sep 29, 2023 •

edited

Loading