Add FL+XGBoost Baseline #2226

Aml-Hassan-Abd-El-hamid · 2023-08-20T19:03:04Z

Closes Issue #2220

Checklist

(I'll be adding more details later as I go)

Aml-Hassan-Abd-El-hamid · 2023-08-27T19:35:54Z

That's the first working version of the centralized baseline, those are the results that I got for now:

Results for a9a , Task: BINARY , Train: 0.8558051923234419 , Test: 0.8548849398083695
Results for cod-rna , Task: BINARY , Train: 0.9796843727856992 , Test: 0.96986704954409
Results for ijcnn1 , Task: BINARY , Train: 0.9964114706656143 , Test: 0.9611693395630762

Results for abalone , Task: REG , Train: 1.1268618022311907 , Test: 4.806583543721308
Results for cpusmall , Task: REG , Train: 0.7895664538911239 , Test: 7.022483517505609
Results for space_ga , Task: REG , Train: 0.024788769022399276 , Test: 0.02961448763369315

Results for the centralized model after shuffling the dataset instead of choosing the first 75% rows of the dataset:

Results for a9a , Task: BINARY , Train: 0.8539488411454779 , Test: 0.8491524035705511
Results for cod-rna , Task: BINARY , Train: 0.9767785885658168 , Test: 0.9733166756792034
Results for ijcnn1 , Task: BINARY , Train: 0.990865561694291 , Test: 0.9874352822326051

Results for abalone , Task: REG , Train: 2.12119987844593 , Test: 4.6848017634240415
Results for cpusmall , Task: REG , Train: 1.7059859732195364 , Test: 9.064894281750984
Results for space_ga , Task: REG , Train: 0.024452750881655515 , Test: 0.03206095305593539

Results for YearPredictionMSD , Task: REG , Train: 40.99316293430138 , Test: 76.41236208673367

#shuffled dataset, not the original paper hyper params
Results for cpusmall , Task: REG , Train: 3.4195770468748443 , Test: 7.261266237603397

I use the same metrics as the ones in the paper -Accuracy for binary classification, and mean_squared_error for regression-, I tried different numbers of estimators: (100,500), The ones that I'm showing are for n_estimators = 500 because they're better -but not with a big difference- than the one with n_estimators set to 100.

The results are kinda different from the ones shared on the paper:

        a9a , Test: 0.849
	cod-rna, Test: 0.939
	ijcnn1 , Test: 0.963

	abalone , Test: 1.3
	cpusmall  , Test: 6.7
	space_ga , Test: 0.024

        YearPredictionMSD, Test:80.5

As you can see the main differences are in cod-rna and abalone datasets.

If you can give me any tips that can help me with that, please do.

If you have any notes on the implementation style so far, please let me know so I can modify them.

I'll go through the paper and the notebook you provided again to see if I missed any details and also, and I'll also try and reach out to one of the authors to see if he can help with that.

jafermarq

Hi @Aml-Hassan-Abd-El-hamid,

I made some changes to the code:

removed top-level .gitignore changes and moved them to a .gitignore local to your baseline.
minor change to __init__.py
moved print in main.py after load_single_dataset() into that function. (<-- could you double check if this looks good to you?)
Specified types in missing places (e.g. in client.py)

just a couple of small request:

could you describe either in the README.md or at the top of your server.py why you need a custom server class? (I know why, but mentioning this would be useful for others out there). Keeping it brief ~100word is sufficient.
Would it make sense to move your sweep.yaml inside conf/?

Aml-Hassan-Abd-El-hamid · 2023-11-14T17:18:14Z

Hi @Aml-Hassan-Abd-El-hamid,

I made some changes to the code:

* removed top-level `.gitignore` changes and moved them to a `.gitignore` local to your baseline.

* minor change to `__init__.py`

* moved `print` in `main.py` after `load_single_dataset()` into that function. (**<-- could you double check if this looks good to you?**)

* Specified types in missing places (e.g. in `client.py`)

just a couple of small request:

* could you describe either in the `README.md` or at the top of your `server.py` why you need a custom server class? (I know why, but mentioning this would be useful for others out there). Keeping it brief ~100word is sufficient.

* Would it make sense to move your `sweep.yaml` inside `conf/`?

Hi @jafermarq,
Thank you very much for your help.
All the changes are good with me.
Moving the print statements from main.py to the load_single_dataset() in the dataset.py is totally fine, it's basically the same thing -both are called once at the start of the program-.

for the server, I'll work on adding that ASAP
I tried to move sweep.yaml inside conf/ before and ended up with an error as sweep.yaml didn't find main.py.

jafermarq

@Aml-Hassan-Abd-El-hamid, this looks good! We'll merge it very soon. 💯

Aml-Hassan-Abd-El-hamid and others added 2 commits August 20, 2023 20:27

initialize FL+XGBoost baseline

8776a6f

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

5640ca6

Aml-Hassan-Abd-El-hamid requested review from jafermarq, tanertopal and danieljanes as code owners August 20, 2023 19:03

Aml-Hassan-Abd-El-hamid marked this pull request as draft August 20, 2023 19:03

jafermarq added the summer-of-reproducibility About a baseline for Summer of Reproducibility label Aug 20, 2023

Aml-Hassan-Abd-El-hamid and others added 7 commits August 22, 2023 08:20

dowenload datasets

81aa7be

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

d7e55e7

delete unwanted folders

8657dba

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

21a3988

centr. model, no hydra config, no docs

5e8268f

Merge branch 'main' into Aml-SoR-FL+XGBoost

8354627

hydra conf + run all exp

700a242

Aml-Hassan-Abd-El-hamid and others added 15 commits August 28, 2023 19:47

Merge branch 'main' into Aml-SoR-FL+XGBoost

a5ab6fc

Update utils.py

2c704f8

Update main.py

c8b9d48

Update Centralized Baseline.yaml

40e75ca

change folders name to Horizontal_XGBoost

27be45b

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

25c8143

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

1b15c94

adding fit_XGBoost to utils

f2183d8

Update Centralized Baseline.yaml

e2b4c80

adding doc strings for dataset_prep file

6aa806c

move fit_XGBoost fn to models

dc5b925

local_clients_preformance_for_comparison

3c1e2b3

fixing

988c3a9

fixing

f165df2

Merge branch 'adap:main' into Aml-SoR-FL+XGBoost

44a24fd

Aml-Hassan-Abd-El-hamid deleted the Aml-SoR-FL+XGBoost branch November 6, 2023 20:23

Aml-Hassan-Abd-El-hamid restored the Aml-SoR-FL+XGBoost branch November 6, 2023 20:23

Aml-Hassan-Abd-El-hamid reopened this Nov 6, 2023

jafermarq and others added 6 commits November 7, 2023 19:37

Merge branch 'main' into Aml-SoR-FL+XGBoost

823b1c7

tidy up; reflected in docs

7d0d424

Update README.md

259ec60

Merge branch 'main' into Aml-SoR-FL+XGBoost

77e3f15

Merge branch 'main' into Aml-SoR-FL+XGBoost

4befa2f

local .gitignore; missing types; other small changes

0befed9

jafermarq reviewed Nov 14, 2023

View reviewed changes

Merge branch 'main' into Aml-SoR-FL+XGBoost

fc81bf7

Update server.py with a brif description of its main purpose

3da8174

jafermarq previously approved these changes Nov 14, 2023

View reviewed changes

Merge branch 'main' into Aml-SoR-FL+XGBoost

67f9f34

jafermarq enabled auto-merge (squash) November 14, 2023 19:48

jafermarq and others added 2 commits November 16, 2023 09:38

Merge branch 'main' into Aml-SoR-FL+XGBoost

e34ff24

update virtualenv dep

4c30325

jafermarq dismissed their stale review via 4c30325 November 16, 2023 09:45

jafermarq and others added 9 commits November 16, 2023 09:55

formatting fix

2bfff26

Merge branch 'main' into Aml-SoR-FL+XGBoost

74ed1bc

Merge branch 'main' into Aml-SoR-FL+XGBoost

88698c3

Merge branch 'main' into Aml-SoR-FL+XGBoost

1df2aa9

temp remove form changelog

30d2630

Merge branch 'main' into Aml-SoR-FL+XGBoost

8e63629

back in changelog

b33fab5

fix

d7b7387

Merge branch 'main' into Aml-SoR-FL+XGBoost

e7c1573

danieljanes approved these changes Dec 7, 2023

View reviewed changes

jafermarq merged commit 306e9f6 into adap:main Dec 7, 2023
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FL+XGBoost Baseline #2226

Add FL+XGBoost Baseline #2226

Aml-Hassan-Abd-El-hamid commented Aug 20, 2023 •

edited

Loading

Aml-Hassan-Abd-El-hamid commented Aug 27, 2023 •

edited

Loading

jafermarq left a comment

Aml-Hassan-Abd-El-hamid commented Nov 14, 2023

jafermarq left a comment

Add FL+XGBoost Baseline #2226

Add FL+XGBoost Baseline #2226

Conversation

Aml-Hassan-Abd-El-hamid commented Aug 20, 2023 • edited Loading

Closes Issue #2220

Checklist

Aml-Hassan-Abd-El-hamid commented Aug 27, 2023 • edited Loading

jafermarq left a comment

Choose a reason for hiding this comment

Aml-Hassan-Abd-El-hamid commented Nov 14, 2023

jafermarq left a comment

Choose a reason for hiding this comment

Aml-Hassan-Abd-El-hamid commented Aug 20, 2023 •

edited

Loading

Aml-Hassan-Abd-El-hamid commented Aug 27, 2023 •

edited

Loading