Skip to content

Debugging Federated Learning Pipeline #281

Closed Answered by yanchengnv
LSnyd asked this question in Q&A
Discussion options

You must be logged in to vote

Thanks for the question.

Debugging a distributed and componentized system is quite challenging in general. By "componentized", I mean that the system function is achieved by a collection of configured components (Python objects).

That said, we usually follow these steps:

  • The key is to determine which component(s) caused the error. From the error message (or exception traceback), try to determine which component is complaining.
  • Components are configured in various config JSON files (config_fed_server.json, config_fed_client.json in your app definition). These objects are created at the time of system sttartup.
  • For errors on Server, check config_fed_server.json; for errors on Client, check…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@LSnyd
Comment options

Answer selected by yanchengnv
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants