AggregaThor is a distributed SGD framework layered over TensorFlow that integrates pluggable Byzantine-resilient gradient aggregation rules (e.g. Krum, Bulyan, coordinate-wise median) and supports asynchronous parameter-server topologies. It tolerates both Byzantine worker failures and unreliable network links, and its modular design allows aggregation rules and communication backends to be swapped independently.
This page was last edited on 2024-03-22.
This page was last edited on 2024-03-22.