[Enhancement Request] Integrate Plato into Sedna as a backend for supporting federated learning - Phase one #116

XinYao1994 · 2021-07-07T06:12:15Z

This is a PR for integrating Plato into Sedna to support federated learning (#50). @li-ch @baochunli @jaypume

We currently support Fedavg and Mistnet in Sedna via Plato.
We add one example for each federated learning job.

Finished

A tool to automatically generate Plato configuration file based on the CRD.
~~Replace the communication libs with Sedna build-in libs.~~ S3 and asyncio are used.
Examples and demo presentation
- CV: Yolo-v5 demo
Trainer and Estimator
datasource is from sedna dataset

kubeedge-bot · 2021-07-07T06:12:24Z

Welcome @XinYao1994! It looks like this is your first PR to kubeedge/sedna 🎉

JoeyHwong-gk · 2021-07-07T11:23:25Z

examples/federated_learning/mistnet/agg_worker.py

@@ -0,0 +1,8 @@
+import sedna.service.server


useless import

JoeyHwong-gk · 2021-07-07T11:26:00Z

examples/federated_learning/mistnet/client.yml

@@ -0,0 +1,68 @@
+clients:


We may need to discuss whether this yaml file needs to be generated by the developer or just for plato compatibility.

We have a discussion with @jaypume, this Plato configuration file will be automatically generated based on the CRD.

JoeyHwong-gk · 2021-07-07T11:28:40Z

lib/requirements.txt

this seems scary, since here add all dependences for all cases.

JoeyHwong-gk · 2021-07-07T11:33:55Z

lib/sedna/service/server/aggregation.py

+
+from plato.servers import mistnet
+
+class MistnetServer(mistnet.Server):


This seems to work directly with import plato

Yes, but we want to keep transparency when the users are using Sedna. So we package the Plato server within the Sedna lib. Sedna's communication libs (transmitter) are under development @jaypume. After that, we will replace the communication libs with Sedna build-in libs.

JoeyHwong-gk · 2021-07-07T11:36:09Z

lib/sedna/datasources/__init__.py

@@ -130,3 +130,23 @@ def parse(self, *args, **kwargs):
            return
        self.x = pd.concat(x_data)
        self.y = pd.concat(y_data)
+
+# import os
+# os.environ['config_file'] = '/home/work/client.yml'


it is recommended to clean up unwanted codes.

JoeyHwong-gk · 2021-07-07T11:37:48Z

lib/sedna/core/federated_learning/federated_learning.py

@@ -43,6 +42,7 @@ def __init__(self, estimator, aggregation="FedAvg"):
        super(FederatedLearning, self).__init__(
            estimator=estimator, config=config)
        self.aggregation = ClassFactory.get_cls(ClassType.FL_AGG, aggregation)
+        self.transmitter = ClassFactory.get_cls(ClassType.TRANSMITTER, transmitter)


test_transmitter doesn't seem to be registered in sedna.

This function is under development. @jaypume
In the current stage, we enable Plato in Sedna via directly call its server and client.

JoeyHwong-gk · 2021-07-07T11:38:49Z

lib/sedna/algorithms/transmitter/base.py

+
+    @abstractmethod
+    def compress(self):  # 传输的内容可能有：weights，压缩后的weights， 特征向量，蒸馏后的数据
+        pass


Code comments are best described in English.

Jie Pu has updated it.

JoeyHwong-gk · 2021-07-07T11:42:46Z

examples/federated_learning/surface_defect_detection/training_worker/interface.py

@@ -21,6 +21,58 @@
 os.environ['BACKEND_TYPE'] = 'KERAS'


+import torch


This looks like the Pytorch framework used, but keras is defined in the Env variable.

surface_defect_detection is using Keras, and we will perform it in a similar way.

JoeyHwong-gk · 2021-07-07T11:44:33Z

lib/sedna/algorithms/aggregation/__init__.py

+        # self.fedavg_server = fedavg.Server(model=model)
+        pass
+
+    def aggregate0(self, weights, size=0):


Why is naming aggregate0 more recommended than aggregate?

JoeyHwong-gk · 2021-07-07T11:46:06Z

lib/sedna/algorithms/aggregation/mistnet.py

+from .base import BaseAggregation
+
+
+class MistNet(BaseAggregation):


If it needs to be called by the registration factory, maybe we can add the registration wrap with ClassFactory.

Jie Pu has updated it.

JoeyHwong-gk · 2021-07-07T11:48:34Z

lib/sedna/service/server/aggregation.py

@@ -241,3 +268,17 @@ async def client_info(self, request: Request):
        if client_id:
            return server.get_client(client_id)
        return WSClientInfoList(clients=server.client_list)
+
+import os
+os.environ['config_file'] = '/home/work/server.yml'


The import module can be add on the top.

jaypume · 2021-07-27T02:32:47Z

/reopen

kubeedge-bot · 2021-07-27T02:32:49Z

@jaypume: Failed to re-open PR: state cannot be changed. There are no new commits on the XinYao1994:main branch.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jaypume · 2021-07-27T02:42:21Z

/reopen

kubeedge-bot · 2021-07-27T02:42:24Z

@jaypume: Reopened this PR.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

JimmyYang20 · 2021-09-08T07:11:57Z