New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

AutoNLP TrainerBase and Text Classification #3728

Merged

sijunhe merged 76 commits into develop from autonlp_trainer

Jan 5, 2023

Collaborator

sijunhe commented Nov 11, 2022 •

edited

Loading

PR types

New features

PR changes

APIs

Description

AutoNLP TrainerBase and Text Classification

AiStudio例子 https://aistudio.baidu.com/aistudio/projectdetail/4994688?contributionType=1

sijunhe added 7 commits

November 11, 2022 00:11


          init commit; unit test pass

303fa26


          ready for quick review

a2d0461


          add types

f6cad1c


          isort,black,flake8

fc75035


          mypy passes, other than paddle APIs

ed39de8


          yapf training_args

57bd571


          yapf

7cf2bdc

sijunhe requested review from ZeyuChen and wawltor

November 11, 2022 06:24

sijunhe self-assigned this

sijunhe added the autonlp label

sijunhe added 2 commits

November 11, 2022 14:25


          remove afqmc

a758ae8


          import error

0727aac

ZeyuChen reviewed

View reviewed changes

paddlenlp/experimental/autonlp/auto_trainer_base.py Show resolved Hide resolved

tests/experimental/autonlp/test_text_classification.py Outdated

+                      auto_trainer.train(
+                          train_ds,
+                          dev_ds,
+                          num_cpus=1,

Member

ZeyuChen Nov 11, 2022

这里的计算会存在异构设备同时计算吗？
因为同时暴露num_cpus和num_gpus.
从飞桨自身发展角度，未来还有更多硬件，如昆仑xpu等，这一命名设计会限制未来新AI硬件接入的扩展

Collaborator Author

sijunhe Nov 12, 2022

这里的cpu就是正常的cpu core, gpu特指cuda devices. 至于未来的npu, xpu之类的，也可以通过类似的方法来限制https://docs.ray.io/en/latest/tune/faq.html#how-do-i-set-resources

这块有什么建议吗？

Collaborator Author

sijunhe Nov 12, 2022

因为ray也是apache 2.0协议开源的，有定制的需要我们也可以向upstream推。之前tpu，他们就做了类似的工作https://github.com/ray-project/ray/blob/master/python/ray/autoscaler/gcp/tpu.yaml

paddlenlp/experimental/autonlp/auto_trainer_base.py Outdated

+                          time_budget_s=time_budget_s,
+                          max_concurrent_trials=max_concurrent_trials,
+                      )
+                      tuner = tune.Tuner(

Member

ZeyuChen Nov 11, 2022

得看ray的依赖是否很重，如果很重的话是否可以在外围，当用户需要这个功能时再安装，而不宜作为默认的依赖

Collaborator Author

sijunhe Nov 12, 2022

当前就是放在外围的。见setup.py

Member

ZeyuChen commented Nov 11, 2022

#3724
GitHub有DraftPR的功能，后续可以不需要通过PR Title这么区分

sijunhe marked this pull request as draft

November 12, 2022 03:15

sijunhe changed the title ~~[Draft, Do Not Review] AutoNLP TrainerBase and Text Classification~~ AutoNLP TrainerBase and Text Classification

sijunhe requested a review from ZeyuChen

November 12, 2022 03:37

sijunhe added 5 commits

November 14, 2022 17:36

wip

4af58cb


          ready for review

eb99889


          ready for revie

c36732d


          yapf

d07ccf1


          Merge branch 'develop' into autonlp_trainer

c8e15ba

sijunhe marked this pull request as ready for review

November 14, 2022 12:11

sijunhe marked this pull request as draft

November 15, 2022 02:15

sijunhe added 5 commits

November 15, 2022 15:22


          implement predict, export, show_training_results API

e8b8a06


          Merge branch 'autonlp_trainer' of https://github.com/PaddlePaddle/Pad…

53e44de

…dleNLP into autonlp_trainer


          styles

cdec5f3


          add classification metrics

3a335da


          styles and docstring

13d27ab

sijunhe added 13 commits

December 27, 2022 16:24

wip

beb41e6


          changes

dbe0e80


          remove missing fn

518e0c7


          redesigned overrides and custom model candidates

4b19cdc


          Merge remote-tracking branch 'origin/develop' into autonlp_trainer

36461c7


          test

e49a048


          Merge remote-tracking branch 'origin/develop' into autonlp_trainer

a8beae3


          update api

a35405d


          evaluate works

97d7aae


          readme

9ad445b


          add chinese readme

8778d1b


          Merge remote-tracking branch 'origin/develop' into autonlp_trainer

65710e9


          error type

8b9f0e6

sijunhe requested review from guoshengCS and wj-Mcat

January 3, 2023 11:57

sijunhe marked this pull request as ready for review

January 3, 2023 11:58

sijunhe commented

View reviewed changes

requirements-dev.txt

		@@ -1,4 +1,4 @@
		paddlepaddle>=2.3.0,<2.4.0
		paddlepaddle==2.4.0rc0

Collaborator Author

sijunhe Jan 3, 2023

目前prompt model的动转静需要2.4.0以后才能跑通

sijunhe commented

View reviewed changes

Makefile

@@ @@ -45,6 +45,7 @@ unit-test: @@
               install:
               	pip install -r requirements-dev.txt
               	pip install -r requirements.txt
+              	pip install -r paddlenlp/experimental/autonlp/requirements.txt

Collaborator Author

sijunhe Jan 3, 2023

这里按道理不应该加入make install, 因为大部分开发都不需要autonlp的依赖，现在暂时为了跑单测加入

sijunhe added 3 commits

January 3, 2023 12:11


          add verbosity

9fba845


          add verbosity


          verbosity fix

54a83af

ZeyuChen reviewed

View reviewed changes

Member

ZeyuChen left a comment

文档增加下跳转，同意一下AI Studio品牌名撰写，其他没问题

paddlenlp/experimental/autonlp/README.md Show resolved Hide resolved

paddlenlp/experimental/autonlp/README.md Outdated Show resolved Hide resolved

paddlenlp/experimental/autonlp/README_en.md Show resolved Hide resolved

sijunhe added 4 commits

January 3, 2023 12:50


          set log level

a619f3d


          address Zeyu's comment

dc26101


          address Zeyu's comment

a6ea660


          Merge remote-tracking branch 'origin/develop' into autonlp_trainer

48d5763

wawltor reviewed

View reviewed changes

paddlenlp/experimental/autonlp/README.md

+                  train_dataset=train_ds,
+                  eval_dataset=dev_ds,
+                  label_column="labels",
+                  text_column="sentence",

Collaborator

wawltor Jan 4, 2023

对于文本分类任务中输入有两个columns输入，这里是否兼容了？

Collaborator Author

sijunhe Jan 4, 2023

刻意不兼容，后续会有专门的AutoTrainerForSemanticSearch

paddlenlp/experimental/autonlp/README.md

+              ```python
+              auto_trainer = AutoTrainerForTextClassification(
+                  train_dataset=train_ds,

Collaborator

wawltor Jan 4, 2023

这里的输入这一侧，是否要把datasets概念传递给用户了？

Collaborator Author

sijunhe Jan 4, 2023 •

edited

Loading

对，这里暂时是这样设计的，用户需要自己把数据集转成datasets格式。一个是trainer本来就需要Datasets, 所以这里正好衔接上。还有就是MVP版本尽量轻量化，减少非核心功能，如果后续有需求，可以支持pd.DataFrame之类的转化

paddlenlp/experimental/autonlp/README.md

+              - num_models (int, required): 模型试验数量
+              - num_gpus (str, optional): 实验使用的 GPU 数量。默认情况下，这是根据检测到的 GPU 设置的。
+              - num_cpus (str, optional): 实验使用的 CPU 数量。默认情况下，这是根据检测到的 vCPU 设置的。

Collaborator

wawltor Jan 4, 2023

vCPU -> CPU

Collaborator Author

sijunhe Jan 4, 2023 •

edited

Loading

这里确实是vCPU, 英语原文是virtual core, 来自底层调用的ray

paddlenlp/experimental/autonlp/README.md

+              - num_gpus (str, optional): 实验使用的 GPU 数量。默认情况下，这是根据检测到的 GPU 设置的。
+              - num_cpus (str, optional): 实验使用的 CPU 数量。默认情况下，这是根据检测到的 vCPU 设置的。
+              - max_concurrent_trials (int, optional): 同时运行的最大试验数。必须是非负数。如果为 None 或 0，则不应用任何限制。默认为None。
+              - time_budget_s: (int|float|datetime.timedelta, optional) 以秒为单位的全局时间预算，超过时间后停止所有模型试验。

Collaborator

wawltor Jan 4, 2023

这里的时间限制倒是没有必要提供这么多类型，可以直接int、float类型

Collaborator Author

sijunhe Jan 4, 2023

这个来自于底层ray的配置，既然人家配置好了，我也乐于放出来

paddlenlp/experimental/autonlp/README.md

		- hp_overrides: (dict[str, Any], optional): （仅限高级用户）。覆盖每个候选模型的超参数。例如，`{"TrainingArguments.max_steps"：5}`。
		- custom_model_candiates: (dict[str, Any], optional): （仅限高级用户）。运行用户提供的候选模型而不 PaddleNLP 的默认候选模型。可以参考 `._model_candidates` 属性

Collaborator

wawltor Jan 4, 2023

这里还是有疑问？AutoNLP看起来没有统一对外接口，而是不同的任务统一接口，这里的考虑是什么了？

Collaborator Author

sijunhe Jan 4, 2023 •

edited

Loading

以后会有，当前MVP因为只有一个分类的class, 暂时还没有配置。后续会支持类似Taskflow这种，加上一个task_type

paddlenlp/experimental/autonlp/auto_trainer_base.py

+                      """
+                      model_result = self._get_model_result(trial_id=trial_id)
+                      exported_model_path = os.path.join(model_result.log_dir, self.export_path)
+                      shutil.copytree(exported_model_path, export_path, dirs_exist_ok=True)

Collaborator

wawltor Jan 4, 2023

导出模型这里，可以把静态图的模型的导出放入进去

Collaborator Author

sijunhe Jan 4, 2023

这一块暂时还是输出动态图，保持一个灵活性。后续版本会统一增加to_static的stage。

paddlenlp/experimental/autonlp/text_classification.py

+              from hyperopt import hp
+              from paddle.io import Dataset
+              from scipy.special import expit as sigmoid
+              from sklearn.metrics import accuracy_score, precision_recall_fscore_support

Collaborator

wawltor Jan 4, 2023

sklearn看起来没有放入到requirements

Collaborator Author

sijunhe Jan 4, 2023

paddlenlp本来就带了sklearn, seqeval带进来的

paddlenlp/experimental/autonlp/text_classification.py

+                      )
+                  @property
+                  def _model_candidates(self) -> List[Dict[str, Any]]:

Collaborator

wawltor Jan 4, 2023

目前咱们只考虑预训练模型是吗？

Collaborator Author

sijunhe Jan 4, 2023

对

wawltor approved these changes

View reviewed changes

Collaborator

wawltor left a comment

LGTM

sijunhe merged commit 4afe186 into develop

sijunhe deleted the autonlp_trainer branch

January 5, 2023 04:27

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels