Skip to content

Commit

Permalink
Merge branch 'PaddlePaddle:develop' into develop
Browse files Browse the repository at this point in the history
  • Loading branch information
mmglove committed Jun 20, 2024
2 parents f75436d + 5619cc3 commit 26fa8bf
Show file tree
Hide file tree
Showing 3,329 changed files with 44,233 additions and 215,423 deletions.
The diff you're trying to view is too large. We only load the first 3000 changed files.
48 changes: 0 additions & 48 deletions .github/workflows/fast_tokenizer.yml

This file was deleted.

8 changes: 4 additions & 4 deletions .github/workflows/pipelines.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,10 +3,10 @@ name: Pipelines-Test
on:
push:
paths:
- 'pipelines/*'
- 'legacy/pipelines/*'
pull_request:
paths:
- 'pipelines/*'
- 'legacy/pipelines/*'


jobs:
Expand All @@ -20,11 +20,11 @@ jobs:
python-version: '3.10'
cache: 'pip' # caching pip dependencies
- name: Install dependencies
working-directory: ./pipelines
working-directory: ./legacy/pipelines
run: |
python -m pip install --upgrade pip
make install
pip install -r tests/requirements.txt
- name: run the command
working-directory: ./pipelines
working-directory: ./legacy/pipelines
run: make test
1 change: 1 addition & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,7 @@ test: unit-test
unit-test:
PYTHONPATH=$(shell pwd) pytest -v \
-n auto \
--retries 1 --retry-delay 1 \
--durations 20 \
--cov paddlenlp \
--cov-report xml:coverage.xml
Expand Down
12 changes: 0 additions & 12 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -279,18 +279,6 @@ PaddleNLP针对信息抽取、语义检索、智能问答、情感分析等高

### 高性能分布式训练与推理

#### ⚡ FastTokenizer:高性能文本处理库

<div align="center">
<img src="https://user-images.githubusercontent.com/11793384/168407921-b4395b1d-44bd-41a0-8c58-923ba2b703ef.png" width="400">
</div>

```python
AutoTokenizer.from_pretrained("ernie-3.0-medium-zh", use_fast=True)
```

为了实现更极致的模型部署性能,安装FastTokenizer后只需在`AutoTokenizer` API上打开 `use_fast=True`选项,即可调用C++实现的高性能分词算子,轻松获得超Python百余倍的文本处理加速,更多使用说明可参考[FastTokenizer文档](./fast_tokenizer)

#### ⚡️ FastGeneration:高性能生成加速库

<div align="center">
Expand Down
12 changes: 0 additions & 12 deletions README_en.md
Original file line number Diff line number Diff line change
Expand Up @@ -224,18 +224,6 @@ For more details please refer to [Speech Command Analysis](./applications/speech

### High Performance Distributed Training and Inference

#### ⚡ FastTokenizer: High Performance Text Preprocessing Library

<div align="center">
<img src="https://user-images.githubusercontent.com/11793384/168407921-b4395b1d-44bd-41a0-8c58-923ba2b703ef.png" width="400">
</div>

```python
AutoTokenizer.from_pretrained("ernie-3.0-medium-zh", use_fast=True)
```

Set `use_fast=True` to use C++ Tokenizer kernel to achieve 100x faster on text pre-processing. For more usage please refer to [FastTokenizer](./fast_tokenizer).

#### ⚡ FastGeneration: High Performance Generation Library

<div align="center">
Expand Down
188 changes: 0 additions & 188 deletions applications/document_intelligence/README.md

This file was deleted.

17 changes: 0 additions & 17 deletions applications/document_intelligence/doc_vqa/.gitignore

This file was deleted.

This file was deleted.

Loading

0 comments on commit 26fa8bf

Please sign in to comment.