PaddlePaddle · gongel · Jun 26, 2024 · Jun 22, 2024
diff --git a/llm/npu/llama/README.md b/llm/npu/llama/README.md
@@ -1,7 +1,7 @@
 ## 🚣‍♂️ 使用PaddleNLP在NPU下跑通llama2-13b模型 🚣
 PaddleNLP在昇腾NPU（[了解昇腾](https://www.hiascend.com/zh/ecosystem/industry)）上对llama2-13B模型进行了深度适配和优化，该套件实现了昇腾NPU和GPU的训推入口基本统一，达到了『无缝切换』的效果。
 在技术领先性上：
-- **训练策略完全适配** 支持3D混合并行，灵活适应多种训练策略。
+- **训练策略完全适配** 支持4D混合并行，灵活适应多种训练策略。
 - **训练性能极致优化** 95%的通信被掩盖在计算中，软硬结合提供极致性能。
 - **低门槛性能调优** 分布式策略自动寻优能力打通多硬件，完全屏蔽硬件复杂性的同时，使用户可以轻松挖掘算力极限。
 - **推理成本极致压缩** 推理支持 Layer 级算子融合，且融合算子已支持动态插入功能

diff --git a/paddlenlp/transformers/mc2_parallel_linear.py b/paddlenlp/transformers/mc2_parallel_linear.py
@@ -34,7 +34,7 @@
 from paddlenlp.utils.tools import get_env_device
 
 __all_gather_recomputation__ = False
-if int(os.getenv("MC2_Recompute", 0)):
+if int(os.getenv("FLAGS_NPU_MC2_Recompute", 0)):
     __all_gather_recomputation__ = True
 
 

diff --git a/requirements.txt b/requirements.txt
@@ -24,4 +24,4 @@ tool_helpers ; platform_system == "Linux"
 aistudio-sdk>=0.1.3
 jinja2
 regex
-numpy==1.26.4
+numpy<=1.26.4