Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
更新文档
  • Loading branch information
Huanshere committed Sep 9, 2024
1 parent f3f8286 commit 3c725ba
Show file tree
Hide file tree
Showing 2 changed files with 30 additions and 21 deletions.
35 changes: 20 additions & 15 deletions docs/install_locally_en.md
Original file line number Diff line number Diff line change
@@ -1,26 +1,26 @@
# 🏠 VideoLingo Local Deployment Guide (Windows)

VideoLingo offers a choice of multiple Whisper solutions (as there is no single perfect option so far):
VideoLingo offers multiple Whisper solutions for speech recognition text steps (as there's no single perfect choice currently). Choose one based on your personal configuration and needs.

| Solution | Advantages | Disadvantages |
|:---------|:-----------|:--------------|
| **whisper_timestamped** | • Runs locally<br>• Easy to install<br>• Uses native Whisper model | • Ideal for English only |
| **whisperX_api** |Uses Replicate API, no local computing power needed |Replicate service may be unstable<br>• Occasional CUDA errors |
| **whisperX** (🌟Recommended) |Runs locally<br>• Based on faster-whisper, excellent performance |Requires CUDA and cuDNN configuration<br>• Separate wav2vec model download for each language |
| **whisper_timestamped** | • Runs locally<br>• Easy installation<br>• Uses native Whisper model | • Ideal for English only<br>• Requires GPU with 8GB+ VRAM |
| **whisperX** (🌟Recommended) |Runs locally<br>• Based on faster-whisper, excellent performance<br>• Good multi-language support |Requires CUDA and cuDNN installation<br>• Separate wav2vec model download for each language<br>• Requires GPU with 8GB+ VRAM |
| **whisperX_api** |Uses Replicate API, no local computing power needed |Replicate service may be unstable, occasional CUDA errors<br>• Uses large-v3, punctuation not as good as local v2 |

## 📋 Preparation

1. Register an account on [Cloud Fog API](https://api.wlai.vip/register?aff=TXMB) and recharge to get a token
1. Register an account on [Yunwu API](https://api.wlai.vip/register?aff=TXMB) and recharge to get a token (or use any claude-3.5-sonnet provider)

![Cloud Fog API Registration Process](https://github.com/user-attachments/assets/762520c6-1283-4ba9-8676-16869fb94700)
![Yunwu API Registration Process](https://github.com/user-attachments/assets/762520c6-1283-4ba9-8676-16869fb94700)

2. If using `whisperX_api`, please register a Replicate account and bind a payment method
2. If using `whisperX_api`, register a Replicate account, link a payment method, and get your token

## 🛠️ Installation Process

### Prerequisites

Before installing VideoLingo, please ensure you complete the following steps (most of which are for GPU acceleration):
Before installing VideoLingo, ensure you complete the following steps:

1. Install [Visual Studio 2022](https://visualstudio.microsoft.com/thank-you-downloading-visual-studio/?sku=Community&channel=Release&version=VS2022&source=VSLandingPage&cid=2030&passive=false)
- Select and install the "Desktop development with C++" component package
Expand All @@ -38,26 +38,31 @@ Before installing VideoLingo, please ensure you complete the following steps (mo

### Installation Steps

1. Clone the project:
1. Open Anaconda Prompt and switch to the desktop directory:
```bash
cd desktop
```

2. Clone the project:
```bash
git clone https://github.com/Huanshere/VideoLingo.git
cd VideoLingo
```

2. Configure virtual environment:
3. Configure virtual environment:
```bash
conda create -n videolingo python=3.12.0
conda create -n videolingo python=3.10.0
conda activate videolingo
```

3. Run the installation script:
4. Run the installation script:
```bash
python install.py
```
Follow the prompts to select the desired Whisper project, and the script will automatically install the corresponding torch and whisper versions.
Choose the desired Whisper project when prompted, and the script will automatically install the corresponding torch and whisper versions.

4. 🎉 Launch the Streamlit application:
5. 🎉 Launch the Streamlit application: Double-click `一键启动.bat` or enter
```bash
streamlit run st.py
```
Open the Web interface in your browser, select the corresponding Whisper method through the sidebar and configure it.
Open the Web interface in your browser, select the corresponding Whisper method in the sidebar, and configure it.
16 changes: 10 additions & 6 deletions docs/install_locally_zh.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,6 @@ VideoLingo 语音识别文本步骤提供多种 Whisper 方案的选择(因为
## 📋 前期准备

1.[云雾 API](https://api.wlai.vip/register?aff=TXMB) 注册账号并充值以获取令牌(或者换任意的claude-3.5-sonnet提供商)
> 云雾 api 的成本是官方的 1/10,但近期不稳定,也可以考虑 deepseek-coder 尝试效果。

![云雾 API 注册流程](https://github.com/user-attachments/assets/762520c6-1283-4ba9-8676-16869fb94700)

Expand Down Expand Up @@ -39,25 +38,30 @@ VideoLingo 语音识别文本步骤提供多种 Whisper 方案的选择(因为

### 安装步骤

1. 克隆项目:
1. 打开 Anaconda Prompt 并切换到桌面目录:
```bash
cd desktop
```

2. 克隆项目:
```bash
git clone https://github.com/Huanshere/VideoLingo.git
cd VideoLingo
```

2. 配置虚拟环境:
3. 配置虚拟环境:
```bash
conda create -n videolingo python=3.12.0
conda create -n videolingo python=3.10.0
conda activate videolingo
```

3. 运行安装脚本:
4. 运行安装脚本:
```bash
python install.py
```
根据提示选择所需的 Whisper 项目,脚本将自动安装相应的 torch 和 whisper 版本。

4. 🎉 启动 Streamlit 应用:
5. 🎉 启动 Streamlit 应用:双击 `一键启动.bat` 或输入
```bash
streamlit run st.py
```
Expand Down

0 comments on commit 3c725ba

Please sign in to comment.