posts: 添加关于使用RVC进行AI翻唱的详细教程文章,包含训练音色模型和使用Replay进行翻唱的完整流程说明

This commit is contained in:
二叉树树
2025-10-13 14:58:41 +08:00
parent 88f5a19d57
commit 5d138f4502
21 changed files with 129 additions and 0 deletions

Binary file not shown.

After

Width:  |  Height:  |  Size: 64 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 91 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 59 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 144 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 28 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 144 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 20 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 68 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 298 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 114 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 346 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 73 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 298 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 6.8 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 57 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 6.8 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 10 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 288 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 68 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 250 KiB

129
src/content/posts/rvc.md Normal file
View File

@@ -0,0 +1,129 @@
---
title: 手把手教你AI翻唱
published: 2025-10-13T14:23:12
description: 利用RVC训练音色模型然后使用Replay直出AI翻唱
image: ../assets/images/rvc-19.png
tags:
- AI
- RVC
- Replay
- UVR
draft: false
lang: ""
---
# 视频教程
https://www.bilibili.com/video/BV19F41zPEnM/
# 流程
RVC训练角色音色模型
Replay利用音色模型+原曲进行AI翻唱
UVR&MSST进行人声伴奏分离
# 准备音源
至少10分钟推荐1小时。音频内仅允许有一种音色可以有停顿如果想要更高质量可以自己裁剪停顿处
# 利用RVC训练模型
进入 [RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data <= 10 mins!](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI) 根据你的系统和显卡来进行下载,或者使用该链接下载(国内高速) [语音克隆&变声器 整合包下载](https://www.yuque.com/flowercry/hxf0ds) 注意不要下错了
![](../assets/images/rvc-1.png)
直接运行 `go-web.bat`
![](../assets/images/rvc-2.png)
进入 WebUI 并切换到训练一栏
![](../assets/images/rvc-3.png)
首先写模型名称
![](../assets/images/rvc-4.png)
然后将你的音源放到一个空文件夹
![](../assets/images/rvc-5.png)
然后填进去
![](../assets/images/rvc-6.png)
总训练轮数推荐50 ~ 200
![](../assets/images/rvc-7.png)
然后点击一键训练(需要很久,建议晚上睡觉前训练)
![](../assets/images/rvc-8.png)
训练结束后可以在 `assets/weights` 找到模型文件, `.pth` 结尾的
![](../assets/images/rvc-9.png)
# 利用Replay做AI翻唱
下载 [Replay](https://www.weights.com/replay)
首先 **Select Audio** 选择你的原歌曲
**Model** 选择刚刚训练出的模型
然后点击 **Convert Audio**
![](../assets/images/rvc-10.png)
在输出的文件的 **View in Folder** 可以找到 **干净的AI人声**
![](../assets/images/rvc-11.png)
# 伴奏和人声分离
### UVR
> 如果你是50系显卡请前往[GPU Acceleration Hangs on RTX 5070Ti (Driver 576.80, CUDA 12.9) · Issue #1889 · Anjok07/ultimatevocalremovergui](https://github.com/Anjok07/ultimatevocalremovergui/issues/1889)通过[UVR_Patch_4_24_25_20_11_BETA_full_cuda_12.8](https://www.mediafire.com/file_premium/4jg10r9wa3tujav/UVR_Patch_4_24_25_20_11_BETA_full_cuda_12.8.zip/file)下载适用于50系显卡的UVR
下载 [Anjok07/ultimatevocalremovergui: GUI for a Vocal Remover that uses Deep Neural Networks.](https://github.com/Anjok07/ultimatevocalremovergui)
首先下载模型,选择设置
![](../assets/images/rvc-12.png)
选择 **Download Center** 下载 **VR Arch****5_HP-Karaoke-UVR** 模型。然后回到首页
![](../assets/images/rvc-13.png)
首先通过 **Select Input** 选择原音频
然后通过 **Select Output** 选择输出的文件夹
**CHOOSE PROCESS METHOD** 选择 **VR Architecture**
**CHOOSE VR MODEL** 选择我们刚刚下载的 **5_HP-Karaoke-UVR** 模型
勾选 **GPU Conversion**
然后点击 **Start Processing**
![](../assets/images/rvc-14.png)
输出文件夹中 **Instrumental** 为伴奏, **Vocals** 为人声
![](../assets/images/rvc-15.png)
### MSST
下载 [SUC-DriverOld/MSST-WebUI: A WebUI app for Music-Source-Separation-Training and we packed UVR together!](https://github.com/SUC-DriverOld/MSST-WebUI)
双击 `go-webui.bat` 运行
![](../assets/images/rvc-16.png)
首先去安装模型。每个模型的最终输出文件可能不一样
![](../assets/images/rvc-17.png)
然后都是字面意思了,随后点击 **输入音频分离** 开始转换
![](../assets/images/rvc-18.png)