From 2192f8e0a215899b66de427bb17051338d629afc Mon Sep 17 00:00:00 2001 From: MaxFish <16432329+MaxMax2016@users.noreply.github.com> Date: Wed, 20 Sep 2023 11:34:57 +0800 Subject: [PATCH] release --- README.md | 10 +++------- 1 file changed, 3 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index 8f02ac9..0ec35e9 100644 --- a/README.md +++ b/README.md @@ -13,7 +13,7 @@ The framework of grad-svc-v1 ![grad_svc_v2](./assets/grad_svc_v2.jpg) -The framework of grad-svc-v2, encoder:768->512, diffusion:64->96, next ver for whisper +The framework of grad-svc-v2, encoder:768->512, diffusion:64->96 https://github.com/PlayVoice/Grad-SVC/assets/16432329/f9b66af7-b5b5-4efb-b73d-adb0dc84a0ae @@ -34,10 +34,6 @@ https://github.com/PlayVoice/Grad-SVC/assets/16432329/f9b66af7-b5b5-4efb-b73d-ad 6. Integrated Fast Maximum Likelihood Sampling Scheme -7. Low GPU memery required for train - - `batch_size: 8, occupy 3.1GB GPU memory when fast epochs, and 5.8G when last epochs` - ## Setup Environment 1. Install project dependencies @@ -175,9 +171,9 @@ data_gvc/ tensorboard --logdir logs/ ``` -## Finetune Loss +## Train Loss -![grad_svc_loss](./assets/grad_svc_loss_fit.jpg) +![loss_96_v2](./assets/loss_96_v2.jpg) ![grad_svc_mel](./assets/grad_svc_mel.jpg)