Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CMP源码申请下载为什么只有large参数模型,为什么没有CPM-small&CPM-medium,你用的small参数模型在哪下载的 #6

Open
MaNing1924382115 opened this issue Jul 19, 2021 · 4 comments

Comments

@MaNing1924382115
Copy link

1、CMP源码申请下载为什么只有large参数模型,为什么没有CPM-small&CPM-medium,你用的small参数模型在哪下载的吗?
2、你代码中config文件下的CPM-small&CPM-medium json文件是你自己写的还是在哪下载的?
谢谢

@yangjianxin1
Copy link
Owner

分享的模型参数不是CPM官方的,是自己使用作文数据集从头开始训练的。CPM本质上是中文版的GPT2模型,config文件夹下的模型配置文件是使用huaggingface

@xiewenqian
Copy link

“本项目是基于CPM模型的中文文本生成项目”?

@rattlesnakey
Copy link

请问这个 zuowen_epoch40 的模型,是官方放出来的CPM 模型还是你自己从头训练的呀?

@liangzz1991
Copy link

liangzz1991 commented Nov 25, 2022

我理解就是用了CPM的config和tokenier(其实也是gpt2),作者利用中文数据从头训练作文生成模型,所以也就跟cpm提供的模型关系不大,我刚开始也认为作者是finetune的cpm....一直在cpm官网找对应的pretrain

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants