Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据集格式问题 #15

Open
SXUleiyang opened this issue Jan 11, 2024 · 3 comments
Open

数据集格式问题 #15

SXUleiyang opened this issue Jan 11, 2024 · 3 comments

Comments

@SXUleiyang
Copy link

作者您好,这种格式high_rouge_indices_and_scores.jsonl的数据是如何处理得到的,没有找到indices和score的来源

@nianlonggu
Copy link
Owner

您好,可以参考Training pipeline里面介绍的如何创建custom数据集的内容: https://github.com/nianlonggu/MemSum/blob/main/Training_Pipeline.md#preprocessing-custom-data

@SXUleiyang
Copy link
Author

您好,可以参考Training pipeline里面介绍的如何创建custom数据集的内容: https://github.com/nianlonggu/MemSum/blob/main/Training_Pipeline.md#preprocessing-custom-data

非常感谢您的回复,我对这个地方---> create high-ROUGE episodes for the training set 比较疑惑,是必须要做这一步操作吗

@nianlonggu
Copy link
Owner

是的,需要这个创建训练数据

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants