Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问下载的resources数据集目录里面merged、all_excluded这两个目录名称代表什么意思? #8

Open
tjhwk opened this issue Aug 2, 2024 · 1 comment
Labels
documentation Improvements or additions to documentation

Comments

@tjhwk
Copy link

tjhwk commented Aug 2, 2024

No description provided.

@Spico197
Copy link
Owner

嗨,感谢您对我们项目的关注,抱歉回复晚了。

  • merged 是Mirror使用的预训练数据
  • all_excluded 表示预训练数据集中不包含下游benchmark的数据(避免数据泄露)

@Spico197 Spico197 added the documentation Improvements or additions to documentation label Aug 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants