How many tokens were used for training?

Curious to know how many tokens the models have seen. The repo mentions the dataset, but not the totals.

This checkpoint is trained on the stricter permissive subset of the deduplicated version of the Stack dataset (v1.1). Supported languages (and frameworks) are as follows: c, c++, c-sharp, dart, go, java, javascript, kotlin, lua, php, python, ruby, rust, scala, shell, sql, swift, typescript, vue.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How many tokens were used for training? #4

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

How many tokens were used for training? #4

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions