To evaluate on bgGLUE, collect your system's predictions on the nine tasks.
The expected format for the predictions from each dataset is a jsonl file (a valid json on each row) with two fields: `{"id": 123, "label": "label_name"}`.
You may upload at most two submissions a day. A sample submission with the necessary formatting is available
in our GitHub repository.
You can use the
training code for the baselines as a starting point. See below if you are running into issues submitting.