Prompt Detail:
could you create a table for the size of data you are trained , sorted by the size of their training corpuses descending order along with the percentage of the total training data each language
Add a comment