Is it a common practice in LLMs to give different weights to different training ...

sebzim4500 · on June 6, 2024

Yes it is. IIRC back when OpenAI was open and they published the breakdown they were significantly overweighting wikipedia.

Tenoke · on June 7, 2024

Yes, though it's not about taking precedence but about sampling frequency. So for example, if you have 1 GB emails, 10 GB external data, you can instead sample your emails twice as much and effectively change the ratio of what the model was trained on from 1:10 to 2:10.