AI models now perform strongly in obscure languages with minimal training data Cross-lingual transfer allows shared patterns to boost rare language performance Tokenizer efficiency improvements ...