Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance