Generate text based on user prompts
This is a first implementation on transformer
Tokenizer specific to odia language with 5000 tokens
Trained on ImageNet1k