Posts

Showing posts from December, 2024

The Power of Large Language Models for Translations

One of the earliest use cases of large language models, i.e. text-based generative AI, was for translating text provided in one input language into another language - a literal transformation of source to target if you will - greetings almighty transformer model. Before that, translation was often based on rule-based and or statistical models like Hidden Markov models, and later some variant of the recursive neural network models like RNN and LSTM. But then the transformer came, and blew everything that was before out of the park. While many more powerful, novel and interesting adoptions of generative AI for text have taken center stage in the last month and years, the translation of text from one language into another language still feels magical. Just a few years back we would have searched for individual words in our offline or online dictionaries and translated between German and English or any other language word by word. We would have depended on our or someone else's experti...