LLMs have two main training stages:
Pre-training is about feeding the model with lots of text data with the goal of getting the model to 'understand' human language. The 'understanding' that the model gains after pre-training consists of a probability distribut…



