ELI5 How does chatgpt do its shit?

Acamon@lemmy.world · 10 months ago

ELI5 How does chatgpt do its shit?

Dran@lemmy.world · 10 months ago

The magic sauce is context length within reasonable compute restraints. Phone predictive text has a context length of like 2-3 words, ChatGPT (and other LLMs) have figured out how to do predictions on thousands or tens of thousands of words of context at a time.

doublejay1999@lemmy.world · 10 months ago

It’s that why is compute heavy ?

Dran@lemmy.world · 10 months ago

Correct, and the massive databases of long-length context associations are why you need tens to hundreds of gigabytes worth of ram/vram. Disk would be too slow