Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Large Language Models

Timeline

Model Date Company Cite
Transformer 06/2017 Google 78k
GPT 06/2018 OpenAI 5k
BERT 10/2018 Google 68k
GPT-2 02/2019 OpenAI 6k
GPT-3 05/2020 OpenAI 11k

References

  • CS324 - Large Language Models. The field of natural language processing (NLP) has been transformed by massive pre-trained language models. They form the basis of all state-of-the-art systems across a wide range of tasks and have shown an impressive ability to generate fluent text and perform few-shot learning. At the same time, these models are hard to understand and give rise to new ethical and scalability challenges. In this course, students will learn the fundamentals about the modeling, theory, ethics, and systems aspects of large language models, as well as gain hands-on experience working with them.

Table of contents