Large Language Models#

Amid the hype and fear around artificial intelligence, we have developed these resources to allow those interested to pursue rigorous exploration of the technology underlying systems like ChatGPT, Claude and Deepseek. These notebooks focus on small models available on HuggingFace, but the concepts can be applied to models of all sizes.

Note on hardware: Unlike the other notebooks in the NLP sequence, these notebooks require special hardware, graphical processing units or GPUs, to work. To that end, instead of displaying the notebook itself, there will be links to Google Colab notebooks, which allow users to run this code with a small GPU (Nvidia T4). As a result, you must go to File > Save a copy in Drive to run the cells.