Burkov Andriy - The Hundred-Page Language Models Book + Code - 2025
File List
- Burkov Andriy - The Hundred-Page Language Models Book - 2025.pdf 24.3 MB
- Code/news_decoder_language_model.ipynb 425.2 KB
- Code/news_RNN_language_model.ipynb 397.7 KB
- Code/emotion_GPT2_as_text_generator.ipynb 115.4 KB
- Code/emotion_GPT2_as_text_generator_LoRA.ipynb 29.8 KB
- Code/emotion_GPT2_as_classifier.ipynb 24.5 KB
- Code/byte_pair_encoding.ipynb 23.9 KB
- Code/instruct_GPT2.ipynb 23.1 KB
- Code/count_language_model.ipynb 20.9 KB
- Code/sampling_method.ipynb 18.0 KB
- Code/emotion_classifier_LR.ipynb 8.9 KB
- Code/wiki/inference.md 3.1 KB
- Code/wiki/evaluation.md 2.0 KB
- Code/embedding_vs_linear.py 1.8 KB
- Code/wiki/index.md 1.5 KB
- Code/quadratic_loss.py 1.3 KB
- Code/wiki/math.md 1.3 KB
- Code/wiki/non-transformer.md 1.2 KB
- Code/wiki/compression.md 1.2 KB
- Code/wiki/colabs.md 1.1 KB
- Code/wiki/encoder-decoder.md 1.1 KB
- Code/wiki/embeddings.md 991 bytes
- Code/wiki/prompting.md 883 bytes
- Code/wiki/encoder.md 707 bytes
- Code/wiki/deployment.md 681 bytes
- Code/wiki/function-calling.md 669 bytes
- Code/wiki/alignment.md 636 bytes
- Code/wiki/VLM.md 611 bytes
- Code/wiki/security.md 601 bytes
- Code/wiki/overfitting.md 553 bytes
- Code/wiki/MoE.md 409 bytes
- Code/wiki/PyTorch.md 388 bytes
- Code/wiki/scaling.md 372 bytes
- Code/wiki/tokenization.md 352 bytes
- Code/wiki/notebook-services.md 337 bytes
- Code/wiki/distributed.md 331 bytes
- Code/wiki/GPU-rental.md 325 bytes
- Code/wiki/merging.md 323 bytes
- Code/wiki/test.md 271 bytes
- Code/wiki/online-finetuning.md 151 bytes
- Code/README.md 130 bytes
- Code/wiki/scripts.md 115 bytes
Download Torrent
Related Resources
Copyright Infringement
If the content above is not authorized, please contact us via activebusinesscommunication[AT]gmail.com. Remember to include the full url in your complaint.