SMOL-GPT

2 likes

A minimal PyTorch implementation for training your own small LLM from scratch. Designed for educational purposes and simplicity, featuring efficient training, flash attention, and modern sampling techniques.

Cost / License

Free
Open Source (MIT)

Platforms

Python
Windows
Mac
Linux
BSD
PyTorch

SMOL-GPT alternatives

2likes

0comments

8alternatives

0articles

Features

Python-based

SMOL-GPT News & Activities

Highlights All activities

Recent activities

POX added SMOL-GPT as alternative to Plexe AI
10 months ago

SMOL-GPT information

Developed by
Om Alve
Licensing
Open Source (MIT) and Free product.
Written in
Python
Alternatives
8 alternatives listed
Supported Languages
- English

AlternativeTo Category

AI Tools & Services

GitHub repository

1,465 Stars
122 Forks
9 Open Issues
Updated Feb 15, 2025

View on GitHub

Popular alternatives

View all

SMOL-GPT was added to AlternativeTo by Paul on Feb 3, 2025 and this page was last updated Feb 3, 2025.

No comments or reviews, maybe you want to be first?

What is SMOL-GPT?

Features:

Minimal Codebase: Pure PyTorch implementation with no abstraction overhead
Modern Architecture: GPT model with:
Flash Attention (when available)
RMSNorm and SwiGLU
Efficient top-k/p/min-p sampling
Rotary embeddings - RoPE (Optional)
Training Features:
Mixed precision (bfloat16/float16)
Gradient accumulation
Learning rate decay with warmup
Weight decay & gradient clipping
Dataset Support: Built-in TinyStories dataset processing
Custom Tokenizer: SentencePiece tokenizer training integration

SMOL-GPT

Cost / License

Platforms

SMOL-GPT

Features

Tags

SMOL-GPT News & Activities

Recent activities

SMOL-GPT information

Developed by

Licensing

Written in

Alternatives

Supported Languages

AlternativeTo Category

GitHub repository

Popular alternatives

What is SMOL-GPT?

Official Links

AppStores & Other Links