This repository is a derivative project based on cpa-warden. Upstream project: fantasticjoe/cpa-warden Derivative baseline in this repository: commit ...
MicroGPT is a custom-built, 30.5 million parameter Large Language Model (LLM) trained entirely from scratch. The mathematical architecture, tokenization pipeline, and training loop were written in ...