Gradient-Descent Markov Chain.
The grid represents $W$. Rows = Input Char, Columns = Predicted Char. Brighter = Higher Probability.
Training String:
1. Init Model/Vocab
2. Start Training
Inference Prompt:
3. Run Inference
Ready...
Inference results appear here...