H-Net
Summary
H-Net is an end-to-end hierarchical sequence model that learns dynamic byte chunking jointly with language modeling.
Role In The Wiki
H-Net is the clearest example of replacing tokenizer preprocessing with learned hierarchical chunking.