• Stars
    star
    34
  • Rank 762,943 (Top 16 %)
  • Language
    Python
  • License
    MIT License
  • Created 7 months ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA