A newer version of the Gradio SDK is available:
5.49.1
metadata
title: B2NL v6.2.1 - Byte-to-Natural Language Tokenizer π
emoji: π
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.19.2
app_file: app.py
pinned: true
license: apache-2.0
models:
- ggunio/B2NL-IntelligentTokenizer-v6.2.1
B2NL v6.2.1 - Byte-to-Natural Language Tokenizer π
Compress and reconstruct text with token boundaries
β οΈ IMPORTANT: Currently in AUTOREGRESSIVE MODE
- Current: ~500ms inference (Teacher Forcing training)
- Coming Soon (November 2025): Non-autoregressive training (<50ms)
π What's New in v6.2.1
- 204 languages support (up from 6)
- 16:1 fixed compression ratio
- Multi-Query Attention (8x memory reduction)
- Model: ggunio/B2NL-IntelligentTokenizer-v6.2.1
Author
Jinhyun Woo
- GitHub: Woojiggun/intelligent-tokenizer
- Paper: Zenodo