Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ggunio
/
intelligent-tokenizer-v6-demo
like
1
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
0250815
intelligent-tokenizer-v6-demo
425 MB
2 contributors
History:
31 commits
ggunio
Fix UTF-8 safe chunking, token boundary visualization, and embedding display
0250815
2 months ago
core
Fix import error by adding core module files
2 months ago
src
Upload src/core/byte_tokenizer_v6.py with huggingface_hub
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
Safe
8.11 kB
Trigger rebuild after model upload
2 months ago
VERSION_COMPARISON.md
Safe
8.54 kB
Update to B2NL v6.1.2 POC - 18.6:1 compression with 6 languages (Korean, English, Chinese, Japanese, Spanish, Arabic)
2 months ago
app.py
Safe
20.5 kB
Fix UTF-8 safe chunking, token boundary visualization, and embedding display
2 months ago
config.json
Safe
394 Bytes
Upload config.json with huggingface_hub
3 months ago
demo_poc.py
Safe
9.88 kB
Upload demo_poc.py with huggingface_hub
3 months ago
inference.py
Safe
9.67 kB
Upload inference.py with huggingface_hub
3 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
425 MB
xet
Upload pytorch_model.bin with huggingface_hub
3 months ago
requirements.txt
Safe
65 Bytes
Fix reconstruction issue by downloading checkpoint from HF model repo
2 months ago
test_app.py
Safe
6.05 kB
Update to B2NL v6.1.2 POC - 18.6:1 compression with 6 languages (Korean, English, Chinese, Japanese, Spanish, Arabic)
2 months ago