Tools/AI Frameworks & Libraries/Hugging Face Tokenizers

Hugging Face Tokenizers

Ultra-fast text tokenization library in Rust with Python bindings.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Hugging Face Tokenizers is a tokenization library with a Rust core and bindings for Python and other languages, built for speed and versatility. It implements Byte-Pair Encoding, WordPiece, and Unigram models, trains custom tokenizers in a couple of lines, and handles configurable pre- and post-processing, tokenizing about a gigabyte of text in under twenty seconds on a server CPU. Released under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
Apache-2.0
Added
Apr 3, 2026

Related Tools

Tensor library for machine learning on commodity hardware

Open SourceSelf HostedOffline
Expert
0.0 (0)

Structured output extraction from LLMs with Pydantic

Open SourceSelf Hosted
Easy
0.0 (0)

Deploy LangChain runnables as REST APIs

Open SourceSelf Hosted
Easy
0.0 (0)

Unified system for large-scale distributed training and inference.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

High-level deep learning library making neural nets accessible with best practices.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)
Featured

Open-source machine learning framework by Meta with dynamic computation graphs.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Browse all AI Frameworks & Libraries tools