Up to 54% fewer tokens than JSON ・ 11% more efficient than TOON

Stop Wasting Tokens.
Start Using Tauq.

The most token-efficient data notation for LLMs. Schema-driven, streaming-ready, and state-of-the-art.

44%
vs JSON
11%
vs TOON
5x
Faster Parsing
100%
Streaming Ready
JSON 17 tokens
[{"id": 1, "name": "Alice"}, {"id": 2, "name": "Bob"}]
Tauq 11 tokens (-35%)
!def User id name
1 Alice
2 Bob

* 1,000 records: JSON 24,005 → TOON 12,002 → Tauq 11,012

Built for the AI Era

Every feature designed to minimize tokens and maximize developer productivity.

Token-Optimal

44% fewer tokens than JSON overall, 54% on flat data. Space delimiters tokenize better than commas.

🌊

True Streaming

StreamingParser iterator API processes records one at a time. No count required unlike other formats.

📐

Schema-Driven

Define data shapes with !def. Switch schemas with !use. Nested types and typed arrays for complex data.

🔧

Programmable

Tauq Query for data transformations. Unix pipe model with polyglot support (Python, Rhai, JavaScript).

🛠️

Production CLI

Full-featured CLI with build, format, minify, exec, and validate commands. Ready for production use.

🌍

Language Bindings

Native support for Python, JavaScript, Go, and Rust. Use Tauq in your favorite language.

🚀

LSP Support

Language Server Protocol implementation for VS Code, Neovim, and other editors. IntelliSense, syntax highlighting, and more.

Battle-Tested

85+ passing tests. Benchmarked with tiktoken. Production-ready with comprehensive error handling.

Verified Performance

Primary benchmark: 1,000 user records with tiktoken cl100k_base (GPT-4/Claude tokenizer).

JSON (minified)
24,005 tokens baseline
TOON
12,002 tokens -50.0%
Tauq SOTA
11,012 tokens -54.1%
12,993
Tokens Saved vs JSON
10.8%
Better than TOON (avg)
$0.04
Saved per 1K Records

* All measurements verified with tiktoken cl100k_base tokenizer
** Cost based on GPT-4o/Claude 3.5 Sonnet pricing ($3 per 1M input tokens)
*** TOON encoded via toon-python library for fair comparison. Tauq v0.1.0

Get Started in Minutes

Install Tauq and start saving tokens immediately. Available for all major languages.

1. Install

# Install via cargo
cargo install tauq

# Or via package managers
pip install tauq         # Python
npm install tauq         # JavaScript
go get github.com/epistates/tauq  # Go

2. Use

# Create a config file
cat > config.tqn <<EOF
app_name "MyService"
version "1.0.0"
port 8080
debug true
features [api websockets metrics]
EOF

# Parse to JSON
tauq build config.tqn --pretty

Perfect for Every Use Case

🤖

AI/LLM Applications

Reduce token costs by up to 54% vs JSON, 11% vs TOON on average. Perfect for prompt engineering, RAG systems, and LLM training data.

Learn more →
📊

Data Processing

Stream large datasets efficiently. ETL pipelines, data transformations, and batch processing.

Learn more →
⚙️

Configuration Management

Human-readable configs with schema validation. Replace YAML, JSON, or TOML with Tauq.

Learn more →

Multi-Language Support

Use Tauq in your favorite programming language.

🐍

Python

Native bindings

📜

JavaScript

Native bindings

🦀

Rust

Native bindings

🐹

Go

Native bindings

Example: Tabular Data

users.tqn tauq
!def User id name email role

1 Alice "alice@example.com" admin
2 Bob "bob@example.com" user
3 Carol "carol@example.com" user

Ready to Save Tokens?

Join developers who are already using Tauq to reduce AI costs and improve performance.

MIT License
85+ Tests
Production Ready