AI cost gateway 40–70% lower token spend Local-first & private Effectively pays for itself

Smart AI tooling that respects your wallet—and your privacy.

TokenBeaver sits between your agent and the model, trimming wasted tokens before they ever hit your bill. Local-first, so your prompts never leave your machine.

Add to VS Code — free→ See pricing

Lower AI spend · Free to start · Fail-open architecture

Works with

Claude Code Cursor Cline VS Code GitHub Copilot Kilo Roo + many more

Two ways to get TokenBeaver.

Install straight from the VS Code Marketplace, or download the .vsix and add it by hand — your call.

Add to VS Code→

Download .vsix

How it works

A gate between your editor and the bill.

Three steps. One line of config. Your keys, your machine — TokenBeaver never sees your code or your account.

Intercept

Point your editor at TokenBeaver's local endpoint. One line of config — no SDK, no rewrite.

Optimize

We optimize both input and output, alongside context and cache. Every prompt stays on your machine.

Forward

A leaner request reaches the model with your own key. Same answer, fewer tokens, smaller bill.

Why TokenBeaver

Built to save money without holding your data hostage.

↯

Local-first by design

Prompts, code, and keys never leave your machine. TokenBeaver optimizes in-process.

Real cost reduction

40–70% lower API spend, depending on session length, model, and workload.

⌘

Editor-native

Drops into Claude Code, VS Code, and others with a single endpoint change.

⊞

Your keys, your account

Bring your own provider keys. We never proxy billing or touch your model account.

◷

Stretch your quota

15–25% more usable Claude Code quota in a 5-hour rolling window, depending on session length and number of sub-agents.

◎

Fully transparent

See every optimization applied to every request. Nothing hidden, nothing silently dropped.

40–70%

lower API cost

15–25%

more quota · 5-hr window

0 bytes

of your code leaves the machine

Figures from internal testing across models. Live telemetry from real users is coming soon.

Pricing

Start free. Upgrade when it pays for itself.

Free

$0/ forever

For trying it out on a real project.

Add to VS Code

✓ 20 optimizations / month ✓ Full feature access ✓ Local-first, your keys

14-day trial

Pro

$15/ month

Or $100 / year — under $8.40 a month.

Start 14-day trial

✓ Unlimited optimizations ✓ Full feature access ✓ Priority optimization engine ✓ Email support

Compare full plans & FAQ →

Stop paying for tokens you don't need.

Twenty free optimizations per month, no card required. See the savings on your own project today.

Add to VS Code — free→ Set up in 2 minutes

Pricing

Pricing that pays for itself.

One Pro plan if you want it. Free forever if you don't. No seats, no usage surprises.

Free

$0/ forever

For trying TokenBeaver on a real project, no card needed.

Add to VS Code

✓ 20 optimizations / month ✓ Full feature access ✓ Local-first — your keys, your machine ✓ Works with Claude Code, Cursor, Cline ✓ Community support

14-day free trial

Pro

$100/ year

Just $8.33 / month, billed annually. Two months free vs monthly.

$15/ month

Billed monthly. Switch to annual any time to save 44%.

Start 14-day trial

✓ Unlimited optimizations ✓ Everything in Free ✓ Priority optimization engine ✓ Advanced caching & dedupe ✓ Email support

Cancel anytime from the self-serve portal · No charge during the trial

FAQ

Questions, answered.

How does TokenBeaver actually cut my bill?

It sits between your editor and the model, stripping redundant context, deduping repeated content, and caching locally — so each request carries fewer tokens while returning the same answer.

Do my prompts or code ever leave my machine?

No. TokenBeaver is local-first — optimization happens in-process on your machine, and requests go directly to the model with your own key.

What's an "optimization"?

One optimized request passing through the gate. Free gives you 20 per month; Pro is unlimited.

Will it work with my setup?

If you use Claude Code, Cursor, Cline, or any tool that lets you set a base URL, yes. Setup is a single endpoint change.

Can I cancel during the trial?

Yes — cancel any time in the 14 days and you won't be charged. Self-serve, no email required.

← Back to pricing

Start your Pro trial

Pro unlocks unlimited optimizations right inside the extension you've installed. 14 days free — we'll only charge you when the trial ends, and you can cancel any time before then.

Email Card information

Name on card Country

🔒 Secured by Stripe · You won't be charged until the trial ends

Order summary

TokenBeaver Pro

Unlimited optimizations

Pro · Annual$100.00 / yr

Billed monthly equivalent$8.33 / mo

Renews after your trial$100.00/yr on {{ trialEnd }}

Pro · Monthly$15.00 / mo

Renews after your trial$15.00/mo on {{ trialEnd }}

14-day free trial−100%

Due today$0.00

✓ Cancel anytime, self-serve ✓ Local-first — your keys, your machine ✓ Email support included

Integrations

Set up in two minutes.

Install the extension, then point your editor at the local gate. No SDK, no code changes, no account hand-off — your keys stay yours.

Claude Code VS Code Cursor Cline Roo Code GitHub Copilot

Install the extension

Search TokenBeaver in the VS Code Marketplace, or install directly.

Install from VS Code Marketplace→

Download from tokenbeaver.ai

Start the gateway

Click Start Gateway in the TokenBeaver panel, or open the Command Palette (Ctrl+Shift+P / Cmd+Shift+P) and run TokenBeaver: Start Gateway. Either way, the extension starts a local server and shows you a Base URL.

> TokenBeaver: Start Gateway

TB TokenBeaver: Start Gateway ↵ run

Base URL http://127.0.0.1:4505/v1

Point your tool at it

Paste the Base URL into your coding tool's API settings. Keep using your own API key — TokenBeaver forwards it unchanged.

Tool

Where to paste

Claude Code (API key)

Set ANTHROPIC_BASE_URL=http://127.0.0.1:4505 — no /v1 at the end

Cursor

Settings → Models → Override OpenAI Base URL

Cline / Roo Code

Settings → API Provider → OpenAI Compatible → Base URL

GitHub Copilot

TokenBeaver Settings → Provider → GitHub Copilot (beta)

Same tasks. Same answer. Far fewer tokens.

Without TokenBeaver

18,400

tokens / request

→

With TokenBeaver

7,600

tokens / request · −59%

Illustrative figure from internal testing. Your results vary by model and workload.

Start free — 20 optimizations/month→

About

We built TokenBeaver because our own AI bill got silly.

We're developers who lean on AI coding tools every day. The output was worth it — the invoices, less so. Most of what we were paying for was redundant context shipped to the model over and over again.

So we built a small gate that sits between the editor and the model, trims the waste, and forwards a leaner request. It runs locally, uses your own keys, and never sees your code. In about a month of internal testing it's cut our API spend by 40–70% depending on the model — without changing a single answer.

TokenBeaver is independent and intentionally simple. No data harvesting, no lock-in, no surprise pricing. If it saves you money, keep it. If it doesn't, the free tier costs you nothing.

What we stand on

Local-first, always

Your prompts, code, and keys stay on your machine. Privacy isn't a setting — it's the architecture.

Honest pricing

One Pro plan, a real free tier, and no usage traps. You always know what you'll pay.

Show the work

Every optimization is visible. We'd rather under-claim and show you the numbers.

Small and independent

No investors to please, no growth-at-all-costs. Just a tool that earns its keep on your machine.

Try it free→

Legal

Privacy Policy

Last updated: June 28, 2026

TokenBeaver is built local-first. The short version: your prompts, your code, and your API keys stay on your machine. We designed the product so that we never see them in the first place.

1. What TokenBeaver is

TokenBeaver is a VS Code extension and local gateway that sits between your AI coding agent and your model provider, trimming wasted tokens before requests are sent. Optimization happens in-process on your own device.

2. What we do not collect

We never receive, store, or transmit any of the following:

—Your source code

—Your prompts

—Your API keys

—AI responses returned by the model

TokenBeaver optimizes in-process on your own device. Nothing from your editor or model interactions is sent to TokenBeaver's servers — requests go directly from your machine to your chosen model provider using your own key, so we are never in the path of your billing relationship with that provider.

3. What we do collect

We don't ask you to create an account, and we don't collect your email or personal profile. To handle billing and basic product analytics, we collect only a limited set of data:

—Billing data: handled entirely by our payment processor (Stripe) when you upgrade to Pro. We never store full card numbers or payment details on our servers.

—Usage metadata: aggregate counts such as the number of optimizations performed, used to enforce plan limits and improve the product. This is a count — not the content of your requests.

What

Why

Device fingerprint (hashed)

License verification

Install mode (cost / capacity / both)

Aggregate product analytics

AI provider in use

Aggregate product analytics

Anonymous token-saving stats (no device ID)

Public usage totals

4. How we use it

We use the data above to provide and secure the service, enforce free and Pro plan limits, process payments, respond to support requests, and understand aggregate product usage. We do not save or sell your data, and we do not use your code or prompts to train models.

5. Third parties

We rely on a small number of processors to run the business — for example, Stripe for payments and standard infrastructure providers for hosting our website and account systems. Your model provider (such as Anthropic) receives your requests directly from your machine under your own agreement with them.

6. Data retention & your rights

We keep billing records only as long as required for legal and accounting purposes. Since there's no account or email on file, the only data tied to you is your hashed device record. You can request deletion of your device record at any time by emailing [email protected]. Aggregate stats are anonymous and cannot be linked to a device.

7. Changes & contact

We may update this policy as the product evolves; material changes will be reflected by the date above. Questions about privacy? Email [email protected].

Legal

Terms of Service

Last updated: June 28, 2026

These terms govern your use of TokenBeaver — the VS Code extension, local gateway, and website. By installing or using TokenBeaver, you agree to them. If you're using it for an organization, you accept on its behalf.

1. What TokenBeaver provides

TokenBeaver is a tool that optimizes AI coding requests locally on your device to reduce token usage. You bring your own model-provider API key; TokenBeaver forwards your requests to that provider unchanged except for the optimizations it applies. We don't resell model access and we aren't a party to your agreement with your model provider.

2. Accounts & plans

The Free plan includes 20 optimizations per month with full feature access. Pro is $15/month or $100/year and includes unlimited optimizations, billed through our payment processor. Pro starts with a 14-day free trial; if you don't cancel before it ends, the plan converts to paid at the price shown at signup.

3. Acceptable use

Use TokenBeaver lawfully and in line with your model provider's terms. Don't attempt to resell, reverse-engineer for circumvention, abuse the optimization service, or use it to violate others' rights. We may suspend access for activity that threatens the service or other users.

4. Your responsibilities

You're responsible for your own API keys, your model-provider charges, and the code and prompts you run through your editor. TokenBeaver runs locally and uses a fail-open design, but you should keep your own backups and review outputs before relying on them. Don't use TokenBeaver to circumvent model-provider rate limits in ways that violate those providers' terms of service.

5. Disclaimers & liability

TokenBeaver is provided as-is. We're not liable for API costs, model behavior, or data loss. Token savings are not guaranteed — results vary by session length, model, workload, and number of sub-agents. To the extent permitted by law, our liability is limited to the amount you paid us in the 6 months before a claim.

6. Changes & contact

We may update these terms as the product evolves; material changes are reflected by the date above and continued use means acceptance. Questions? Email [email protected].

Legal

Refund Policy

Last updated: June 28, 2026

We want TokenBeaver to pay for itself. Between the free tier and the 14-day Pro trial, you can confirm it saves you money before you're ever charged — but if something's wrong, here's how refunds work.

1. Try before you pay

The Free plan (20 optimizations per month) and the 14-day Pro trial let you evaluate TokenBeaver at no cost. No card is charged until a trial ends, and you can cancel any time before then to avoid being billed.

2. 7-day money-back guarantee

If you're charged for Pro and aren't happy, email us within 7 days of that charge and we'll refund it in full — no interrogation. This applies to your first paid term (monthly or annual).

3. Annual plans & renewals

For annual plans, the 7-day window applies from the date of each charge, including renewals. After the window, we don't provide prorated refunds for the remainder of a term, but you can cancel to stop the next renewal and keep access until the period ends.

4. How to request one

Email [email protected] from the address on your subscription. Approved refunds are issued to your original payment method through Stripe and typically appear within 5–10 business days, depending on your bank.

404

This page slipped through the gate.

The beaver couldn't find what you're looking for — but your tokens are still safe.

Back home→ See pricing