What is Lakon?
Lakon is an AI Optimization & Continuity Engine. It consists of two powerful tools:
Removes everything that wastes tokens — polite phrasing, redundant context, scaffolding — while preserving every piece of signal the AI needs to answer correctly. Output is always shorter, response is always equivalent.
Converts massive, messy chat logs into a clean, structured "Continuation Prompt". Never lose your AI's attention window again. Instantly resume workflows exactly where you left off.
Quick Start
Option 1 — Browser Extension (Recommended)
Install the extension and compress directly inside Claude, ChatGPT, or Gemini. No copy-paste. No tab switching.
Option 2 — Web App
No installation needed. Paste your prompt at /app and compress directly.
Try It Now
Test compression live without installing anything:
How It Works
Lakon sends your prompt to a compression backend powered by Groq. A specialized system prompt instructs the model to:
Polite phrasing, hedging words, redundant restatements of the request.
LLMs pay most attention to the beginning and end of a prompt (primacy/recency effect). Lakon moves signal to those zones.
Frameworks, formats, word counts, tone instructions — all survive compression exactly as specified.
The backend returns compressed text, token counts, and preserved signals — never commentary.
What it returns
{
"compressed": "Compare PostgreSQL vs MongoDB: when to use each. Skip basics. Include decision table.",
"tokens_before": 76,
"tokens_after": 17,
"reduction_pct": 78,
"signal_preserved": ["technical comparison", "decision table format"],
"warning": null
}Extension Guide
Installing in Developer Mode (Chrome / Brave / Opera / Edge)
Until the extension is live on the Chrome Web Store, install it manually:
Using the Extension
After installation, open Claude, ChatGPT, or Gemini. Type your prompt as usual. You'll see a Lakon button appear next to the send button. Click it — your prompt is replaced with the compressed version instantly.
Supported Platforms
Playground (Web App)
The web app at lakonai.vercel.app/app lets you compress prompts without installing the extension. It's identical to the extension's backend — same compression quality.
When to use the web app
Keyboard Shortcut
Press ⌘ + Enter (Mac) or Ctrl + Enter (Windows) to compress.
Context Snapshots
The Continuity Engine analyzes your massive chat logs and generates a clean "Context Snapshot" using a specialized Map-Reduce pipeline running on Llama 3.3 70B.
How it works
1. Map Phase: Lakon chunks your chat log and extracts the ultimate goal, key decisions, hard constraints, and open tasks.
2. Reduce Phase: It merges these intermediate snapshots into a final, unified JSON structure.
3. Continuation Prompt: It automatically generates a highly optimized, rich first-person briefing paragraph. Paste this into a new chat to completely bring a fresh AI up to speed without bringing along the raw, cluttered history.
API Reference
The Lakon backend exposes a simple REST API. You can call it directly from your own code.
Base URL
https://lakon-api.onrender.com
POST /compress
Compresses a prompt. Returns the compressed text and token statistics.
Request body{
"prompt": "string (required)",
"task_type": "auto | coding | writing | analysis | creative | data",
"compression_mode": "strict | balanced | creative"
}{
"compressed": "string",
"tokens_before": number,
"tokens_after": number,
"reduction_pct": number,
"signal_preserved": string[],
"warning": string | null
}curl -X POST https://your-api.onrender.com/compress \
-H "Content-Type: application/json" \
-d '{"prompt": "Help me understand React hooks.", "task_type": "auto"}'GET /health
Returns server status. Use this to wake the server on cold start.
curl https://your-api.onrender.com/health
# → {"status": "ok", "message": "Server is awake"}POST /snapshot
Generates a Context Snapshot and Continuation Prompt from a raw conversation log.
Request body{
"conversation": "string (required, the raw chat log)"
}Rate Limits
3 requests per minute per IP address. Designed for interactive use, not batch processing.