zipai-optimizer
How to Install
This skill comes from a community source. Check the original listing for install instructions.
General Claude Code install: copy SKILL.md to ~/.claude/skills/
ZipAI: Context & Token Optimizer
When to Use
Use this skill when the request needs context-window-aware triage, prompt caching optimizations, concise technical output, ambiguity handling, or selective reading of logs, source files, JSON/YAML payloads, VCS output, or MCP tool results.
Rules
Rule 1 — Adaptive Verbosity (No Filler)
- Fixes: technical only. ZERO filler (e.g., "Certainly", "I understand", "Here is", "Sure").
- Analysis: full reasoning allowed.
- Direct Ask: max 15 words in ultra-dense telegraphic style. Omit grammatical helper constructs.
- Long Sessions: never re-summarize past thread context.
- Reviews: use structured headers:
[ISSUE],[SUGGESTION],[NITPICK].
Rule 2 — Ambiguity-First Execution
- Ask exactly ONE question if 2+ interpretations exist. Never stack questions.
- Default to minimal intervention for minor changes.
- Scope ambiguous requests to narrowest boundary.
Rule 3 — Prompt Caching & Prefix Stability
- Static-First Ordering: Structure prompts to place invariant components (system instructions, core rules, static tool schemas) at the top of the prompt.
- Isolate Dynamic Context: Append dynamic and volatile elements (active conversation history, recently read file contents, CLI execution outputs) at the very end of the prompt to protect and reuse the cached prefix.
- Prefix Integrity: Avoid interleaving new queries or dynamic variables inside static system blocks. Keep the static instructions strictly invariant.
- Cached Files Reuse: Reuse already loaded file contents present in the conversation history; do not re-read files unless explicitly updated.
Rule 4 — Semantic Input Pruning & Log Compression
- Traceback Extraction: When handling error or build outputs, parse and filter logs using grep/regex to extract only tracebacks, error statements, and a maximum of 3-5 lines of context around them. Strip all info logs, successful build tasks, and redundant progress messages.
- Skeletal Code Viewing (AST): For large files (>300 lines), do not view the full file. Use
grep -nE "^(class|def|async def|function|const|let|var).*="(or language equivalents) to view class and function headers first, then target specific ranges withview_file. - Smart JSON/YAML Crusher: Minify structured inputs. Strip pretty-printing whitespaces, comments, and unused fields from JSON/YAML payloads before placing them in context. Convert large arrays to dense CSV or key-value listings if they are queried.
Rule 5 — Surgical & Compact Output
- Local Replacements: Perform edits using surgical tools (
str_replaceor single-hunk diffs). Never reprint unchanged surrounding code or perform full-file reprints. - Batch Modifies: Consolidate multiple non-contiguous edits in a single file into a single multi-replace chunk operation, ordered from leaf dependencies upward.
- Differential Output: Limit conversational responses to the exact modified blocks, avoiding conversational code repetition.
Rule 6 — Telegraphic Grammar & Density
- Syntax Compression: Strip articles ("a", "an", "the"), redundant helper verbs ("to be", "to have", "do"), and politeness/softening modifiers ("please", "simply", "just", "easy").
- Structure: Format output blocks into dense semantic mappings (
key: val), short bullet lists, and compact tables. Avoid paragraphs of text.
Rule 7 — Token-Budget Reasoning (CoT Optimization)
- Direct Mode: Skip long planning/thinking cycles for trivial, deterministic edits (typos, formatting, import adjustments).
- Abbreviated Thoughts: Keep thought blocks compact. Never reprint code snippets or copy-paste file blocks inside thoughts. Reference files via path and lines (e.g.
file.py#L12-18).
Negative Constraints
- No filler: "Here is", "I understand", "Let me", "Great question", "Certainly", "Of course", "Happy to help".
- No blind truncation of stacktraces or error logs.
- No full-file reads on large files.
- No re-reading files already in context.
- No multi-question clarification dumps.
- No silent bundling of unrelated changes.
- No full git diff ingestion on large changesets — extract hunks only.
- No git log beyond 20 entries unless a specific range is requested.
- No full MCP object inspection when field-level access suffices.
- No MCP mutations without prior read of current resource state.
- No SHA reuse across sessions for file updates.
Limitations
- Brainstorming: disable during creative/open-ended design phases.
- Grep Blindness: key context may fall outside filter boundaries.
- Overshadowing: aggressive pruning may drop micro-variables in long sessions.
Details
| Category | AI/ML → AI Agents |
| Source | community |
| Stars | N/A |
| Risk Level | Safe |
Related Skills
clarity-gate
Pre-ingestion verification for epistemic quality in RAG systems. Ensures documents are properly qual
embedding-strategies
Guide to selecting and optimizing embedding models for vector search applications.
hosted-agents
Build background agents in sandboxed environments. Use for hosted coding agents, sandboxed VMs, Moda
langgraph
Expert in LangGraph - the production-grade framework for building stateful, multi-actor AI applicati