Token Saver Skill — 智能令牌成本优化

Name: Token Saver Skill — 智能令牌成本优化
Rating: 1 (1 reviews)
Author: jzming9

jzming9

Token Saver Skill — 智能令牌成本优化

v1.0.2

Token Saver Skill 是 OpenClaw 的智能令牌成本优化技能，通过智能上下文压缩、语义缓存和自适应优化，自动减少 50-80% 的 AI 令牌消耗，保持响应质量。

1· 502·2 当前·2 累计

by @jzming9·MIT-0

AI模型访问自动化系统工具

下载技能包

License

MIT-0

最后更新

2026/3/23

安全扫描

VirusTotal

Pending

查看报告

OpenClaw

可疑

medium confidence

该技能的行为与描述一致（上下文压缩、语义缓存），但存在未解决的项目，特别是缺少核心依赖项和存储对话数据，这些在安装前需要谨慎考虑。

评估建议

["审查 '@token-saver/core' 包源（或向发布者请求），因为插件导入了它，但它没有包含在显示的文件中。如果平台从 npm 获取它，请检查该包和其依赖项。","验证存储行为：缓存数据保存位置、保留/ TTL 以及是否可以排除或加密敏感消息。","在非敏感环境中测试，以确认 'never compress' 和回滚行为按文档工作。","如果有强烈的隐私问题，禁用自主调用或限制插件权限，直到您有信心。","如果作者/源是未知的，考虑避免安装或请求签名/验证版本和完整源代码进行审查。"]...

详细分析 ▾

ℹ 用途与能力

技能描述与代码一致，涉及令牌压缩、缓存和 UI 命令，但依赖外部未提供的 '@token-saver/core' 包。

✓ 指令范围

SKILL.md 明确指令仅与令牌保存相关，操作对话上下文以优化 AI 请求。

ℹ 安装机制

安装仅依据指令，无额外下载，但依赖未提供的外部包，增加了安全风险。

ℹ 凭证需求

不需要凭证，但需要存储权限，存储对话数据，存在隐私风险。

✓ 持久化与权限

技能允许用户调用和自动运行，符合平台默认设置，无系统级修改。

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.22026/3/20

["添加技能图标 assets/icon.png。","代码库无功能性更改。","更新包装以包含视觉图标。"]

● Pending

安装命令点击复制

官方npx clawhub@latest install token-saver-skill

镜像加速npx clawhub@latest install token-saver-skill --registry https://cn.clawhub-mirror.com

技能文档

TokenSaver 智能令牌成本优化技能，帮助您在不牺牲响应质量的情况下节省 50-80% 的 AI 令牌使用。...（中英文混合，保留代码块、命令和 Markdown 格式，仅翻译文本部分）

# TokenSaver A token cost optimization skill that helps you save 50-80% on AI token usage without sacrificing response quality.

When to Use

Use TokenSaver when:

You have long conversations that consume many tokens
You want to reduce AI API costs
You're working with technical discussions that accumulate context
You notice token usage growing rapidly in long sessions

Core Capabilities

1. Smart Context Compression

Automatically compresses conversation history based on message importance. How it works:

Recent messages (last 3-5) kept fully intact
Older messages summarized based on importance score
Code blocks and critical decisions never compressed

Savings: 50-70% reduction in context tokens

2. Semantic Cache

Caches responses to similar queries to avoid reprocessing. How it works:

L1: Exact query match → 100% savings
L2: Semantic similarity > 85% → 80% savings
L3: Pattern match → 50% savings

3. Adaptive Optimization

Automatically adjusts compression based on token pressure. Stages:

< 3K tokens: No compression
3-6K tokens: Light compression
6-10K tokens: Medium compression
> 10K tokens: Heavy compression + suggest new chat

Natural Language Commands

When user asks about TokenSaver in natural language, interpret and execute:

Settings & Configuration

User says: "Configure TokenSaver" / "TokenSaver settings" / "Setup TokenSaver" Action: Show current configuration and available options ``


Current TokenSaver Settings:
Mode: Adaptive (auto-adjust based on token pressure)
Compression: Balanced
Cache: Enabled
Quality Threshold: 85%

You can change mode:
/tokensave - Aggressive mode (max savings)
/tokenbalance - Balanced mode (default)
/tokenquality - Quality priority (min compression)

`



User says: "Use aggressive mode" / "Maximize savings" / "Set to save mode"
Action: Execute /tokensave command
Response: "✅ TokenSaver switched to aggressive save mode. This provides maximum token savings (up to 80%) with slight quality trade-off."

User says: "Use balanced mode" / "Default settings" / "Set to balanced"
Action: Execute /tokenbalance command
Response: "✅ TokenSaver switched to balanced mode. Good savings (50-70%) with quality preserved."

User says: "Prioritize quality" / "Keep full context" / "Set to quality mode"
Action: Execute /tokenquality command
Response: "✅ TokenSaver switched to quality priority mode. Minimal compression, maximum context retention for precision-critical tasks."

User says: "Disable TokenSaver" / "Turn off token saving" / "Stop TokenSaver"
Action: Execute /tokenoff command
Response: "⏸️ TokenSaver temporarily disabled. Re-enable anytime with /tokens or by asking me to turn it back on."

Usage Reports & Analytics

User says: "Show me token usage report" / "TokenSaver statistics" / "Token report"
Action: Execute /tokenreport command
Response format:

`


📈 TokenSaver Usage Report
━━━━━━━━━━━━━━━━━━━━━━━━━━━━
This Session:
• Total Original Tokens: 12,450
• Total Optimized Tokens: 4,358
• Tokens Saved: 8,092 (65%)
• Estimated Cost Saved: $0.016
• Cache Hits: 5
• Compressions Applied: 12

Top Savings Sources:
Context Compression: -6,200 tokens
Semantic Cache: -1,500 tokens
Response Optimization: -392 tokens

`



User says: "TokenSaver status" / "How much am I saving?" / "Token status"
Action: Execute /tokens command
Response format:

`


💰 TokenSaver Status
━━━━━━━━━━━━━━━━━━━━
Current Session:
• Savings: 65%
• Original Tokens: 12,450
• Optimized Tokens: 4,358
• Cache Hits: 5

Current Mode: Balanced (Adaptive)
• Stage 3 compression active
• Quality Score: 95%

`



User says: "Clear token cache" / "Reset cache" / "Clear TokenSaver cache"
Action: Execute /tokencache clear command
Response: "✅ TokenSaver cache cleared. Fresh responses will be generated for subsequent queries."

Explanation & Help

User says: "What is TokenSaver" / "How does token saving work?" / "Explain TokenSaver"
Response:

`


TokenSaver is an intelligent token cost optimization skill that helps you save 50-80% on AI token usage.

Key Features:
Smart Context Compression - Summarizes old conversation while keeping recent messages intact
Semantic Cache - Reuses responses for similar questions
Adaptive Optimization - Automatically adjusts based on conversation length
Quality Guard - Never compresses code, errors, or important messages

Benefits:
• Save 50-80% on token costs
• Faster responses with caching
• No quality loss for critical content
• Automatic - works in background

`



User says: "TokenSaver not working" / "Why no savings?" / "TokenSaver not saving"
Response:

`


TokenSaver may not show savings if:
Conversation is too short (< 3000 tokens) - compression starts at 3K
Currently in quality priority mode - switch with /tokensave
First-time queries - cache builds up over time
Code-heavy conversation - code blocks are never compressed

Try starting a longer technical discussion to see significant savings!

`



Best Practices

User says: "TokenSaver best practices" / "How to use TokenSaver effectively?" / "TokenSaver tips"
Response:

`


TokenSaver Best Practices:

For Debugging/Code Review:
   Use /tokenquality mode to keep full context

For Research/Exploration:
   Use /tokensave mode for maximum savings (up to 80%)

For General Chat:
   Balanced mode (default) works best

When to Start New Chat:
   If switching major topics, start fresh for better compression

Monitor with:
   /tokens - Quick status check
   /tokenreport - Detailed analytics

`



Slash Commands

For direct command access:

/tokens
Show current status and statistics

`


Session Savings: 65%
Original Tokens: 12,450
Optimized Tokens: 4,358
Cache Hits: 3

`



/tokensave
Enable aggressive save mode
Maximum compression
Best for very long technical discussions
Slight quality trade-off possible

/tokenbalance  
Balanced mode (default)
Good savings with quality preserved
Recommended for most use cases

/tokenquality
Quality priority mode
Minimal compression
Maximum context retention
Use when precision is critical

/tokenreport
Generate detailed usage report

`


Total Tokens Saved: 8,092
Estimated Cost Saved: $0.016
Compressions Applied: 12
Cache Hits: 5

`



/tokencache clear
Clear all cached responses

/tokenoff
Temporarily disable optimization

Usage Examples

Example 1: Long coding session

`


User: [20 rounds of Python discussion]
TokenSaver: Optimized 15K → 4.5K tokens (70% saved)

`



Example 2: Repeated questions

`


User: "How do I write to a file in Python?"
User: "Python file write method?"
TokenSaver: L2 cache hit - instant response, 0 tokens used

`



Example 3: Topic switching

`


User: Switching from discussing Python to JavaScript...
TokenSaver: "Detected topic change. Start new chat to keep context clean?"
[Yes] [No]

`



Safety Features

TokenSaver never compresses:
Code blocks (always kept intact)
Error messages and stack traces
User-marked important messages
Messages with high cross-references

Quality Guard:
Auto-rollback if quality drops > 15%
One-click restore to uncompressed version
Snapshots for every compression

Configuration

Default configuration:

`

json
{
  "mode": "adaptive",
  "compression": "balanced",
  "cache": true,
  "qualityThreshold": 0.85
}

``

Expected Results

| Conversation Type | Tokens Saved | Quality Impact | |-------------------|--------------|----------------| | Technical discussion (50 rounds) | 70% | Minimal | | Code review | 80% | None | | Casual chat | 75% | None | | Quick Q&A | 30-50% | None |

Limitations

Requires conversation to exceed 3K tokens before compression starts
First-time queries cannot be cached
Very short conversations (< 10 messages) see minimal benefit
Code-heavy conversations benefit most from smart referencing

Related Skills

shieldclaw: For security scanning
browser_visible: For web browsing
file_reader: For reading local files

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

When to Use

Core Capabilities

1. Smart Context Compression

2. Semantic Cache

3. Adaptive Optimization

Natural Language Commands

Settings & Configuration

Usage Reports & Analytics

Explanation & Help

Best Practices

Slash Commands

/tokens

/tokensave

/tokenbalance

/tokenquality

/tokenreport

/tokencache clear

/tokenoff

Usage Examples

Safety Features

Configuration

Expected Results

Limitations

Related Skills

安装命令点击复制