📦 Characteristic Voice — 技能工具

v0.1.1

[自动翻译] Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say li...

1· 385·1 当前·1 累计
ksuriuri 头像by @ksuriuri (kusuriuri)·MIT-0
下载技能包
License
MIT-0
最后更新
2026/3/18
0
安全扫描
VirusTotal
无害
查看报告
OpenClaw
安全
high confidence
The skill's code, instructions, and required credentials are consistent with its stated purpose (expressive TTS); nothing indicates hidden or unrelated behavior, but note the skill will upload text and any chosen reference audio to noiz.ai when using the Noiz backend and saves an API key to ~/.noiz_api_key.
评估建议
This skill appears to do what it says: expressive TTS with optional voice cloning. Before installing, decide whether you want any audio/text to leave your machine. If you use the Noiz backend, the script will send text and any reference audio to https://noiz.ai/v1 and will save a normalized API key to ~/.noiz_api_key (file mode 600). If you must keep everything local, use the Kokoro backend. Be careful about sourcing reference audio from copyrighted or private material — the skill documents this...
详细分析 ▾
用途与能力
The skill implements expressive TTS and optionally voice cloning via the Noiz API or a local Kokoro backend. Required tools (curl, python3) and optional tooling (yt-dlp, ffmpeg) match the documented features. Minor metadata mismatch: the registry lists no required env vars, but SKILL.md and the script require a NOIZ_API_KEY when using the Noiz backend.
指令范围
SKILL.md and the script limit actions to generating TTS, optionally uploading user-provided or downloaded reference audio to Noiz, and using local Kokoro when requested. The SKILL.md explicitly warns about copyright/privacy when sourcing reference audio. There are no instructions to read arbitrary system files or to exfiltrate unrelated data.
安装机制
No install spec is provided (instruction-only with an included script). The script itself uses standard system tools only; nothing is downloaded or executed from unknown URLs by the skill itself.
凭证需求
The only credential used is NOIZ_API_KEY (optional if you use the Noiz backend), which is proportional to the skill's external API use. The script will normalize and save the API key to ~/.noiz_api_key (mode 600) for convenience — this persistent storage is reasonable but should be understood by the user. Registry metadata not listing this env var is an inconsistency to be aware of.
持久化与权限
The skill does not request elevated privileges nor set always:true. Its only persistent action is writing the API key file in the user's home directory; it does not modify other skills or system-wide agent settings.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv0.1.12026/3/10

characteristic-voice 0.1.1 changelog: - Added detailed credential and API key setup instructions for Noiz backend, including local key storage. - Listed all runtime prerequisites and external tools required, clarifying provisioning responsibility. - Introduced a privacy and data transmission policy section, distinguishing between Noiz and Kokoro backends for online/offline use. - Expanded and clarified instructions for preparing and using reference audio for voice cloning, including legal and privacy considerations. - No code changes; documentation greatly expanded for clarity and responsible use.

无害

安装命令

点击复制
官方npx clawhub@latest install noizai-characteristic-voice
镜像加速npx clawhub@latest install noizai-characteristic-voice --registry https://cn.longxiaskill.com

技能文档

Make your AI agent sound like a real companion — one who sighs, laughs, hesitates, and speaks with genuine feeling.

Credentials

VariableRequiredDescription
NOIZ_API_KEYYes if using Noiz backendAPI key from developers.noiz.ai. Not needed if using the local Kokoro backend.
The script saves a normalised copy of the key to ~/.noiz_api_key (mode 600) for convenience. To set it:

bash skills/characteristic-voice/scripts/speak.sh config --set-api-key YOUR_KEY

Prerequisites

The included speak.sh script requires curl and python3 at runtime. Depending on which backend and features you use, you may also need:

ToolWhen neededInstall hint
curl, python3Always (core script)Usually pre-installed
kokoro-ttsKokoro (local/offline) backenduv tool install kokoro-tts
yt-dlpDownloading reference audio for voice cloninggithub.com/yt-dlp/yt-dlp
ffmpegTrimming reference audio clipsffmpeg.org
rg (ripgrep)Searching subtitle filesgithub.com/BurntSushi/ripgrep
None of these are installed by the skill itself — provision them manually in your environment.

Privacy & Data Transmission

  • Noiz backend: When using the Noiz backend, the text you speak and any reference audio you provide are sent to https://noiz.ai/v1. If you supply --ref-audio, that audio file is uploaded for voice cloning.
  • Kokoro backend: Runs entirely locally — no data leaves your machine.
  • Choose the Kokoro backend (--backend kokoro) if you want fully offline processing.

Triggers

  • say like
  • talk like
  • speak like
  • companion voice
  • comfort me
  • cheer me up
  • sound more human

The Two Tricks

  • Non-lexical fillers — sprinkle in little human noises (hmm, haha, aww, heh) at natural pause points to make speech feel alive
  • Emotion tuning — adjust warmth, joy, sadness, tenderness to match the moment

Filler Sounds Palette

SoundFeelingUse for
hmm...Thinking, gentle acknowledgmentComfort, pondering
ah...Realization, soft surpriseDiscoveries, transitions
uh...Hesitation, empathyCareful moments
heh / hehePlayful, mischievousTeasing, light moments
hahaLaughterJoy, humor
awwTenderness, sympathyDeep comfort
oh? / oh!Surprise, attentionReacting to news
pfftStifled laughPlayful disbelief
whewReliefAfter tension
~ (tilde)Drawn out, melodic endingWarmth, playfulness
Rules: 2–4 fillers per short message max. Place at natural pauses — sentence starts, thought shifts. Use ... after fillers for a beat of silence, ~ at word endings for warmth.

Presets

Good Night

Gentle, warm, slightly sleepy. Slow pace.

Good Morning

Warm, cheerful but not overwhelming.

Comfort

Soft, understanding, unhurried. Give space. Don't rush to "fix" things.

Celebration

Excited, proud, genuinely happy.

Just Chatting

Relaxed, playful, natural.

Using a Character's Voice

When a user says something like "speak in Hermione's voice" or "sound like Tony Stark", first check whether a reference audio file already exists in skills/characteristic-voice/. If one does, use it directly with --ref-audio.

If no reference audio exists, you can create one — but read the warnings below first.

Preparing reference audio (one-time setup)

You need a short (10–30 s) WAV clip of the target voice. Possible sources:

  • User-provided audio — the safest option. Ask the user to supply their own recording.
  • Public-domain / CC-licensed clips — search for freely licensed material.
  • Extracting from online video — tools like yt-dlp and ffmpeg can download and trim audio. Example workflow:
yt-dlp "URL" --write-auto-sub --sub-lang en --skip-download -o tmp/clip
rg -n "target line" tmp/clip.en.vtt
yt-dlp "URL" -x --audio-format wav --download-sections "*00:00:00-00:00:25" -o tmp/clip
ffmpeg -i tmp/clip.wav -ss 00:00:02 -to 00:00:20 skills/characteristic-voice/character.wav
Copyright & privacy warning: Downloading and re-using someone's voice from copyrighted media (movies, TV, YouTube) may violate copyright or personality-rights laws depending on your jurisdiction. Do not upload private voice recordings or material you don't have permission to use. The reference audio is sent to https://noiz.ai/v1 for voice cloning when using the Noiz backend. If this is a concern, consider using the local Kokoro backend instead.

Using reference audio

bash skills/characteristic-voice/scripts/speak.sh \
  --preset goodnight -t "Hmm... rest well~ Sweet dreams." \
  --ref-audio skills/characteristic-voice/character.wav -o night.wav

The --ref-audio flag uploads the file to the Noiz backend for voice cloning (requires NOIZ_API_KEY).


Usage

This skill provides speak.sh, a wrapper around the tts skill with companion-friendly presets.

# Use a preset (auto-sets emotion + speed)
bash skills/characteristic-voice/scripts/speak.sh \
  --preset goodnight -t "Hmm... rest well~ Sweet dreams." -o night.wav

# Custom emotion override bash skills/characteristic-voice/scripts/speak.sh \ -t "Aww... I'm right here." --emo '{"Tenderness":0.9}' --speed 0.75 -o comfort.wav

# With specific backend and voice bash skills/characteristic-voice/scripts/speak.sh \ --preset morning -t "Good morning~" --voice-id voice_abc --backend noiz -o morning.mp3 --format mp3

Run bash skills/characteristic-voice/scripts/speak.sh --help for all options.

Writing Guide for the Agent

  • Start soft — lead with a filler ("hmm...", "oh~"), not content
  • Mirror energy — gentle when they're low, match when they're high
  • Keep it brief — 1–3 sentences, like a voice message from a friend
  • End warmly — close with connection ("I'm here", "see you tomorrow~")
  • Don't lecture — listen and stay present; no unsolicited advice
数据来源ClawHub ↗ · 中文优化:龙虾技能库