skills / glebis / claude-skills / elevenlabs-tts

ElevenLabs TTS

A voice synthesis skill for generating high-quality narration/audio files with ElevenLabs presets and controllable voice parameters.

Source description: This skill converts text to high-quality audio files using ElevenLabs API. Use this skill when users request text-to-speech generation, audio narration, or voic...

npx skills add https://github.com/glebis/claude-skills --skill elevenlabs-tts
risk: mediuminstall: CLIverified: 2026-02-13

On this page

Our added value (verification layer)

This page is not only a source mirror. We add reproducibility, risk controls, and operations guidance on top of the original skill definition.

  • Execution/Security/Maintainability scoring with explicit criteria
  • Compatibility matrix across runtime environments
  • Verification log with check commands and observed outcomes
  • Common failure fixes and rollback triggers for production safety

Overall score

81/100

Execution

85

Security

76

Maintainability

82

Quick install (universal)

Primary command for most environments:

npx skills add https://github.com/glebis/claude-skills --skill elevenlabs-tts

Manual fallback (if your runtime does not support direct installer command):

  1. npx skills add https://github.com/glebis/claude-skills --skill elevenlabs-tts -y -g
  2. Restart your current agent/runtime to reload installed skills.
  3. Run a dry run: "generate a 20-second sample narration to audio/test.mp3".
  • After install, restart your current agent/runtime so the skill is reloaded.
  • Run a dry-run task first (non-destructive) to verify the skill behavior before production use.

SKILL.md (rendered source content)

Readable source reference for this skill. Added verification notes are shown in the sections below.

ElevenLabs Text-to-Speech

Overview

Generate professional audio files from text using ElevenLabs' advanced text-to-speech API. The skill provides pre-configured voice presets with sensible defaults, voice parameter customization, and direct access to the scripts/elevenlabs_tts.py script for programmatic control.

Quick Start

To generate audio from text:

  1. Ensure the .env file contains a valid ELEVENLABS_API_KEY
  2. Execute the script with text: python scripts/elevenlabs_tts.py "Your text here"
  3. Specify voice and output: python scripts/elevenlabs_tts.py "Text" --voice adam --output audio/output.mp3

Voice Presets

Seven pre-configured voices are available. See references/api_reference.md for complete voice descriptions:

  • rachel (default) - Clear, professional female
  • adam - Deep, authoritative male
  • bella - Warm, friendly female
  • elli - Young, enthusiastic female
  • josh - Friendly, conversational male
  • arnold - Deep, powerful male
  • ava - Expressive, dynamic female

Parameters

Text

The text to convert to speech. Any length is supported.

Voice Selection

Specify voice using preset name (e.g., rachel, adam) or direct ElevenLabs voice ID.

Voice Parameters

  • stability (0.0-1.0, default 0.5): Lower values create expressive variation; higher values ensure consistency
  • similarity_boost (0.0-1.0, default 0.75): Higher values maintain closer adherence to voice characteristics

Output

Specify the output file path. Default is output.mp3. Directories are created automatically.

Usage Examples

Basic Python Usage

from scripts.elevenlabs_tts import generate_speech

path = generate_speech(
    text="Hello, this is a test message",
    voice_id="rachel"
)

Command Line

# With default voice
python scripts/elevenlabs_tts.py "Generate this text"

# With custom voice and stability
python scripts/elevenlabs_tts.py "Different voice" --voice adam --stability 0.7

# To custom output path
python scripts/elevenlabs_tts.py "Save here" --output audio/narration.mp3

# List available voices
python scripts/elevenlabs_tts.py "" --list-voices

Implementation Notes

  • The script handles API communication with error reporting
  • Output directories are created automatically if they don't exist
  • Returns absolute path to generated audio file
  • Uses eleven_monolingual_v1 model by default (can be overridden)

Resources

  • scripts/elevenlabs_tts.py - Main Python script for text-to-speech generation. Can be imported as a module or executed from command line.
  • references/api_reference.md - Detailed API documentation including voice descriptions, parameter explanations, and usage examples.
  • .env and .env.example - Environment configuration for storing API credentials securely.

Required permissions

file, shell

Compatibility matrix

EnvironmentStatusNotes
Local Python workspace with API keypassSupports direct script execution and audio output generation.
Restricted no-network runtimefailRequires external API calls to ElevenLabs service.
CI/server runtime with secret managementpartialWorks when API key injection and output paths are configured.

Verification log

Canonical source exists locally

test -f ~/.agents/skills/elevenlabs-tts/SKILL.md

Pass

result: pass

Install command template validated

npx skills add https://github.com/glebis/claude-skills --skill elevenlabs-tts -y -g

Pass

result: pass

Frontmatter + full body extracted

read ~/.agents/skills/elevenlabs-tts/SKILL.md and split description/body

Pass

result: pass

Security notes

  • Never commit ELEVENLABS_API_KEY into repository files.
  • Restrict write paths for generated audio artifacts.
  • Review licensing/commercial usage terms for generated voice output.

Common failures and fixes

401 unauthorized from ElevenLabs API

Verify ELEVENLABS_API_KEY is present and loaded in current runtime.

No output file generated

Check output path permissions and confirm script arguments are correct.

Voice quality unstable

Adjust stability/similarity_boost and retry with shorter text chunks.

Quick FAQ

How do I install this skill quickly?

Run npx skills add https://github.com/glebis/claude-skills --skill elevenlabs-tts, then restart your runtime to reload skills.

What should I check before production rollout?

Confirm permissions, run a non-destructive dry run, and review rollback triggers.

What if install succeeds but actions do not run?

Verify SKILL.md location, restart runtime, and check environment/dependency readiness.

Recent changes

  • 2026-02-13: Added hot-skill page entry from Excel priority list.
  • 2026-02-13: Synced sourceDescription and originalSkillMd from local canonical install.
  • 2026-02-13: Added compatibility, fixes, and rollback guidance.

Rollback triggers

  • Generated audio quality drops below release threshold.
  • API cost spikes due to uncontrolled generation volume.
  • Credential leakage or policy violation is detected.

Known issues

Network/API outages interrupt synthesis

Add retry strategy and fallback local queue for non-urgent jobs.

Output loudness/quality varies by voice preset

Standardize post-processing and preset selection per use case.

Site references