Audio Control

TTS Wrapper provides comprehensive audio playback control features, allowing you to manage speech synthesis playback in real-time.

Basic Playback Controls

Starting Playback

# Basic playback
tts.speak("Hello, world!")

# Playback with streaming
tts.speak_streamed("Hello, world!")

Pause and Resume

# Start speaking
tts.speak("This is a long sentence that we will pause in the middle.")

# Pause after 1 second
time.sleep(1)
tts.pause()

# Resume after another second
time.sleep(1)
tts.resume()

Stop Playback

# Stop playback immediately
tts.stop()

Audio Properties

Volume Control

# Set volume (0-100)
tts.set_property("volume", "75")

# Get current volume
current_volume = tts.get_property("volume")

Speech Rate

# Set speech rate
tts.set_property("rate", "fast")  # x-slow, slow, medium, fast, x-fast
tts.set_property("rate", "150")   # Percentage (100 is normal)

Pitch Control

# Set pitch
tts.set_property("pitch", "high")  # x-low, low, medium, high, x-high

Audio Device Selection

# List available audio devices
import sounddevice as sd
devices = sd.query_devices()
print(devices)

# Set output device by ID
tts.set_output_device(device_id=1)

File Output

Saving to File

# Save as WAV file
tts.synth_to_file("Hello world", "output.wav")

# Save as MP3
tts.synth_to_file("Hello world", "output.mp3", "mp3")

Streaming with File Output

# Stream audio and save to file simultaneously
tts.speak_streamed("Hello world", save_to_file_path="output.wav")

Advanced Audio Control

Timed Playback

# Play for specific duration
tts.play(duration=2.0)  # Play for 2 seconds

# Pause for specific duration
tts.pause(duration=1.0)  # Pause for 1 second

Audio Buffer Management

# Load audio into buffer without playing
audio_bytes = tts.synth_to_bytes("Hello world")
tts.load_audio(audio_bytes)

# Play when ready
tts.play()

Stream Setup

# Configure audio stream
tts.setup_stream(
    samplerate=44100,
    channels=1,
    dtype="int16"
)

Event Handling

Playback Events

def on_start():
    print("Audio started playing")

def on_end():
    print("Audio finished playing")

# Connect event handlers
tts.connect("onStart", on_start)
tts.connect("onEnd", on_end)

Cleanup

# Clean up resources when done
tts.cleanup()

Best Practices

Resource Management
- Always call cleanup() when done
- Use context managers where possible
- Release audio resources explicitly

Error Handling

try:
    tts.speak("Hello world")
except Exception as e:
    print(f"Playback error: {e}")
finally:
    tts.cleanup()

Performance
- Use streaming for long text
- Pre-synthesize repeated phrases
- Monitor memory usage with large audio files
Device Compatibility
- Check device availability before setting
- Handle device errors gracefully
- Test across different platforms

Example: Complete Audio Control

Here's a comprehensive example combining various audio control features:

from tts_wrapper import PollyClient, PollyTTS
import time

# Initialize TTS
client = PollyClient(credentials=('region', 'key_id', 'access_key'))
tts = PollyTTS(client)

# Set up event handlers
def on_start():
    print("Started speaking")

def on_end():
    print("Finished speaking")

tts.connect("onStart", on_start)
tts.connect("onEnd", on_end)

try:
    # Configure audio properties
    tts.set_property("volume", "80")
    tts.set_property("rate", "medium")
    tts.set_property("pitch", "high")

    # Start speaking
    text = "This is a demonstration of audio control features."
    tts.speak_streamed(text)

    # Wait a moment
    time.sleep(1)

    # Pause playback
    tts.pause()
    print("Paused for 2 seconds...")
    time.sleep(2)

    # Resume playback
    tts.resume()
    print("Resumed playback")

    # Wait for completion
    while tts.isplaying:
        time.sleep(0.1)

except Exception as e:
    print(f"Error during playback: {e}")
finally:
    tts.cleanup()

Next Steps

Learn about streaming capabilities for real-time synthesis
Explore SSML support for fine-grained speech control
Check out callback functionality for speech events

Basic Playback Controls​

Starting Playback​

Pause and Resume​

Stop Playback​

Audio Properties​

Volume Control​

Speech Rate​

Pitch Control​

Audio Device Selection​

File Output​

Saving to File​

Streaming with File Output​

Advanced Audio Control​

Timed Playback​

Audio Buffer Management​

Stream Setup​

Event Handling​

Playback Events​

Cleanup​

Best Practices​

Example: Complete Audio Control​

Next Steps​