What is text to speech?

Text to speech (TTS) is technology that converts written text into natural-sounding spoken audio. It’s also known as computer-generated speech, speech synthesis, or “read aloud” technology. AI voice generation can be used to enhance accessibility, engagement and efficiency in a wide range of applications, from educational tools to virtual assistants.

Generate Speech is an AI text to speech feature in Adobe Firefly that lets you create human-sounding voiceovers in 20+ languages. You can use the tool anywhere to elevate your assets with adjustable pacing, tone and emotional control.

Questions? We have answers.

#E8E8E8

Firefly features you may also like.

Generate Video

Generate video clips just from an idea. Choose from a range of resolutions and aspect ratios to meet your creative needs.

Learn more | Learn more - Generate Video

Generate Soundtrack

Generate Soundtrack analyses your video to match your story and compose customised, emotionally rich music for every platform. Go from ideas to tracks instantly with Firefly’s AI music generator — licensed to use anywhere.

Learn more | Learn more - AI music generator

Generate Sound Effects

Imagine any sound effect and create it with Generate Sound Effects. Describe the effect, upload reference audio or act it out into your mic — then easily add your high-quality effect to any video.

Learn more | Learn more - Sound effect generator

Avatar Generator

Create a studio-grade video featuring an engaging, life-like avatar with Text to Avatar. It’s fast, easy and always safe for commercial use. Perfect for business, education or social media content.

Learn more | Learn more - AI avatar generator