Text-to-speech generation
Generate speech from text with a large voice library. The home and voices pages describe 2,000+ AI voices across 130 languages, with plan pages showing default and Pro voice libraries.
Voicemaker is a browser-based text-to-speech platform that converts written text into synthetic speech. The site presents it as a tool for generating audio in multiple formats and for adjusting how the output sounds before download.
Across the home and pricing pages, Voicemaker positions itself around a large voice library, multilingual output, and controls for timing and delivery. The product also extends beyond basic TTS with tools for pronunciation editing, speech-to-speech, voice cloning, studio-style project work, and plan-based options for teams and higher-volume users.
Generate speech from text with a large voice library. The home and voices pages describe 2,000+ AI voices across 130 languages, with plan pages showing default and Pro voice libraries.
Adjust delivery with pause, speed, pitch, volume, emphasis, and voice-effect controls. The site also exposes pronunciation editing and metadata tag options in the web app.
Export audio in common file formats. The product pages mention MP3 and WAV downloads, and the API page also lists OGG, AAC, and OPUS output options.
Work with longer or more structured projects using the studio tools. The pricing page describes VoxStudio, Projects, background music mixing, subtitle export, file history, and cloud storage.
Use advanced voice workflows such as speech-to-speech, voice cloning, and subtitle generation. These capabilities are shown on the pricing and home pages, though availability varies by plan and voice type.
Support team and organization use with Business features such as Enterprise SSO, Team Workspace, seat management, and usage monitoring.
Turn scripts, notes, or short-form copy into spoken audio for social clips, presentations, or other publishable assets. The home page explicitly mentions YouTube Shorts, videos, and presentations as target outputs.
Use pronunciation, pause, pitch, speed, and voice-effect controls to shape delivery for narration, customer-facing audio, or branded voice work. This is useful when natural pacing matters more than raw conversion.
Build longer audio projects with VoxStudio, projects, background music, subtitle export, and cloud storage. These tools are aimed at users managing multiple audio elements in one workflow.
Use the Business plan for multi-seat workflows that need SSO, role-based access, file sharing, and usage monitoring. The pricing page frames this tier for teams and businesses scaling content production.
Convert text to speech through the API when the goal is to automate generation inside another product or pipeline. The pricing page describes a developer API with customizable speech controls and RESTful access.
You can start with the free plan by registering an account. The pricing page also shows paid Starter, Premium, and Business plans for users who need higher limits and additional features.
The source shows downloadable audio in MP3 and WAV formats on the home page, and the API/pricing pages also mention OGG, AAC, and OPUS for supported workflows.
Voicemaker includes controls for pauses, pronunciation, speed, pitch, volume, and voice effects in the web app and API. Some controls are limited to certain voice types or paid plans.
The pricing page shows support for individual, premium, and business usage, including cloud storage, file history, team workspace, SSO, and broadcasting rights on higher plans. It also states that commercial rights are included on paid plans, while broadcast rights are separate.
The refund policy says subscriptions are managed through auto-renewal and that cancellations or refunds are generally not offered, except in cases approved by support.
Vocloner is a web-based AI voice cloning tool that lets users create a custom voice from an audio sample and generate speech with it. The site highlights multilingual output, inline emotion tags, and a free tier with usage limits.
Voice Generator is a free browser-based text-to-speech web app that turns typed text into spoken audio. It lets you play or download the result, with pitch and speed controls and offline use when compatible voices are installed.
Free Text to Speech Online is a web-based AI voice generator that converts text into speech, supports many languages, and lets users preview and download audio. It offers a free plan, paid tiers for heavier use, and email support.
PixMagic is a browser-based AI studio for image generation, editing, upscaling, background removal, image-to-video, and text-to-speech. It uses one-time credit recharges instead of a subscription and offers 10 trial credits for new users.
TTSReader is a browser-based text-to-speech tool for reading text, documents, PDFs, ebooks, and webpages aloud. It also supports MP3 export, multilingual voices, and pronunciation controls for offline listening, sharing, and publishing workflows.
NaturalReader is an AI text-to-speech product for reading documents, PDFs, websites, and books aloud with natural-sounding voices. It also offers school management features, mobile apps, a Chrome extension, and a separate commercial voice generator.