AudioCraft: AI-Powered Audio Generation

Transform text and documents into natural-sounding audio with customizable AI voices and styles!

Overview

AudioCraft is a versatile AI tool that allows you to generate audio content in two primary ways: creating dialogue-based audio from text input (Studio mode) or narrating content from PDFs or entered text (Narration mode). With a selection of AI voices and synthesis styles, AudioCraft is ideal for creating podcasts, voiceovers, audiobooks, and more.

Studio (dialogue to audio)

Generate audio featuring dialogue between two speakers using distinct AI voices.

Narration (PDF/Data to audio)

Convert PDF documents or text data into narrated audio.

Multiple AI Voices

Choose from a variety of voices with different accents, genders, and ages.

Synthesis Styles

Apply different narration styles like ‘podcast’, ‘executive-briefing’, etc.

How to Use AudioCraft

AudioCraft offers two main modes, accessible via the tabs at the top of the interface.

Studio (dialogue to audio) Mode

This mode is designed for creating audio that simulates a conversation between two speakers.

Select Studio Tab

Click on the Studio (dialogue to audio) tab.

Enter Your Dialogue

Input the text for your dialogue in the main text area.

Optimize Text (Optional)

Use the Optimize, News Caster, Podcast, or Analyst buttons to automatically format or stylize your input text for different purposes.

Set Speaker Names

Enter names for Speaker 1 Name and Speaker 2 Name. These names are used to identify which speaker says which lines in your input text (e.g., “Speaker 1: Hello”, “Speaker 2: Hi”).

Choose Voices

Select a Voice 1 and Voice 2 from the dropdown menus. Choose voices that fit your desired speakers.

Enter Voice Prompts (Optional)

Provide specific instructions or prompts for Prompt for Voice 1 and Prompt for Voice 2 to influence the tone or delivery style of each voice.

Convert to Speech

Click the Convert to Speech button. The AI will process your text and generate audio featuring the dialogue between the two selected voices.

Narration (PDF/Data to audio) Mode

This mode is used to convert longer text content from a PDF file or directly entered text into narrated audio.

Select Narration Tab

Click on the Narration (PDF/Data to audio) tab.

Choose Input Type

Select your Input Type:

  • Upload PDF: Click to upload a PDF file.
  • Enter Data: Switch to this tab and enter text directly into the provided text area.

Choose Voices

Select a Voice 1 and Voice 2 from the dropdown menus. These voices will be used for the narration.

Select Synthesis Style

Choose a Synthesis Style from the dropdown menu. Options include:

  • podcast
  • executive-briefing
  • childrens-story
  • debate

Upload and Convert

Click the Upload and Convert button. The AI will process your PDF or text data and generate narrated audio using the selected voices and style.

Input Parameters and Options

AudioCraft provides various inputs and options depending on the selected mode:

Common Parameters:

Text Input
string
required

(Studio & Narration - Enter Data) The text content to be converted to audio.

Voice 1 / Voice 2
Enum
required

Selectable AI voices with different characteristics (e.g., Angelo (US, male, Young), Arsenio (US African American, male, Middle)).

Studio Mode Specific:

Speaker 1 Name / Speaker 2 Name
string
required

Names used to identify speaker turns in the input text.

Prompt for Voice 1 / Prompt for Voice 2
string

Text prompts to influence the delivery style of each voice.

Optimize / News Caster / Podcast / Analyst Buttons
button

Buttons to automatically format or optimize the input text.

Narration Mode Specific:

Input Type
Enum
required

Choose between Upload PDF and Enter Data.

Upload PDF
file

(Input Type: Upload PDF) Upload a PDF file for narration.

Synthesis Style
Enum
required

Select the overall style for the narration (e.g., podcast, executive-briefing).

Generated Audio History / Conversion History

AudioCraft keeps a history of your generated audio clips or conversions.

Credits

Generating audio with AudioCraft has a credit requirement.

Your current credit balance is displayed at the top left of the interface. Click the Buy More button to purchase additional credits if needed.

Tips for Best Results

Clear Text Input

Provide clear, well-formatted text. For Studio mode, ensure speaker turns are clearly marked (e.g., “Speaker 1: …”).

Choose Appropriate Voices

Select voices that match the tone and character of your content. Experiment with different voice combinations.

Utilize Synthesis Styles (Narration)

Choose a synthesis style that fits the type of content you are narrating (e.g., ‘podcast’ for conversational content, ‘executive-briefing’ for formal reports).

Optimize Text (Studio)

Use the text optimization buttons to refine your dialogue input for better AI interpretation.

Troubleshooting

If you encounter issues with AudioCraft, consider these solutions:

Conclusion

AudioCraft provides a powerful and flexible platform for converting text and documents into high-quality audio using AI voices. With its distinct modes and customization options, you can easily create dialogue or narration for a wide range of applications.