AudioCraft: AI-Powered Audio Generation

Transform text and documents into natural-sounding audio with customizable AI voices and styles!

Overview

AudioCraft is a versatile AI tool that allows you to generate audio content in two primary ways: creating dialogue-based audio from text input (Studio mode) or narrating content from PDFs or entered text (Narration mode). With a selection of AI voices and synthesis styles, AudioCraft is ideal for creating podcasts, voiceovers, audiobooks, and more.

Studio (dialogue to audio)

Generate audio featuring dialogue between two speakers using distinct AI voices.

Narration (PDF/Data to audio)

Convert PDF documents or text data into narrated audio.

Multiple AI Voices

Choose from a variety of voices with different accents, genders, and ages.

Synthesis Styles

Apply different narration styles like ‘podcast’, ‘executive-briefing’, etc.

How to Use AudioCraft

AudioCraft offers two main modes, accessible via the tabs at the top of the interface.

Studio (dialogue to audio) Mode

This mode is designed for creating audio that simulates a conversation between two speakers.

Select Studio Tab

Click on the Studio (dialogue to audio) tab.

Enter Your Dialogue

Input the text for your dialogue in the main text area.

Optimize Text (Optional)

Use the Optimize, News Caster, Podcast, or Analyst buttons to automatically format or stylize your input text for different purposes.

Set Speaker Names

Enter names for Speaker 1 Name and Speaker 2 Name. These names are used to identify which speaker says which lines in your input text (e.g., “Speaker 1: Hello”, “Speaker 2: Hi”).

Choose Voices

Select a Voice 1 and Voice 2 from the dropdown menus. Choose voices that fit your desired speakers.

Enter Voice Prompts (Optional)

Provide specific instructions or prompts for Prompt for Voice 1 and Prompt for Voice 2 to influence the tone or delivery style of each voice.

Convert to Speech

Click the Convert to Speech button. The AI will process your text and generate audio featuring the dialogue between the two selected voices.

Narration (PDF/Data to audio) Mode

This mode is used to convert longer text content from a PDF file or directly entered text into narrated audio.

Select Narration Tab

Click on the Narration (PDF/Data to audio) tab.

Choose Input Type

Select your Input Type:

Upload PDF: Click to upload a PDF file.
Enter Data: Switch to this tab and enter text directly into the provided text area.

Choose Voices

Select a Voice 1 and Voice 2 from the dropdown menus. These voices will be used for the narration.

Select Synthesis Style

Choose a Synthesis Style from the dropdown menu. Options include:

podcast
executive-briefing
childrens-story
debate

Upload and Convert

Click the Upload and Convert button. The AI will process your PDF or text data and generate narrated audio using the selected voices and style.

Input Parameters and Options

AudioCraft provides various inputs and options depending on the selected mode:

Common Parameters:

Text Input

string

required

(Studio & Narration - Enter Data) The text content to be converted to audio.

Voice 1 / Voice 2

Enum

required

Selectable AI voices with different characteristics (e.g., Angelo (US, male, Young), Arsenio (US African American, male, Middle)).

Studio Mode Specific:

Speaker 1 Name / Speaker 2 Name

string

required

Names used to identify speaker turns in the input text.

Prompt for Voice 1 / Prompt for Voice 2

string

Text prompts to influence the delivery style of each voice.

Optimize / News Caster / Podcast / Analyst Buttons

button

Buttons to automatically format or optimize the input text.

Narration Mode Specific:

Input Type

Enum

required

Choose between Upload PDF and Enter Data.

Upload PDF

file

(Input Type: Upload PDF) Upload a PDF file for narration.

Synthesis Style

Enum

required

Select the overall style for the narration (e.g., podcast, executive-briefing).

Generated Audio History / Conversion History

AudioCraft keeps a history of your generated audio clips or conversions.

View History

Playback

Download

Clear All

Credits

Generating audio with AudioCraft has a credit requirement.

Credit Requirement

Final Cost

Your current credit balance is displayed at the top left of the interface. Click the Buy More button to purchase additional credits if needed.

Tips for Best Results

Clear Text Input

Provide clear, well-formatted text. For Studio mode, ensure speaker turns are clearly marked (e.g., “Speaker 1: …”).

Choose Appropriate Voices

Select voices that match the tone and character of your content. Experiment with different voice combinations.

Utilize Synthesis Styles (Narration)

Choose a synthesis style that fits the type of content you are narrating (e.g., ‘podcast’ for conversational content, ‘executive-briefing’ for formal reports).

Optimize Text (Studio)

Use the text optimization buttons to refine your dialogue input for better AI interpretation.

Troubleshooting

If you encounter issues with AudioCraft, consider these solutions:

Audio Doesn't Sound Natural

Conversion Fails

Incorrect Speaker Turns (Studio)

Credit Issues

Conclusion

AudioCraft provides a powerful and flexible platform for converting text and documents into high-quality audio using AI voices. With its distinct modes and customization options, you can easily create dialogue or narration for a wide range of applications.

Get Started with AI Tutor

AI Tutor

Pixio

Account

Machine

Pixio Model Info

Pixio API Endpoint

AI Tutor RAG API Endpoint

AI Tutor API Endpoint

AudioCraft

AudioCraft: AI-Powered Audio Generation

Overview

Studio (dialogue to audio)

Narration (PDF/Data to audio)

Multiple AI Voices

Synthesis Styles

How to Use AudioCraft

Studio (dialogue to audio) Mode

Narration (PDF/Data to audio) Mode

Input Parameters and Options

Common Parameters:

Studio Mode Specific:

Narration Mode Specific:

Generated Audio History / Conversion History

Credits

Tips for Best Results

Clear Text Input

Choose Appropriate Voices

Utilize Synthesis Styles (Narration)

Optimize Text (Studio)

Troubleshooting

Conclusion

Get Started with AI Tutor

AI Tutor

Pixio

Account

Machine

Pixio Model Info

Pixio API Endpoint

AI Tutor RAG API Endpoint

AI Tutor API Endpoint

​AudioCraft: AI-Powered Audio Generation

​Overview

Studio (dialogue to audio)

Narration (PDF/Data to audio)

Multiple AI Voices

Synthesis Styles

​How to Use AudioCraft

​Studio (dialogue to audio) Mode

​Narration (PDF/Data to audio) Mode

​Input Parameters and Options

​Common Parameters:

​Studio Mode Specific:

​Narration Mode Specific:

​Generated Audio History / Conversion History

​Credits

​Tips for Best Results

Clear Text Input

Choose Appropriate Voices

Utilize Synthesis Styles (Narration)

Optimize Text (Studio)

​Troubleshooting

​Conclusion

AudioCraft: AI-Powered Audio Generation

Overview

How to Use AudioCraft

Studio (dialogue to audio) Mode

Narration (PDF/Data to audio) Mode

Input Parameters and Options

Common Parameters:

Studio Mode Specific:

Narration Mode Specific:

Generated Audio History / Conversion History

Credits

Tips for Best Results

Troubleshooting

Conclusion