Character faces look distorted or unclear in Leonardo AI-generated images

Use the Alchemy Upscaler to enhance facial details and correct distortions. If problems persist, generate multiple variations and select the best result. You can also adjust your prompt to specifically request "front-facing view with clear facial features" to reduce angle-related issues.

Animated clips from Runway Gen 2 have excessive or chaotic motion that doesn't match the story

Always reduce the Motion slider to 1 (the lowest setting) for narrative animations. Use Camera Motion controls to add intentional, purposeful movement instead. Generate multiple versions with different settings to find the right balance between static and dynamic.

Voiceover audio in CapCut doesn't sync properly with video clips

Place your voiceover audio on the timeline first as your anchor track. Use the playhead to identify exact breaking points between story segments, then trim video clips to match. Enable zoom controls for more precise frame-by-frame adjustments.

Running out of daily tokens or credits across multiple free accounts

Plan multi-day production: generate images on day one (Leonardo gives 150 daily tokens), animations on day two (Runway credits), and voiceovers across the month (ElevenLabs monthly limit). Alternatively, upgrade to paid tiers for specific tools if you exceed daily limits.

Auto-generated captions in CapCut contain inaccuracies or misheard words

Review captions immediately after generation. Click on individual captions to edit text directly. Clear audio in your voiceover recording produces better caption accuracy—minimize background noise when possible.

Video exports from CapCut are too large or have poor quality

Export at 1080p resolution and 30fps for optimal YouTube balance. If file size is excessive, reduce bitrate settings in the export menu. For lower quality outputs, check that your source images and video clips are sufficiently high resolution.

Published May 7, 2026

How to Create 3D Animated Videos with Free AI Tools in 2026

Name: How to Create 3D Animated Videos with Free AI Tools in 2026
Uploaded: 2026-05-06T21:53:23.392359+00:00
Duration: 180 min
Description: Learn how to create professional 3D animated cartoon videos using free AI tools including ChatGPT, ElevenLabs, Leonardo AI, Runway, and CapCut.

Joshua Kishaba·AI Mastery·Subscribe

180 minintermediatefree

Learn how to create professional 3D animated cartoon videos using free AI tools including ChatGPT, ElevenLabs, Leonardo AI, Runway, and CapCut.

This page may contain affiliate links. We may earn a commission at no extra cost to you. Full disclosure.

Introduction

This tutorial walks you through the complete process of creating professional-quality 3D animated cartoon videos using entirely free AI tools. You'll cover story creation, voiceover generation, image creation, animation, and video editing. By the end, you'll have a finished animated video ready to share on YouTube or other platforms.

The workflow consists of six distinct steps that transform a simple concept into a fully animated story. You'll use ChatGPT for scriptwriting, ElevenLabs for voiceovers, Leonardo AI for image generation, Runway Gen 2 for animation, and CapCut for final editing. Each tool plays a specific role in the production pipeline, and this guide shows you exactly how to configure and use them.

Core Actions

01Generate story script using ChatGPT with specific narrative prompts
02Create voiceover audio with ElevenLabs from your completed story
03Break down story into scene descriptions and generate image prompts in ChatGPT
04Generate and upscale images using Leonardo AI with 16:9 aspect ratio and 3D Animation Style
05Animate each image using Runway Gen 2 with motion slider set to 1 and camera motion controls
06Assemble all clips and audio in CapCut, sync to narration, add transitions and effects, export at 1080p 30fps

Step 01

Generate Your Story Script with ChatGPT

The foundation of any animated video is a compelling narrative.

Navigate to ChatGPT and craft a specific prompt that tells the AI exactly what kind of story you need.

Review the generated story to ensure it fits your vision and has a clear beginning, middle, and end.

The foundation of any animated video is a compelling narrative. Start by creating a short story using ChatGPT that serves as the basis for your entire animation project.

Navigate to ChatGPT and craft a specific prompt that tells the AI exactly what kind of story you need. For this tutorial, we asked ChatGPT to "write a child's short story about a young boy and his bird." The more specific your prompt, the better the results will be.

Review the generated story to ensure it fits your vision and has a clear beginning, middle, and end. The story should be concise enough to translate into a short video, typically 1-3 minutes when narrated. Copy the entire story text once you're satisfied with the output.

Open Google Docs or any note-taking application and paste your story there. This document will serve as your central reference throughout the production process. You'll return to this document multiple times to copy sections for voiceover generation and image prompt creation.

Step 02

Create Professional Voiceovers with ElevenLabs

Transform your written story into spoken narration using ElevenLabs to generate a high-quality AI voiceover that brings emotional depth to your animated story.

Visit ElevenLabs by clicking the link in the video description or navigating directly to their website.

Browse through the available voice options and select one that matches the tone and style of your story.

Transform your written story into spoken narration using ElevenLabs to generate a high-quality AI voiceover that brings emotional depth to your animated story.

Visit ElevenLabs by clicking the link in the video description or navigating directly to their website. Sign up for a free account if you don't already have one. The free tier provides sufficient credits for creating voiceovers for multiple short videos.

Browse through the available voice options and select one that matches the tone and style of your story. For a children's story, choose a warm, gentle voice with clear articulation. Click on different voices to preview them before making your selection.

Copy the full story text from your Google Doc and paste it into the text input field in ElevenLabs. Ensure you've pasted the complete story without any formatting issues. Click the Generate button and wait for the AI to process your text into speech.

Listen to the generated voiceover to ensure quality and pacing. Download the audio file to your computer once satisfied. Name the file clearly (such as "story_voiceover.mp3") so you can easily locate it during the editing phase.

Generate Image Prompts for Each Scene

With your story and narration complete, plan the visual elements by returning to ChatGPT to create detailed image prompts for each paragraph or scene in your story.

Go back to ChatGPT and enter a new prompt: "I want to create a YouTube video from this story so give me image ideas for each paragraph.

Review the image prompts that ChatGPT generates for each paragraph.

With your story and narration complete, plan the visual elements by returning to ChatGPT to create detailed image prompts for each paragraph or scene in your story.

Go back to ChatGPT and enter a new prompt: "I want to create a YouTube video from this story so give me image ideas for each paragraph." Paste your story text after this instruction. This tells ChatGPT to break down your narrative into distinct visual scenes.

Review the image prompts that ChatGPT generates for each paragraph. Each prompt should describe a specific scene, including characters, setting, mood, and action. These prompts will guide the AI image generator to create consistent visuals.

Copy all the image prompts and paste them into your Google Doc below your story text. This keeps everything organized in one place. You'll need to modify these prompts slightly in the next step to work optimally with Leonardo AI.

Before moving forward, replace any character names with generic descriptors. In the example, "Oliver" becomes "boy" and "Skyler" becomes "bird." AI image generators work better with clear, descriptive terms rather than proper names, which they cannot visualize consistently.

Step 04

Generate Images with Leonardo AI

Transform your text prompts into actual images using Leonardo AI, a powerful free image generation platform.

Navigate to Leonardo AI and create a free account using your email address.

Configure your generation settings before creating any images.

Transform your text prompts into actual images using Leonardo AI, a powerful free image generation platform. This step creates all the visual assets you need for your animation.

Navigate to Leonardo AI and create a free account using your email address. The free tier provides daily token credits that are sufficient for generating multiple high-quality images. Locate the Image Generation section from the main dashboard once logged in.

Configure your generation settings before creating any images. From the left sidebar, set the number of images to generate per prompt to four. Enable Prompt Magic V2 for enhanced prompt interpretation.

Open the Advanced Control section and change the aspect ratio to 16:9, which is the standard YouTube video format. Lock this ratio by clicking the lock icon. Increase the guidance scale slider to ensure the AI follows your prompts more closely.

Select the 3D Animation Style model from the model dropdown menu. This ensures all generated images have a consistent cartoon aesthetic suitable for animated storytelling. This setting is crucial for maintaining visual coherence across all scenes.

Copy the first image prompt from your Google Doc and paste it into the main prompt field. Enable Enhance Prompt if available. Paste the same prompt into the negative prompt field prefaced with terms describing what you don't want (like "blurry, distorted, ugly").

Click the Generate button and wait for Leonardo AI to create four variations of your scene. This typically takes 10-30 seconds depending on server load. Review all four generated images carefully.

Select the image that best matches your story's narrative and aesthetic. Look for proper composition, clear character depiction, and appropriate mood. If the character's face appears distorted or unclear, fix this in the next step.

Click on the Alchemy Upscaler option to enhance image quality and fix facial features. Wait a few seconds for the upscaling process to complete. This dramatically improves detail and corrects common AI artifacts around faces and hands.

Download the upscaled image to your computer. Repeat this entire process for each scene prompt in your story. Maintain consistent naming conventions for your files (scene_01.png, scene_02.png, etc.) to stay organized.

Step 05

Animate Your Images with Runway Gen 2

Bring static images to life through motion by using Runway Gen 2 to transform each still image into a short animated video clip.

Visit Runway ML and sign up for a free account.

Upload your first scene image to Runway Gen 2.

Bring static images to life through motion by using Runway Gen 2 to transform each still image into a short animated video clip.

Visit Runway ML and sign up for a free account. Locate and click on Gen 2 from the available tools menu. This tool specializes in converting static images into dynamic video content.

Upload your first scene image to Runway Gen 2. Leave the text prompt field blank at this stage. Click the Generate button to create your first animation attempt.

Review the initial 4-second generated video. You may notice excessive or unrealistic movement that doesn't match your story's tone. Fine-tune the animation settings to achieve better results.

Click on the Motion slider and decrease it to 1 (the lowest setting). This creates subtle, natural movement rather than dramatic scene changes. Lower motion values work better for narrative storytelling where you want controlled, gentle animation.

Click on Camera Motion to add intentional camera movement to your scene. Add slight zoom, pan, or tilt movements that enhance the storytelling. For example, a slow zoom on a character's face emphasizes emotion.

Click Generate again with these new settings. The result should be much more polished and natural-looking. The animation will feel purposeful rather than chaotic.

Download the generated video clip once you're satisfied with the result. Repeat this process for every image you created in Leonardo AI. Maintain consistent settings across similar scenes for visual continuity.

Step 06

Edit Your Final Video in CapCut

Bring together all your assets into a cohesive animated video using CapCut, a free and powerful video editing application, to assemble, sync, and enhance your animation.

Download and install CapCut on your computer if you haven't already.

Import all your animated video clips by clicking the Import button or dragging files into the media library.

Bring together all your assets into a cohesive animated video using CapCut, a free and powerful video editing application, to assemble, sync, and enhance your animation.

Download and install CapCut on your computer if you haven't already. Open the application and click Create New Project to begin. This opens the main editing interface with a timeline at the bottom.

Import all your animated video clips by clicking the Import button or dragging files into the media library. Also import the voiceover audio file you created with ElevenLabs. Organize your media by creating folders if needed.

Drag your voiceover audio file onto the timeline first. This serves as the backbone of your edit since all visuals must sync to the narration. Position it at the very beginning of the timeline.

Listen to your voiceover and identify natural breaking points between story segments. These breaks indicate where you should transition from one animated clip to the next. Mark these points mentally or with timeline markers.

Drag your first animated video clip onto the video track above the audio. Trim the clip length to match the corresponding section of narration by dragging the edges of the clip. Repeat this process for each subsequent video clip.

Sync each video clip carefully to its matching narration segment. The visual action should align with what the voiceover describes. Use the playhead and zoom controls to make precise adjustments.

Add transitions between your video clips to smooth the visual flow. Click on the Transitions tab in CapCut and browse available options. Drag transitions between clips on your timeline.

Select transitions that match your story's mood—gentle dissolves work well for calm narratives, while faster transitions suit action sequences. Avoid overusing flashy transitions that distract from your story.

Enhance your video with effects by clicking the Effects tab. CapCut provides numerous options including light leaks, particles, and color grading filters. Apply effects sparingly to enhance mood without overwhelming the visuals.

Add automatic captions to improve accessibility and engagement. Click on Text and select Auto Captions. CapCut will analyze your audio and generate synchronized subtitles automatically.

Review the auto-generated captions for accuracy and make corrections as needed. Customize the caption style, font, size, and position to match your video's aesthetic. Ensure captions are readable against your video backgrounds.

Preview your complete video from start to finish. Watch for timing issues, audio sync problems, or jarring transitions. Make final adjustments to create a smooth viewing experience.

Click Export when you're satisfied with your video. Select appropriate export settings—1080p resolution at 30fps is standard for YouTube content. Choose a destination folder and export your final animated video.

Prompt Library

Copy-paste prompts that work

Each prompt has been tested and optimized for this workflow. Customize the bracketed sections.

Story Generation

Write a short children's story (300-400 words) about [main character] who [main conflict or adventure]. The story should have a clear beginning, middle, and end with an emotional resolution. Keep the language simple and suitable for ages 5-8.

Scene Planning

I want to create a YouTube video from this story so give me detailed image ideas for each paragraph. For each scene, describe: the main subject, the setting/background, the mood/lighting, and the action taking place. Format as a numbered list.

Image Generation

3D animated style, [scene description], [character description], [setting details], vibrant colors, soft lighting, Pixar quality, detailed, high quality

Image Quality Control

blurry, distorted, ugly, deformed, bad anatomy, disfigured, poorly drawn face, mutation, extra limbs, low quality, watermark

Story Generation - Adventure

Write a short adventure story about a young explorer discovering a hidden jungle temple filled with mysterious artifacts. The story should be 250-350 words, suitable for animation, with clear visual moments.

Advanced Scene Planning

Based on this story paragraph, create a cinematic image prompt that emphasizes dramatic lighting, dynamic composition, and emotional storytelling. Include specific details about colors, atmosphere, and character positioning.

Troubleshooting

Common issues

Expert Tips

Go further

Use Leonardo AI's Prompt Magic strength slider at different values for each scene type—set it higher (around 0.7-0.8) for complex scenes with multiple elements, and lower (0.3-0.5) for simple character portraits to prevent over-interpretation that can distort faces.

This prevents the AI from adding unwanted creative elements to simple scenes while still helping with complex compositions, resulting in more predictable and consistent outputs across your video.

In Runway Gen 2, generate 2-3 variations of critical emotional scenes by clicking generate multiple times with the same settings—the AI produces different results each time, and you can select the one that best captures the intended emotion.

Since Runway's AI interprets motion differently with each generation, creating multiple versions ensures you have options for key moments and don't settle for suboptimal animation in important story beats.

In CapCut, use the 'Speed Curve' feature (accessed by right-clicking a video clip) to create dynamic slow-motion effects on emotional moments or action sequences, making your 4-second Runway clips feel more cinematic and filling more timeline space.

This technique helps you match longer narration segments without generating additional expensive Runway clips, and adds professional polish by emphasizing important story moments through timing control.

Continue Learning

Works well with this

Chatgpt

Core tool for generating story scripts and breaking down narratives into visual scene descriptions

Elevenlabs

Produces the voiceover narration that serves as the timing backbone for the entire animated video project

Capcut

Final assembly and editing tool that syncs all animation clips, audio, transitions, and effects into the finished video

This tutorial was created by Joshua Kishaba and produced using AI-assisted editorial tools. All recommendations reflect genuine editorial opinion based on hands-on testing. This page may contain affiliate links — see our full disclosure.