How to Create 3D Animated Videos with Free AI Tools in 2026

YouTube channel logo
Joshua Kishaba·AI Mastery·Subscribe
Published May 7, 2026
180 minIntermediateFree

Learn how to create professional 3D animated cartoon videos using free AI tools including ChatGPT, ElevenLabs, Leonardo AI, Runway, and CapCut.

This page may contain affiliate links. If you purchase through these links, we may earn a commission at no extra cost to you. Full disclosure.

Prerequisites

  • Free accounts at ChatGPT, ElevenLabs, Leonardo AI, Runway ML, and CapCut
  • A story concept or inspiration for your animated video
  • A text editor or Google Docs for organizing scripts and prompts
  • Approximately 2-4 hours of uninterrupted production time

Core Actions

  1. Generate story script using ChatGPT with specific narrative prompts
  2. Create voiceover audio with ElevenLabs from your completed story
  3. Break down story into scene descriptions and generate image prompts in ChatGPT
  4. Generate and upscale images using Leonardo AI with 16:9 aspect ratio and 3D Animation Style
  5. Animate each image using Runway Gen 2 with motion slider set to 1 and camera motion controls
  6. Assemble all clips and audio in CapCut, sync to narration, add transitions and effects, export at 1080p 30fps

Expected Outcome

A complete, YouTube-ready 1-3 minute animated video with synchronized voiceover, consistent visual style, and professional editing using entirely free tools.

Introduction

This tutorial walks you through the complete process of creating professional-quality 3D animated cartoon videos using entirely free AI tools. You'll cover story creation, voiceover generation, image creation, animation, and video editing. By the end, you'll have a finished animated video ready to share on YouTube or other platforms.

The workflow consists of six distinct steps that transform a simple concept into a fully animated story. You'll use ChatGPT for scriptwriting, ElevenLabs for voiceovers, Leonardo AI for image generation, Runway Gen 2 for animation, and CapCut for final editing. Each tool plays a specific role in the production pipeline, and this guide shows you exactly how to configure and use them.

In This Video

This tutorial demonstrates how to create professional-quality 3D animated videos using free AI tools in a six-step workflow. Users generate story scripts with ChatGPT, create voiceovers with ElevenLabs, produce images with Leonardo AI, animate scenes with Runway Gen 2, and assemble everything in CapCut. By following this process, creators can produce YouTube-ready animated videos from concept to export without paid software.

1

Generate Your Story Script with ChatGPT

Watch from 0:37
  • The foundation of any animated video is a compelling narrative.
  • Navigate to ChatGPT and craft a specific prompt that tells the AI exactly what kind of story you need.
  • Review the generated story to ensure it fits your vision and has a clear beginning, middle, and end.

The foundation of any animated video is a compelling narrative. Start by creating a short story using ChatGPT that serves as the basis for your entire animation project.

Navigate to ChatGPT and craft a specific prompt that tells the AI exactly what kind of story you need. For this tutorial, we asked ChatGPT to "write a child's short story about a young boy and his bird." The more specific your prompt, the better the results will be.

Review the generated story to ensure it fits your vision and has a clear beginning, middle, and end. The story should be concise enough to translate into a short video, typically 1-3 minutes when narrated. Copy the entire story text once you're satisfied with the output.

Open Google Docs or any note-taking application and paste your story there. This document will serve as your central reference throughout the production process. You'll return to this document multiple times to copy sections for voiceover generation and image prompt creation.

2

Create Professional Voiceovers with ElevenLabs

Watch from 1:00
  • Transform your written story into spoken narration using ElevenLabs to generate a high-quality AI voiceover that brings emotional depth to your animated story.
  • Visit ElevenLabs by clicking the link in the video description or navigating directly to their website.
  • Browse through the available voice options and select one that matches the tone and style of your story.

Transform your written story into spoken narration using ElevenLabs to generate a high-quality AI voiceover that brings emotional depth to your animated story.

Visit ElevenLabs by clicking the link in the video description or navigating directly to their website. Sign up for a free account if you don't already have one. The free tier provides sufficient credits for creating voiceovers for multiple short videos.

Browse through the available voice options and select one that matches the tone and style of your story. For a children's story, choose a warm, gentle voice with clear articulation. Click on different voices to preview them before making your selection.

Copy the full story text from your Google Doc and paste it into the text input field in ElevenLabs. Ensure you've pasted the complete story without any formatting issues. Click the Generate button and wait for the AI to process your text into speech.

Listen to the generated voiceover to ensure quality and pacing. Download the audio file to your computer once satisfied. Name the file clearly (such as "story_voiceover.mp3") so you can easily locate it during the editing phase.

3

Generate Image Prompts for Each Scene

Watch from 1:51
  • With your story and narration complete, plan the visual elements by returning to ChatGPT to create detailed image prompts for each paragraph or scene in your story.
  • Go back to ChatGPT and enter a new prompt: "I want to create a YouTube video from this story so give me image ideas for each paragraph.
  • Review the image prompts that ChatGPT generates for each paragraph.

With your story and narration complete, plan the visual elements by returning to ChatGPT to create detailed image prompts for each paragraph or scene in your story.

Go back to ChatGPT and enter a new prompt: "I want to create a YouTube video from this story so give me image ideas for each paragraph." Paste your story text after this instruction. This tells ChatGPT to break down your narrative into distinct visual scenes.

Review the image prompts that ChatGPT generates for each paragraph. Each prompt should describe a specific scene, including characters, setting, mood, and action. These prompts will guide the AI image generator to create consistent visuals.

Copy all the image prompts and paste them into your Google Doc below your story text. This keeps everything organized in one place. You'll need to modify these prompts slightly in the next step to work optimally with Leonardo AI.

Before moving forward, replace any character names with generic descriptors. In the example, "Oliver" becomes "boy" and "Skyler" becomes "bird." AI image generators work better with clear, descriptive terms rather than proper names, which they cannot visualize consistently.

4

Generate Images with Leonardo AI

Watch from 2:08
  • Transform your text prompts into actual images using Leonardo AI, a powerful free image generation platform.
  • Navigate to Leonardo AI and create a free account using your email address.
  • Configure your generation settings before creating any images.

Transform your text prompts into actual images using Leonardo AI, a powerful free image generation platform. This step creates all the visual assets you need for your animation.

Navigate to Leonardo AI and create a free account using your email address. The free tier provides daily token credits that are sufficient for generating multiple high-quality images. Locate the Image Generation section from the main dashboard once logged in.

Configure your generation settings before creating any images. From the left sidebar, set the number of images to generate per prompt to four. Enable Prompt Magic V2 for enhanced prompt interpretation.

Open the Advanced Control section and change the aspect ratio to 16:9, which is the standard YouTube video format. Lock this ratio by clicking the lock icon. Increase the guidance scale slider to ensure the AI follows your prompts more closely.

Select the 3D Animation Style model from the model dropdown menu. This ensures all generated images have a consistent cartoon aesthetic suitable for animated storytelling. This setting is crucial for maintaining visual coherence across all scenes.

Copy the first image prompt from your Google Doc and paste it into the main prompt field. Enable Enhance Prompt if available. Paste the same prompt into the negative prompt field prefaced with terms describing what you don't want (like "blurry, distorted, ugly").

Click the Generate button and wait for Leonardo AI to create four variations of your scene. This typically takes 10-30 seconds depending on server load. Review all four generated images carefully.

Select the image that best matches your story's narrative and aesthetic. Look for proper composition, clear character depiction, and appropriate mood. If the character's face appears distorted or unclear, fix this in the next step.

Click on the Alchemy Upscaler option to enhance image quality and fix facial features. Wait a few seconds for the upscaling process to complete. This dramatically improves detail and corrects common AI artifacts around faces and hands.

Download the upscaled image to your computer. Repeat this entire process for each scene prompt in your story. Maintain consistent naming conventions for your files (scene_01.png, scene_02.png, etc.) to stay organized.

5

Animate Your Images with Runway Gen 2

Watch from 4:11
  • Bring static images to life through motion by using Runway Gen 2 to transform each still image into a short animated video clip.
  • Visit Runway ML and sign up for a free account.
  • Upload your first scene image to Runway Gen 2.

Bring static images to life through motion by using Runway Gen 2 to transform each still image into a short animated video clip.

Visit Runway ML and sign up for a free account. Locate and click on Gen 2 from the available tools menu. This tool specializes in converting static images into dynamic video content.

Upload your first scene image to Runway Gen 2. Leave the text prompt field blank at this stage. Click the Generate button to create your first animation attempt.

Review the initial 4-second generated video. You may notice excessive or unrealistic movement that doesn't match your story's tone. Fine-tune the animation settings to achieve better results.

Click on the Motion slider and decrease it to 1 (the lowest setting). This creates subtle, natural movement rather than dramatic scene changes. Lower motion values work better for narrative storytelling where you want controlled, gentle animation.

Click on Camera Motion to add intentional camera movement to your scene. Add slight zoom, pan, or tilt movements that enhance the storytelling. For example, a slow zoom on a character's face emphasizes emotion.

Click Generate again with these new settings. The result should be much more polished and natural-looking. The animation will feel purposeful rather than chaotic.

Download the generated video clip once you're satisfied with the result. Repeat this process for every image you created in Leonardo AI. Maintain consistent settings across similar scenes for visual continuity.

6

Edit Your Final Video in CapCut

Watch from 5:03
  • Bring together all your assets into a cohesive animated video using CapCut, a free and powerful video editing application, to assemble, sync, and enhance your animation.
  • Download and install CapCut on your computer if you haven't already.
  • Import all your animated video clips by clicking the Import button or dragging files into the media library.

Bring together all your assets into a cohesive animated video using CapCut, a free and powerful video editing application, to assemble, sync, and enhance your animation.

Download and install CapCut on your computer if you haven't already. Open the application and click Create New Project to begin. This opens the main editing interface with a timeline at the bottom.

Import all your animated video clips by clicking the Import button or dragging files into the media library. Also import the voiceover audio file you created with ElevenLabs. Organize your media by creating folders if needed.

Drag your voiceover audio file onto the timeline first. This serves as the backbone of your edit since all visuals must sync to the narration. Position it at the very beginning of the timeline.

Listen to your voiceover and identify natural breaking points between story segments. These breaks indicate where you should transition from one animated clip to the next. Mark these points mentally or with timeline markers.

Drag your first animated video clip onto the video track above the audio. Trim the clip length to match the corresponding section of narration by dragging the edges of the clip. Repeat this process for each subsequent video clip.

Sync each video clip carefully to its matching narration segment. The visual action should align with what the voiceover describes. Use the playhead and zoom controls to make precise adjustments.

Add transitions between your video clips to smooth the visual flow. Click on the Transitions tab in CapCut and browse available options. Drag transitions between clips on your timeline.

Select transitions that match your story's mood—gentle dissolves work well for calm narratives, while faster transitions suit action sequences. Avoid overusing flashy transitions that distract from your story.

Enhance your video with effects by clicking the Effects tab. CapCut provides numerous options including light leaks, particles, and color grading filters. Apply effects sparingly to enhance mood without overwhelming the visuals.

Add automatic captions to improve accessibility and engagement. Click on Text and select Auto Captions. CapCut will analyze your audio and generate synchronized subtitles automatically.

Review the auto-generated captions for accuracy and make corrections as needed. Customize the caption style, font, size, and position to match your video's aesthetic. Ensure captions are readable against your video backgrounds.

Preview your complete video from start to finish. Watch for timing issues, audio sync problems, or jarring transitions. Make final adjustments to create a smooth viewing experience.

Click Export when you're satisfied with your video. Select appropriate export settings—1080p resolution at 30fps is standard for YouTube content. Choose a destination folder and export your final animated video.

Prompt Library

Copy-paste these prompts directly into the chatbot of your choice for best results. Each prompt has been tested and optimized for this workflow.

Story Generation

Use this prompt in ChatGPT to generate the narrative foundation for your animated video. It ensures the story is appropriately scoped and structured for short-form animation.

Write a short children's story (300-400 words) about [main character] who [main conflict or adventure]. The story should have a clear beginning, middle, and end with an emotional resolution. Keep the language simple and suitable for ages 5-8.
Scene Planning

Paste your completed story after this prompt in ChatGPT to break down your narrative into specific visual scenes. This creates the image briefs you'll use with Leonardo AI.

I want to create a YouTube video from this story so give me detailed image ideas for each paragraph. For each scene, describe: the main subject, the setting/background, the mood/lighting, and the action taking place. Format as a numbered list.
Image Generation

Use this template when pasting prompts into Leonardo AI's image generation field. Replace bracketed sections with your specific scene requirements to achieve consistent Pixar-style visuals.

3D animated style, [scene description], [character description], [setting details], vibrant colors, soft lighting, Pixar quality, detailed, high quality
Image Quality Control

Paste this into Leonardo AI's negative prompt field to exclude common AI artifacts and ensure cleaner, more usable generated images.

blurry, distorted, ugly, deformed, bad anatomy, disfigured, poorly drawn face, mutation, extra limbs, low quality, watermark
Story Generation - Adventure

Alternative story prompt for users wanting to create different content themes beyond the tutorial's original example.

Write a short adventure story about a young explorer discovering a hidden jungle temple filled with mysterious artifacts. The story should be 250-350 words, suitable for animation, with clear visual moments.
Advanced Scene Planning

Use this meta-prompt in ChatGPT when you want more detailed, cinematically-focused image descriptions than standard scene breakdowns.

Based on this story paragraph, create a cinematic image prompt that emphasizes dramatic lighting, dynamic composition, and emotional storytelling. Include specific details about colors, atmosphere, and character positioning.

Troubleshooting & Common Errors

Running into issues? Here are the most common problems and how to fix them.

Expert Tips

💡 Use Leonardo AI's Prompt Magic strength slider at different values for each scene type—set it higher (around 0.7-0.8) for complex scenes with multiple elements, and lower (0.3-0.5) for simple character portraits to prevent over-interpretation that can distort faces.

This prevents the AI from adding unwanted creative elements to simple scenes while still helping with complex compositions, resulting in more predictable and consistent outputs across your video.

💡 In Runway Gen 2, generate 2-3 variations of critical emotional scenes by clicking generate multiple times with the same settings—the AI produces different results each time, and you can select the one that best captures the intended emotion.

Since Runway's AI interprets motion differently with each generation, creating multiple versions ensures you have options for key moments and don't settle for suboptimal animation in important story beats.

💡 In CapCut, use the 'Speed Curve' feature (accessed by right-clicking a video clip) to create dynamic slow-motion effects on emotional moments or action sequences, making your 4-second Runway clips feel more cinematic and filling more timeline space.

This technique helps you match longer narration segments without generating additional expensive Runway clips, and adds professional polish by emphasizing important story moments through timing control.

Continue Learning

Explore More Tools

This tutorial was created by Joshua Kishaba and produced using AI-assisted editorial tools. All recommendations reflect genuine editorial opinion based on hands-on testing. This page may contain affiliate links — see our full disclosure.

Tools Required
  • ChatGPT
  • ElevenLabs
  • Leonardo AI
  • Runway Gen 2
  • CapCut