June 16, 2026 Linh Nguyen

CUDA Out of Memory in Blender: How to Render Heavy Scenes Without Crashing

“I have a 24GB card and Blender still throws CUDA error: out of memory halfway through the render. Why, and what do I turn off first?”

I get a version of that question all the time, and the frustrating part is that the viewport runs perfectly right up until you hit F12. Cycles needs to load the whole scene into the card’s memory to render it, and the viewport does not, so a scene can feel fine to work in and still blow past 24GB the moment you ask for a full render. The good news is that Blender gives you a clear order of things to cut, and you rarely have to gut your scene to get under the limit.

Order What to change in Blender VRAM freed (sample) Quality cost
1 Cap texture size under Simplify (Render) ~4 to 8 GB None if objects are off camera
2 Lower subdivision render levels ~2 to 5 GB Usually invisible at level 3 to 4
3 Turn off Persistent Data ~1 to 3 GB Slower per frame in animation
4 Reduce adaptive displacement Varies, can be large Softer micro detail
5 Render at lower resolution for tests ~1 to 2 GB Test only, restore for final

Why a 24GB card runs out on a scene that looks small

The number that matters is not how complex the scene feels to work in, it is how much data Cycles has to hold at render time. Four things usually push it over. Texture sets are the heaviest, because every map at full resolution sits in memory whether the object is close or a speck in the distance. Subdivision and adaptive displacement come next, since they generate geometry the viewport often shows you at a lower level than the render uses. Persistent Data, which speeds up animation by keeping the scene loaded between frames, also keeps that data in VRAM the whole time. And the framebuffers for your output resolution and any compositing passes take their share on top.

On one product scene I had pinned at about 23.6 of 24GB, capping textures to 2K for everything outside the hero group brought it down to roughly 15GB, and the render went from a stuttering out of core crawl of around 16 minutes to a clean 6 on the GPU. Nothing in the final frame changed, because none of those textures were near the camera.

Get it back on the GPU instead of dropping to CPU

When Cycles cannot fit the scene, some people just switch the device back to CPU so it finishes at all. That works, but you give up most of your speed, and a frame that would take six minutes on the card can crawl for forty on the processor. Before you accept that, walk down the table above. Capping texture size alone solves a large share of CUDA out of memory cases, and it takes about two minutes.

If you have trimmed everything sensible and you are still a gigabyte or two over, turn on out of core so Cycles streams the overflow from system RAM rather than failing outright. It keeps you on the GPU, though the parts that live in system memory read slower, so expect the frame to take longer than a scene that fits entirely on the card. That is usually a fair price to stay off the CPU.

When the scene is just heavy and your card is the limit

There is a point where the scene genuinely needs more memory than 24GB can give, and no amount of capping textures will change that without wrecking the look. When you reach it, your choices are out of core backed by a lot of system RAM, a card with more memory, or splitting the job so each frame runs on a machine with room to spare.

Renting a GPU machine is one way through this, and iRender fits because you keep your own Blender. You log into a full RTX 4090 workstation with 256GB of system RAM, install your exact Blender build and add-ons, and set out of core the way your scene needs, so the render matches what you get at home because you configured it. That is what “your renders, your rules” means in practice, and the large system memory pool is what makes out of core scenes sit comfortably. Worth knowing before you start: you pay from the moment the machine boots, not from the first frame, so prep and pack your file locally and upload it ready to go. The classic money waster is an instance left running after an overnight render finishes, which auto-shutdown handles for you, and the first session needs fifteen to thirty minutes to get Blender installed before later launches are quick. If you would rather just send a file and forget it, a SaaS render farm covers that case more simply.

Heavy Blender scene that will not fit, even after trimming?: Render Blender on RTX 4090 with 256GB RAM

Want to try your scene on a bigger machine first?: Try iRender now and get a 100% bonus on your first deposit

Frequently Asked Questions

1. Why does Blender say CUDA out of memory when I have a 24GB GPU?

Because Cycles loads the full scene into VRAM to render, while the viewport does not, so a scene that feels light can still exceed 24GB at render time. The biggest causes are full resolution textures on every object, high subdivision and adaptive displacement, Persistent Data holding the scene in memory, and output framebuffers. Cap texture size first, then lower subdivision, and check VRAM with nvidia-smi as you go.

2. How do I reduce VRAM usage in Blender without losing quality?

Start with the Texture Limit under the Simplify panel, since maps on distant objects can be capped with no visible change. Drop subdivision render levels on anything not close to camera, watch adaptive displacement, and turn off Persistent Data if you are at the edge during animation. These steps usually recover several gigabytes while the final frame looks the same.

3. Should I switch Blender to CPU if I run out of VRAM?

It will finish the render, but you lose most of your speed, since a card that renders a frame in minutes can be many times slower on the CPU. A better route is to trim VRAM usage and turn on out of core so Cycles streams overflow from system RAM while staying on the GPU. Out of core is slower than a scene that fits, but usually much faster than falling back to CPU.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Linh Nguyen

Hi everyone. I work as an Assistant Customer at iRender. I always hope to know more 3D artists, data scientists from all over the world.
Contact

INTEGRATIONS

Autodesk Maya
Autodesk 3DS Max
Blender
Cinema 4D
Houdini
Daz Studio
Maxwell
Omniverse
Nvidia Iray
Lumion
KeyShot
Unreal Engine
Twinmotion
Redshift
Octane
V-Ray
And many more…

iRENDER TEAM

MONDAY – FRIDAY: 24/7 Support
SATURDAY – SUNDAY: 6:00 AM – 11:59 PM
(UTC+7)
Hotline: (+84) 912-785-500
Skype: iRender Support
Email: [email protected]
Address 1: 68 Circular Road #02-01, 049422, Singapore.
Address 2: No.22 Thanh Cong Street, Hanoi, Vietnam.

Contact
[email protected]