May Wrapped

ComfyUI May Wrapped: Everything we Integrated

Josiah Villegas

Jun 01, 2026

If you blinked, here’s the catch-up.

In May we integrated 11 new models spanning video, image, 3D, audio, and multimodal.

Here’s what’s new.

Krea 2 — Krea AI · Image & Style Transfer

Krea’s first foundation model, live as a ComfyUI Partner Node on day one. Most image models compete on what’s in the frame. Krea 2 competes on how it looks — style references, moodboards, and aesthetic range across illustration, anime, photorealism, and beyond.

Void — Netflix · Video Object Removal

Netflix open-sourced VOID (Video Object and Interaction Deletion). Most inpainting tools erase pixels. VOID removes the subject and everything it caused — shadows, reflections, physical interactions. Apache 2.0, natively supported.

Tripo 3.1 — Tripo AI · 3D Generation

Text-to-model, image-to-model, and multiview-to-model in one node. Pair it with TripoSplat and you have a full image-to-3D-Gaussian pipeline, end to end.

Luma UNI-1 — Luma AI · Image Editing

Not a diffusion model. Luma’s Uni-1 is a decoder-only autoregressive transformer — it reasons through your prompt before generating. Stronger reference handling, more precise editing. Create and Modify modes, up to 9 reference images. One of the most architecturally interesting image models of the year

Claude — Anthropic · Multimodal

Claude is now accessible inside ComfyUI. Prompt writing, workflow reasoning, multimodal understanding — language intelligence anywhere in your pipeline.

OpenRouter · Text

Access to 20+ LLM models through a single node. Route to the right model for the right task without leaving your workflow.

Gemma 4 — Google DeepMind · Multimodal

Google’s best open model family yet. Text, image, audio, and video — runs on a phone or a single GPU. The 31B model sits at #3 on the open Arena leaderboard. Apache 2.0, 256K context window.

HidDream-O1-Image — Hidream.ai · Image

Reasoning-guided image generation. Open source model built on a Pixel-level Unified Transformer (UiT) without external VAEs or disjoint text encoders. Strong on complex compositional prompts where most models fall apart.

Stable Audio 3 — Stability AI · Audio & SFX

Text-to-audio covering music, sound effects, and production-ready audio. Your sound design toolkit, now in ComfyUI.

BiRefNet — CAAI AIR · Background Removal

High-resolution background segmentation. Pairs naturally with VOID — BiRefNet handles stills, VOID handles motion.

MoGe — Microsoft · 3D Geometry & Depth

Full 3D geometry from a single image — point maps, depth, normals, and camera FOV in one forward pass. A CVPR ‘25 Oral, now in your toolkit.

ComfyHub is gaining momentum

Crossed 500+ workflows on the Hub
If you haven’t browsed lately, there’s a good chance someone already built the workflow you were about to make.

Stay tuned for June!

Discussion about this post

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts