The SD3 checkpoints that contain text encoders: sd3_medium_incl_clips.safetensors (5.5GB) and sd3_medium_incl_clips_t5xxlfp8.safetensors (10.1GB) can be used like any regular checkpoint in ComfyUI. The difference between both these checkpoints is that the first contains only 2 text encoders: CLIP-L and CLIP-G while the other one contains 3: CLIP-L, CLIP-G and T5XXL. Make sure to put either sd3_medium_incl_clips.safetensors or sd3_medium_incl_clips_t5xxlfp8.safetensors in your ComfyUI/models/checkpoints/ directory.
Here is a very basic example how to use it:
The sd3_medium.safetensors file does not contain text encoder/CLIP weights so you must load them separately to use that file. Download the text encoder weights from the text_encoders directory and put them in your ComfyUI/models/clip/ directory. sd3_medium.safetensors should be put in your ComfyUI/models/checkpoints/ directory.
Here is a basic example how to use it:
As a reminder you can save these image files and drag or load them into ComfyUI to get the workflow.
SD3 performs very well with the negative conditioning zeroed out like in the following example:
SD3 Controlnets by InstantX are also supported. Download the canny controlnet model here, and put it in your ComfyUI/models/controlnet directory. Be sure to rename it to something clear like sd3_controlnet_canny.safetensors.
Here is an example of how to use it: