I developed an LTX 2.3 program based on the desktop version of LTX, with optimizations that bypass the 32GB VRAM limitation. It integrates features such as start/end frames, text-to-video, image-to-video, lip-sync, and video enhancement. The links are in the comments.
https://redd.it/1s7g50w
@stablediffusion_r
https://redd.it/1s7g50w
@stablediffusion_r
Z-image character lora great success with onetrainer with these settings.
For z-image base.
Onetrainer github: https://github.com/Nerogar/OneTrainer
Go here https://civitai.com/articles/25701 and grab the file named z-image-base-onetrainer.json from the resources section. I can't share the results because reasons but give it a try, it blew my mind. Made it from random tips i also read on multiple subs so I thought I'd share it back.
I used around 50 images captioned briefly ( trigger. expression. Pose. Angle. Clothes. Background - 2-3 words each ) ex: "Natasha. Neutral expression. Reclined on sofa. Low angle handheld selfie. Wearing blue dress. Living room background."
Poses, long shots, low angles, high angles, selfies, positions, expressions, everything works like a charm (provided you captioned for them in your dataset).
Would be great if I found something similar for Chroma next.
My contribution is configured it so it works with 1024 res images since most of the guides I see are for 512.
Works incredible with generating at FHD; i use the distill lora with 8 steps so its reasonably fast: workflow: https://pastebin.com/UacpHZUG
I found that euler_cfg_pp with beta33 works really well if you want the instagram aesthetic; you can get the beta33 scheduler with this node: https://github.com/silveroxides/ComfyUI\_PowerShiftScheduler
What other sampler / schedulers have you found works well for realism?
https://redd.it/1s7fr2b
@stablediffusion_r
For z-image base.
Onetrainer github: https://github.com/Nerogar/OneTrainer
Go here https://civitai.com/articles/25701 and grab the file named z-image-base-onetrainer.json from the resources section. I can't share the results because reasons but give it a try, it blew my mind. Made it from random tips i also read on multiple subs so I thought I'd share it back.
I used around 50 images captioned briefly ( trigger. expression. Pose. Angle. Clothes. Background - 2-3 words each ) ex: "Natasha. Neutral expression. Reclined on sofa. Low angle handheld selfie. Wearing blue dress. Living room background."
Poses, long shots, low angles, high angles, selfies, positions, expressions, everything works like a charm (provided you captioned for them in your dataset).
Would be great if I found something similar for Chroma next.
My contribution is configured it so it works with 1024 res images since most of the guides I see are for 512.
Works incredible with generating at FHD; i use the distill lora with 8 steps so its reasonably fast: workflow: https://pastebin.com/UacpHZUG
I found that euler_cfg_pp with beta33 works really well if you want the instagram aesthetic; you can get the beta33 scheduler with this node: https://github.com/silveroxides/ComfyUI\_PowerShiftScheduler
What other sampler / schedulers have you found works well for realism?
https://redd.it/1s7fr2b
@stablediffusion_r
GitHub
GitHub - Nerogar/OneTrainer: OneTrainer is a one-stop solution for all your Diffusion training needs.
OneTrainer is a one-stop solution for all your Diffusion training needs. - Nerogar/OneTrainer
Inspired by u/goddesspeeler's work, I created a "VACE Transition Builder" node.
u/goddesspeeler shared a great workflow he did yesterday.
It allows entering the path to a folder and having all the clips stitched together using VACE.
This works amazingly well and thought of converting it into a node instead.
https://preview.redd.it/hbth1oy1f4sg1.png?width=1891&format=png&auto=webp&s=7c1b496afabd1947dcb1e0bcccd8fb2b9812d802
For those that haven't seen his post. It basically allow creating automatic transitions between clips and then stitching them all together. Making long video generation a breeze. This node aims to replicate his workflow, but with the added bonus of being more streamlined and allowing for easy clip selection or re-ordering. Mousing over a clip shows a preview if it.
The option node is only needed if you want to tweak the defaults. When not added it uses the same defaults found in the workflow. I plan on exposing some of these to the comfy preferences, so we could make changes to what the defaults are.
You can find this node here
Hats off again to goddess_peeler for a great solution!
I'm still unsure about the name though..
I hesitated between this or VACE Stitcher... any preference? 😅
https://redd.it/1s7ilwe
@stablediffusion_r
u/goddesspeeler shared a great workflow he did yesterday.
It allows entering the path to a folder and having all the clips stitched together using VACE.
This works amazingly well and thought of converting it into a node instead.
https://preview.redd.it/hbth1oy1f4sg1.png?width=1891&format=png&auto=webp&s=7c1b496afabd1947dcb1e0bcccd8fb2b9812d802
For those that haven't seen his post. It basically allow creating automatic transitions between clips and then stitching them all together. Making long video generation a breeze. This node aims to replicate his workflow, but with the added bonus of being more streamlined and allowing for easy clip selection or re-ordering. Mousing over a clip shows a preview if it.
The option node is only needed if you want to tweak the defaults. When not added it uses the same defaults found in the workflow. I plan on exposing some of these to the comfy preferences, so we could make changes to what the defaults are.
You can find this node here
Hats off again to goddess_peeler for a great solution!
I'm still unsure about the name though..
I hesitated between this or VACE Stitcher... any preference? 😅
https://redd.it/1s7ilwe
@stablediffusion_r
Can LTX-2.3 do video to video, like LTX-2?
A great feature of LTX-2 is that it can take a video sequence as input, and use the voices and motions in it as seed for generating a new video starting with the last frame.
Can LTX-2.3 do that too? I haven't seen a workflow yet that does this.
https://redd.it/1s7ixma
@stablediffusion_r
A great feature of LTX-2 is that it can take a video sequence as input, and use the voices and motions in it as seed for generating a new video starting with the last frame.
Can LTX-2.3 do that too? I haven't seen a workflow yet that does this.
https://redd.it/1s7ixma
@stablediffusion_r
Reddit
From the StableDiffusion community on Reddit
Explore this post and more from the StableDiffusion community