A 4B-parameter open-source music model that generates full songs in seconds — locally on consumer hardware
I downloaded the XL-base model. No satisfaction. Singer sounds like she's in a tin shower. Instruments seem like they have no coherence, compared to the same song on Suno.
even your provided samples don't sound very good to me.
It’s like when I first used v3.5
Now we just need LoRa’s for it.
How about a prefab Acestep v1.5 LoRa training workflow?
Nice! Would it be possible to output stems someday?
I downloaded the XL-base model. No satisfaction. Singer sounds like she's in a tin shower. Instruments seem like they have no coherence, compared to the same song on Suno.
even your provided samples don't sound very good to me.
It’s like when I first used v3.5
Now we just need LoRa’s for it.
How about a prefab Acestep v1.5 LoRa training workflow?
Nice! Would it be possible to output stems someday?