Workflows/Suno Killer (From image to Song)

Suno Killer (From image to Song)

Save it for me

Operate

Manu

01/20/2026

ComfyUI

Audio Generation

New & Trending

1 / 0

Detailed Introduction

🎵 Image-to-Song AI Workflow — From Visual to Fully Sung Music

Turn any image into a fully sung original song — automatically.

This workflow transforms a single image into lyrics, musical style, and a complete vocal track, using state-of-the-art multimodal AI. No music theory, no songwriting skills, no manual prompting required.

✨ How it works

Image Understanding (QWEN-VL) The first QWEN-VL model analyzes the input image in depth: mood and atmosphere characters, emotions, environment implicit story and visual themes
Lyrics & Style Generation (QWEN-VL) A second QWEN-VL model converts that visual interpretation into a structured music JSON: complete song lyrics (intro, verse, chorus, bridge, outro) musical tags (genre, instruments, mood, tempo, energy) optimized for a target duration (e.g. 60 seconds, 90 seconds, etc.)
Sung Music Generation (HeartMuLa) The HeartMuLa node consumes the generated lyrics and tags to produce: a fully sung vocal track coherent melody, rhythm, and structure automatic duration management with smart extension clean musical ending with fade-out

🚀 What makes this workflow special

Image → Song, end-to-end One image becomes a complete musical piece — vocals included.
Duration-aware generation The workflow intelligently adapts the song length to match your target time, avoiding abrupt cutoffs.
Creative control when you want it You can override musical tags manually or let the AI handle everything.
Perfect for creators Ideal for: video soundtracks storytelling and cinematic content concept art music social media, reels, and shorts experimental and generative art
MimicPC-optimized Designed to run smoothly in Mimic environments with no manual setup.

🎶 The result

A unique, AI-generated song, inspired directly by your image — with lyrics, emotion, and vocals that feel intentional and alive.

Upload an image.

Set a duration.

Get a song.

This workflow turns visuals into music — automatically.

Details

APP	ComfyUI(v0.8.2)
Update Time	01/20/2026
File Space	27.7 GB
Models	0
Extensions	5