Past work demonstrated that using neural networks, we can extend unfinished music pieces while maintaining the music style of the musician. With recent advancements in large language models and diffusion models, we are now capable of generating comics with an interesting storyline while maintaining the art style of the artist. In this paper, we used ChatGPT to generate storylines and dialogue and then generated the comic using stable diffusion. We introduced a novel way to evaluate AI-generated stories, and we achieved SOTA performance on character fidelity and art style by fine-tuning stable diffusion using LoRA, ControlNet, etc.
翻译:过去的研究表明,利用神经网络可以在保持音乐家风格的同时,对未完成的音乐片段进行扩展。随着大语言模型和扩散模型的最新进展,我们现在能够在保持艺术家绘画风格的同时,生成具有精彩故事情节的漫画。本文使用ChatGPT生成故事情节和对话,然后通过Stable Diffusion生成漫画。我们提出了一种评估AI生成故事的新方法,并通过LoRA、ControlNet等微调Stable Diffusion,在角色一致性和艺术风格上达到了最先进的性能。