我要投稿

使用Llama3辅助生成StableDiffusion提示词，AI绘画从未如此简单

发布日期：2024-05-09 12:10:40 浏览次数： 3253

作者：AI工程师笔记

微信搜一搜，关注“AI工程师笔记”

微信公众号：AI工程师笔记
关注可了解更多的内容。问题或建议，请公众号留言;
如果你觉得对你有帮助，欢迎赞赏

Llama3简介

Llama 3是由Meta公司推出的一款大型语言模型（LLM），其版本规模从8B到400B参数不等，这超越了包括谷歌的Gemma/Gemini、Mistral和Claude 3 Sonnet在内的众多竞争对手，且能在仅4GB的GPU上实现高效运行，为商业应用提供了更多可能性。

HuggingFace上有大神发布了基于Llama3的stable diffusion的prompt提示词微调模型——llama3_ifai_sd_prompt_mkr_q4km。我们可以向模型输入简单的提示，Llama3将会给我们返回详细的提示描述。

Llama3部署

llama3_ifai_sd_prompt_mkr_q4km模型已上架Ollama仓库。

Ollama部署可参考：Windows本机运行7B大模型

执行ollama run impactframes/llama3_ifai_sd_prompt_mkr_q4km即可完成部署。

C:\Users\Zz>ollama run impactframes/llama3_ifai_sd_prompt_mkr_q4km
pulling manifest
pulling 8c307d788852... 100% ▕████████████████████████████████████████████████████████▏ 4.9 GB
pulling 2190828de961... 100% ▕████████████████████████████████████████████████████████▏  260 B
pulling 0661d86cbb71... 100% ▕████████████████████████████████████████████████████████▏  575 B
pulling 563e392d0f49... 100% ▕████████████████████████████████████████████████████████▏  12 KB
pulling 3ac6276a4a91... 100% ▕████████████████████████████████████████████████████████▏  415 B
pulling 98f4faea31b4... 100% ▕████████████████████████████████████████████████████████▏   73 B
pulling d074ab4deb16... 100% ▕████████████████████████████████████████████████████████▏  633 B
verifying sha256 digest
writing manifest
removing any unused layers
success

测试使用

以往我们在使用Stable Diffusion时，自己思考提示词是一件很消耗脑细胞的事情。通过大模型生成的提示词往往更加精准、丰富多样，能够引导Stable Diffusion产出更高分辨率、细节更丰富的图像。大型语言模型能够理解和生成高度复杂、抽象的概念描述，这为图像生成提供了无限的创意空间。用户可以使用自然语言描述极其详细或新颖的场景，即使是那些现实中不存在或难以想象的构想，大模型都能辅助生成精确的提示词，进而创造出独特且富有创意的图像作品。并且对于非专业设计人员来说，直接操作复杂的图像生成参数可能较为困难。而利用大模型生成的提示词作为桥梁，用户只需简单描述心中所想，即可得到满意的作品，极大地降低了艺术创作和技术使用的门槛。

提示词1：一个人站在山顶

>>> A man standing atop of a mountain
Philosophical, introspective, contemplative, solitary figure, dressed in simple attire, standing at the summit of
a majestic mountain, gazing out at the breathtaking panorama, lost in thought, pondering the mysteries of
existence, the universe, and human nature, as the wind whispers secrets in his ear, the sun

提示词2：拟人的猫猫坐在电脑前工作

>>> 拟人的猫猫坐在电脑前工作
Fictional character, anthropomorphic cat named 'Whiskers', sitting comfortably at a desk in front of a computer
screen, wearing glasses, typing away with paws, intense focus on the task at hand, surrounded by scattered papers,
pencils, and catnip toys, whimsical atmosphere, soft lighting, cozy setting.

提示词3：即将降落月球的宇宙飞船

>>> 即将降落月球的宇宙飞船
Soaring through space, the sleek, silver spacecraft, 'Lunar Quest', prepares to land on the moon's surface, vast
craters and rugged terrain visible beneath, solar panels extended, engines firing, smoke trails forming,
astronauts in suits, suited up for lunar exploration, lunar module separating from main spacecraft, majestic Earth
rising above horizon, dramatic sky gradient.

提示词4：机器人战争

>>> 机器人战争 Advanced, mechanized armies clashing on a desolate battlefield, steel behemoths with glowing blue eyes, towering robots marching across the terrain, smoke and flames engulfing the landscape, laser beams slicing through the air, explosions rocking the ground, metal shards flying everywhere, apocalyptic devastation, twisted ruins in the distance.

提示词5：现代化的熊猫战士

>>> 现代化的熊猫战士
Panda warrior, dressed in futuristic armor, helmet adorned with panda emblem, standing confidently on a futuristic
battlefield, holding energy sword, shield emblazoned with panda pattern, advanced jetpack strapped to back,
airbrushed camouflage design, bold lines, metallic sheen, modern technology merged with ancient symbolism.

插件

如果感觉每次手动复制比较麻烦，可以在SableDiffusion下载IF_prompt_MKR插件。

插件安装好，重启SableDiffusion遇到报错：

我们需要在这块安装openai框架：

安装好openai框架后，再次启动SableDiffusion就不会报错了。这时候，我们打开Web页面，在设置—>未分类里找到iF_prompt_MKR，设置Ollama。

设置完成后，在文生图页面的脚本处设置：

此外，我们还可以根据自己的需要设置系统提示词和反向提示词。
所有设置完毕后，在Input Prompt处输入自己想要绘画的概要内容，点击生成即
可。

关注我，定期分享前沿AI应用。

53AI，企业落地大模型首选服务商

产品：场景落地咨询+大模型应用平台+行业解决方案

承诺：免费POC验证，效果达标后再合作。零风险落地应用大模型，已交付160+中大型企业

相关资讯

2024-07-10

科研助力神器：Scholar GPT，百倍提升你的研究效率！

2024-07-09

Doc2X：一款功能超级强大的文档解析与转换工具

2024-07-06

我对多智能体协作过程自动演化架构设计

2024-07-06

可穿戴AI，底层逻辑的变化

2024-07-06

一文彻底搞懂Transformer - Word Embedding（词嵌入）

2024-07-06

AI动态 | 腾讯元宝AI搜索能力升级：深度搜索模式上线

2024-07-06

智能手表 + AI ，都已经这么智能了？？

2024-07-06

死磕10万卡GPU算力集群，腾讯星脉网络2.0有什么秘密武器？

了解更多

160+中大型企业正在使用53AI

立即咨询预约演示

把握AI发展的机遇，共同探索、共同进步

2025-01-22

如何打造基于GenAI的员工服务机器人

2025-01-22

热点资讯

实测Qwen3-Coder，这就是目前最强的开源编程模型

2025-07-23

看大厂PM，如何玩转多个智能体开发平台

2025-06-17

53AI Hub重磅开源！让99%的智能体开发者赚到钱！

2025-06-17

DeepSeek R1-0528 小版本升级

2025-05-29

高效 Agents 构建指南

2025-05-23

Qwen3-Coder开源：面向世界的智能编程引擎

2025-07-23

SpringAI Alibaba实战文生图、聊天记忆功能

2025-06-01

忽视小模型和知识库，企业AI应用必将是死路一条

2025-05-07

从RAG到CoT再到MCP，一文读懂AI Agent落地难题｜大模型研究

2025-05-07

CAG 与 RAG：哪种方法能带来性能更好的人工智能

2025-05-07

大家都在问

扣子（Coze）开源了！你发现了哪些商业机会？

2025-07-30

GLM-4.5 发布，六大主流模型混战测评，谁能一键生成“ 真·可用 ”的应用？

2025-07-29

AI 应用开发，还需要意图识别吗？

2025-07-29

Coze既可开源也能本地部署，n8n和coze哪家强？

2025-07-29

AI还有哪些机会？你是否适合切入？

2025-07-29

文档知识图谱构建：AI代理如何简化复杂流程？

2025-07-29

AI Agent 新选择：Coze Studio 开源上手实录，能替代 Dify 吗？

2025-07-28

Cursor Meetup 杭州站分享实录：小团队如何用 AI 撑起万级日活产品？

2025-07-28

热门标签

内容创作大模型技术个人提效 langchain llamaindex 多模态技术 RAG技术智能客服知识图谱模型微调 RAGFlow coze Dify Fastgpt Bisheng Qanything AI+汽车 AI+金融 AI+工业 AI+培训 AI+SaaS 提示词框架提示词技巧 AI+电商 AI面试数字员工 ChatBI 知识管理开源大模型智能营销智能硬件智能化改造 AI+医疗 MaxKB