TL;DR
一条好用的 GPT Image 2 提示词不是一句话,而是一叠决策:主体、场景、风格、镜头、光线、情绪。本指南给你 50+ 条可直接复制的提示词模板,覆盖电影感、人像、动作、自然与奇幻五大类,并附上失败场景的修复清单与高效的迭代工作流。文中所有样图都用的是同一套 KIE gpt-image-2-text-to-image 模型,每张 12 credits、提示词上限 20,000 字符。免费试用 GPT Image 2 →
一条好提示词的解剖结构
大多数人上来就直接写"我想要什么"。而真正出好图的人,写的是"镜头看到了什么"。这就是全部秘密。
我们在 KIE gpt-image-2-text-to-image 接口上跑了几千次测试后,沉淀出一套八槽位公式,几乎覆盖所有场景。八个槽位填六个就已经在平均线之上,全部填满就能达到商业片水准。
公式:
[主体] + [动作/姿态] + [场景] + [风格/参考] + [镜头/构图] + [光线] + [情绪/色调] + [画质修饰]
每一个槽位,都是在替模型把一个它本来要"猜"的问题钉死:
- 主体——画面里是谁或是什么。"红发图书管理员"比"女人"强十倍。
- 动作/姿态——此刻正在做什么。动词决定构图。
- 场景——周围的世界。说清国家、年代、时辰。
- 风格/参考——"film noir"、"Ufotable 制作级动画"、"Wes Anderson 对称构图"、"Fenty Beauty 广告风"。调用已知的视觉语言,而不是堆无意义的形容词。
- 镜头/构图——"极近景"、"低角度广角"、"85mm 人像镜头,f/1.4"、"变形宽银幕镜头"。这是把快照变成电影帧的关键。
- 光线——"黄金时刻边缘光"、"单一 Rembrandt 光"、"湿地面霓虹反光"。光线占一张图 60% 的感觉。
- 情绪/色调——"冷青与暖橙对撞"、"暖琥珀加深阴影"、"去饱和的忧郁调"。
- 画质修饰——"超真实 4K"、"胶片颗粒"、"时尚大片"。保持简短,前面已经做了真正的重活。
基础 vs 优化——同一个主体的两轮对比

上面这张图对应的原始提示词是:
A woman standing in a room.现在用八槽位公式重写"同一个概念":
A breathtaking young woman with flowing auburn hair stands in a luxurious Art Deco penthouse at golden hour. She wears a champagne-colored satin slip dress that catches the warm light. Floor-to-ceiling windows behind her show a panoramic city sunset. Dramatic side lighting creates deep shadows and golden highlights on her face and bare arms. The composition follows the rule of thirds. Cinematic depth of field with gorgeous city bokeh. Fashion editorial quality. Ultra-realistic 4K.中文注释:Art Deco 风格顶层公寓里,一位长发女子站在落地窗前,黄金时刻的侧光雕出面部与手臂轮廓。

注意:优化版并不是堆了更多形容词,而是留给模型去猜的部分变少了。GPT Image 2 底层是一个由 transformer 引导的扩散模型(参见 Wikipedia 关于扩散模型的解释),每一个你没写的细节,模型都会用它的"先验平均值"去补。你不说"黄金时刻",它就默认给你一个多云周二下午两点的光。
最后补一条冷知识:GPT Image 2 的提示词上限是 20,000 字符——大约 3,000 英文单词。普通场景远远用不到,但对复杂的多人物场景或详细概念图来说,这个天花板意味着你的构图决策可以做得很细。第 11 节会演示长提示词怎么用。
提示词库:电影感场景
电影感场景是最容易拿捏的类别,因为电影史已经沉淀了一百年的视觉词汇。说出类型、年代、镜头,模型就能还你一帧像样的画面。

1. 新黑色香港后巷
Film noir cinematic shot. A dangerously beautiful femme fatale in a curve-hugging red silk dress with a thigh-high slit, walking through a rain-soaked Hong Kong back alley at night. Neon signs in Chinese characters reflect red and blue on the wet cobblestones. She carries a black umbrella over one shoulder, her red-painted lips the only warm color against the cold teal lighting. Smoke wisps from a nearby vent. Anamorphic lens, shallow depth of field, cinematic grain. Ultra-realistic 4K noir film frame.中文注释:雨夜香港后巷,红衣女子撑伞穿过霓虹反光的石板路。
2. 爵士酒吧 Rembrandt 光
Moody jazz bar interior. A mysterious woman in a sheer black lace dress sits on a velvet barstool, one leg crossed showing stiletto heels. Cigarette smoke curls around her silhouette. Warm amber spotlight from above illuminates her face and exposed collarbones while the rest fades into deep shadow. A saxophone player is a blurred silhouette in the background. Film noir meets modern luxury aesthetic. Dramatic Rembrandt lighting, 35mm film look. Ultra-realistic 4K.中文注释:爵士酒吧里的黑蕾丝女子,单一顶光+烟雾形成 Rembrandt 式戏剧光。
3. 银翼杀手屋顶
Cyberpunk cinematic wide shot. A lone detective in a wet black trench coat stands on a neon-drenched Tokyo rooftop at 3am. Giant holographic advertisements of a geisha float across the skyline behind him, casting shifting pink and cyan light on his face. Light rain catches the glow. Flying cars streak past as horizontal light trails. Shot on anamorphic lens, 2.39:1 aspect, shallow depth of field. Blade Runner 2049 color grade — teal shadows, orange highlights. Ultra-realistic 4K cinematic frame.中文注释:赛博朋克东京屋顶,全息艺伎广告投射的青粉双色包裹着侦探。
4. 韦斯·安德森对称大堂
Wes Anderson style cinematic composition. A 1960s hotel concierge in a burgundy uniform stands dead-center in a pastel-pink Art Deco lobby, flanked by perfectly symmetrical potted palms and brass sconces. Flat front-on framing, everything on center axis. Soft fluorescent overhead lighting. Pastel pink and mint green color palette. 35mm film look. Ultra-detailed 4K.中文注释:粉色 Art Deco 酒店大堂,正面对称构图,礼宾站在画面正中。
5. 韩式犯罪片厨房对峙
Cinematic still from a modern Korean crime thriller. Two men face each other across a small Seoul apartment kitchen at 2am, both holding knives but frozen in a tense moment. Single fluorescent tube overhead casts hard green-tinted light and harsh shadows. Steam rises from an abandoned pot on the stove. Tight composition, 40mm lens, handheld feel. Bong Joon-ho style. Ultra-realistic 4K.中文注释:凌晨两点首尔厨房里的两人对峙,日光灯偏绿的硬光+手持感镜头。

6. 维伦纽夫沙漠史诗
Epic cinematic wide shot in Denis Villeneuve style. A lone hooded figure in flowing desert robes walks across a vast orange sand dune at sunset. The sun is enormous on the horizon, casting elongated shadows. Scale is extreme — the figure is tiny, the landscape overwhelming. Dust kicks up in the wind. Warm amber palette with deep violet shadows. Shot on 65mm, ultra-wide aspect. Ultra-realistic 4K cinematic quality.中文注释:维伦纽夫式沙漠广角,人物渺小、景观压倒性。
7. 法国新浪潮咖啡馆
Black and white French New Wave cinematic still. A young woman in a striped Breton shirt and dark bob haircut smokes at a Paris cafe table in 1962. She looks off-camera with soft intensity. Natural window light, high contrast, slightly overexposed highlights. Film grain visible. Godard aesthetic. 35mm monochrome, 50mm lens. Ultra-detailed.中文注释:1962 年巴黎咖啡馆里的短发女子,法国新浪潮风格黑白。
8. 意大利 giallo 恐怖走廊
Cinematic horror frame in the style of a 1970s Italian giallo. A woman in a white nightgown stands at the end of a long Victorian hallway lit only by flickering red lamplight. Her back is turned. Shadow stretches toward the camera. Wallpaper is blood-red damask. Shallow depth of field, 28mm lens slightly distorted. Grainy film look. Deep red and black color story. Ultra-detailed 4K.中文注释:1970 年代意大利 giallo 风格,血红走廊尽头的白衣女子背影。
9. 迈阿密风云霓虹夜
1980s Miami Vice cinematic shot. A woman in a white linen blazer drives a red convertible at night through downtown Miami. Palm trees and neon motel signs blur past. She looks at the camera with sunglasses reflecting the pink and turquoise glow of the city. Lens flare, soft film grain. Teal and magenta color grade. Ultra-realistic 4K.中文注释:80 年代迈阿密夜景,红色敞篷车+墨镜反射霓虹。
10. 吉卜力真人化
Cinematic still styled as a live-action Studio Ghibli adaptation. A young woman in a simple blue linen dress stands in a vast green hillside field, wind blowing her hair and skirt. Fluffy white clouds race overhead. Soft golden hour light. Warm, painterly color grading with gentle film grain. Wide lens, low-angle composition making her heroic against the sky. Ultra-detailed 4K.中文注释:吉卜力风格的真人化山坡画面,低角度仰拍衬出天际线。
提示词库:人像与美妆
人像的成败只看三件事:镜头、光线方向、皮肤质感。写明"85mm f/1.4"或"环形灯"或"相机左前方柔光箱",能帮你直接跳过三轮无效迭代。

11. Fenty Beauty 级微距
Extreme close-up beauty portrait. A stunning model with wet dewy skin and tousled damp hair, bare shoulders glistening. Water droplets on her face and neck catch the light of a ring light. Flawless skin texture in macro detail — every pore, every water droplet razor sharp. Smoky eye makeup with subtle gold shimmer. Lips slightly parted, intense gaze at camera. Dark background. Fenty Beauty campaign aesthetic. 85mm macro lens, f/1.4, ultra-shallow depth of field. Ultra-realistic 4K.中文注释:湿润皮肤美妆特写,环形灯加持,每一颗水珠都锐利。
12. 巴洛克长椅人像
Luxury editorial portrait. A gorgeous model wearing an elegant black velvet off-shoulder gown reclines on a dark velvet chaise longue in a dimly lit Baroque-style room. One arm draped elegantly above her head. Rich warm Rembrandt lighting from a single window highlights the fabric draping against her glowing skin. Oil painting-like quality with deep shadows and warm highlights. High-end fashion editorial photography. 85mm lens, creamy bokeh. Ultra-realistic 4K.中文注释:天鹅绒长椅上的黑裙女子,Rembrandt 单窗光打出油画质感。
13. 干净商务证件照
Professional corporate headshot. A confident woman in her early 30s wearing a tailored navy blazer over a crisp white shirt. Neutral gray seamless studio background. Three-point lighting — soft key from camera left, subtle fill from right, rim light from behind. Genuine warm smile, direct eye contact. 85mm lens, f/2.8. Skin tone natural and healthy. LinkedIn executive headshot quality. Ultra-realistic 4K.中文注释:灰色背景+标准三点布光,LinkedIn 级别高管头像。
14. 东京街拍人像
Environmental street portrait. A 20-something Tokyo local with bleached blonde hair and oversized vintage streetwear stands in Shibuya on a weekday afternoon. Shallow depth of field with crowd of pedestrians soft-blurred behind her. Natural overcast daylight. She looks slightly off-camera, lost in thought. Shot on Fujifilm X100 aesthetic, 35mm lens, f/2. Ultra-realistic 4K.中文注释:涉谷工作日午后,漂染金发的女孩与人群虚化背景。
15. Vogue 级封面
High-end fashion portrait in the style of a Vogue Italia cover. A striking model with razor-sharp cheekbones wears an oversized metallic silver couture gown with architectural shoulders. She stares directly into camera with a cold, commanding expression. Hair pulled back tight. Studio lighting is a single hard light from 45 degrees creating sculptural shadows. Gray backdrop. 85mm portrait lens, f/5.6 for crisp detail. Ultra-detailed 4K.中文注释:Vogue Italia 封面质感,银色立体礼服+单一硬光雕塑式人像。
16. 自然光厨房人像
Soft natural light portrait. A woman with wavy chestnut hair sits by a large north-facing window in a quiet morning kitchen. She holds a ceramic mug of coffee in both hands, looking out the window thoughtfully. Warm cream sweater, no makeup, freckles visible. Shot in Rembrandt light with window as the only source. 50mm lens, f/1.8, shallow depth of field. Soft, honest, lived-in feel. Ultra-realistic 4K.中文注释:晨光厨房窗边的素颜女子,仅用单侧窗户自然光。
17. 单色戏剧光
Dramatic black and white portrait. A man with a short salt-and-pepper beard and intense dark eyes stares into the lens. Only half his face is lit — hard side light from camera right, pure black shadow on the other side. Textured gray background fades to black. Shot on medium format film aesthetic, 80mm lens. Film grain. Peter Lindbergh style monochrome. Ultra-detailed.中文注释:Peter Lindbergh 风格黑白人像,半脸硬光、半脸纯黑。
18. 粉色美妆大片
Dreamy pastel beauty portrait. A model with soft pink lips, dewy skin, and flushed cheeks against a blush pink seamless backdrop. She wears a sheer white off-shoulder top. Soft diffused lighting from a large softbox creates flattering even illumination. Hair in loose tousled waves. 85mm lens, f/2. Cotton candy color palette — pink, peach, cream. Ultra-realistic 4K beauty editorial.中文注释:粉色背景+大柔光箱,糖果色调美妆大片。
19. 黄金时刻浪漫
Sun-drenched golden hour portrait. A woman in a flowing cream linen dress stands in a wheat field at 7pm on a summer evening. The sun is low behind her, creating a halo of golden backlight through her hair and the sheer fabric. Lens flare across the frame. Her eyes are closed, face tilted up to the warmth. 135mm telephoto lens, f/2, compressed background. Warm honey color grade. Ultra-realistic 4K.中文注释:夏日黄昏麦田,逆光+135mm 长焦压缩空间。
20. 暗学院派图书馆
Dark academia editorial portrait. A young woman with auburn hair in a loose braid wears a wool cardigan over a white collared shirt in an old university library. She holds an open leather-bound book, reading by the light of a green banker's lamp. Towering bookshelves around her fade into shadow. Warm tungsten light, deep navy and olive color palette. 50mm lens, f/2.8. Ultra-realistic 4K.中文注释:老图书馆+绿色银行家灯,dark academia 氛围。
提示词库:动作与动态
动作场景需要两样东西:冻结时刻的词("frozen mid-air"、"high-speed capture")以及边缘光,用来把主体从混乱背景中剥离出来。

21. Nike 训练冻结帧
Dynamic action freeze-frame. An athletic woman in a fitted sports bra and high-waisted compression shorts executes a powerful spinning roundhouse kick. Water splashes frozen in mid-air around her legs and feet in a dramatic spray pattern. Her toned abs and defined muscles visible. Dramatic single-source rim lighting from behind creates a glowing silhouette edge. Dark studio background. Nike Training campaign energy. High-speed photography feel — ultra-sharp subject, motion blur on water droplets. Ultra-realistic 4K.中文注释:Nike 广告级高速摄影,水花冻结在空中。
22. 冲浪者管浪内景
Epic wide-angle shot of a female surfer riding inside a massive crystal-clear barrel wave at golden hour. Her silhouette and athletic body visible through the translucent turquoise water of the wave tube. Golden sunlight creates an explosion of light and water mist behind her. Dramatic backlit composition. The wave is enormous and perfectly formed. GoPro-style immersive perspective. Ultra-realistic 4K cinematic quality.中文注释:黄金时刻巨浪管内的女冲浪者,逆光剪影。
23. 跑酷屋顶腾跃
High-speed action shot of a parkour athlete mid-leap between two Brooklyn rooftops at sunset. Frozen at the apex of the jump, arms and legs extended, silhouetted against a burning orange sky. The gap below him is dizzying — city streets far below. Motion blur on the trailing edge of his hoodie. Shot from a drone at his height, 35mm lens. Ultra-realistic 4K cinematic action.中文注释:夕阳中的布鲁克林屋顶跑酷,跳跃最高点冻结。
24. 综合格斗擂台聚光
Dramatic fight night action. A female MMA fighter mid-spinning back elbow, sweat flying from her hair in a visible arc of droplets. Single harsh overhead ring spotlight isolates her from pure black background — classic boxing photography look. Her opponent is a blurred silhouette out of focus. 70-200mm lens at 200mm, f/2.8, 1/2000 shutter frozen motion. High contrast, desaturated. Ultra-detailed 4K.中文注释:MMA 擂台单一顶光,汗珠飞起的弧线清晰可见。
25. 越野摩托扬尘
Low-angle action shot of a motocross rider airborne over a dirt jump, red desert dust exploding behind the rear tire. Late afternoon sun casts long shadows. The bike is tilted aggressively mid-trick. Camera is just above ground level looking up, making the jump look monumental. Anamorphic lens flare from the sun. Orange and teal color grade. Ultra-realistic 4K action.中文注释:越野摩托腾空而起,后轮扬起红色尘土。
26. 芭蕾舞室跃起
Contemporary ballet dancer mid-grand jete frozen in the air, arms extended, body perfectly horizontal. She wears a simple nude leotard. Plain gray cyclorama studio background. Strong side-light from camera left creates a sculptural chiaroscuro on her musculature. Powder disturbed from the floor traces her leap in a soft cloud. 1/4000 shutter speed feel. Ultra-detailed 4K.中文注释:芭蕾 grand jete 最高点,身体与地面平行。
27. 篮球扣篮仰拍
Low-angle hero shot of a male basketball player mid-slam dunk, one hand gripping the rim, body extended diagonally across the frame. Arena lights streak as lens flares. Crowd is a soft blurred wall of phone flashes behind him. Frozen sweat and net motion. Shot on 24mm wide from directly below the hoop. NBA official photography energy. Ultra-realistic 4K.中文注释:球篮正下方 24mm 广角仰拍扣篮瞬间。
28. 骏马冲浪奔跑
A rider on a powerful black horse gallops through knee-deep shallow ocean water at sunrise. Water explodes from each hoofstrike, frozen in a dramatic spray. The rider is leaned low, hair streaming behind. Warm golden backlight from the rising sun. Mist rising off the water. Shot at 1/4000 shutter, 200mm telephoto compression. Ultra-realistic 4K equine photography.中文注释:日出时浅海,黑马奔腾水花炸开、200mm 长焦压缩。
提示词库:自然与风光
风光类的关键词是时辰、天气、垂直尺度。模型对"一般的漂亮自然"有非常强的先验,你必须用具体的词把它推离那个均值。

29. 瀑布雾气仙境
Ethereal fantasy nature scene. A graceful young woman in a flowing sheer gossamer dress stands at the edge of a towering waterfall cliff. Dense tropical mist swirls around her legs and the translucent fabric. She extends one arm toward the cascade, water droplets catching golden light. Aerial perspective slightly from above showing the dramatic cliff drop. Lush green ferns frame the composition. Golden hour light filtering through the mist. Ultra-realistic 4K cinematic quality.中文注释:悬崖瀑布边的白纱女子,航拍视角+雾气。
30. 马尔代夫航拍漂浮
Overhead drone shot of a beautiful woman in a minimal white bikini floating on her back in crystal-clear turquoise shallow water over white sand in the Maldives. Her long dark hair fans out in the water like a halo. The water is so clear her full body is visible through the translucent surface. Tiny fish swim nearby. Travel photography editorial style. Ultra-realistic 4K aerial quality.中文注释:马尔代夫正上方俯拍,清澈海水中漂浮的女子。
31. 冰岛黑沙海岸
Dramatic wide landscape of Iceland's Reynisfjara black sand beach at dawn. Massive basalt sea stacks rise from the churning North Atlantic. Low fog drifts across the black sand. A single figure in a red rain jacket walks along the shoreline for scale. Moody desaturated color grade — almost monochrome with just the red jacket as accent. 24mm wide lens, f/11 for deep focus. Ultra-detailed 4K.中文注释:冰岛黑沙滩+红色雨衣作为色彩锚点。
32. 红杉林教堂光
Vertical composition looking up through towering California redwood trees. Shafts of golden morning sunlight cut through the fog between the trunks like cathedral light rays. Ferns carpet the forest floor. A tiny hiker in the distance gives scale. Ultra-wide 14mm lens distorting the trunks into a radial pattern toward the sky. Warm green and gold palette. Ultra-realistic 4K nature photography.中文注释:14mm 广角仰拍红杉林,雾中教堂光束。
33. 巴塔哥尼亚镜面湖
Perfect mirror reflection of the jagged Torres del Paine peaks in a glass-still Patagonian alpine lake at blue hour. Pink and purple alpenglow on the snow-capped summits. A single orange tent on the near shore as human scale. Complete symmetry — upper and lower half of frame are near-mirror images. 35mm lens, f/11. Ultra-realistic 4K landscape.中文注释:蓝色时刻的完美镜面湖,上下对称。
34. 撒哈拉沙尘暴
Vast Sahara desert at the start of a sandstorm. Rolling orange dunes extend to the horizon, with a towering wall of sand approaching from the left. A lone nomadic figure on camelback is silhouetted against the dust cloud. Sun struggles through the haze as a dim orange disc. Cinematic wide-angle, heavy atmospheric haze. Monochromatic warm orange palette. Ultra-detailed 4K.中文注释:撒哈拉沙尘暴边缘,骆驼骑手剪影对抗尘墙。
35. 极光小屋
Wide landscape of a tiny warm-lit wooden cabin in a Norwegian fjord valley at 1am. A spectacular green and purple aurora borealis dances overhead, reflecting in the still black fjord water. Snow-dusted pine trees and mountains frame the scene. The cabin glow is the only warm color in an otherwise cold composition. 20-second long exposure feel. Ultra-realistic 4K astrophotography.中文注释:挪威峡湾凌晨一点的极光与温暖小屋。
36. 非洲草原日落
Cinematic wide shot of a family of elephants crossing a golden savanna at sunset in Kenya. The sun is a huge orange disc on the horizon, silhouetting the herd. Long grass ripples in the warm wind. Dust kicked up by the herd diffuses the backlight into warm beams. 200mm telephoto compression. National Geographic editorial style. Ultra-realistic 4K wildlife photography.中文注释:肯尼亚日落草原,象群剪影+200mm 长焦压缩。
37. 京都樱花河
Serene wide landscape of the Philosopher's Path in Kyoto at peak cherry blossom season. Pink petals float on the narrow canal, with more drifting down from the trees above. Traditional wooden bridges arch over the water. Early morning mist softens the light into diffused pink. A solo figure in a dark kimono walks along the stone path for scale. 50mm lens, f/4, gentle pastel color grade. Ultra-realistic 4K.中文注释:京都哲学之道樱花盛开季节,粉色花瓣漂在运河水面。
38. 苏格兰高地风暴光
Dramatic landscape of the Scottish Highlands during a clearing thunderstorm. Dark churning clouds above a lone glen, with a single shaft of golden sunlight breaking through and lighting one patch of heather-covered hillside. Rainbow arc barely visible at the edge. Ancient standing stones in the foreground. Moody cinematic color grade — steel blue shadows, warm sunlit highlight. 24mm wide, f/11. Ultra-realistic 4K landscape photography.中文注释:苏格兰高地雷暴将散时,唯一一束金色阳光从乌云中穿下。
提示词库:奇幻与风格化
一旦你在奇幻题材里具体点名一个艺术参考(Ufotable、Arcane、Studio Trigger、Magic: The Gathering 插画),提示词就会变得锋利得多。泛泛的"fantasy art"只会还你泛泛的奇幻画。

39. Ufotable 动漫战姬
Epic anime-inspired fantasy warrior princess with flowing silver-white hair that reaches her waist, wearing ornate golden battle armor that hugs her figure with intricate engravings. She holds a glowing magical sword aloft, emitting bright blue energy. Cherry blossom petals and magical sparkles swirl in a violent storm around her. Her expression is fierce and determined. Dynamic action pose mid-battle leap. Ultra-detailed anime with CGI-quality lighting — Ufotable production quality. Rich colors, dramatic volumetric lighting. 4K quality.中文注释:Ufotable 级动漫战姬,蓝色魔剑+樱花风暴。
40. 黑暗精灵女法师
Dark fantasy dark elf sorceress with long flowing midnight-purple hair, pointed ears, and luminous violet eyes. She wears an elegant off-shoulder dark robe with intricate silver embroidery that reveals her collarbones and shoulders. Purple arcane energy spirals from her outstretched hands, illuminating her face from below. A vast star field and nebula visible in the background through a shattered stone archway. Semi-realistic fantasy illustration style with cinematic lighting. Ultra-detailed 4K.中文注释:黑暗精灵女法师,紫色奥术能量从手中盘旋而出。
41. 吉卜力森林精灵
Studio Ghibli style painterly scene. A small forest spirit that looks like a glowing white fox with three tails walks through a mossy enchanted forest at dusk. Fireflies dance around it. Soft painterly brushstrokes, warm honey-gold light filtering through massive ancient trees. Hayao Miyazaki watercolor aesthetic. Ultra-detailed animation cel quality.中文注释:吉卜力风格三尾白狐在黄昏苔藓林中漫步。
42. Arcane 双城之战风
Arcane Netflix animated series style illustration. A young woman with blue-tipped braided hair and steampunk goggles leans against a graffitied alley wall in the undercity of Piltover. Neon magical rune-signs glow behind her. Textured painterly brushstrokes visible, 2D illustration with 3D depth, saturated purple and teal color story. Fortiche animation studio aesthetic. Ultra-detailed 4K.中文注释:Arcane Fortiche 风格下城小巷少女。
43. 万智牌巨龙
Fantasy illustration in the style of a Magic The Gathering card. A colossal red dragon emerges from molten lava in an underground cavern, wings half-spread, mouth roaring with fire breath forming. A tiny knight in silver armor stands at the cavern's edge for scale, raising a shield. Dramatic low-angle hero composition. Rich oil-painting texture, Greg Rutkowski influence. Ultra-detailed 4K fantasy art.中文注释:万智牌插画风格的熔岩红龙与渺小骑士。
44. 赛博武士
Cyberpunk fantasy fusion. A female samurai with a chrome katana stands on the rain-slicked rooftop of a neo-Tokyo megacorp tower at night. She wears a fusion of traditional kimono and carbon-fiber combat armor. Holographic cherry blossoms drift around her. Neon reflections on the wet rooftop, flying ad-drones in the background. Illustrated in the style of Katsuhiro Otomo meets modern 3D concept art. Ultra-detailed 4K.中文注释:赛博东京屋顶上的女武士+全息樱花。
45. 水下美人鱼
Ethereal underwater fantasy. A graceful mermaid with iridescent teal and violet scales swims through a coral reef illuminated by shafts of sunlight piercing the water surface above. Her long turquoise hair flows weightlessly. Bubbles trail from her fingertips. School of small silver fish swim past. Dreamlike painterly quality, Lisa Frank meets National Geographic. Ultra-detailed 4K fantasy art.中文注释:珊瑚礁中的虹彩美人鱼,光束自水面穿下。
46. 蒸汽朋克飞艇船长
Illustrated steampunk fantasy portrait. A young female airship captain in a brass-buttoned red military coat, goggles pushed up on her forehead, stands at the wheel of a wooden airship. Visible brass gears and copper pipes. Behind her, clouds and other distant airships. Warm golden hour lighting. Illustration style inspired by Nausicaa and Howl's Moving Castle. Ultra-detailed 4K.中文注释:宫崎骏风格女飞艇船长与黄铜齿轮。
多风格迭代:同一个主体,不同的世界
GPT Image 2 里一个被低估的工作流:锁定主体,只改风格槽位。你会很清楚地看到每种风格对同一张脸、同一套衣服、同一个姿态做了什么——下次选风格就不再靠猜。

基础提示词——主体在四次生成中保持完全一致:
A beautiful young woman with shoulder-length brown hair stands in a sunlit garden, wearing a simple white sundress, one hand lightly touching a rose bush. Soft golden afternoon light. Three-quarter body framing, slightly tilted head, warm smile.中文注释:阳光花园里触摸玫瑰的简裙女子,黄金下午光。
然后只切换风格槽位,每条跑一次:
47. 写实摄影
[Base] — Hyperreal fashion photography aesthetic. 85mm lens at f/1.8, soft natural light, editorial sharpness. Ultra-realistic 4K.48. 日式动漫
[Base] — Japanese anime style with cel shading, bold line art, vibrant saturated colors, large expressive eyes. Kyoto Animation production quality. Ultra-detailed.49. 古典油画
[Base] — Classical oil painting style with visible thick brushstrokes, warm Renaissance lighting, chiaroscuro shadow, Vermeer-like color palette. Museum-quality.50. 赛博朋克
[Base] — Neon-drenched cyberpunk futurism. Holographic overlays, circuit-pattern light tattoos on skin, magenta and cyan rim lighting. Ghost in the Shell art direction. Ultra-detailed.我们在内部测试号上跑这套序列,第一张大约 18 秒,后面几张风格切换耗时差不多。总共不到两分钟、48 credits,就得到一套完整的风格 moodboard。放在客户提案里,这相当于把原本半天的素材搜索压缩成一杯咖啡的时间。
常见失败案例与修复
诚实章节:GPT Image 2 很好用,但它不是魔法。以下是我们记录到频率最高的几类失败,以及对应的修复模板。把这一节当作排错清单来用——下次出图翻车时按顺序检查一遍,大多数问题都能在第一次修改之后解决。
失败 1:输出平淡无奇
Before:
A beautiful woman in a city.After:
A 28-year-old woman with auburn hair pulled into a low ponytail, wearing a camel trench coat, crossing a Manhattan crosswalk at 6pm on a rainy Thursday. Yellow taxis blur past in motion-blurred streaks. 50mm lens, f/2, cinematic grain. Ultra-realistic 4K.第一条提示词没给模型任何抓手。修复办法永远是具体的名词和具体的地点。
失败 2:手指数量错误
GPT Image 2 在手部表现上已经远好于第一代扩散模型,但手的特写仍然可能翻车。两种可靠的规避方式:
- 别让手成为主体,直接裁掉:"framing is shoulders up only"(只拍肩膀以上)。
- 让手里握东西:"hands gently holding a ceramic coffee cup"。有物体约束姿势,手指数量就稳了。
失败 3:图中文字乱码
模型不是排版软件。要在图中放 Logo、路牌、海报上的可读文字——要么极短("a sign reads OPEN"),要么在提示词里直接加一句:"no text, no letters, no words anywhere in the image",然后到 Figma/Photoshop 里再单独排版。
失败 4:光线方向被忽略
Before:
A portrait of a woman with dramatic lighting.After:
A portrait of a woman lit by a single hard spotlight from 45 degrees camera-left, with deep black shadow filling the right side of her face. Rembrandt lighting with a small triangle of light on the shadowed cheek."Dramatic lighting"什么都没说。说清方向、硬度、阴影覆盖范围才是真正的提示词。
失败 5:主体出现在错误的场景里
如果模型反复把人物放进通用摄影棚而不是图书馆——把场景挪到提示词最前面,并写得更具体:
In a candle-lit 17th-century English library with floor-to-ceiling oak shelves, leather-bound books, and a stone fireplace, a woman in…把场景放在主体之前,等于在引入人物之前就框定了整个构图。
失败 6:提示词过载
超过 1,200 词左右,单个形容词的影响力就开始被稀释。如果你的提示词是 40 个风格标签的流水账,模型会"取平均"。保留一个主风格锚(比如"film noir"),其他都当成辅助。
用满 20,000 字符:结构化长提示词
GPT Image 2 一个被低估的优势是提示词上限高达 20,000 字符。大多数竞品都卡在 1,000–2,000 字符左右。人像用不到,但对于多人物复杂场景、概念图 brief、或品牌一致性很强的系列图,结构化的长提示词非常值得用。
生产 brief 里我们常用的模板:
# SCENE
[场景:地点、时刻、天气、历史时期,2–3 句话]
# CHARACTERS
- Character A: [外貌、服装、当前姿势、表情]
- Character B: [同上]
- Background extras: [简短描述]
# COMPOSITION
[构图:广角/中景/特写;机位角度;镜头;景深;每个角色在画面中的位置 — 三分法/黄金分割/中心]
# LIGHTING
[光源、方向、硬度、色温、阴影行为]
# COLOR
[用 3–4 个色彩术语描述调色板。调色方向 — 暖/冷/分离调色]
# STYLE
[一个主风格参考。如"Roger Deakins 在《银翼杀手 2049》里的摄影风格"]
# TECHNICAL
[分辨率修饰、胶片颗粒、画幅、画质标签。保持简短]
# EXCLUSIONS
[避免的东西:"No text, no logos, no watermarks, no extra limbs"]示例——完整结构化提示词(约 500 词)用于一张广告主图:
# SCENE
A restored 1930s Art Deco ballroom on a rainy Tuesday evening in Paris, set during a private jazz performance. Tall arched windows on the left show wet boulevards and soft yellow streetlamp glow. Interior is lit warm and amber.
# CHARACTERS
- Lead: A striking 32-year-old woman with dark auburn hair in a low chignon, wearing a deep emerald-green silk bias-cut gown with a low back. She stands near a grand piano, one hand resting on its polished black lid, gazing thoughtfully toward the windows. Faint melancholy in her expression.
- Pianist: A middle-aged man in a black tuxedo, seated at the piano mid-performance, profile view, fingers on keys. He is a secondary figure — should not pull focus from the lead.
- Background: Three or four well-dressed patrons at candlelit round tables in soft bokeh, unidentifiable faces.
# COMPOSITION
Medium-wide shot. Lead character is on the right third of the frame, piano extending diagonally across the center toward the left. Rule of thirds. 50mm lens, f/2.2, shallow depth of field — lead and piano sharp, background patrons and windows softly blurred. Eye-level camera height.
# LIGHTING
Warm tungsten chandelier overhead providing ambient glow on the room. Key light on the lead is a single practical wall sconce camera-right at 45 degrees, modeling her face in gentle Rembrandt pattern. Rim from the windows behind her (cool blue rainy light) separates her hair and shoulder edge from the warm interior. Overall contrast: high but soft.
# COLOR
Deep emerald green (dress) and warm amber (interior) as hero colors, with cool blue window light as counter-accent. Warm gold dominant, with selective teal shadow detail. Film-look color grade reminiscent of early Wong Kar-wai.
# STYLE
Cinematic still in the visual language of In the Mood for Love meets a modern luxury cognac commercial. Anamorphic lens quality (slight horizontal flare on the candles). Painterly softness, 35mm film grain.
# TECHNICAL
Ultra-realistic 4K, 16:9 aspect, cinematic frame.
# EXCLUSIONS
No text, no signage, no logos, no watermarks, no visible phones or modern electronics, no extra limbs, no warped fingers on the pianist.分段结构有两层好处:一是让你自己不漏填任何槽位;二是给模型一个结构化的解析入口,而不是一口气 500 词的散文。整个系列只需要改 CHARACTERS 和 SCENE 两段,就能批量产出同一支广告的不同镜头。
一条实战建议:当一张图渲染到 80% 对了、但某一个元素不对(比如女主穿错颜色),不要重写整条提示词。复制成功的那条,只改对应的槽位,再跑一次。我们内部迭代日志显示:结构化提示词平均 2.8 次就能得到主图级别的一帧;而自由散文提示词常常超过 6 次。按 12 credits 一张算,这就是每张主图 $2 和 $5 的差别。
想把结构化提示词工作流交给同事?先让他们看上手教程,再回来看这篇。
常见问题
GPT Image 2 提示词里最重要的是什么?
光线和镜头——顺序就是这个。主体和场景写得模糊一点还能救,但光线方向和镜头选择一旦含糊,出来永远像库存图。如果你只有时间精修两个槽位,精修这两个。写清"光从哪个方向来、多硬、阴影落在哪一侧",再写清"多少毫米的镜头、多大光圈、多近多远",一张图的基本盘就稳了。
GPT Image 2 提示词应该写多长?
人像和简单场景,80–150 词是甜区。带年代与风格锚点的电影感广角,150–250 词。多人物场景或广告 brief,用结构化模板 400–800 词。20,000 字符的上限是留给极端情况的——日常使用很少会超过 500 词。
可以在提示词里写真实艺术家的名字吗?
你可以引用一种风格或年代——"film noir"、"1970s giallo"、"Studio Ghibli painterly"——模型会识别这些视觉语言。但直接用在世艺术家名字做风格标签,伦理上灰色、在模型侧也越来越多被过滤。更好做法是描述风格、媒介、年代,而不是点名个人。
为什么同一条提示词每次结果都不一样?
扩散模型本质上是随机的——它从一张噪声图开始去噪成成图。同一条提示词跑两次,必然得到相近但不同的结果。这是特性不是 bug,也是"多样性"的来源。想要复现,大多数生成系统支持 seed 参数。技术背景可以参考 OpenAI 的图像生成博客。
提示词的长度影响价格吗?
不影响。GPT Image 2 使用扁平定价:每张 12 credits,无论你写 20 词还是 2,000 词。影响成本的只有生成图片的数量。
一个概念应该试几次再放弃?
经验法则:同一条提示词跑 3 次感受自然方差,还不对就只改一个槽位,不要推倒重来。大多数时候要修的就是光线或机位。如果跑到第 8 次还没进展,就是结构出问题了——回到八槽位公式检查你到底填了几个。我们内部还有一个习惯:把每次生成的提示词和对应种子记在一个表格里,复盘时很容易看出哪一个词是真正起作用的。
GPT Image 2 生成的图能商用吗?
可以。按产品的标准条款,你生成的图归你所有、可商用。具体授权条款以站点页脚为准,涉及高风险场景(品牌广告、出版物封面等)建议咨询律师。另外,别把提示词里提到的真实人物或品牌商标当成免责金牌——那属于肖像权和商标权问题,不归 AI 产品条款管。
text-to-image 和 image-to-image 的提示词有什么不同?
text-to-image 从噪声起步,提示词是唯一指引。image-to-image 从你上传的参考图起步,提示词只是在修改它。image-to-image 的提示词应更短,聚焦"改什么"("改成油画风,保持主体姿势和服装不变"),而不是把整个场景再描述一遍——参考图已经提供了大部分槽位。
准备好开工了吗?
你手上现在有 50+ 条提示词、一套八槽位公式、一份失败案例修复清单,以及一个结构化长提示词模板。下一步就是打开工具真的跑一条。随便挑一条粘上去,看看输出离你脑海里的画面差多少——然后只修那个跑偏的槽位,再跑一次。两三轮之内,你就能稳定产出"可以直接交付"的图。
把这篇文章收藏到浏览器书签里,或者把八槽位公式贴在你的第二屏显示器边上。真正的提升不是记住这些提示词,而是把公式内化成肌肉记忆——之后你看到任何参考图都会自然地拆出它的主体、光线、镜头与风格。
继续阅读:
- 什么是 GPT Image 2?完整介绍与首次上手
- GPT Image 2 使用教程:一步一步带你上手
- GPT Image 2 vs Sora:诚实对比
- GPT Image 2 vs Kling:到底选哪个?
对某一条提示词有疑问?在站内给我们留言——我们会看每一条,提问频率最高的那几条,往往会出现在下一版指南里。理论背景可以配合 Wikipedia 关于文本生成图像模型的词条 一起读,10 分钟左右。想要进一步提升,下一步可以看同系列的使用教程,把工作流从"生成一张好图"升级到"稳定产出一组风格统一的图"。

