探索AI在地质科研绘图中的应用:ChatGPT与Midjourney绘图流程与效果对比
文章目录
- 个人感受
- 一、AI绘图流程
- 1.1 Midjourney
- (1)环境配置
- (2)生成prompt
- (3)完善prompt
- (4)开始绘图
- (5)后处理
- 1.2 ChatGPT
- 不合理的出图结果
- 解决方案
- 二、主题绘图结果展示
- 地球内部圈层
- 史前时期地貌演化模式
- 不同时期化石演化
- 板块运动
- 地质活动
- 地层褶皱
- 地震
- 海啸
- 雪崩
- 火山喷发
- 岩浆流动
- 冰川地貌
- 河流地貌-细小河流汇聚
- 河流地貌-河流穿过树林
- 河流地貌-上游至下游
- 河流地貌-河间地块
- 喀斯特地貌
- 风化侵蚀
- 水循环
- 土壤剖面1
- 土壤剖面2
- 土壤质地方块
- 孔隙含水介质
- 油气开采
- 卫星遥感
- 水库
- 地面沉降
- 三、出图效果对比
个人感受
AI擅长的主题 + 好的prompt + 局部重绘 + 后处理 = 好的出图效果
AI出图效果的好坏强依赖于prompt(提示词),直接根据某个宽泛的主题出图的效果通常很差,部分公众号等媒体在宣传中往往夸大了AI在科研绘图中的作用。
AI技术能够生成具有氛围感、真实感和艺术性的插图,作为科普插图是足够的。
然而,目前AI生成的插图在精确性、可控性和科学规范性方面仍存在一定局限,因此难以直接应用于严谨的科研论文插图中。
一些比较满意的出图
【风化侵蚀】
【冻土退化】
一些效果差的出图,缺乏逻辑
一、AI绘图流程
1.1 Midjourney
(1)环境配置
环境配置流程
- 开通会员账号/租共享账号
- 安装discord
- 创建服务器
- 添加Midjourney Bot至群组
- 用/imagine命令开始绘图
(2)生成prompt
方式1:用Midjourney的/describe
命令
方式2:上传图片至chatgpt生成绘图所需prompt
假如我想生成几乎一模一样的图片,请你给我这副图片的prompt
请详细的描述一下这张图片,生成prompt。以便我重新绘画,(可以忽略文
字)
(3)完善prompt
Midjourney的prompt分为三部分:
- 图片URL
- 文字
- 参数
图片URL通过上传图片到当前服务器,复制链接
获得,添加图片URL生成的图片在风格上会更贴近参考图。
文字prompt的获得见前文。
一些常用的参数有,其余参数在本文绘图中保持默认,见下图
--ar: 改变纵横比
--no:设置否定词,排除要素
(4)开始绘图
用空格作为分隔符,将上一步骤中三部分的prompt,进行拼接如:
运行命令后,会一次性生成四张图
(5)后处理
Midjourney会一次性生成四张图,四张图的数字编号排列为
1 | 2 |
---|---|
3 | 4 |
如果对生成的图不满意,可以
- 调节prompt(这个最直接)
- 点击刷新,重新生成4张图
- 选择
V
开头的选项,选择四张图中你偏爱的风格,生成相近的四张图
- 局部重绘
Midjourney有局部重绘工具vary (Region)
开启局部重绘窗口后,可选择右下角的套索工具
或矩形工具
选择要进行局部重绘的区域,基于prompt进行重新绘制。
可以看到局部重绘后的图片,仅在被选中的修改区域产生了变化,非选中区域则基本不变
- 修改结果图比例
点击custom zoom
工具
在原有prompt的基础上,修改参数项,为
# 命令范式
# --ar 目标图片比例 --zoom 1# 将图片输出尺寸修改为1:1,整体缩放不变
--ar 1:1 --zoom 1
修改尺寸后的绘图结果为
单击图片后,右键下载图片到本地
1.2 ChatGPT
ChatGPT的生图功能基于OpenAI的DALL·E模型,普通用户使用该功能限制为2张/24h,升级Plus账号后限制有放宽。
不合理的出图结果
本人使用体验ChatGPT生图功能遇到的错误有
(1)文字、箭头的错误
(2)无关的装饰要素过多
(3)对地下部分的描述脱离现实
地下生长的植物
轮船航行在地下
奇怪的地下结构
植物茎干部分和地下根系的错位
2D profile showing a mixture of vegetation with different root depths. Grass with shallow roots about 10 cm deep, shrub with medium roots about 50 cm deep, and tree with deep roots about 3 meters deep. Multiple soil layers visible beneath the vegetation.
解决方案
- 降低文字生成的可能,凸显主题
ChatGPT的绘图不支持排除项的指定,如指定出图不包含某个元素。因此直接设置出图结果不包含文字、箭头等,效果并不好。但可以通过输入肯定的prompt来凸显出主体,prompt如下
[global option] Focus on specific, visually representable elements. Describe actions and scenarios rather than abstract concepts. Avoid ambiguous language that could be interpreted as including text.
- 出图后用局部重绘功能,
remove
去除不想要的元素
二、主题绘图结果展示
地球内部圈层
An artistic cross-sectional diagram of the Earth showing its internal layers, including the crust, mantle, outer core, and inner core. Each layer is vividly colored, with distinct textures and gradients to represent the density and composition changes. The background is white, and the Earth is partially transparent to reveal the layers within. The inner core glows with a bright yellowish light, representing its heat and solid state.
【ChatGPT】
【Midjourney】
史前时期地貌演化模式
A detailed artistic diagram illustrating the evolution of prehistoric landscapes. The image is a spiraling timeline showing changes in terrain over geological periods, including mountains, rivers, forests, and deserts. Each segment of the spiral represents a distinct geological era, with vivid details like volcanic eruptions, glacial formations, and vegetation development. The background is white to highlight the colorful and intricate layers of terrain evolution.
【ChatGPT】
【Midjourney】
不同时期化石演化
A detailed artistic illustration showcasing fossil evolution across different geological periods. The image consists of a series of layered blocks, each representing a distinct time period, with stratified earth layers and corresponding surface ecosystems. Fossilized remains of plants and animals are depicted in each layer, showing gradual changes over time, such as dinosaurs, mammoths, and early human activity. The background is white, emphasizing the colorful and detailed transitions of geological and biological history.
【ChatGPT】
【Midjourney】
板块运动
Continent movement
A detailed cross-sectional diagram of Earth’s lithosphere showcasing plate tectonics. The image includes mountain formations, subduction zones, mid-ocean ridges, and volcanic activity. The layers of the Earth’s crust and mantle are clearly depicted with distinct textures and colors. The background is white, emphasizing the dynamic processes of plate movement, such as divergence, convergence, and magma rising at the mid-ocean ridge.
【ChatGPT】
【Midjourney】
地质活动
2D profile, geological activity at a mid-ocean ridge. The diagram depicts two tectonic plates moving apart due to magma rising from the mantle, forming new oceanic crust. The Earth’s layers are represented with distinct textures and colors, showing the crust, mantle, and magma. The background is white, emphasizing the dynamic process of seafloor spreading.
【ChatGPT】
【Midjourney】
地层褶皱
Stratigraphic folding in the Earth’s crust. The illustration features layered sedimentary rocks bent into an anticline and syncline structure. The layers are shown in different colors to represent their composition and depth. The surface is green, symbolizing vegetation, and the background is white to emphasize the geological deformation.
【ChatGPT】
【Midjourney】
地震
The aftermath of an earthquake in an urban setting. Collapsed buildings, tilted structures, cracked streets, and a derailed tram. Smoke and fire rise from destroyed buildings in the background, while people are depicted in chaos and rescue efforts. The destructive power of earthquakes with intricate details and a white background.
【ChatGPT】
【Midjourney】
海啸
a tsunami caused by an underwater earthquake. The image shows the ocean floor with a fault line, the displacement of water due to seismic activity, and the resulting waves propagating toward the coastline. Palm trees and small huts on the shore highlight the vulnerability of coastal areas. The background is white, emphasizing the dynamic process of wave formation and energy transfer.
【ChatGPT】
【Midjourney】
雪崩
A dramatic scene of a massive avalanche descending from a towering, snow-covered mountain. The avalanche rushes down the slope with immense force, its dense snow cloud and debris cascading toward the valley below. At the foot of the mountain, a pine forest surrounds a few houses, with people running in panic to escape the oncoming disaster. Animals, including deer and birds, are seen fleeing the area. The snow is already beginning to engulf parts of the landscape, creating a sense of chaos and urgency. The lighting is natural but slightly overcast, with a cold, white-dominated palette emphasizing the snow and tension in the atmosphere. Created using: cinematic composition, dynamic motion effects, realistic textures, vivid environmental details, high-definition quality, dramatic lighting, and an intense, natural disaster theme.
【ChatGPT】
【Midjourney】
火山喷发
A simplified illustration of a volcanic eruption showing a cross-sectional view of a volcano. The diagram features a cone-shaped volcanic mountain with lava flowing down its slopes and thick smoke and ash rising into the air. Surrounding the volcano are small patches of greenery and a water body at the base. The background is white, emphasizing the eruption process and the volcano’s structure.
【ChatGPT】
【Midjourney】
岩浆流动
A detailed cross-sectional illustration of magma flow, showing molten lava moving through volcanic channels and erupting on the surface. The diagram highlights the underground magma chamber feeding the lava flow, with bright orange and red tones representing heat and molten rock. The surface features fiery explosions and glowing lava spreading across rugged terrain. The background is white, emphasizing the dynamics of magma movement.
【ChatGPT】
【Midjourney】
冰川地貌
Glacial landforms in a mountainous region. The image features U-shaped valleys, cirques, tarns, and rivers flowing through the valleys. The terrain is rugged, with steep mountain peaks and lush green vegetation on the slopes. Small lakes are scattered in the valleys, connected by streams. The background is neutral, emphasizing the geological features shaped by glacial activity.
【ChatGPT】
【Midjourney】
河流地貌-细小河流汇聚
A detailed isometric illustration showing a river system in a mountainous region. The image features snow-capped peaks, dense vegetation, and multiple streams converging into a main river channel. The terrain is rugged with steep slopes and carved valleys. The river is depicted in blue, flowing dynamically through the landscape. The background is neutral, emphasizing the natural river system.
【ChatGPT】
【Midjourney】
河流地貌-河流穿过树林
A detailed isometric illustration of a meandering river in a forested landscape. The river is shown in blue, curving gently between lush green forests on both sides. The terrain features a cross-section of soil layers, emphasizing the riverbank’s structure. The trees are dense, creating a natural, serene environment. The background is neutral, focusing on the river’s flow and surrounding vegetation.
【ChatGPT】
【Midjourney】
河流地貌-上游至下游
a river’s journey from upstream to downstream. The image features snow-capped mountains at the source, a dam controlling water flow, and the river winding through various landscapes. Surrounding elements include dense forests, agricultural fields, orchards, bridges, and a city at the downstream end. The terrain showcases cross-sections of soil and rock layers, emphasizing the connection between natural and human-made features. The background is white, highlighting the progression of the river through the environment.
【ChatGPT】
【Midjourney】
河流地貌-河间地块
A simplified cross-sectional diagram illustrating fluvial landforms. The image shows valleys carved by river erosion, with U-shaped and V-shaped channels on the surface. Thin blue rivers flow through the valleys, highlighting the process of erosion and sediment transport. The terrain is composed of sandy or soil-like material, and the background is white to emphasize the geological features.
【ChatGPT】
【Midjourney】
喀斯特地貌
A detailed cross-sectional illustration of karst landforms, showcasing a landscape shaped by water erosion and dissolution of limestone. The diagram features sinkholes, underground rivers, caves, and cracks in the rock layers. Water flows through the system, creating interconnected channels and reservoirs. The surface includes grasslands and small streams. The background is white, emphasizing the internal and external features of the karst system.
【ChatGPT】
【Midjourney】
风化侵蚀
Weathering and Erosion
A detailed isometric illustration depicting a landscape shaped by weathering and erosion. The image features eroded rock formations, layered sedimentary structures, and a desert-like terrain. The terrain includes canyons, mesas, and a small basin filled with water. Soil and rock layers are exposed, highlighting the effects of natural forces over time. The background is white, emphasizing the geological processes that shaped the land.
【ChatGPT】
【Midjourney】
水循环
A detailed isometric illustration of the terrestrial hydrological cycle, featuring precipitation, surface water evaporation, groundwater flow, vegetation transpiration, and solar radiation. The diagram includes mountains, rivers, forests, and clouds. The sun is depicted as the primary energy source driving evaporation and transpiration. The background is white, emphasizing the interconnected processes of the water cycle.
【ChatGPT】
【Midjourney】
土壤剖面1
A detailed illustration of a soil profile featuring three distinct layers. The top layer shows plants with green leaves and roots extending into the soil. The soil layers are depicted with different textures and colors, ranging from dark, organic-rich topsoil to lighter subsoil and coarse, rocky material at the bottom. The roots penetrate through all three layers, connecting the vegetation to the soil. The background is white, emphasizing the soil structure and plant interaction.
【ChatGPT】
【Midjourney】
土壤剖面2
A detailed isometric illustration of soil layers from top to bottom, including humus, topsoil, subsoil, weathered rock fragments, and bedrock. The surface features a tree with deep roots extending into the subsoil and grass with shallow roots confined to the humus layer. Each soil layer is distinctly colored and textured to show its composition. The background is white, emphasizing the stratification and root interactions.
【ChatGPT】
【Midjourney】
土壤质地方块
An educational illustration of three vertical rectangular prisms placed side by side, representing different soil textures. The left prism shows salinized soil with a pale crust and dry, dead grass on top. The middle prism depicts fertile brown-yellow soil with green herbaceous vegetation on the surface. The right prism represents arid, cracked soil with visible fractures and no vegetation. The design is minimalistic, focusing on the textures and colors of the soil, with a clean white background.
孔隙含水介质
Porous Aquifer Medium
A simplified 3D illustration of a porous aquifer medium. The diagram shows a cube filled with interconnected pores and solid grains, representing the spaces where water can flow and be stored. The background is neutral blue to emphasize the porous structure and the contrast between the solid material and the voids. Each pore is highlighted to show water retention and movement potential.
【ChatGPT】
【Midjourney】
油气开采
A detailed cross-sectional illustration of an oil and gas extraction site. The surface includes a drilling rig, storage tanks, and infrastructure set on a desert landscape. Below the surface, multiple geological layers are shown, with a drilling well extending through the layers to reach the oil and gas reservoir. The reservoir is depicted as a black layer trapped between impermeable rock layers. The background is white, emphasizing the subsurface and drilling process.
【ChatGPT】
【Midjourney】
卫星遥感
Satellite remote sensing,real photo
【ChatGPT】
【Midjourney】
水库
an illustration showcasing watershed water resource management. A dam intercepts the river, storing water in an upstream reservoir. The reservoir’s bottom consists of bedrock, and water is regulated through the dam’s gates before being discharged into the downstream plain area.
【ChatGPT】
【Midjourney】
地面沉降
land subsidence caused by excessive groundwater extraction. The image features a coastal area with tilted and sinking buildings, cracked ground, and a lowered surface layer. Arrows indicate the upward flow of water from underground, and the subsurface layers show depleted aquifers. The background includes a mix of land and water, emphasizing the environmental impacts of over-extraction on urban and rural landscapes.
【ChatGPT】
【Midjourney】
三、出图效果对比
(1)出图效果对比
Midjourney的效果整体比ChatGPT更好,速度更快,更加方便二次修改
(2)成本对比
官方售价均在人民币100元左右,二次市场的共享账号方面,Midjourney略便宜。
【ChatGPT会员】
【Midjourney会员】
(3)风格差异
ChatGPT的绘图风格是绮丽绚烂的梦,通常具有较高的饱和度,色彩鲜明、对比度较强
MidJourney的绘图风格则相对柔和,饱和度较低,色调更为平衡一些
参考链接
1.[60张高清地质用图] https://mp.weixin.qq.com/s/spozxpFLvkstA7wZOZKAsA