Wan 2.7 Image API

alibaba/wan-2.7-image

by Alibaba Group•release date: 4/1/2026

Wan 2.7 Image is Alibaba's unified model for realistic face customization, text-to-image, precise color control, editing, and multilingual text rendering.

$0.048per request

Wan 2.7 Image API - Background

Overview

Wan 2.7 Image, also known as Wan2.7-Image or 万相2.7图像生成模型, is a next-generation unified image generation and editing AI model developed by Alibaba Group's Tongyi/Qwen team. Released in April 2026, it is designed to address common challenges in AI image synthesis, such as generic facial outputs, imprecise color control, blurry text rendering, and poor adherence to complex instructions. The model emphasizes usability, precision, and production-grade control, making it highly suitable for business and creative applications via the Wan 2.7 Image API.

Development History

The Wan 2.7 Image model is part of the Wan (万相) series, reflecting Alibaba's ongoing investment in advanced generative AI. Developed by the Tongyi/Qwen team, the model was officially launched in April 2026. Its architecture and feature set were shaped by extensive feedback from design, marketing, and content creation industries, focusing on practical pain points such as facial diversity, color accuracy, and robust text rendering. The API-first approach ensures seamless integration for developers and enterprise users.

Key Innovations

Shared latent space enabling unified text-to-image, image editing, and multi-image fusion
Realistic face customization with fine-grained control over facial features
Advanced color palette extraction and application for brand consistency

Wan 2.7 Image API - Technical Specifications

Architecture

Wan 2.7 Image employs a shared latent space architecture, supporting both text-to-image and image editing tasks within a single framework. The model integrates chain-of-thought reasoning for logical consistency and supports interactive editing through natural language and reference images. It is optimized for API deployment, allowing flexible input parameters and batch processing.

Parameters

The exact parameter count is not disclosed, as the focus is on production usability rather than sheer scale. The model supports high-resolution outputs up to 4K (4096x4096) in the Pro version and flexible aspect ratios, with efficient performance for API-based workflows.

Capabilities

Realistic face customization with prompt-based fine control
Precise color palette extraction and application from reference images or HEX arrays
Multilingual and high-fidelity text rendering for up to 3000 tokens in 12 languages

Limitations

Artistic style diversity may be less pronounced compared to some aesthetic-focused models
Slightly increased generation time when using advanced reasoning (Thinking Mode)

Wan 2.7 Image API - Performance

Strengths

Exceptional text rendering clarity and multilingual support
High accuracy in color reproduction and brand consistency

Real-world Effectiveness

In real-world deployments, the Wan 2.7 Image API delivers robust performance for business-critical applications such as product imagery, marketing collateral, and branded visual assets. Its ability to follow complex prompts, render precise text, and maintain color consistency makes it a preferred choice for enterprises seeking reliable, scalable image generation and editing. The API supports batch processing, seed control, and interactive editing, ensuring high productivity and output consistency.

Wan 2.7 Image API - When to Use

Scenarios

You have a need to generate branded product images for e-commerce or marketing campaigns. The Wan 2.7 Image API excels at extracting and applying precise color palettes from reference images or HEX codes, ensuring brand consistency across all outputs. This reduces manual editing time and ensures visual uniformity for large-scale product catalogs.
You are designing promotional materials, posters, or infographics that require clear, multilingual text rendering. The Wan 2.7 Image API supports up to 3000 tokens of readable text in 12 languages, outperforming most models in text clarity and layout. This is ideal for businesses targeting diverse markets or producing information-rich visuals.
You need to create diverse, realistic avatars or character illustrations for gaming, social media, or virtual events. The Wan 2.7 Image API enables fine-grained control over facial features, avoiding generic outputs and allowing for unique, lifelike portraits. This enhances user engagement and personalization at scale.

Best Practices

Leverage the API's colorPalette and prompt parameters to ensure precise control over visual style and brand alignment.
Utilize Thinking Mode for complex scene compositions or when logical consistency between multiple elements is critical.

Technical Specs

Context Length5,000

Release Date4/1/2026

Input Formats

textimage

Output Formats

image

Capabilities & Features

Capabilities

text to-image generationrealistic human face customizationprecise color palette controlmultilingual text renderingimage editingmulti image fusion and referenceimage set/sequential generationchain of-thought 'Thinking Mode'marquee/box pixel level editinghigh consistency batch output

Supported File Types

.png.jpg.webp

← Back to Search