Qwen-Image Edit gives Photoshop a run to work in second

Want smart insight into your inbox? Enterprise AI, only what matters to data and security leaders, sign up for our weekly newspapers. Subscribe now

Adobe Photoshop is one of the most recognizable pieces of software made so far, used by more than 90% of the world’s creative professionals. Photutorial,

So the fact is that New open source ai model , Qwen-Image EditAI researchers’ Chinese e-commerce giant Alibaba’s Quven team has been released tomorrow- Now able to complete a large number of editing jobs like Photoshop with text input aloneThere is a notable achievement.

Made on the 20 billion-layer-the-image foundation model released earlier this month, QWEN-Emage-Endition, the text rendering expands the unique strength of the system in text rendering to cover a wide spectrum of editing functions, which ranges from micro-appearance changes to comprehensive semantic changes.

Just upload an initial image – I tried one by myself Final annual conversion conference of venturebeat In San Francisco-and then type the instructions of what you want to change, and the Quven-Emins-edit will return a new image with those editing.

AI scaling hits its boundaries

Power caps, rising token costs, and entrance delays are re -shaping Enterprise AI. Join our exclusive salons to learn about top teams:

Transform energy into a strategic profit

Architecting efficient estimates for real thrruput benefits

Unlocking competitive ROI with sustainable AI system

Secure your location to stay ahead,

Input image example:

Photo Credit: Michael O’Donal Photography

Output image examples with the prompt: “Make man wearing Tuxedo.”

The model is now available in many platforms, including Qwen chat, Throat face, Modelcope, GithubAnd through Alibaba Cloud Application Programming Interface (API)The latter that allows any third-party developer or enterprise to integrate this new model in its own applications and workflows.

I set my examples above Qwen chatRival to the Chatgpt of the qwen team’s openai, however, any aspiring users should be noted that it should be noted that the generations are limited to about 8 free jobs (input/output) over a 12 -hour period before resetting generations. Paying users may have access to more jobs.

With support for both English and Chinese input, and a dual focus on both semantic meanings and visual loyalty, Qwen-Emage-Editing The purpose is to reduce obstacles for professional-grade visual content manufacturing.

And given that the model is available as an open source code Apache 2.0 under licenseIt is safe for enterprises to take, download and set on their own hardware or virtual clouds/machines, resulting in a huge cost saving from proprietary software such as Photoshop.

As “It can remove a strand of hair, very delicate image modification.”

The team’s declaration resonates this feeling, which does not introduce Qwen-Emage-Edit as a new system, but as a natural expansion of Qwen-image that applies its unique text rendering and double encoding approach to direct editing functions.

Dual encoding allows to preserve the style and material of the original image

The foundation is built on the foundation installed by Qwen-Emage-Edit Cowen-imageWhich was introduced as a large -scale model earlier this year, specializing in both the image generation and lesson rendering.

The technical report of Qwen-Emage highlighted its ability to handle complex tasks such as paragraph-level text rendering, Chinese and English characters and accuracy with multi-line layouts.

The report also emphasized Dual hesitant systemA variant autoocheer (VAE) for feeding images simultaneously in Qwen2.5-Vl for cementic control and reconstruct expansion. This approach allows editing that remains loyal to both signs and looks of the original image.

The same architectural options underline Qwen-Image-Editing. By taking advantage of dual encoding, the model can accommodate at two levels: Semantic editing This changes the meaning or structure of a view, and Attendance editing Introducing or removing the elements while keeping the rest untouched.

Semantic editing Creating new intellectual property, rotating 90 or 180 degrees to reveal individual ideas, or transforming an input into another style such as studio adjacent art. These edits usually modify many pixels but preserve the underlying identity of goods.

Is here An example of semantic editing From Sridhar Ethinarayanan, an engineer from the AI application platform, who used a replica-hosted implementation or “estimation” of Quven to resume a picture of Manhattan to look like a toy Lego set.

Attendance editing Accurately focuses on local changes. In these cases, most of the image remains unchanged while specific objects are replaced. Demonstration involves adding a signboard that produces a reflection in water, removes stray hair strands from a picture, and a lesson replaces the color of the single letter in the image.

A good example of editing attendance with Quven-Imez Edit comes from Uttarai’s co-founder and CEO Thomas Hill who posted Side-by-side on X Showing his wife under a radical in his wedding dress and covered with frescoes with one and the same archway:

In presenting Chinese and English text, the editing-centric system combined with the established strength of Quven, has been deployed as a flexible tool for the creators that require more than simple generative imagines.

Dual control over cementic scope and appearance loyalty means that the same device can meet very different needs, from creative IP development to production-level photo retchings.

Adopt the images

There is another standout capacity Bilingual text editingQwen-image-edit allows users to add, remove or modify the text in both Chinese and English, preserving the font, size and style.

This strong text extends over the reputation of Quven-image for rendering, especially in challenging scenarios such as complex Chinese characters.

In practice, it allows for accurate editing of posters, signs, T-shirts, or calligraphy artifacts, where small text details matters, as seen as seen Another example of repeating below,

A performance involved correcting errors in a piece of sugar calligraphy generated through a step-by-step chain editing process.

Users can highlight wrong areas, instruct the system to fix them, and then refine the details until the right characters are provided. This recurrence approach shows how accuracy is necessary where the model can be implemented on high-day editing functions.

Use applications and cases

QWEN team has highlighted a series of potential applications:

Creative Design and IP ExtensionLike a mascot-based emoji pack.

Advertisement and material manufactureWhere logo, signage and text-Haavi visuals can be adapted.

Virtual avatar and artWith style transfer that supports unique character representation.

Photography and personal useIncluding background adjustment, cloth changes and object removal.

Cultural protectionClassical calligraphy was displayed through correcting the functions.

By bridging fine editing with widespread creative changes, Qwen-Image-Edit fulfills professionals who require controls while being acceptable for accidental use.

Benchmarking and performance

According to the Quven team, the evaluation in the public benchmark indicates that the Quven-Emins delivers the Edit Sophisticated performance In image editing.

This is as follows by comprehensive covane-image technical evaluation, where the Aadhaar model achieved leading results in both the general image generation and lesson rendering functions.

While the specific editing benchmark figures in release were not detailed, Quven-images gave themselves excessively in independent assessment such as AI Arena, where human rats compared the output in the model of various providers.

API pricing and availability

Through Alibaba Cloud Model StudioDevelopers can use covane-image-edit as an API. Pricing is set $ 0.045 per imageWith a free quota 100 images are valid for 180 days After activation.

Service is initially available Singapore regionWith the rate limit of Five requests per second and by Two concurrent work per account,

To use API, developers should get a model studio API key and can call the model via HTTP or through dashskop SDK in Python or Java.

Images can be presented as the URL or base 64 format, in which the supported resolution range from 512 to 4,096 pixels and file size 10 MB. The output images are hosted on the Alibaba cloud object storage with a valid link for 24 hours, which requires users to immediately download and save.

What’s next for Qwen?

Qwen image the image as a step tawlerD reduce obstacles for visual material manufacture. Accurate, style-compatible editing more accessible, model, model Can support applications from design studios for casual users refining individual projects.

System AI also indicates a comprehensive tendency in development: moving from a single-obvious generation towards editing, improvement and integration devices.

With both cementic flexibility and appearance-level precision, the Quven-Emez-edit indicates this change, which inserts the general strength of large models with the reliability required for professional editing.

Daily insights on business use cases with VB daily

If you want to impress your boss, VB daily has covered you. We give you the scoop inside what companies are doing with generative AI, from regulatory changes to practical deployment, so you can share insight for maximum ROI.

Read our privacy policy

Thanks for membership. See more VB newsletters here.

There was an error.

What's Hot

I tried 0patch as a last resort for my Windows 10 PC – here’s how it compares to its promises

A PC Expert Explains Why Don’t Use Your Router’s USB Port When These Options Are Present

New ‘Remote Labor Index’ shows AI fails 97% of the time in freelancer tasks

Adopt the images

Use applications and cases

Benchmarking and performance

API pricing and availability

What’s next for Qwen?

I Found the Best Way to Run an Internet Speed Test (And Use the Results for Better Wi-Fi)

OpenAI, Anthropic and Google all have new AI healthcare tools – here’s how they work

Advertisements are coming on Chatgpt. Here’s how they’ll work

Microsoft’s new text editor is a VIM and Nano option

The best luxury car for buyers for the first time in 2025

Massives Datenleck in Cloud-Spichenn | CSO online

Most Popular

10,000 steps or Japanese walk? We ask experts if you should walk ahead or fast

FIFA Club World Cup Soccer: Stream Palmirus vs. Porto lives from anywhere

Google tests AI-operated audio overview in search results for some questions

Our Picks