Product photographyAugust 22, 202524 min read

State of generative AI technology for product photography: creating lifestyle perfume shots with AI

Discover how generative AI is transforming product photography. We test 5 AI background generator tools/models to create lifestyle perfume shots—no edits, just raw results. See which delivers the most authentic product visuals.

Table of contents

Generative AI creates new visuals from scratch: imagined objects, places, and scenes. But product photography plays by different rules. It’s not about inventing; it’s about showing the product as it is.

That raises some interesting questions:

Can a lifestyle photoshoot be replaced entirely by generative AI?
Which AI background generator is best to achieve an authentic lifestyle shot?
Can these images be trusted to represent real products accurately?

Before you start

This article takes a close look at how generative AI is being used to create lifestyle perfume bottle shots and what that means for the future of product photography.We’ll compare 5 different AI background generator tools/models using a single prompt, with zero additional edits. As if the photos were generated by average users, who aren’t experts, and expect to achieve acceptable results as the tools promise. This approach lets us test how AI technology performs in a realistic scenario.

Can generative AI be a game-changer for lifestyle product photography?

Today’s business is all about finding quick, cost-efficient, and effective ways of producing content. Up to recently, lifestyle photography required meticulous planning, budgeting, finding a studio location, proper photo equipment, and an expert photographer. Now, generative AI promises a potentially simpler and more efficient way: all you need is a packshot, a generative image-to-image AI tool, and a good prompt. The promised result is a perfect lifestyle image with a stunning AI-generated background in no time and at a fraction of the cost. But is that really the case?

Time for a test: 4 different fragrance bottles, 4 challenges for AI

To thoroughly test how generative AI models can handle virtual photoshoots, we decided to select perfumes as a representative example. Perfume bottles, being transparent, reflective with distinct branding, pose challenges for AI algorithms for proper lighting, blending with the environment, maintaining authentic branding, and captions.

We opted for four different fragrances, each representing a different style and challenge for algorithms, from metallic reflections, transparency to intricate ornamentation, and non-standard shapes.

Although perfumes are used as a primary example, the results of this research can be applied broadly to other types of products.

Time for a test: perfumes

Just Cavalli (Roberto Cavalli)— an elegant bottle with a metallic finish and a distinctive logo that reflects its surroundings in the light. Why we chose this: Good to test how different models blend reflective products with the environment. Additionally, the bottle features a futuristic design, making it ideal for a CGI scene with a sci-fi aesthetic. We immediately wanted to create something that resembles a 3D rendering.

Qaed Al Fursan (Lattafa) — a square bottle in an oriental style with intricate gold and black graphics and Arabic inscriptions. Why we chose this: We wanted to test how well non-Latin texts and patterns are replicated by the AI tool.

Spicebomb Extreme (Viktor&Rolf) – a designer grenade-shaped bottle with a matte black finish and a copper-colored metallic band. Why we chose this: Generic, simple product that shouldn’t create issues for a generative AI tool.

Devotion (Dolce & Gabbana) – a classic transparent bottle with a decorative gold heart-shaped plaque in a vintage style. Why we chose this: Chosen for its transparency as well as complicated and distinctive ornament with branding.

Time for a test: AI tools

Generative image-to-image AI technologies are creating a new image, based on the input image and the prompt. By design, a genAI model “wants” to change the input image and specifically the product within it. Older technologies struggled to maintain product fidelity in the newly generated scene, and the original product was usually distorted. When fidelity was preserved, the product often appeared artificially blended with the environment. The most advanced tools can balance this by preserving product authenticity in the new image while seamlessly integrating it into the new environment through realistic reflections, shadows, adapted lighting, and transparency.

There are hundreds of virtual photoshoot tools out there. Most of them rely on the same base technologies/AI models. We decided to pick the most popular AI models and tools that promise high-fidelity results.

Midjourney - an advanced AI image generator known for creating extremely realistic, stylized, and artistically stunning backgrounds. Its biggest advantage is a deep visual style, which attracts creators, graphic designers, and marketers.
ChatGPT model 5 - an image generator integrated with ChatGPT based on the gpt-image-1 model. It creates images based on text descriptions or with image input. It’s easy to use, and to some degree, output image fidelity can be controlled.
Flux.1 Kontext Pro - a model for generating scenes and editing images that promises high input image fidelity. Specifically designed to maintain high product fidelity (in this context). There are two options Flux.1 Kontext Pro or Flux.1 Kontext Max. We decided to go for the “Pro” variant, which is less expensive, supposedly less accurate, but we found it generated better results for our test.
Flair AI - an image background generator and photo editor for product photos. Claims to create “photorealistic product images that are indistinguishable from professional photography. Accurately renders textures, reflections, and lighting to create stunning product visualizations.”
Nano Banana (Gemini 2.5 Flash Image) – an intelligent image generator and editor model from Google, designed for conversational use. Its key strengths are consistency of characters across multiple edits, seamless image blending, and extremely fast performance (“instant Photoshop”). All outputs are watermarked and embedded with SynthID for traceability and safety. Perfect for creators who want natural, intuitive image editing in a single tool.

Time for a test: input packshots

All packshots were taken in high resolution, in PNG format with a transparent background, maintaining semi-transparency in bottles. We used our automated photo studio ALPHASHOT PRO G2 with Orbitvu Station software.

High-quality input images are crucial for maintaining precision when generating AI backgrounds. This quality allows for an accurate assessment of how algorithms handle details, edges, and integration with the generated scene.

Comparison of D&G

So, we have 4 products and 5 popular AI background generators. For every perfume, we prepared a separate prompt describing a lifestyle scene, generated 2-4 photos, and picked the best one. To measure the quality of AI models, we took into account key lifestyle photography features, assigning points for each:

Product fidelity (max. 10 pts.): The ideal generated image should accurately maintain the product's shape, colors, and distinct features, like transparency and reflection. Maintaining product branding, captions, and ornaments is crucial. A score of 10 points means that no additional post-production would be required to achieve a result comparable to traditional methods, which is crucial in lifestyle product photography.
Environment blending (max. 8 pts.): The product should blend in naturally with the generated environment/background. Reflections, colors, lighting, and shadows should all match the generated surroundings. This is important for the perceived quality of lifestyle photography, but not as important as product fidelity. An 8-point score indicates results comparable to a traditional photoshoot.
Scene aesthetics (max. 7 pts.): This includes composition, the creativity of the scenery, and the natural appearance of the scene. It’s our subjective measure.
Prompt adherence (max. 5 pts.): The scene should be generated as described, and the product's position should be maintained. While important for a stylist's workflow, this is less critical than product fidelity. Max. 5 points for 100% prompt following.

Comparison of D&G

The prompt:

“A luxurious Mediterranean terrace overlooking the sea, with a panoramic view of a sunlit coastline and deep blue water. Elegant stone surface in the foreground, surrounded by blooming citrus flowers, green glossy leaves with morning dew, and subtle elements like vanilla pods and candied fruit pieces. Bright, clear sky, a few yachts sailing in the distance. Sophisticated, warm summer atmosphere — perfect backdrop for a high-end fragrance product. Keep the original angle, position, and perspective of the perfume bottle from the uploaded image exactly as it is. Create in resolution 16:9, maintain original identity, and input fidelity to high.”

Midjourney

AI background generator: Midjourney

Our take: Bottle shape and proportions, logotype, and ornament are only slightly distorted. Overall, product features are well-preserved. The product doesn’t blend in perfectly with the background: the reflections in the cup are studio-like (like reflections from the environment), the transparency is somehow handled, but in reality, the bottle is less transparent (real transparency was provided in the input image). Also, the shadow is a little too big for a small transparent bottle. The position of the bottle is maintained as requested in the prompt. The scenery, however, is clearly artificial, and the prompt regarding the perfume ingredients hasn’t been fully followed. Overall score: 63%

Flux.1 Kontext PRO

AI background generator: Flux.1 Kontext PRO

Our take: The product's proportions in the image differ from the real product, appearing wider and bulkier. While the fluid color is slightly altered, this may be an adaptation to the scene lighting. The product nicely blends into the new scene, featuring a pleasing reflection from the light in the bottom left corner. Transparency is well highlighted and aligns with the actual product. Although the reflection in the cup is modified and doesn't match the environment, it still surpasses other models. The shot's perspective was modified from the straight-on packshot. We tried several other attempts modifying the prompt, but somehow the model “insists” on the angled diagonal shot of the fragrance. Overall, the scene looks natural and pleasing. Overall score: 70%

Chat GPT model

AI background generator: Chat GPT model 5

Our take: The fragrance proportions and shape in the image differ significantly from the real product: the cup is longer and thinner, and the bottle is bulkier. The branding and ornament are well-maintained. Fluid color is altered too much, even considering the scene lighting. The product blends well into the new scene, with natural shadow and semi-transparency in the bottle. The reflection in the cup is modified and doesn't match the environment, nor the lighting, which is coming from left, not right. Position is not maintained. Again, this model also tries to “improve” it. Apart from that, the AI model followed all the prompt instructions. When it comes to aesthetics, the scene looks quite artificial, especially the flowers and oversaturated colors. Overall score: 57%

Flair AI

AI background generator: Flair AI

Our take: The bottle cup proportions and shape differ significantly from the real product: the cup is longer and thinner in the original image. The branding and ornament are distorted: the ornament and logotype are “reinvented” by the model. Fluid color is altered too much: oversaturated. The product blends well into the new scene, with natural shadow and semi-transparency in the bottle, which distorts elements behind the bottle. The reflection in the cup is modified; it doesn't match the environment and the lighting, which is coming from the left, not from both sides. Position isn’t maintained. This model also changes the product position, although instructed to maintain the one from the input image. The AI model followed all the prompt instructions. As for aesthetics, the scene looks quite artificial, especially the flowers and oversaturated colors, similar to ChatGPT. Overall score: 50%

Nano Banana

AI background generator: Nano Banana

Our take:The generated image of the D&G fragrance bottle is a strong and faithful reproduction of the original. The proportions of the cap and bottle are preserved accurately, and the ornate heart-shaped emblem with the DG monogram is well-rendered, maintaining the brand’s recognizable detailing. The liquid color, while slightly richer, is natural and fits the warm tone of the overall composition rather than feeling oversaturated. As for the blending with the background, the bottle integrates naturally into the bright coastal background, with realistic shadowing and convincing semi-transparency in the glass that distorts the view behind it. The lighting direction is coherent, and reflections on the cap, though stylized, don’t break the visual harmony. The added flowers, sugared fruit, and vanilla sticks enrich the storytelling but look somewhat artificial. Overall, this result balances product fidelity with an aesthetically pleasing scene. Overall score: 87%

Comparison of Spice Bomb

The prompt: “A high-end dramatic studio background with large autumn leaves bursting from the center, water splashes surrounding the base, cinematic lighting with a gradient grey-to-white backdrop, hyperrealistic detail, luxury advertising style. Do not modify the original perfume bottle; leave it exactly as it is. Create in resolution 16:9, maintain original identity and input fidelity to high.”

Midjourney

AI background generator: Midjourney

Our take: Although at first glance, the image looks very appealing, there are many issues. The bottle proportions differ significantly from the real product: the generated perfume is slimmer, when in reality it’s bulkier. The branding is distorted. Moreover, the model added the SKORTEO M5 caption, which doesn’t exist in the real product. The bottle has no transparency, but Midjourney added it to the lower part of the bottle. The product blending with the new scene is ok, but nothing sophisticated. Product position is well maintained. The AI model followed the prompt instructions well (apart from product alteration). Overall, the scene looks appealing, and the model was very creative in generating it. Overall score: 53%

Flux.1 Kontext PRO

AI background generator: Flux.1 Kontext PRO

Our take: Not as appealing as Midjourney, and without the “wow effect”. The bottle proportions differ only slightly from the real product. The branding is a bit distorted and blurred. The bottle opacity is preserved. The product blends quite well with the new scene, but the product was made darker and lost many details. The reflective surfaces don’t catch reflections from the environment. The position is well maintained. The prompt instructions were well adhered to. Overall, even if the bottle is too dark, the scene doesn’t look that bad and, in our opinion, better than ChatGPT or Flair.AI. Overall score: 67%

Chat GPT model 5

AI background generator: Chat GPT model 5

Our take: It’s even less appealing than the Flux model. The bottle proportions differ slightly from the real product: it’s made slimmer by ChatGPT. The branding is distorted: a different font, “O” letter instead of “&” inside “O”. The product blends with the new scene; however, there are no reflections from the environment. The lighting looks good, and the product details are highlighted. The position is well-maintained, and the prompt was followed, except for branding. The scene looks very artificial and AI-generated-like. Overall score: 57%

Flair AI

AI background generator: Flair AI

Our take: The bottle proportions differ from the real product: it’s made bulkier by Flair.ai. There is a collar missing at the spray part. The branding is altered: “&” letter instead of “&” inside “O”. The product blends well with the new scene but lacks authenticity - there are no reflections from the environment. The lighting looks good and natural. The position is well maintained, and the prompt was generally followed. The scene looks unnatural, sort of made in a studio with the floor and background clearly visible. Overall score: 53%

Nano Banana

AI background generator: Nano Banana

Our take: The generated version of the Spicebomb Extreme bottle remains faithful to the original in terms of proportions, shape, and detailing, accurately reproducing the grenade-inspired design and metallic band. The logo and typography are oversharpened and well-preserved, with a small mishap: “&” in a circle is replaced with a “$” sign. As for the creative scene setting, the product is surrounded by autumn leaves and dynamic (but somewhat poorly-looking) splashes of water, which add energy and a seasonal context but also create a more stylized, less photorealistic look. The lighting and reflections on the bottle are consistent with the central studio-style illumination, though the added background elements introduce a contrast that feels slightly artificial. Overall, the integration is visually striking and enhances the product’s identity, but it emphasizes aesthetics over realism. Overall score: 77%

Comparison of Just Cavalli

The prompt: “Create a cinematic, futuristic background environment with a high-tech, metallic aesthetic. The rendering scene should feature smooth, reflective steel surfaces, glowing blue ambient lights, and layered geometric architecture with concentric rings, panels, and structural depth — evoking a luxurious sci-fi atmosphere. The lighting should be dramatic, with cool-toned reflections that enhance the sleekness of the setting. Avoid clutter — the environment should feel premium, clean, and engineered with symmetry. The color palette should primarily feature shades of metallic silver, chrome, and deep blue. The background must seamlessly accommodate and highlight a central luxury product, without interfering with its position or scale. Create in resolution 16:9, maintain original identity and input fidelity to high.”

Midjourney

AI background generator: Midjourney

Our take: Once again, Midjourney got very creative with the surroundings. The problem is that it was also creative with the product, which isn’t desirable. The shape and fragrance color were altered, while the branding appears blurred and distorted. Bonus points go to Midjourney for recognizing that the top part of the bottle is mirror-reflective. However, it didn’t do well in blending the product with the surroundings. The product is disappearing in the new scenery, so overall the aesthetics vibe is poor in our opinion. Overall score: 37%

Flux.1 Kontext PRO

AI background generator: Flux.1 Kontext PRO

Our take: The product position was slightly modified - fragrance is rotated for a more direct front shot. Original camera position - slightly from the bottom - was not maintained. The branding was also altered and doesn’t look as sharp as in the packshot. The color of the liquid was modified. As for blending, it’s poor; you can see some reflections from the scene in the bottle, but it feels very artificial and unnatural. The product isn’t highlighted, and disappears in the scene. Having said all that, the image is unattractive and artificial. Overall score: 50%.

Chat GPT model 5

AI background generator: Chat GPT model 5

Our take: Again, ChatGPT slightly modified the logotype — using a different font in Just Cavalli and even changing it to Just Cavali (with a single ‘L’). The bottle was also reinvented, with slightly altered proportions. The fragrance liquid color is different. Image blending with the environment is quite good, with nice reflections and lighting. In our opinion, the whole scene looks attractive. However, the product appears a bit too large in the final image, and its angle was slightly adjusted. Overall score: 57%

Flair AI

AI background generator: Flair AI

Our take: The bottle itself, much like in the case of ChatGPT, has been reinvented. The branding is altered, the bottle shape and details are changed, as well as the color of the fragrance. The product’s position also slightly deviates from the source packshot. Image blending is quite good and looks natural, with nice reflections and lighting. Overall, it’s quite a good lifestyle, but it isn’t authentic. Overall score: 53%

Nano Banana

AI background generator: Nano Banana

Our take: The generated Just Cavalli bottle is reproduced with good fidelity — the embossed “Just” logo and gradient blue liquid are well-preserved, with the chrome finish rendered in a polished way. Moreover, the transparency is well-preserved. However, the proportions of the bottle were clearly modified - it appears elongated compared to the original. As for the blending with the generated scene, it’s handled mediocre. On the one hand, the lighting direction is coherent with well-handled reflection in the floor and transparency. On the other hand, the reflections on the metallic surface don’t match the environment and overall lighting style. ChatGPT did a better job there. The scene shows the product’s bold identity and creates a visually striking, premium look. Overall score: 67%.

Comparison of Qaed Al Fursan

The prompt: “Create a realistic, luxurious background for a product photo. The perfume bottle must stay fixed in place on a rustic wooden fence of a horse stable. In the distance, add blurred silhouettes of horses behind the fence, within a warm golden-hour setting. Include visual themes inspired by these notes: saffron, pineapple, jasmine, fir, oud, cedarwood, amber. Use earthy textures and warm tones. Only generate the background – do not change or move the product in the foreground. Create in resolution 16:9, maintain original identity and input fidelity to high.”

Midjourney

AI background generator: Midjourney

Our take: Again, if you don’t go into details, the image isn’t bad. Looking closely, though, the branding is mostly changed, and Midjourney added transparency to the bottle, which is opaque. Position isn’t kept: diagonal instead of frontal as in the input image. The product isn’t well separated from the background, which, although blurred, is very saturated, making the whole composition hard to look at, and the product gets “lost” in all that. Overall score: 47%

Flux.1 Kontext Pro

AI background generator: Flux.1 Kontext Pro

Our take: Very well-maintained product features, including branding and ornaments. Just as usual, in the case of Flux, the product is slightly blurred. Great work on color coordination - everything blends in smoothly, and the horse on the right is well done. Good reflections and product details. With the one on the left, though, something went wrong as it stands in the middle of the fence. :) As of composition, it looks artificial on an oval bench - probably physics wouldn’t hold it. However, it’s aesthetically very appealing. Overall score: 80%

Chat GPT model 5

AI background generator: Chat GPT model 5

Our take: Very well-maintained product features, including branding and ornaments. Average blending with the environment - lighting from the back, reflects in the front. Slightly artificial composition with the flowers and a pineapple. Strange horse silhouettes. Maintained position and well adhered to the prompt. Overall score: 77%

Flair AI

AI background generator: Flair AI

Our take: Good composition and high product fidelity, except for slight modifications in gold color on the bottle ornament and proportions of the cup. Well blended in, with very good re-lighting. Changed product position, and part of the prompt was ignored. Generally quite good, naturally looking image. Overall score: 73%

Nano Banana

AI background generator: Nano Banana

Our take: The generated image of the fragrance bottle captures the product’s overall form quite accurately, though there are still differences compared to the original. The proportions of the bottle remain consistent, with the square silhouette and cap closely matching the real design. The front label, however, shows slight reinterpretation: while the horse motif and geometric pattern are recognizable, some details are softened or simplified. Additionally, you can notice that the brand name has been modified too extensively. Shot position is “reinvented” - Nano Banana tries to capture a pit at the top of the bottle, thus adding some top view and creating a new bottle shape, which isn’t true to reality. Moreover, the gold tone appears slightly warmer and more saturated than in the original, but it may be due to warm scene lighting.

When it comes to scene aesthetic, the product is placed into a rustic outdoor scene with horses in the background, which adds a strong thematic connection to the fragrance’s identity. Shadows and lighting are handled convincingly, aligning well with the warm sunset atmosphere, though reflections on the cap are more generic and less integrated with the environment. The additional props — pineapple, saffron, and flowers — enhance the storytelling but look somewhat staged, reducing naturalism. A large, prominent pineapple takes over the scene, making the fragrance “fight” for its central place. Other AI technologies captured it better. Overall, the generated image succeeds in creating a striking, atmospheric composition that emphasizes brand character, but compromises are visible in label fidelity and the realism of the surrounding elements. Overall score: 77%.

Summing up the tests

Taking everything into consideration, let’s see how they scored in terms of proportion, color, and authenticity:

Which AI tool is the best?

When it comes to lifestyle images, generative AI can already be an alternative to traditional photo shoots. Tools like Nano Banana, Midjourney, ChatGPT, Flux, or FlairAI can place a perfume bottle into sophisticated, emotional scenes — from minimalist interiors to sunlit beaches — with convincing realism.

For us, Midjourney stands out in terms of creativity—it did a great job generating backgrounds, but it also alters the product the most, which most of the time isn’t acceptable for product photography. This can be fixed in a photo editing program, but requires additional skills. On the other hand, Flux Kontext Pro most faithfully reproduces the product, but the backgrounds it generates aren’t always impressive. However, the Gemini 2.5 Flash Image model (aka Nano Banana) has exceeded our expectations and surpassed all other models/tools in all aspects. It ensures the highest authenticity in terms of product representation and creates stunning backgrounds with just a few clicks.

The majority of tools sometimes ignore parts of the prompt. Why? We aren’t sure, but it’s probably related to the learning datasets and the stochastic nature of how those tools work. For sure, there are ways to improve the prompt to achieve more desirable results, or to use JSON prompting.

A key finding from this research is the inconsistency of generative AI. While results for products like Al Fusan and Dolce & Gabbana were remarkably brilliant, others were unacceptable, suggesting that the outcome is highly dependent on the specific product. We also had to do several tries before achieving acceptable results that were good enough for this research.

Which tool is the best for you?It all depends on how much authenticity you require from the tool. If not much, and you require stunning scenery, maybe even Midjourney, which alters products, can be acceptable for you. If you care about product branding, shape, and details,it seems Nano Banana is the best choice, but Flux.1 Kontext is not far away and in some cases exceeds Google AI.

Summing up, each AI tool/model has its strengths and weaknesses, especially when it comes to generating content from a single prompt without extra revisions.

FAQ

Q: What does AI change for product photography?

A: For photographers and content managers, AI in product photography means more control over time, budget, and creativity. Instead of planning complex shoots, they can focus on capturing one perfect packshot, then use AI tools/models to create multiple variations tailored for campaigns, social media, or seasonal updates.

Generative AI isn’t replacing photography; it’s reshaping how it’s used. The core image stays authentic, while AI expands its possibilities.

Q: Will AI replace photographers?

A: We don’t think so. If you want to achieve authentic visual content, AI needs a good packshot. And for a good packshot, you need a photographer. As a result, photographers become co-creators of creative and fast-paced productions. Their experience, combined with innovative technologies like AI, translates into the quality of the final result. Creative, high-end visual content will still require professional photographers and a more traditional way of working.

Q: Will AI ever generate a ready-to-publish product photo for a PDP?

A: Yes, but not without a solid starting point. A well-prepared packshot is essential. Without it, AI struggles to reproduce a product’s exact shape, color, and details. Even with a good packshot, small errors can happen: a slightly distorted logo, uneven glass reflections, or misplaced text. Fortunately, these are quick fixes. A few minutes in Photoshop or another editing tool, and the image is ready to go live.

-----------------------------------------------

This blog post was originally published in August 2025 and has been updated in September 2025 to reflect the fast pace of AI technology development. We included the Nano Banana image model (Gemini 2.5 Flash Image) from Google for comparison.

-----------------------------------------------

This research article was done by the Orbitvu team:

Packshots - Julia Banduch

Prompts, generative images & descriptions - Marek Herceliński

Copywriting - Elżbieta Binkowska

Guidance & support - Tomasz Bochenek

Talk to Orbitvu about your workflow

Orbitvu specialist ready to discuss product content workflow

Use the form to tell us what you are planning and what kind of product content workflow you need.

More from this category

July 27, 2026

Why Smart Factories Are Integrating Automated Photography Solutions

Industry 4.0 made data the backbone of manufacturing. Automated photography extends that logic to visual documentation - consistent, traceable, AI-ready image records that support quality control, compliance, and e-commerce from a single capture session.

Read article

July 23, 2026

What is Orbitvu's AI OCR, and how does it work?

Meet Orbitvu's AI OCR: extract labels, part numbers, and product data straight from your images and turn it into structured metadata inside Orbitvu Station.

Read article

July 15, 2026

What is Orbitvu's AI Masking, and how does it work?

Meet Orbitvu's AI Masking: automatically remove the background from product photos in high resolution, right inside your Orbitvu Station workflow - no design skills needed.

Read article