Skip to main content
Images. Video. Voice. Text. One integration.

Add AI media generation to your product

Give your users images, video, voiceovers, and copy in one embedded workflow. Ship faster than building it yourself, keep the experience inside your app, and skip months of AI infrastructure work.

index.html
<!-- Images, video, audio, text — one tag. -->
<image-layer
  api-key="il_live_..."
  api-url="https://api.imagelayer.app"
/>

Everyone's shipping AI. Are you?

Your users already expect AI features. Here's what stops most teams from shipping them fast.

Your competitors are already shipping AI features

AI is in every product around you. Your users expect it. Every month you wait is a month your competitors pull ahead.

Building it yourself costs a fortune

Model selection, API integration, rate limiting, billing, brand controls — the real cost isn't the API call, it's the engineering time to make it production-ready.

Sending users to external tools kills the experience

When users leave your product for ChatGPT or Canva, you lose engagement, you lose context, and the output doesn't match your brand.

ImageLayer handles the infrastructure, guardrails, and delivery — so your team can ship the feature.

4 modes Image · Video · Audio · Text
1 tag To embed
0 ML expertise needed
Full Brand compliance
Features

Everything you need. Nothing to build.

One widget. Images, video, voice, text. Your brand, your rules.

Text-to-Image

Your users describe what they need. The widget returns a polished, on-brand visual — right inside your product.

AI-generated stat card

Stat highlight

AI-generated blog promo card

Blog promo

See full case study

Sketch-to-Image (Starter+)

Users upload a rough sketch or wireframe. Get a brand-aligned result — without leaving your app.

Polished product launch announcement generated from sketch
Result
Rough wireframe sketch of a product launch layout
Sketch Input
See full case study

AI Text Generation New

Captions, product descriptions, social copy — generated alongside your visuals or on their own. Same prompt pipeline, same brand rules.

Prompt Brand tone applied

Launch the new serum with a calm, premium tone for Instagram and PDP copy.

Generate text
Caption Instagram

Golden-hour glow. Solene serum, soft rose finish, ready for your morning ritual.

Product blurb PDP

Lightweight hydration with a refined botanical finish and a clean editorial feel.

Video Generation New

Cinematic product demos, social teasers, brand intros — generated from text in under two minutes. Powered by Veo 3.1 on Vertex AI.

Read the guide
Video generation preview
Product demo
Veo 3.1 Fast 8s 720p

AI Voiceovers & TTS New

Professional narrations, product voiceovers, and audio ads — eight distinct voices, two quality tiers. Pair with video for narrated content.

Read the guide
Kore Flash TTS 22s MP3

“Meet Prism — the smart lamp that shifts from warm focus to evening calm.”

0:22

Podcast Snippets (Pro+) New

Generate podcast intros, two-host discussions, and interview segments. Automatic voice pairing creates natural multi-speaker audio.

Read the guide
Achird Sulafat Two-host
Host A

AI video is not replacing a crew. It is collapsing the time to first usable draft.

Host B

Exactly. A small team can turn one script into a polished two-voice segment in minutes.

Content Types

Stat highlights, quote cards, blog promos, announcements — users pick a template, fill in the fields, and generate. No prompt engineering needed.

Stat Quote Blog News
Value
42%
Label
growth rate

Platform Presets

LinkedIn, Instagram, X, blog headers, email banners — one click sets the right dimensions and optimizes the composition for each channel.

LinkedIn Instagram X Blog Email

AI Brand Enforcement (Pro+)

Define your style, colors, and tone. The AI follows your brand from the first draft — not just with your logo added at the end.

Read the guide
Aa
Modern Clean Professional

Usage Analytics (Pro+)

Track who generates what, monitor credits across users, and manage costs from one dashboard.

1,284 credits +12%
MTWTFSS

Works Everywhere

A native Web Component that drops into any codebase. React, Vue, Next.js, Angular, Svelte, Astro — or just vanilla HTML. No wrappers, no framework lock-in.

ReactVueNext.jsAngularSvelteAstroHTML + any stack
How it works

Up and running in three steps

No AI infrastructure. No tool sprawl. Just results.

01

Set up your brand guardrails

Upload your logo, pick your colors, and define the style and tone you want the AI to follow. One setup, every workflow stays on-brand.

02

Embed the widget

Drop one <image-layer> tag into your app. It's a native Web Component — one integration for images, video, voice, and text.

03

Your users start creating

Users pick the workflow they need, add a prompt or structured inputs, and generate on-brand images, videos, voiceovers, or copy without leaving your product.

Interactive preview

Explore the actual widget

Switch through image, video, audio, and text modes, inspect sample history, and explore the full widget surface your customers would actually embed.

Interactive preview Explore every mode and screen here. Real generations, voiceovers, and exports unlock in the playground after sign up. Open the playground
imagelayer — full widget preview

Want to run the real pipeline? Sign in to the playground. Open the playground

Use cases

Fits the surfaces your users already work in

ImageLayer belongs inside the workflow where people draft, review, and publish content, not in a separate AI tab.

Pricing

Start free. Unlock broader AI workflows as you grow.

No credit card required. Start with image workflows, then expand into video, voice, and text as your product grows.

Free

Try the widget in your product before wiring up paid workflows.

$0

≈ 5–10 images

Start Building Free
  • 50 credits / month
  • Up to 1 end user
  • 72-hour temporary history
  • Image generation
  • Email support

Starter

For teams adding image, video, voice, and text workflows without building AI infrastructure.

$59 /mo

≈ 100–180 images or 69–128 short videos

Get Starter
  • 900 credits / month
  • Up to 20 end users
  • 10GB archive storage (30-day archive)
  • Image generation
  • Sketch inputs & Reference inputs
  • Video generation (720p)
  • AI voiceovers
  • AI text generation
  • Email support
Popular

Pro

For growing teams that need richer media workflows, brand control, and usage visibility.

$199 /mo

≈ 333–600 images or 230–428 short videos

Get Pro
  • 3,000 credits / month
  • Up to 50 end users
  • 100GB archive storage (12-month archive)
  • Image generation
  • Sketch inputs & Reference inputs
  • Video generation (1080p)
  • AI voiceovers
  • Podcast snippets
  • AI text generation
  • Brand guidelines
  • Usage analytics
  • Priority support

Enterprise

For organizations that need scale, rollout support, and custom workflow control.

Custom
Talk to Sales
  • Custom credit allocation
  • Custom user limits
  • 1,024GB archive storage (Custom archive)
  • Image generation
  • Sketch inputs & Reference inputs
  • Video generation (1080p)
  • AI voiceovers
  • Podcast snippets
  • AI text generation
  • Brand guidelines
  • Usage analytics
  • Custom integrations
  • Dedicated onboarding

Credit costs by generation type

One plan, multiple AI workflows. Here's how credits translate to generations.

Type Credits On Starter (900)
Image (standard) 3–5 180–300
Image (pro quality) 9 100
Short video (4s, 720p) 7–13 69–128
Long video (8s, 1080p) 14–92 9–64
Voiceover (per minute) 1–2 450–900
Text generation 1–2 450–900

Credit cost depends on model, duration, and resolution. Most image workflows use 5 credits per generation.

Security & Trust

Enterprise-grade security built in

Your data, your clients, your reputation. Built for products that can't afford a security incident.

Encrypted in transit

Connections use HTTPS/TLS. API keys are SHA-256 hashed and passwords are stored as bcrypt hashes.

Email-verified accounts

Organizations must verify their email before creating API keys or using live API flows.

Rate-limited & monitored

Built-in rate limiting with fail-closed behavior. Every auth event is logged for security analysis.

Short-lived generated media

Temporary generated media is cleaned up on a rolling TTL. Our current target is roughly 72 hours for generated assets.

IDOR protection

API requests are scoped to the authenticated organization, with signed-access patterns and tenant checks on stored assets.

Bot & abuse prevention

Cloudflare Turnstile on registration. Disposable emails are blocked. IP-based daily limits prevent mass sign-ups.

Want to learn more about our security practices? Read our security page

FAQ

Questions before you start

If your question isn't here, reach out — we respond within a business day.

How does ImageLayer enforce my brand?

Two layers of control. First, you set AI brand guidelines — style description, color palette, tone keywords, and elements to avoid. Those instructions shape supported workflows so generated visuals and copy stay aligned with your brand. Second, image outputs can add logo and layout overlays in the branding editor after generation. Your users can't bypass either layer.

What AI models power ImageLayer?

ImageLayer runs production generations on Google Vertex AI: Gemini image models for visuals, Gemini text models for copy, Veo 3.1 Lite/Fast/Quality for video, and Gemini TTS for voice. We also use Gemma 4 on Cloudflare Workers AI for moderation and prompt enrichment. You can try a limited public preview without signing in, then switch the playground to your live workspace after authentication.

What can ImageLayer generate?

Images, video, voiceovers, podcast-style audio, and text. Plan controls decide which modes are available to your users. Most teams start with image workflows, then unlock video, voice, and text as they expand the experience.

What are content type templates?

Pre-built forms for common creative outputs — stat highlights, quote cards, blog promos, announcements, and infographics. Instead of writing a prompt from scratch, users pick a template, fill in a few fields, and generate. Depending on your plan, you can create up to 3, 10, or 100 custom templates for your specific use cases.

Does ImageLayer support social media image sizes?

Yes. Built-in platform presets cover LinkedIn (post, carousel, banner), Instagram (post, story), X/Twitter, Facebook (post, story), Google Business, WhatsApp Status, Telegram Post, Pinterest Pin, TikTok Cover, YouTube Thumbnail, blog headers, email banners, and custom dimensions. Each preset sets the correct dimensions and optimizes the composition for that platform. Choose Custom if you need a different aspect ratio.

How hard is the integration?

Add one <image-layer> HTML tag — that's the entire integration. It's a native Web Component, so it works with any framework: React, Vue, Angular, Svelte, Next.js, or plain HTML. The same widget can expose image, video, voice, and text workflows. Most teams go from sign-up to live widget in under 15 minutes.

Does it work with my tech stack?

Yes. The widget is a native Web Component — the same standard browsers support natively. It works out of the box with React, Vue, Next.js, Nuxt, Angular, Svelte, SvelteKit, Astro, Remix, and any other framework. It also drops into WordPress, Webflow, and other no-code tools. No npm package required — you can load it from our CDN with a single script tag.

Can I control what my users generate?

Yes. You can control brand rules, content templates, enabled modes, and other dashboard settings for each workspace. Your dashboard also shows who created what, when, and how many credits were used.

What happens if I hit my credit limit?

You'll get a notification before you run out. You can upgrade your plan anytime with no downtime. Enterprise customers get volume pricing, custom credit pools, and SLAs.

What video models does ImageLayer use?

We use Google Veo 3.1 on Vertex AI for video generation, with Lite, Fast, and higher-quality tiers depending on the workflow. It produces 720p or 1080p (Pro+) videos up to 8 seconds long. Lite starts at 7 credits, Fast typically costs 13-25 credits depending on duration and resolution, and the highest-quality tier costs more.

How does the text-to-speech work?

ImageLayer offers 8 distinct AI voices powered by Google Gemini TTS — from firm and professional to warm and conversational. Choose between Flash Lite (lowest-cost single-speaker), Flash (faster general-purpose), and Pro (richer intonation) models. Output is MP3 or WAV format.

Can I generate podcast-style content with multiple speakers?

Yes. Pro plan users can generate two-host discussion segments and interview-format audio with multi-speaker TTS models like Gemini 2.5 Flash TTS and Gemini 2.5 Pro TTS. The system automatically pairs complementary voices for natural-sounding conversations. Write your script in conversational style and the TTS handles speaker alternation.

Is my data secure?

Connections use HTTPS/TLS. API keys are SHA-256 hashed and never stored in plaintext, passwords are stored as bcrypt hashes, and temporary generated media is cleaned up on a rolling TTL. We require email verification before live API use, apply rate limits and tenant scoping, and do not use your prompts or generated assets to train our own models. See our Security page for full details.

Ship AI-powered creative content
in your product

Images, video, audio, and text — embedded in your product. Free credits included. No credit card. No ML team needed.