Deploy visual pipelines with our API Engine.
A predictable layout and rendering infrastructure with unified SDK client packages, webhooks, and structured response streaming. Built to handle production traffic.
curl https://api.veltneon.com/v1/images \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"prompt": "a futuristic city at golden hour, cinematic",
"style": "photoreal",
"resolution": "4K"
}'{
"id": "img_7x8932pq19",
"status": "succeeded",
"created_at": 1782439120,
"completed_at": 1782439121,
"model": "veltneon-lumen-v2",
"style": "photoreal",
"resolution": "4096x2160",
"output": {
"url": "https://cdn.veltneon.com/renders/7x8932pq19.png",
"mime_type": "image/png",
"size_bytes": 4821034
},
"inference_metrics": {
"compute_node": "NV-H100-DGX-04B",
"engine": "TensorRT / Triton",
"latency_ms": 1940,
"precision": "FP8",
"cost_tokens": 12
}
}SDKs
Official SDK libraries.
JavaScript / TypeScript
npm install @veltneon/sdkPython
pip install veltneonGo
go get github.com/veltneon/goWebhook lifecycle flow.
Integrate asynchronous callbacks to resolve high-resolution renderings inside your database records.
1. API Request
Payload includes `webhook_url` endpoint
2. Render Queued
Gateway posts `render.pending` check
3. Processing
H100 Node resolves latent canvas layers
4. Success Dispatch
Gateway posts `render.succeeded` with PNG URL
Server-Sent Events (SSE)
For real-time UI previews, developers can listen to layout token changes progressively using our EventStream API path. Progressive previews are yielded while denoising passes execute.
{
"layers": [
{
"id": "bg_layer_01",
"type": "background",
"prompt": "studio soft ambient background",
"weight": 0.85
},
{
"id": "product_mesh_01",
"type": "foreground_silhouette",
"source_contour_url": "https://assets.brand.com/bottle_mesh.obj",
"position": { "x": 0.5, "y": 0.6, "scale": 1.2 },
"light_angle": 120
}
],
"global_overrides": {
"color_palette": ["#8B5CF6", "#EC4899"],
"locked_contours": true
}
}Dynamic Composition Objects
veltneon supports rich layer lists where objects hold dynamic bounds, reference shapes, and custom light angles. This structured JSON representation parses directly to intermediate tensor maps in Triton nodes.
Token-bucket rate limits
In order to protect H100 computing channels, API keys are governed by Token-Bucket boundaries. Default sandbox keys process up to 10 image requests per minute with a bucket buffer size of 20 tokens.
CI/CD asset auto-compiles
Link visual updates directly into code reviews. Git hooks trigger veltneon automated API builds to test design layouts, check compliance boundaries, and render production assets automatically.
Local Sandbox Emulator
Test your API routes without incurring server cost or hitting GPU queues. Our sandbox emulator package mocks all endpoints locally, returning mock schemas and layout contours.
npx @veltneon/emulator startEndpoints
Simple API endpoints.

Compute Specs
Hardware Acceleration
NVIDIA H100-DGX Accelerated Backend.
We compile and scale generative models directly on enterprise-grade NVIDIA DGX compute nodes. By leveraging CUDA-X deep learning layers, custom FP8 precision models, and NVIDIA Triton Inference Server, veltneon renders multi-layer 4K image assets in less than 2 seconds.
Execution Pipeline
Inference Architecture Flow
How requests are parsed, balanced across NVIDIA clusters, compiled, and resolved in real-time.
API Gateway
Parses user specs, applies auth tokens, checks guidelines.
NVIDIA Triton
Schedules dynamic batch queues, loads balance metrics.
Hopper Compute
Runs latent diffusion processing via TensorRT FP8 kernels.
CDN Delivery
Delivers finished high-fidelity 4K layout png payload.
Developer views
Model internals, prompts and response traces.
Begin building with veltneon.
Get started with sandbox credentials. Try the Canvas and Engine API today.
Create an accountBuilt with developer-first tools