◇ Vision API · Beta · Free
CPU-only vision pipeline — CLIP concepts, YOLO objects, OCR, face emotion, all in a single 900 ms call. Stateless, horizontally scalable, beta-free while you kick the tyres.
Full pipeline runs on a 4-vCPU node. No GPU required, no warm-up latency on warm workers.
Retina-native gives you concepts/objects/OCR/faces structured. Gemini-compat is a one-line swap.
50 images per key per day. No credit card. Real production use, not a demo sandbox.
◇ Drop-in usage
If you're already using the Gemini SDK, point the API endpoint at retina.frank.ink and keep the same request/response shape.
from google import generativeai as genai
genai.configure(
api_key="rk_live_...",
transport="rest",
client_options={
"api_endpoint": "retina.frank.ink",
},
)
model = genai.GenerativeModel("gemini-flash-latest")
result = model.generate_content(
["Describe this image", img]
)
print(result.text)