discover
Frontier models
Agents scan SOTA model releases, papers, and method traces for candidates worth testing.
Post-train smaller, deployable models on the proprietary data and workflows frontier labs skip — financial, legal, ops, and beyond.
Fine-Tuning
Frontier RL environments don't cover your 10-Ks, your trading desks, or your underwriting playbooks. We do — turning your domain into a model that beats a vanilla commercial LLM on the work that matters to you.
Domain data
Proprietary corpora, traces, and tool calls become the training signal.
RSI
Our mission is to push post-training beyond reinforcement learning to true generalization — models that reason on your domain, not just on academic benchmarks. We get there by discovering and iterating state-of-the-art methods through recursive agentic self-improvement.
discover
Agents scan SOTA model releases, papers, and method traces for candidates worth testing.
experiment
We run rollouts, ablations, and reward checks until a method improves in realistic tasks.
ship
The best variants become endpoints you can call, compose, and evaluate in your domain.
feedback
Production traces and evals feed the next search cycle, so the system improves itself.
The loop
Three primitives, one closed loop, repeated until the model generalizes.
Define environments, actions, tools, and rewards in a typed, versioned API.
Run thousands of parallel rollouts; every step traced, every reward attributed.
Tempera explores post-training methods on your data and ships the model that generalizes best.
API
Compose environments, rollouts, and training in a single typed surface.
Deployment
Fine-tuned models ship where your data lives. Pick the isolation model that matches your security and compliance posture.
A dedicated control and data plane in your cloud account. No shared compute, no shared weights.
Shared managed control plane with isolated data planes per customer. Faster to onboard, lower TCO.
Air-gapped deployment on your hardware for the most regulated environments.
Careers
Small team, large ideas, infinite mission.
Perform research on post-training, generalization, world models, and recursive self-improvement.
Apply → founders@tempera.devBuild comprehensive scheduling, distributed training systems, and observability to power research.
Apply → founders@tempera.devTurn research into production.
Apply → founders@tempera.devWe're always looking for talented individuals across all disciplines to consider joining.
Apply → founders@tempera.devWaitlist
Join the waitlist for the first cohort, or email the founders directly.