Ai engineer for llm ops & evaluation (m/f/d)
MünchenAuxilius.ai
...optimization loop, and the production integration that turns experiments into reliable customer-facing features Design evaluation strategy per output type: Decide when to use deterministic evals (exact match, schema validation, embeddings) vs. LLM-as-judge, and build the rubrics, test datasets, and human-review loops that make [...]
Kategorie IT / Informationstechnologie