Ai engineer for llm ops & evaluation (m/f/d)
MünchenAuxilius.ai
...evals (exact match, schema validation, embeddings) vs. LLM-as-judge, and build the rubrics, test datasets, and human-review loops that make the system trustworthy Drive prompt engineering and optimization across all LLM operations in the product: Moving from hand-tuned prompts to a measurable, iterative process Pick the right [...]
Kategorie IT / Informationstechnologie