Working student - genai / llm evaluation - agentic ai / nlp (f/m/d)
KarlsruheCinemo GmbH
...features, including qualitative and quantitative analysis. Create, curate, and maintain datasets for benchmarking, regression testing, and scenario coverage. Extend and improve internal evaluation frameworks (metrics, dashboards, automated test runs). Contribute to end-to-end testing of GenAI features within the in-car [...]
Kategorie Medien / Verlag / Redaktion