Generative AI

Senior Research Engineer, Model Evaluation

Cohere · Toronto

CORE_AI_JOBashby2026-06-16
Warum echter KI-Job:
The role is explicitly focused on developing and improving LLM evaluation methods, benchmarks, and infrastructure. The core responsibilities revolve around pushing the state-of-the-art in LLM evaluation, building tools for analysis, and working with LLM judges. This is a highly technical and research-oriented position directly related to core AI/ML concepts.

Auszug

Who are we? Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems. We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they…

Ähnliche Jobs

Mountain View, CA (HQ)USAgreenhouse2026-06-08

Warum echter KI-Job: Die Rolle ist klar auf die Entwicklung und Optimierung von LLM-basierten Agenten, Reinforcement Learning, Evaluationsrahmen und agilen Architekturen ausgerichtet. KI ist der überwiegende Kern der Tätigkeit, sowohl in Forschung als auch in Produktionsumgebung.

About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With…

Details Quelle / Bewerbung öffnen

San Francisco, CAUSAgreenhouse2026-06-08

Warum echter KI-Job: Die Rolle ist eindeutig auf führende Forschung in KI, insbesondere in Generative AI, Agenten und RL-Systemen ausgerichtet. Die Aufgaben beinhalten die Leitung einer Forschungsgruppe, die Entwicklung von Forschungsroadmaps und die Umsetzung von Prototypen in r…

Scale AI accelerates the development of AI systems by providing the data, infrastructure, and tooling that power the most advanced models in the world. Our teams operate at the intersection of cutting-edge research, large-scale engineering, and real-world deployment, partnering with leading frontie…

Details Quelle / Bewerbung öffnen

San Francisco, CAUSAgreenhouse2026-06-04

Warum echter KI-Job: Die Rolle ist zentral auf die Entwicklung und Forschung von Evaluationsmethoden fuer LLMs ausgerichtet. Die Aufgaben beinhalten die Gestaltung von Benchmarks, die Entwicklung von Metriken, die Implementierung skalierbarer Pipelines und die Veröffentlichung vo…

As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leading LLM evals, setting new standards for model performance assessment. Our mission is to develop rigoro…

Details Quelle / Bewerbung öffnen

Principal Machine Learning Engineer, App SW

Wayve · Sunnyvale, California USA

98/100
Sunnyvale, California USAUKgreenhouse2026-05-28

Warum echter KI-Job: Die Rolle eines Principal ML Engineers bei Wayve ist klar auf die Entwicklung und Bereitstellung leistungsstarker, robusten und generalisierbaren ML-Modelle für autonome Fahrzeuge ausgerichtet. Die Aufgaben umfassen die gesamte Kette von Modellarchitektur, Da…

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Quelle / Bewerbung öffnen

Sr. Machine Learning Engineer

Databricks · San Francisco, California

98/100
San Francisco, CaliforniaUSAgreenhouse2026-05-26

Warum echter KI-Job: Die Rolle eines Senior Machine Learning Engineers bei Databricks ist stark auf die Entwicklung und Bereitstellung von GenAI-Produkten ausgerichtet. Die Aufgaben beinhalten die Entwicklung von LLMs, die Gestaltung von AI-Features, die Erstellung von ML-Pipelin…

P-1131 The Applied AI team at Databricks sits at the forefront of advancing GenAI-powered products. Over the past years, we’ve launched Databricks Assistant , AI/BI Genie , and Agent Bricks working with product teams, and made significant strides in LLM quality for these products. These products ar…

Details Quelle / Bewerbung öffnen

Sunnyvale, CAChinagreenhouse2026-06-16

Warum echter KI-Job: The role explicitly focuses on architecting machine learning systems, building distributed AI training systems, and optimizing hardware/software for ML products. The qualifications heavily emphasize ML experience and knowledge.

Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in AI industry and the desire to solve them? Do you want to work with a world-class team to explore the fast-growing AI hardware opportunities and impact on AI industry? We’re looking forward to…

Details Quelle / Bewerbung öffnen