Generative AI

Senior Research Engineer, Model Evaluation

Cohere · Toronto

CORE_AI_JOBashby2026-06-16

LLM Machine Learning Data Science Model Evaluation NLP Data Analysis Software Engineering

Warum echter KI-Job:
The role is explicitly focused on developing and improving LLM evaluation methods, benchmarks, and infrastructure. The core responsibilities revolve around pushing the state-of-the-art in LLM evaluation, building tools for analysis, and working with LLM judges. This is a highly technical and research-oriented position directly related to core AI/ML concepts.

Auszug

Who are we? Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems. We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they…

Ähnliche Jobs

Machine Learning Engineer, AI Assistant & Autonomous AI Agents

Glean · Mountain View, CA (HQ)

98/100

Mountain View, CA (HQ)USAgreenhouse2026-06-08

LLM Agentic AI Reinforcement Learning Machine Learning Engineering NLP LLM Orchestration Evaluation Frameworks Fine-tuning Memory Augmented LLMs

Warum echter KI-Job: Die Rolle ist klar auf die Entwicklung und Optimierung von LLM-basierten Agenten, Reinforcement Learning, Evaluationsrahmen und agilen Architekturen ausgerichtet. KI ist der überwiegende Kern der Tätigkeit, sowohl in Forschung als auch in Produktionsumgebung.

About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With…

Details Quelle / Bewerbung öffnen

Manager, Machine Learning Research Scientist, GenAI

Scale AI · San Francisco, CA

98/100

San Francisco, CAUSAgreenhouse2026-06-08

Machine Learning Generative AI Research Science Deep Learning Agent/RL Systems Evaluation Methodologies Post-Training Agentic/RL Environments Large Language Models

Warum echter KI-Job: Die Rolle ist eindeutig auf führende Forschung in KI, insbesondere in Generative AI, Agenten und RL-Systemen ausgerichtet. Die Aufgaben beinhalten die Leitung einer Forschungsgruppe, die Entwicklung von Forschungsroadmaps und die Umsetzung von Prototypen in r…

Scale AI accelerates the development of AI systems by providing the data, infrastructure, and tooling that power the most advanced models in the world. Our teams operate at the intersection of cutting-edge research, large-scale engineering, and real-world deployment, partnering with leading frontie…

Details Quelle / Bewerbung öffnen

Staff Machine Learning Research Scientist, LLM Evals

Scale AI · San Francisco, CA

98/100

San Francisco, CAUSAgreenhouse2026-06-04

Machine Learning Large Language Models (LLMs)NLP Transformer Modeling Evaluation Methodologies Benchmarking ML Research MLOps Reproducible Pipelines AI Research Publication

Warum echter KI-Job: Die Rolle ist zentral auf die Entwicklung und Forschung von Evaluationsmethoden fuer LLMs ausgerichtet. Die Aufgaben beinhalten die Gestaltung von Benchmarks, die Entwicklung von Metriken, die Implementierung skalierbarer Pipelines und die Veröffentlichung vo…

As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leading LLM evals, setting new standards for model performance assessment. Our mission is to develop rigoro…

Details Quelle / Bewerbung öffnen

Principal Machine Learning Engineer, App SW

Wayve · Sunnyvale, California USA

98/100

Sunnyvale, California USAUKgreenhouse2026-05-28

Machine Learning Deep Learning Model Architecture Data Pipelines Evaluation Frameworks Real-world Deployment MLOps PyTorch Python C++

Warum echter KI-Job: Die Rolle eines Principal ML Engineers bei Wayve ist klar auf die Entwicklung und Bereitstellung leistungsstarker, robusten und generalisierbaren ML-Modelle für autonome Fahrzeuge ausgerichtet. Die Aufgaben umfassen die gesamte Kette von Modellarchitektur, Da…

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Quelle / Bewerbung öffnen

Sr. Machine Learning Engineer

Databricks · San Francisco, California

98/100

San Francisco, CaliforniaUSAgreenhouse2026-05-26

Machine Learning LLM (Large Language Models)GenAI (Generative AI)Fine-tuning Prompt Engineering RAG (Retrieval-Augmented Generation)MLOps Model Deployment Python TensorFlow

Warum echter KI-Job: Die Rolle eines Senior Machine Learning Engineers bei Databricks ist stark auf die Entwicklung und Bereitstellung von GenAI-Produkten ausgerichtet. Die Aufgaben beinhalten die Entwicklung von LLMs, die Gestaltung von AI-Features, die Erstellung von ML-Pipelin…

P-1131 The Applied AI team at Databricks sits at the forefront of advancing GenAI-powered products. Over the past years, we’ve launched Databricks Assistant , AI/BI Genie , and Agent Bricks working with product teams, and made significant strides in LLM quality for these products. These products ar…

Details Quelle / Bewerbung öffnen

Machine Learning System Software Architect

Baidu Talent · Sunnyvale, CA

95/100

Sunnyvale, CAChinagreenhouse2026-06-16

Machine Learning Deep Learning TensorFlow PyTorch PaddlePaddle MLOps Data Science System Architecture C++Performance Optimization

Warum echter KI-Job: The role explicitly focuses on architecting machine learning systems, building distributed AI training systems, and optimizing hardware/software for ML products. The qualifications heavily emphasize ML experience and knowledge.

Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in AI industry and the desire to solve them? Do you want to work with a world-class team to explore the fast-growing AI hardware opportunities and impact on AI industry? We’re looking forward to…

Details Quelle / Bewerbung öffnen