AI Job Radar

Model Serving Jobs

Aktuelle KI-Jobs mit Model Serving, passende Lernpfade und Bewerbungsbezug.

Wie du Model Serving für Bewerbungen nutzt

Wenn ein Job Model Serving verlangt, sollte der Skill nicht nur als Stichwort im CV stehen. Besser sind ein kurzer Projektbeleg, ein Kursnachweis oder ein Portfolio-Beispiel. Für den Bewerbungscheck wird geprüft, ob der Skill in deinem Lebenslauf wirklich belegbar ist.

10
Treffer
3
Unternehmen
93.5
Durchschn. Score
1
Remote

10 Treffer auf dieser Seite. Insgesamt 10 Treffer. Weitere Treffer sind über die Seitennavigation, Firmen-, Skill- und Jobdetailseiten erreichbar.

San FranciscoUSAFullTimeashby2026-06-16

Warum echter KI-Job: The role is explicitly focused on building, scaling, and optimizing LLM inference workloads. The team is a 'Forward Deployed Engineering' team working directly with customers on AI deployments. The requirements clearly state experience with LLMs and ML infere…

ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the fronti…

Details Quelle / Bewerbung öffnen

San FranciscoUSAgreenhouse2026-06-16

Warum echter KI-Job: The role is explicitly focused on building and optimizing the model serving layer for voice applications, working with state-of-the-art voice models and inference engines. The responsibilities are heavily centered around ML engineering tasks.

About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…

Details Quelle / Bewerbung öffnen

San FranciscoUSAgreenhouse2026-06-16

Warum echter KI-Job: The role is entirely focused on building and optimizing the model serving layer for voice applications, including LLMs, STT, and TTS. It requires deep expertise in ML engineering, inference optimization, and GPU utilization. The responsibilities and requireme…

About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…

Details Quelle / Bewerbung öffnen

Staff Software Engineer - GenAI inference

Databricks · San Francisco, California

95/100
San Francisco, CaliforniaUSAgreenhouse2026-06-16

Warum echter KI-Job: The role is explicitly focused on the architecture, development, and optimization of the inference engine for large language models (LLMs). The job description details deep technical requirements related to ML inference internals, GPU programming, and distrib…

P-1285 About This Role As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.. You’ll bridge research advances and production demands, ensuring high throughput, low latency,…

Details Quelle / Bewerbung öffnen

Software Engineer - GenAI inference

Databricks · San Francisco, California

95/100
San Francisco, CaliforniaUSAgreenhouse2026-06-16

Warum echter KI-Job: The role is explicitly focused on designing, developing, and optimizing the inference engine for Databricks' Foundation Model API (LLMs). The job description details deep technical work with model architectures, optimization, and distributed systems – all cor…

P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks’ Foundation Model API. You’ll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are f…

Details Quelle / Bewerbung öffnen

Senior Applied AI Engineer

Databricks · Belgrade, Serbia

95/100
Belgrade, SerbiaUSAgreenhouse2026-06-16

Warum echter KI-Job: The role explicitly focuses on building and deploying ML/AI models and systems, improving the performance of AI-powered products, and working with foundational models. The description highlights core AI/ML engineering tasks.

P-1439 As a Senior Applied ML/AI Engineer at Databricks, you will apply machine learning and optimization algorithms to improve the usability and efficiency of the current AutoML and several other user-facing products that will benefit from better classification, regression, forecasting, and recomm…

Details Quelle / Bewerbung öffnen

San Francisco, CaliforniaUSAgreenhouse2026-06-16

Warum echter KI-Job: The role is explicitly focused on building a generative AI platform, including all stages of the ML lifecycle (data generation, training, evaluation, serving, agent-building). The job description heavily emphasizes ML and AI technologies.

P-984 Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maximum security and control. Compatible with all major cloud providers, the Mosaic AI platform prov…

Details Quelle / Bewerbung öffnen

New York City, New YorkUSAgreenhouse2026-05-26

Warum echter KI-Job: Die Rolle befasst sich direkt mit der Entwicklung und Führung von Produkten, die LLM-Endpunkte, Model Serving und AI Governance umfassen. Der Titel und die Beschreibung zeigen klare technische und konzeptionelle KI-Arbeit, insbesondere im Bereich LLM Inferenc…

RDQ127R255 At Databricks, we are passionate about enabling data teams to solve the world’s toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world’s best data and AI infrastructu…

Details Quelle / Bewerbung öffnen

Dallas, TexasUSAgreenhouse2026-05-26

Warum echter KI-Job: Die Rolle beinhaltet tiefgehende Arbeit mit Data & AI-Technologien, einschließlich der Analyse und Optimierung von AI-Workflows, Modell-Serving, Spark und Delta. Der Job ist stark auf die Entwicklung, Wartung und Unterstützung von komplexen Data & AI-Systemen…

P-1398 Note: this is a hybrid role and requires ~3 days in the office in Plano, Tx. Mission As a Staff Data & AI Technical Solutions Engineer, you will personally drive and mentor others in producing Data & AI technical solutions for any issues reported by customers - including deep diving into pro…

Details Quelle / Bewerbung öffnen

BrazilUSAgreenhouse2026-05-26

Warum echter KI-Job: Die Rolle beinhaltet tiefgehende Arbeit mit AI-Workflows, Data Pipelines, Spark, Delta, Model Serving und ML/AI-Anwendungen. Der TSE analysiert Code-Level-Probleme, optimiert Leistungen und unterstützt Kunden bei der Nutzung von KI-Technologien auf dem Databr…

P-993 Mission As a Data & AI Technical Solutions Engineer, you play a critical role by helping customers debug and maintain stable production data pipelines, AI workflows, and more using the Databricks platform. You will develop product expertise in a couple of areas by advising a broad set of cust…

Details Quelle / Bewerbung öffnen