Generative AI Inference Engineer at Stability AI — AI Jobs | Neura Market | Neura Market
    Neura Market
    Neura Market
    Marketplace
    Directories
    Resources
    AI JobsGenerative AI Inference Engineer
    Stability AI

    Generative AI Inference Engineer

    Stability AI

    United States

    Marketplace

    • Prompts
    • Workflows
    • Agents Store
    • Bundles
    • Templates
    • Categories
    • Marketplace

    Directories

    • AI Tools Directory
    • ChatGPT
    • Claude
    • Gemini
    • Cursor
    • Grok
    • DeepSeek
    • Perplexity
    • CoPilot
    • Midjourney
    • Stable Diffusion
    • MCP Servers
    • .md Directory
    • All Directories

    Free Tools

    • AI Text Humanizer
    • AI Content Detector
    • Workflow Generator
    • Model Comparison
    • AI Pricing Calculator
    • AI Benchmarks
    • ROI Calculator
    • All Free Tools

    Resources

    • AI News
    • Blog
    • AI Models
    • Integrations
    • Alternatives
    • Resource Library
    • Documentation

    Community

    • AI Jobs
    • AI Events
    • AI Companies
    • Start Selling
    • Creator Guide
    • Advertise
    • Affiliates

    Company

    • About
    • Contact
    • Help
    • Careers
    • Pricing
    • Terms
    • Privacy
    • License
    • DMCA

    Stay Updated

    Get the latest AI tools and insights delivered to your inbox.

    Neura Market Logoneuramarket

    © 2026 Neura Market. All rights reserved.

    Senior-level / Expert
    Full-time
    On-site
    3/20/2026
    Apply

    About This Role

    Generative AI Inference Engineer

    About the role:

    We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inference for multi-modal models. A deep understanding of diffusion model architectures and familiarity with workflow tools like ComfyUI are a big plus. You will be expected to leverage and push the boundaries of state-of-the-art inference optimization techniques for multi-modal generative models. This role offers the opportunity to work alongside top researchers and engineers, utilizing cutting-edge high-performance computing resources to make a significant impact in the rapidly evolving field of generative AI.

    Responsibilities:

    • Lead efforts to drive the design, development of customer-facing multi modal ML inference systems.
    • Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment.
    • Partner with leading cloud providers to deliver hosted Stability AI inference solutions.
    • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
    • Be part of the team to bring new Stability models and pipelines into existence
    • Prototype and productionize inference platform improvements and new features

    Qualifications:

    • 7+ years working on productionizing machine learning systems, including inference pipeline development
    • Expert level knowledge on writing and running python services at scale
    • 5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT)
    • Deep understanding of Diffusion Architecture
    • Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight
    • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
    • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
    • Experience with Docker
    • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
    • Strong communication, collaboration, and documentation skills
    • Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.)

    Equal Employment Opportunity:

    We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

    Skills & Tech Stack

    PythonPyTorchKubernetesDockerAWSGCPAzureHugging FaceDiffusion ModelsTriton

    Roles

    Engineer

    Location

    Region

    North America

    Country

    United States

    Topics

    Technical

    Related AI Jobs

    Stability AI

    Senior Product Engineer, Growth & Lifecycle Infrastructure - Music & Audio

    Stability AI·Full-time·Los Angeles, CA or Remote (United States)
    Technical
    Stability AI

    Junior Software Engineer

    Stability AI·Full-time·United States or Canada
    Technical
    Reka AI

    Member of Technical Staff (Product Engineer)

    Reka AI·Full-time·US, UK, Remote
    Technical
    Reka AI

    Member of Technical Staff (Data Intelligence)

    Reka AI·Full-time·US, UK, Singapore, Remote
    Technical
    Stability AI

    Senior Site Reliability Engineer

    Stability AI·Full-time·United States
    Technical
    Stability AI

    Director of Data Strategy & Operations

    Stability AI·Full-time·United States
    Technical
    ← Back to all jobs