Member of Technical Staff (Data Intelligence) at Reka AI — AI Jobs | Neura Market | Neura Market
    Neura Market
    Neura Market
    Marketplace
    Directories
    Resources
    AI JobsMember of Technical Staff (Data Intelligence)
    Reka AI

    Member of Technical Staff (Data Intelligence)

    Reka AI

    US, UK, Singapore, Remote

    Marketplace

    • Prompts
    • Workflows
    • Agents Store
    • Bundles
    • Templates
    • Categories
    • Marketplace

    Directories

    • AI Tools Directory
    • ChatGPT
    • Claude
    • Gemini
    • Cursor
    • Grok
    • DeepSeek
    • Perplexity
    • CoPilot
    • Midjourney
    • Stable Diffusion
    • MCP Servers
    • .md Directory
    • All Directories

    Free Tools

    • AI Text Humanizer
    • AI Content Detector
    • Workflow Generator
    • Model Comparison
    • AI Pricing Calculator
    • AI Benchmarks
    • ROI Calculator
    • All Free Tools

    Resources

    • AI News
    • Blog
    • AI Models
    • Integrations
    • Alternatives
    • Resource Library
    • Documentation

    Community

    • AI Jobs
    • AI Events
    • AI Companies
    • Start Selling
    • Creator Guide
    • Advertise
    • Affiliates

    Company

    • About
    • Contact
    • Help
    • Careers
    • Pricing
    • Terms
    • Privacy
    • License
    • DMCA

    Stay Updated

    Get the latest AI tools and insights delivered to your inbox.

    Neura Market Logoneuramarket

    © 2026 Neura Market. All rights reserved.

    Senior-level / Expert
    Full-time
    Remote
    5/14/2026
    Apply

    About This Role

    In this role, you’ll work closely with model researchers, data infrastructure engineers, and cross-functional partners to make sure our data is high quality and can be produced at petabyte scale in a reliable, efficient way. From understanding how data choices show up in model behavior, to building processing pipelines and running the compute behind them, you’ll help ensure our models are trained on the best data we can get.

    What you’ll do

    • Work with model researchers to define what “good data” means for our models, including quality metrics, validation checks, and acceptance thresholds

    • Explore open source datasets and create internal ones most suitable to build fundamental World Models

    • Build algorithms for automated data quality assessment, data domain mixtures, and domain adaptation from synthetic to real data.

    • Track datasets, metadata, provenance, and versions so experiments are reproducible and it’s clear what data went into which training and evaluation runs

    • Own CI/CD and development tooling for the data stack (GitHub, Python, PyTorch), and automate repetitive workflows to reduce friction

    • Track and optimize throughput, storage, and compute utilization across pipelines and related assets

    What we’re looking for

    • Strong ML and deep learning fundamentals with experience building and operating large-scale data and/or compute systems

    • Comfortable moving between research questions and production engineering: you can dig into data, run analyses, and also ship reliable systems

    • Demonstrated research experience with data compositions, quality, and dataset releases

    • Ability to design and execute experiments with convincing unbiased outcomes

    • Practical experience with distributed processing and orchestration (Spark, Ray, Airflow, or equivalents)

    • Solid Python skills, and familiarity with the tooling around modern model training workflows (datasets, checkpoints, experiment tracking)

    • Strong instincts around data quality: how to measure it, how to monitor it, and how to prevent regressions as things scale

    • Able to work in a fast-moving environment, prioritize what matters, and communicate clearly with both researchers and engineers

    • Bonus: experience with large video datasets, dataset curation for training, or building internal tooling for evaluation/analysis in ML environments

    Tasks

    • •Work with model researchers to define what “good data” means for our models, including quality metrics, validation checks, and acceptance thresholds
    • •Explore open source datasets and create internal ones most suitable to build fundamental World Models
    • •Build algorithms for automated data quality assessment, data domain mixtures, and domain adaptation from synthetic to real data.
    • •Track datasets, metadata, provenance, and versions so experiments are reproducible and it’s clear what data went into which training and evaluation runs
    • •Own CI/CD and development tooling for the data stack (GitHub, Python, PyTorch), and automate repetitive workflows to reduce friction
    • •Track and optimize throughput, storage, and compute utilization across pipelines and related assets
    • •Strong ML and deep learning fundamentals with experience building and operating large-scale data and/or compute systems
    • •Comfortable moving between research questions and production engineering: you can dig into data, run analyses, and also ship reliable systems
    • •Demonstrated research experience with data compositions, quality, and dataset releases
    • •Ability to design and execute experiments with convincing unbiased outcomes
    • •Practical experience with distributed processing and orchestration (Spark, Ray, Airflow, or equivalents)
    • •Solid Python skills, and familiarity with the tooling around modern model training workflows (datasets, checkpoints, experiment tracking)
    • •Strong instincts around data quality: how to measure it, how to monitor it, and how to prevent regressions as things scale
    • •Able to work in a fast-moving environment, prioritize what matters, and communicate clearly with both researchers and engineers
    • •Bonus: experience with large video datasets, dataset curation for training, or building internal tooling for evaluation/analysis in ML environments

    Skills & Tech Stack

    PythonPyTorchApache SparkCI/CD

    Location

    Region

    Asia/Pacific

    Country

    Singapore

    Topics

    Technical

    Related AI Jobs

    Stability AI

    Senior Product Engineer, Growth & Lifecycle Infrastructure - Music & Audio

    Stability AI·Full-time·Los Angeles, CA or Remote (United States)
    Technical
    Stability AI

    Junior Software Engineer

    Stability AI·Full-time·United States or Canada
    Technical
    Reka AI

    Member of Technical Staff (Product Engineer)

    Reka AI·Full-time·US, UK, Remote
    Technical
    Stability AI

    Senior Site Reliability Engineer

    Stability AI·Full-time·United States
    Technical
    Stability AI

    Generative AI Inference Engineer

    Stability AI·Full-time·United States
    Technical
    Stability AI

    Director of Data Strategy & Operations

    Stability AI·Full-time·United States
    Technical
    ← Back to all jobs