Staff Machine Learning Systems Engineer

New

Skills

Design end-to-end ML model lifecycle patterns (MLOps) to boost velocity Zero-to-one dev and support of a graph ML codebase/platform Collaborate on performance tuning: faster training, efficiency, GPU costs Optimize batch data processing with Beam, Spark, Ray Data Architect pipelines for billions of nodes and edges in graph data 8+ years in ML infrastructure incl. training and deployments Memory and GPU profiling for ML optimization Cloud-based ML platform experience: GCP BigQuery, GCS, Terraform MLOps tools: MLflow or Wandb, experiment tracking and model serving Proficiency: Python, PyTorch, TensorFlow Distributed training with Ray and Kubernetes Equity in RSUs Potential commission eligibility Medical, dental, and vision insurance 401(k) with employer matching Generous vacation and parental leave Reddit careers page for full benefits

No forms. Your profile is generated instantly.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: Months

Share this job: