Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Engineering Member - Fault Tolerance image - Rise Careers
Job details

Engineering Member - Fault Tolerance

Job Summary

A company is looking for a Member of Engineering focused on pre-training and inference fault tolerance.

Key Responsibilities
  • Identify, study, and troubleshoot hardware problems during training at scale
  • Minimize GPU idle time during faults, both operationally and strategically
  • Design and develop tools and add-ons to accelerate training recovery
Required Qualifications
  • Strong engineering background with programming experience in Linux API and Linux kernel
  • Basic understanding of Large Language Models (LLM) and deep learning fundamentals
  • Proficiency in Python (PyTorch), C/C++, and CUDA API
  • Knowledge of distributed systems, reliability, and fault-tolerance
  • Experience with NCCL and modern development tools

Average salary estimate

$0 / YEARLY (est.)
min
max
$0K
$0K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
Remote Jobs Remote No location specified
Posted 3 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Remote Jobs Remote No location specified
Posted 4 days ago
Photo of the Rise User
Remote Jobs Remote No location specified
Posted 8 days ago
Photo of the Rise User
Remote Jobs Remote No location specified
Posted 2 days ago
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
Remote Jobs Remote No location specified
Posted 7 days ago
Photo of the Rise User
Posted 10 days ago
Photo of the Rise User
Posted 11 days ago
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
August 21, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!