Machine Learning Engineer Job at Evolve Group, Hayward, CA

SGxvN1pFYnVKMnlDeEFhUTYrSTZncllibVE9PQ==
  • Evolve Group
  • Hayward, CA

Job Description

Machine Learning Engineer

Tech start-up

San Fransisco based

We’ve partnered with one of the most ambitious and technically rigorous AI research labs in the world. Based in San Francisco, this team is building foundation models entirely from scratch.

They are now hiring ML Infrastructure Engineers to design and scale the systems that power large-scale, distributed model training. If you’ve built infrastructure that runs across hundreds of GPUs, thrive under technical complexity, and want to work side-by-side with elite AI researchers — this is the role.

Key Responsibilities:

  • Build and scale distributed training systems for large-scale model training across LLMs, vision, and robotics.
  • Set up and run large-scale training across many GPUs using tools like Kubernetes, DeepSpeed, and FSDP.
  • Troubleshoot system issues (GPU errors, network problems) and build tools to monitor and recover from failures.
  • Optimize PyTorch pipelines, sharding, and sampling strategies.
  • Collaborate closely with researchers to support novel model training at scale.

Requirements:

  • 3–15 years in ML infrastructure, systems, or research engineering roles.
  • Proven experience scaling distributed training for large models.
  • Strong with PyTorch, CUDA, NCCL, Kubernetes.
  • Familiar with setting up distributed training clusters.
  • Deep understanding of PyTorch dataloaders, data sharding, and sampling.
  • Strong communicator with a collaborative, mission-driven mindset.

This is a fully in-person role based in San Francisco , it's ideal for engineers excited to build at the edge of what's possible in AI.

Job Tags

Immediate start,

Similar Jobs

LD Consulting

Sr. Energy Efficiency Program Management Consultant Job at LD Consulting

 ...Job Title: Sr. Energy Efficiency Program Management Consultant Location: Portland, Oregon Job Type: Full-Time Work from home part-time (must reside in Oregon or Washington state) Travel: 10% travel to client sites Key Details Pay: $125,000-$150,000... 

Adfero

Marketing Copywriter- Full Time Job at Adfero

 ...Adfero seeks a full-time copywriter to develop compelling, creative copy and liaise across our talented account and creative teams. In this role, you will: Lead copywriting across multiple public affairs and issue advocacy-focused accounts. Develop messaging... 

Inserso Corporation

Market Research Analyst Job at Inserso Corporation

 ...Analyst to support our business development activities. This individual will work closely with our business development team to research target customers, opportunities, incumbent vendors, and incumbent staff. They will identify and investigate upcoming opportunities and... 

Prysmian

Maintenance Electrician Job at Prysmian

 ...manufactures thousands of miles of underground and submarine cables and systems for power transmission and distribution, as well as medium low voltage cables for the construction and infrastructure sectors. We also produce a comprehensive range of optical fibers, copper cables... 

Masis Professional Group

Senior Construction Project Manager Job at Masis Professional Group

 ...The Administrator/Project Manager oversees the planning, design, construction, commissioning, and operation of a large-scale water infrastructure project delivering treated water to multiple communities. The project is a large capital investment extending from Reservoir...