AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

SFY4d1pVSG1JV3FDeHdPUTdPSXhpcm9mbkE9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Dawson

Evening Recruiter Job at Dawson

 ...Recruiter Base Salary + Commission Permanent Opportunity Monday-Friday, 8:00am-5:00pm Grandview, Ohio (onsite) The main...  ...attitude, willingness to learn, and dependability Some local travel as needed to conduct client and candidate meetings will be required... 

Liberty

Engineer, Planning Job at Liberty

 ...responsibilities. Provide technical support and mentorship to lower-level engineers, contributing to their development and ensuring project continuity. Collaborate with entry-level and consulting engineers to coordinate design integrity, material procurement, and... 

Lakeshore Talent

Non-Profit Case Manager (Hybrid) Job at Lakeshore Talent

 ...electronic health records (EHR). Valid drivers license, clean driving record (MVR), and reliable transportation. Must currently live in Colorado. Benefits: Competitive pay with performance-based increase after 6 months. Fully paid medical benefits we... 

Tandym Group

Training Instructor Job at Tandym Group

 ...financial services company in Florida is actively seeking a hardworking professional to join their staff in Pensacola as a Training Instructor. About the Opportunity: Assignment Length: open-ended contract Schedule: Monday to Friday (onsite 2 days a week) Hours... 

Lumiere Systems

Power BI Developer Job at Lumiere Systems

 ...Job Description/Responsibilities/Duties Below: Power BI developer builds complex dimensional data models and reports from the bottom up, visualizes compelling data stories on the report canvas, collaborates with other teams to engineer revolutionary agency wide...