AI Inference Engineer Job at Signify Technology, Alameda, CA

SGxzMFkwSGhKV2lCeEFHZTdlbzFqcm9ibWc9PQ==
  • Signify Technology
  • Alameda, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

OneCruit

Corporate Associate Job at OneCruit

 ...Qualifications: ~48 years of experience practicing corporate law with a strong emphasis on M&A and private equity transactions. ~ Deep understanding of corporate and transactional law, as well as current market practices and trends. ~ Strong drafting and... 

TimeWise Cleaning

Cleaning Professional (Full-Time) Job at TimeWise Cleaning

 ...Love where you work! At TimeWise Cleaning, we're not just another cleaning company we're a team that cares about people . Since 1994...  ...Perform residential cleaning services (dusting, vacuuming, mopping, kitchens, bathrooms, etc.) Follow checklists and company procedures to... 

The Oaks Club

Carpenter/Maintenance Technician Job at The Oaks Club

 ...long term employees at The Oaks Club.! This is a full time position (40 hours per week, 8 hour shifts). We are seeking a full time Carpenter to perform a variety of skilled work in facility maintenance and repair. This position requires extensive and precise finished and... 

Isthmus Partners, LLC

Equity Research Analyst Job at Isthmus Partners, LLC

 ...seriously and wake up every day aiming to do a little better in helping our clients reach their goals. Job Title: Senior Equity Research Analyst / Equity Research Analyst Location: Madison, WI Summary: Primarily responsible for monitoring owned securities... 

Insurance Resourcing LLC

Commercial Lines Account Manager: Work From Home for WA Residents Only Job at Insurance Resourcing LLC

 ...and know EPIC well. They will ship you all IT needed to do the job. They have a mix of employees that work in the office as well as remote. The agency pays 100% of your gold level benefits and 50% of your dependents. They also have a matching 401K. They would...