AI Inference Engineer Job at Signify Technology, Alameda, CA

SGxzMFkwSGhKV2lCeEFHZTdlbzFqcm9ibWc9PQ==
  • Signify Technology
  • Alameda, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Worldwide Flight Services

Airline Ramp Agent - Part Time - San Diego Job at Worldwide Flight Services

 ...highly trained, highly skilled, and confident airport service professionals who are supported...  ...Services family and contribute to the timely delivery of cargo shipment, luggage, business...  ...care? Multiple options for both full and part-time employees!* Want WFS Employee... 

Medical Services of America

Registered Nurse Hospice Job at Medical Services of America

 ...We are excited to welcome you to our hospice team! At Medi we are passionate about putting patients first in everything we do. Join a team...  ...Wilkes) NC. As a member of the multidisciplinary team, the RN works under the general direction of the Director of... 

Emerging Minds Montessori Academy Inc

Assistant Head of School Job at Emerging Minds Montessori Academy Inc

 ...Manage staff during and oversee morning and afternoon car line Oversee orderliness of office and assist in maintaining safe and clean conditions on campus Be available to assist with sick children and contact parents as needed Meet regularly with HOS to inform... 

JUSTIN Vineyards & Winery

Harvest Cellar Worker Job at JUSTIN Vineyards & Winery

 ...FIJI water you can drink at work Wine Education Program Fun Work Environment Located in Paso Robles, JUSTIN Vineyards & Winery was founded in 1981 and is known for crafting world- class wines using Bordeaux grape varieties, including the iconic ISOSCELES blend... 

Leisure Care

Senior Living Sales Manager Job at Leisure Care

 ...Leisure Care managed communities, were all about inspiring older adults to live life to the fullestwhile making sure our team members do the same. With nearly 50 years of experience reimagining senior living, we offer careers that are meaningful, rewarding, and filled...