AI Inference Engineer Job at Signify Technology, Santa Clara, CA

VFdUbVM2MGltbTNPM25kTnZkUGJuY1FiSkE9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

UPMC - Pittsburgh Medical Center

Graduate Nurse Rotational Program: Surgical Track (Spring 2026 Graduates) Job at UPMC - Pittsburgh Medical Center

 ...Job Description Graduate Nurse Rotational Program: Surgical Track \n \n Job Title: Graduate Nurse Rotational Program: UPMC Passavant - Surgical Track (Spring 2026 Graduates)\n \n Description:\n \n Are you graduating in Spring 2026 from nursing school... 

Evergreen Talent Partners

Trainee Technician Job at Evergreen Talent Partners

Junior HVAC Technician $32 per hour + Overtime + Mon-Thurs (12- hour day shift) We are seeking a skilled and dedicated HVAC Technician to join our team. The ideal candidate will have a strong background in heating, ventilation, and air conditioning systems. ...

HB Travels

Work From Home Vacation Planner Job at HB Travels

 ...Work from Home as a Vacation Planner Turn Your Passion for Travel into a Career Do you have a passion for travel and enjoy helping others plan their ideal getaways? We're looking for motivated individuals to join our team as Work-from-Home Vacation Planner . This... 

Plona Partners

Litigation Legal Secretary, Big Law Job at Plona Partners

 ...Secretary, Litigation Support Model: 7 Attorneys to 1 Professional Assistant Target...  ...Creates, edits, formats and proofreads documents. Prepares legal documents for e-Filing...  ...specific to Florida matrimonial matters. Reviews proformas and edits bills according to... 

PDS Health

Dentist, Associate Job at PDS Health

 ...experienced professionals. If you're ready to take your career to the next level and gain valuable experience, apply today! The Associate Dentist role is for any qualified individual, including recent dental school graduates. You will be given a unique opportunity to provide...