AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

ZGxHSnVZbmc1YjVrNGNiQW9qV08wR3BodlE9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Inference Software Engineer — Stealth AI Systems Startup

Base Salary Range: $200,000-$300,000

Location: San Fransisco (Onsite)

A stealth-stage AI systems company is redefining the performance boundaries of inference at scale. As generative AI models become larger and more complex, inference is emerging as the core bottleneck in production environments. This team is building a vertically integrated stack—from low-level GPU kernels to developer-friendly APIs—that dramatically improves inference speed, efficiency, and scalability.

Spun out of cutting-edge academic research and backed by deep industry experience across distributed systems, machine learning infrastructure, and hardware design, they are focused on enabling production-grade AI with minimal latency and maximal throughput. Their platform integrates seamlessly with modern ML frameworks like PyTorch and LangChain, allowing teams to deploy and monitor workloads in seconds.

They are looking for a Software Engineer focused on AI inference performance to help build and optimize the core runtime infrastructure powering these systems. This role sits at the intersection of deep learning, systems engineering, and GPU performance.

What You’ll Do

  • Implement and evaluate advanced inference optimization techniques, including quantization, KV caching, and FlashAttention
  • Design and build systems for distributing inference workloads efficiently across multiple GPUs and nodes
  • Profile and benchmark large-scale models to identify bottlenecks across the software and hardware stack
  • Optimize CUDA kernels and GPU memory usage to improve performance across a wide variety of AI models
  • Collaborate closely with research and systems engineers to push the limits of model serving infrastructure

What They’re Looking For

  • Proficiency with CUDA and experience writing or optimizing GPU kernels
  • Strong background in Python and C++ development
  • Hands-on experience with PyTorch, TensorFlow, or similar deep learning frameworks
  • Knowledge of distributed systems or model-serving platforms at scale
  • Familiarity with performance tuning, benchmarking tools, and profiling techniques

Nice to Have

  • Graduate degree in computer science, engineering, or a related field
  • Experience with compiler frameworks such as MLIR or Triton
  • Exposure to vLLM, ONNX, or custom model runtimes

This is a rare opportunity to work on core infrastructure for AI systems at a team solving some of the hardest performance challenges in the field.

Job Tags

Similar Jobs

Poudre School District

English Language Development Teacher Job at Poudre School District

 ...ELLs. 6. Communicate students progress, and needs with parents/guardians and other staff as needed. 7. Collaborate: a. With teachers, support personnel, administrators, and colleagues to enhance instruction and improve student outcomes. b. With district ELD program... 

Macdonald Devin Madden Kenefick & Harris, P.C.

Litigation Paralegal Job at Macdonald Devin Madden Kenefick & Harris, P.C.

 ...counselors who are dedicated to client service and legal excellence. Role Description We are seeking an on-site Litigation Paralegal to play a vital role in supporting our attorneys through various stages of the litigation process. The ideal candidate will have... 

Calabitek

Mainframe Tester Job at Calabitek

 ...Mainframe Tester Jersey City, NJ (5 days onsite) Hiring Manager notes: The ask is to be able to query Mainframe systems for data and validate the same with API responses. So, we need who have done backend testing (Non UI), understand Payment systems, they should... 

HAN Staffing

Lead Sharepoint Developer Job at HAN Staffing

 ...Key Responsibilities: Design and develop scalable SharePoint Framework (SPFx) web parts and extensions using React and Fluent UI, aligned with modern SharePoint Online architecture. Build responsive, accessible, and high-performance user interfaces integrated with... 

Glenmark Pharmaceuticals

In Process Quality Assurance Specialist Job at Glenmark Pharmaceuticals

 ...for ANY employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time. Glenmark Pharmaceuticals Inc., USA is a subsidiary of Glenmark Pharmaceuticals Ltd., a leading player in the discovery of new molecules both New...