Member of Technical Staff- Inference Job at Acceler8 Talent, Palo Alto, CA

YzFxT3VZamc0cnBxNnNIRG9EK0MxR0ZodGc9PQ==
  • Acceler8 Talent
  • Palo Alto, CA

Job Description

Inference Software Engineer

About Us

We are at the forefront of AI innovation, driving scalable and efficient solutions for enterprise AI workloads. The Inference team focuses on expanding the capabilities of deployable GPU architectures, optimizing performance, and building tools for efficient operations. Our work currently targets inference, with potential expansion into fine-tuning in the future.

Responsibilities

As an Inference Software Engineer, you will:

  • Design, develop, and optimize GPU kernels from scratch and fine-tune existing kernels for both NVIDIA and non-NVIDIA platforms.
  • Leverage CUDA and NCCL for distributed networking on NVIDIA GPUs and extend solutions to other architectures.
  • Write and maintain code to distribute machine learning workloads across distributed systems.
  • Contribute at lower levels (e.g., kernel or network programming).
  • Contribute at higher levels (e.g., Kubernetes, operators, and ML frameworks built on Kubernetes).
  • Collaborate with cross-functional teams to expand the footprint of deployable GPU architectures.
  • Optimize inference pipelines for performance and scalability.
  • Develop tools and workflows for efficient operation of GPU-based inference systems, with a future focus on supporting fine-tuning workloads.

Qualifications

We’re looking for someone with:

  • Expertise in GPU kernel programming, including experience in CUDA and familiarity with NCCL for distributed networking.
  • Proficiency in programming for distributed systems, with a strong foundation in building scalable ML solutions.
  • Experience working with GPU architectures beyond NVIDIA.
  • A solid understanding of systems engineering, with hands-on experience in one or more of the following areas:
  • Kernel or network-level programming for distributed systems.
  • Higher-level tools like Kubernetes, ML operators, or frameworks built on Kubernetes.
  • Proficiency in programming languages such as C++, Python, or similar.
  • Familiarity with ML frameworks like TensorFlow, PyTorch, or ONNX (a plus).
  • A Bachelor’s, Master’s, or Ph.D. in Computer Science, Electrical Engineering, or a related field (or equivalent experience).

Preferred Skills

  • Experience optimizing inference workloads across diverse GPU architectures.
  • Hands-on knowledge of distributed networking tools and protocols, especially in ML contexts.
  • Familiarity with quantization, pruning, or other model optimization techniques.
  • Experience with profiling tools such as NVIDIA Nsight or AMD ROCm tools.

Why Join Us?

  • Tackle cutting-edge challenges in GPU programming, distributed systems, and ML optimization.
  • Collaborate with a dynamic, innovative team driving the future of enterprise AI.
  • Enjoy competitive compensation and benefits, with significant opportunities for impact and growth.

Job Tags

Similar Jobs

Insight Global

Assistant General Manager Job at Insight Global

A client in the hospitality industry is looking to add 1 Assistant General Manager to their team in Sedona, Arizona. This person will oversee 3 managers (housekeeping manager, front office manager, and maintenance manager). This resort houses 416 timeshare rooms, and it...

Bowery Residents'​ Committee

Clinical Supervisor Job at Bowery Residents'​ Committee

 ...pension plan with a matching benefit paid by BRC. # Tuition assistance and many training opportunities for career development. # Flexible spending accounts (FSAs) are available so employees can set aside pre-tax dollars for healthcare, transit and childcare.... 

Insight Global

GMP Investigator Job at Insight Global

 ...Scientific field -Minimum 2 years of experience working in a GMP manufacturing environment -Minimum 1 year of experience performing investigations -Experience with technical writing and written investigations with appropriate grammar -Experience with quality document... 

Roth Staffing

Market Manager-Sales and Recruiting Job at Roth Staffing

 ...operations in the Atlanta, Georgia area. Why Work for Ledgent? Our award-winning, unique...  ...good! Fully remote (100% Work from Home) with choice to work hybrid or in-office...  ...Atlanta, Georgia area. Working in a largely virtual environment where our culture and... 

Diati Staffing

Animal Care Technician Job at Diati Staffing

 ...animal husbandry care on a daily basis. Provides food, water and clean cages to all animals as assigned and maintain animal room sanitation on a scheduled basis; Responsible for daily observation and examination of animals for signs of illness, injury, and or behavioral...