Group 1160449358(2).png

Company Description

Apex Compute is a trailblazer in redefining AI compute architectures. Our mission is to push the boundaries of machine learning performance by developing innovative hardware and software solutions. We pride ourselves on fostering a culture of innovation, collaboration, and excellence, and offer company stock options so that every team member shares in our long-term success.

Role Description

We are seeking a highly skilled ML Performance Engineer to join our dynamic team full-time. This hybrid role is based in Mountain View, CA, with some flexibility for remote work. In this role, you will focus on optimizing transformer architectures and other advanced ML models to achieve breakthrough performance improvements. You will leverage your expertise in C/C++ and Python, combined with a deep understanding of compiler technologies, memory scheduling, and numeric operations. Experience with MLIR, StableHLO, and the llama.cpp repository is highly desirable. Your contributions will directly impact the efficiency and scalability of our AI solutions.

Responsibilities

Qualifications

Why Join Us?

You’ll have the opportunity to work on revolutionary AI hardware alongside a talented, passionate, and ambitious team. This role provides a chance to grow your technical skills, gain valuable hands-on experience, and contribute to groundbreaking innovations in AI compute hardware. Additionally, as part of our team, you’ll be eligible for company stock options, allowing you to share in our long-term success.

If you think you are a good fit, please send your resume to [email protected].