Inference Compiler Engineer

February 16, 2026
Application ends: May 17, 2026

Job Description

REQUIREMENTS

  • Degree in Engineering, Computer Science, or equivalent in experience and evidence of exceptional ability
  • Strong experience working with Python and C++ languages
  • Experience working with PyTorch and HuggingFace Transformers library
  • Knowledge and experience working with Large Language Models (understanding Transformer architecture variations, generation cycle, etc.)

RESPONSIBILITIES

  • Analysis of new models from generative AI field and understanding of impacts on compilation stack
  • Implementation of compiler and frontend features to support new models, improve inference characteristics and Cerebras user experience
  • Collaboration with other teams throughout feature implementation
  • Research on new methods for model optimization to improve Cerebras inference

Are you interested in this position?


Apply by clicking on the “Apply Now” button below!

#CrossChannelJobs #JobSearch
#CareerOpportunities #HiringNow
#Employment #JobOpenings
#JobSeekers
#FacebookLinkedIn

Related Jobs