Inference Compiler Engineer
Job Description
REQUIREMENTS
- Degree in Engineering, Computer Science, or equivalent in experience and evidence of exceptional ability
- Strong experience working with Python and C++ languages
- Experience working with PyTorch and HuggingFace Transformers library
- Knowledge and experience working with Large Language Models (understanding Transformer architecture variations, generation cycle, etc.)
RESPONSIBILITIES
- Analysis of new models from generative AI field and understanding of impacts on compilation stack
- Implementation of compiler and frontend features to support new models, improve inference characteristics and Cerebras user experience
- Collaboration with other teams throughout feature implementation
- Research on new methods for model optimization to improve Cerebras inference
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#CrossChannelJobs #JobSearch
#CareerOpportunities #HiringNow
#Employment #JobOpenings
#JobSeekers
#FacebookLinkedIn