Careers

Embedded AI Engineer

Location: USA, India

Company: QFocus Technologies, QFocus AI Private Limited

Domain: Embedded software, artificial intelligence

Description

We are seeking an experienced Embedded AI Engineer to join our team in validating PyTorch-based Large Language Models (LLMs) using CUDA SDK APIs. The successful candidate will be responsible for debugging, extending, and replacing the underlying CUDA code to ensure seamless functionality on our company-specific AI processors.

Key Responsibilities

  • Validate PyTorch-based LLMs on company-specific AI processors using CUDA SDK APIs
  • Debug and troubleshoot issues related to CUDA code integration with PyTorch models
  • Extend and modify CUDA code to optimize performance on company-specific AI processors
  • Replace existing CUDA code with custom implementations to meet specific requirements
  • Collaborate with cross-functional teams to ensure successful integration of LLMs with company-specific AI processors
  • Develop and maintain validation frameworks and tools for PyTorch-based LLMs
  • Analyze and optimize the performance of LLMs on company-specific AI processors

Requirements

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related fields
  • Strong experience with CUDA programming and PyTorch framework
  • In-depth knowledge of deep learning models, particularly Large Language Models (LLMs)
  • Proficiency in C++ and Python programming languages
  • Experience with debugging and troubleshooting complex software issues
  • Excellent problem-solving skills and attention to detail
  • Strong communication and collaboration skills

Nice to Have

  • Experience with AI processor architecture and design
  • Knowledge of other deep learning frameworks, such as TensorFlow