This page was automatically translated and may contain errors. View in English.
Qualcomm

Senior Model Accuracy Development and Test Engineer (Datacentre AI Engineering)

Qualcomm

Riyadh, Riyadh Province, Saudi Arabia · पूरा समय

अप्लाय करने वाले प्रथम बनिए

अनुभव
2–10 yrs
वेतन
उद्घाटन
1
की तैनाती
4 पहले
कार्य मोड
कार्यालय में हूँ
शिक्षा
Bachelor's degree
Eligibility
Applicants with the specified degrees and software engineering/programming experience may apply. The role also considers candidates with equivalent experience who can demonstrate the required competencies. Candidates needing workplace or recruitment-process accommodations due to disability are supp…
Resume
Required to apply

Where you'll work

नौकरी का विवरण

About the Company

Qualcomm Middle East Information Technology Company LLC is expanding its footprint in Riyadh and is looking for data centre engineering talent to help strengthen its regional infrastructure. The company is investing in advanced computing and data centre capability to support AI, cloud, and next-generation connectivity as Saudi Arabia advances its Vision 2030 digital agenda.

Role Overview

This position is for a senior inference accuracy engineer focused on designing, building, and validating the accuracy of deep learning models running at scale. The work involves deep-dive accuracy investigation, troubleshooting, evaluation, and recovery during inference on large data-centre hardware systems. Success in this role requires strong analytical thinking, advanced Python coding ability, and practical experience with inference workflows.

Key Responsibilities

  • Set and apply accuracy KPIs across different precision configurations.
  • Create scalable Python tools and automated workflows for accuracy measurement.
  • Build accuracy-safe optimizations for inference stacks such as TensorRT, ONNX Runtime, AITemplate, and Triton.
  • Develop and maintain automated accuracy validation pipelines across ONNX, TensorFlow, and PyTorch.
  • Design reusable plugins for data preprocessing, output post-processing, and metric computation.
  • Run end-to-end accuracy testing for large-scale model families including LLMs, vision systems, and diffusion models.
  • Check accuracy behavior under FP32, FP16, and INT8 settings.
  • Inspect model behavior with attention to architecture details such as layers, attention blocks, and parameter setup.
  • Spot architecture-related accuracy regressions and recommend mitigation approaches.
  • Investigate issues caused by preprocessing drift, tokenization mismatches, operator fallback, and quantization side effects.
  • Compare accuracy across hardware targets, firmware releases, and runtime backends.
  • Perform slice-based analysis using variables such as batch size, concurrency, sequence length, and domain changes.
  • Plan and execute accuracy recovery experiments using fine-tuning, calibration, and hyperparameter tuning.
  • Debug failures by tracing root causes through data preparation, model layers, quantization, and deployment pipelines.
  • Benchmark results across different hardware and software combinations and translate findings into clear actions.
  • Document procedures, keep dashboards updated, and share accuracy outputs with stakeholders.

Required Skills and Experience

  • Strong foundation in AI/ML model evaluation and accuracy metrics.
  • Good understanding of model families such as transformers, CNNs, RNNs, and MoE, and how they affect accuracy.
  • Practical exposure to large language models and generative AI accuracy validation.
  • Working knowledge of inference runtimes including TensorRT, ONNX Runtime, and Triton.
  • Understanding of quantization approaches such as INT8, FP8, and INT4, along with calibration, QAT, and accuracy trade-offs.
  • Experience converting models through graph workflows such as PyTorch to ONNX to backend engines.
  • Hands-on experience building automated accuracy pipelines.
  • Strong Python programming skills plus familiarity with ML frameworks such as ONNX Runtime, TensorFlow, and PyTorch.
  • Ability to analyze accuracy statistically and present findings with visualization tools.
  • Experience designing experiments to restore accuracy and diagnose failures effectively.
  • Knowledge of mixed-precision workflows and quantization methods.
  • Strong analytical and debugging skills for isolating complex model accuracy issues.

Preferred Experience

  • Exposure to video generation model validation and multimodal benchmarking.
  • Experience with data-centre accelerators such as NVIDIA A100, H100, B200, AI100 Ultra, Gaudi, or TPU.
  • Familiarity with LLM evaluation tools such as lm-eval, HELM, and synthetic benchmark suites.
  • Knowledge of distributed deployment environments such as Kubernetes and cloud inference services.

Qualifications

  • Bachelor’s or Master’s degree in Engineering, Machine Learning, AI, Information Systems, Computer Science, or a related discipline.
  • 4 to 10 years of software engineering or related experience.
  • 4 to 10 years of programming experience in languages such as C, C++, or Python.

Benefits and What’s Offered

  • Base salary with housing and transport allowance.
  • Stock grants (RSUs) and performance-linked bonus.
  • 16 weeks of fully paid maternity leave.
  • 6 weeks of fully paid paternity leave.
  • Employee stock purchase programme.
  • Child education allowance.
  • Relocation and immigration assistance, if required.
  • Life and medical insurance coverage.
  • Live+ Well reimbursement for health and recreational membership fees.

Minimum Eligibility

Candidates are expected to meet one of the following education-and-experience combinations:

  • Bachelor’s degree in Engineering, Information Systems, Computer Science, or a related field with at least 4 years of software engineering or related experience.
  • Master’s degree in Engineering, Information Systems, Computer Science, or a related field with at least 3 years of software engineering or related experience.
  • PhD in Engineering, Information Systems, Computer Science, or a related field with at least 2 years of software engineering or related experience.
  • At least 2 years of programming experience in languages such as C, C++, Java, or Python.
  • Candidates with equivalent experience may also be considered if they can demonstrate the capability to perform the core duties and meet the required competencies.

Additional Notes

The employer is an equal opportunity organisation and can provide reasonable accommodation during the hiring process for candidates with disabilities. Workplace access and accommodations are supported where required. Employees are also expected to follow all applicable company policies and procedures, including those relating to security and the protection of confidential and proprietary information, where permitted by law.

Staffing and recruiting agencies are not authorised to submit applications or profiles for this role. Unsolicited resumes or applications from agencies will not be accepted, and the company is not responsible for any associated fees. For role-related enquiries, candidates may contact the careers team.

यदि आपको उत्तर चाहिए तो इसे छोड़ दें — हम इसका उपयोग किसी और चीज के लिए नहीं करेंगे।

ब्राउज़ करने के लिए क्लिक करेंड्रैग एंड ड्रॉप करें, या चिपकाएं एक स्क्रीनशॉट

PNG, JPG, GIF, MP4, WebM, MOV · प्रत्येक फ़ाइल का अधिकतम आकार 20MB · अधिकतम 5 फ़ाइलें