Gen AI Data Scientist
RiDiK (a Subsidiary of CLPS. Nasdaq: CLPS)
Dhahran, Eastern Province, Saudi Arabia · Full Time
Be the first to apply
- Experience
- 5+ yrs
- Salary
- —
- Openings
- 1
- Posted
- 1 day ago
- Work mode
- In office
- Eligibility
- Candidates who can join on an immediate basis up to 15 days notice and are available for an on-site role in Dhahran, Saudi Arabia.
- Resume
- Required to apply
Where you'll work
Job description
Role Overview
This opening is for a Gen AI Data Scientist based in Dhahran, Saudi Arabia, with a requirement of more than 5 years of experience. The expected notice period is immediate to 15 days. The position is full-time and on-site.
What You Will Do
- Create, train, and put into production machine learning and deep learning solutions for forecasting over time, language-based use cases, and sensor or IoT datasets.
- Adapt and deploy advanced models, including large language models, using HuggingFace and fine-tuning approaches such as LoRA, Q-LoRA, and complete model tuning.
- Apply ensemble techniques such as bagging and boosting, including XGBoost, LightGBM, and CatBoost, for structured and sensor-driven data.
- Build and refine neural network designs like CNNs, RNNs, Transformers, and attention-focused architectures with PyTorch.
- Measure model effectiveness using suitable evaluation metrics and validation methods.
- Work with smart sensor and IoT data to derive features and develop predictive systems for anomaly detection, failure prediction, and optimization.
- Publish model inference through REST APIs using FastAPI or comparable frameworks.
- Partner with DevOps and engineering teams to move models into production using Docker, CI/CD workflows, and cloud environments.
- Support and enhance data pipelines, feature engineering processes, and production model monitoring.
Requirements
- At least 5 years of experience in data science, machine learning, or related applied AI work.
- Strong hands-on knowledge of deep learning, time series forecasting, NLP, and sensor or IoT analytics.
- Practical experience with HuggingFace and techniques such as LoRA, Q-LoRA, and full fine-tuning of LLMs.
- Ability to work with ensemble methods like XGBoost, LightGBM, and CatBoost.
- Proficiency in PyTorch and familiarity with CNNs, RNNs, Transformers, and attention mechanisms.
- Experience building model validation, testing, and performance evaluation workflows.
- Understanding of feature engineering, data pipelines, and production monitoring for ML systems.
- Knowledge of API development with FastAPI or similar web frameworks.
- Experience collaborating with DevOps teams, including Docker, CI/CD, and cloud deployment practices.
Additional Information
Notice period requirement: immediate to 15 days.
Location: Dhahran, Eastern, Saudi Arabia.
Employer: RiDiK, a subsidiary of CLPS (Nasdaq: CLPS).
Employment Details
This is a full-time, on-site role.