Senior ML Scientist (Optimization & Reinforcement Learning)

Toronto, ON

Senior ML Scientist (Optimization & Reinforcement Learning) – US/Canada

Role Overview

We seek a Senior ML Scientist to drive innovation in AI ML-based dynamic pricing algorithms and personalized offer experiences. This role will focus on designing and implementing advanced machine learning models, including reinforcement learning techniques like Contextual Bandits, Q-learning, SARSA, and more. By leveraging algorithmic expertise in classical ML and statistical methods, you will develop solutions that optimize pricing strategies, improve customer value, and drive measurable business impact.

Key Responsibilities

· Algorithm Development: Conceptualize, design, and implement state-of-the-art ML models for dynamic pricing and personalized recommendations.

· Reinforcement Learning Expertise: Develop and apply RL techniques, including Contextual Bandits, Q-learning, SARSA, and concepts like Thompson Sampling and Bayesian Optimization, to solve pricing and optimization challenges.

· AI Agents for Pricing: Build AI-driven pricing agents that incorporate consumer behaviour, demand elasticity, and competitive insights to optimize revenue and conversion.

· Rapid ML Prototyping: Experience in quickly building, testing, and iterating on ML prototypes to validate ideas and refine algorithms.

· Feature Engineering: Engineer large-scale consumer behavioural feature stores to support ML models, ensuring scalability and performance.

· Cross-Functional Collaboration: Work closely with Marketing, Product, and Sales teams to ensure solutions align with strategic objectives and deliver measurable impact.

· Controlled Experiments: Design, analyze, and troubleshoot A/B and multivariate tests to validate the effectiveness of your models.

Qualifications

· 8+ years in machine learning, 5+ years in reinforcement learning, recommendation systems, pricing algorithms, pattern recognition, or artificial intelligence.

· Expertise in classical ML techniques (e.g., Classification, Clustering, Regression) using algorithms like XGBoost, Random Forest, SVM, and KMeans, with hands-on experience in RL methods such as Contextual Bandits, Q-learning, SARSA, and Bayesian approaches for pricing optimization.

· Proficiency in handling tabular data, including sparsity, cardinality analysis, standardization, and encoding.

· Proficient in Python and SQL (including Window Functions, Group By, Joins, and Partitioning).

Senior ML Scientist (Optimization & Reinforcement Learning)

Share This Job