Researcher – Reinforcement Learning and LLM Reasoning

Huawei Technologies Canada Co., Ltd.

nearmejobs.eu

Our team has a 12-month contract opening for a Researcher.

Responsibilities:

  • Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
  • Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
  • Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
  • Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

Requirements

What you’ll bring to the team:

  • PhD or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
  • Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
  • Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
  • Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
  • Proficient programming skills in Python and strong experience with model development and optimization.
  • Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
  • Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

Apply now
To help us track our recruitment effort, please indicate in your cover/motivation letter where (nearmejobs.eu) you saw this job posting.

Job Location