Research Engineer (Agentic Models)

Added
10 minutes ago
Type
Full time
Salary
Salary not provided

Related skills

kubernetes pytorch airflow llm kubeflow

๐Ÿ“‹ Description

  • Design, implement, and maintain SFT/RL post-training pipelines for agents.
  • Train and adapt LLMs for agent workflows, including planning.
  • Build evaluation/simulation environments for agents to act and be measured.
  • Design evaluation metrics for agent behavior; analyze traces for improvements.
  • Analyze training results to improve model architectures, training recipes, and datasets.
  • Work with distributed GPU clusters and MapReduce-style data processing.

๐ŸŽฏ Requirements

  • Hands-on experience training LLMs (pre-, fine-, or post-training).
  • Deep expertise in PyTorch and specialized LLM stacks (Megatron, NeMo, etc.).
  • LLM fundamentals: architectures, tokenization, data pipelines, batching.
  • Ability to own projects end to end from problem to iteration.
  • Product-aware mindset: translate product needs into modeling and evaluation.
  • 3+ years of Python experience in modern ML codebases.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’