Member of Technical Staff (Research Engineering)

Core team

$180K - $500K/yr compensation

Required Skills

Reinforcement Learning

ML-Oriented Data Design

RL Environments

RL Workflows

About micro1

micro1 connects domain experts to the development of frontier AI models. Real-world expertise is turned into training data, evaluations, and feedback loops that improve how models perform. AI labs and enterprises use micro1 to train models and build reliable AI agents through advanced evaluations and reinforcement learning environments. Experts contribute directly to how AI systems learn, reason, and perform across domains like finance, healthcare, engineering, and more. Our platform identifies and vets top talent through an AI recruiter, enabling high-quality contributions at scale.

Our goal is to enable 1 billion people to do meaningful work by applying their expertise to AI. We’ve raised $40M+ in funding, and our AI recruiter has powered over 1 million AI-led interviews as our global network of experts grows into the human intelligence layer for AI.

Job Description

Job Title: Member of Technical Staff (Research Engineering)

Job Type: Full-time

Location: Remote

The Role

We are seeking a Research Engineer to operate at the frontier of Reinforcement Learning (RL), developing novel environments, training pipelines, and evaluation systems that advance the capabilities of modern AI models. This role sits at the intersection of research and production, translating experimental ideas into scalable, high-performance systems.

What You’ll Work On

Architect self-contained RL environments that capture complex, real-world tasks, including reward functions, verifiers, and evaluation logic.
Design and scale episode pipelines and multi-component training processes (MCPs) to support reproducible experimentation.
Build automated data generation systems, leveraging synthetic data to accelerate training cycles without compromising quality.
Develop and integrate AI-driven evaluation and quality assurance systems for automated grading, validation, and feedback loops.
Fine-tune and optimize open-source RL models using internally generated datasets and custom training strategies.
Establish benchmarking frameworks to measure model capability, robustness, and data quality across tasks.
Contribute to the release and analysis of evaluations on internal and external benchmark platforms (e.g., micro1 benchmarks).

What We're Looking For

Deep experience in Reinforcement Learning, including environment design and training dynamics.
Strong track record of building and scaling RL systems, pipelines, or experimentation frameworks.
Proficient in automation and data generation, including synthetic data pipelines.
Familiar with automated evaluation systems, model validation, and quality assurance workflows.
Experienced in fine-tuning and evaluating open-source ML models.
Clear, concise communicator with strong technical writing skills.
Comfortable operating in fast-paced, research-driven, and highly collaborative environments.

Preferred

Experience publishing benchmarks, evaluations, or research artifacts.
Familiarity with evaluation ecosystems (e.g., micro1 benchmarks or similar frameworks).
Background in scalable infrastructure for large-scale RL experimentation.

Apply now

First name

Last name

Phone number

Linkedin profile URL

Upload your resume (in English)

Click to upload or drag & drop (.pdf)

Please note that after completing the interview process, you’ll be added to our talent pool and considered for this and other roles that match your skills.

Have any questions? See FAQs

Refer and Earn$1500