Systems & Infrastructure Specialist

$40 - $70/hourpay

Required Skills

Terminal-Native Problem Solving
Dynamic Infrastructure Recovery
Containerized Environment Mastery
Systems Multilingualism
About micro1
micro1 connects domain experts to the development of frontier AI models. Real-world expertise is turned into training data, evaluations, and feedback loops that improve how models perform. AI labs and enterprises use micro1 to train models and build reliable AI agents through advanced evaluations and reinforcement learning environments. Experts contribute directly to how AI systems learn, reason, and perform across domains like finance, healthcare, engineering, and more. Our platform identifies and vets top talent through an AI recruiter, enabling high-quality contributions at scale.
Our goal is to enable 1 billion people to do meaningful work by applying their expertise to AI. We’ve raised $40M+ in funding, and our AI recruiter has powered over 1 million AI-led interviews as our global network of experts grows into the human intelligence layer for AI.

Job Description

Job Title: Systems & Infrastructure Specialist


Job Type: Contractor


Location: Remote


Job Summary:

Join our customer's team as a Systems & Infrastructure Specialist for a high-intensity, expert-level project focused on training and optimizing AI models within intricate, containerized environments. In this terminal-intensive role, you'll apply a systems-first mindset to solve complex infrastructure challenges in real time. This one-time project offers significant opportunities for extension or transition into future phases for those who demonstrate elite technical execution.


Key Responsibilities:

• Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes in real-time using command-line tools.

• Master and manage highly containerized environments, including orchestrating Dockerized sandboxes and CI/CD workflows.

• Build, maintain, and optimize systems for AI model training and high-throughput compute environments.

• Respond swiftly to system errors, executing dynamic mid-operation replanning and recovery.

• Collaborate with engineering and AI teams to ensure seamless integration, reliability, and performance.

• Document system architectures, incident responses, and recovery protocols with meticulous clarity.

• Contribute expertise to evolving project needs, adapting to new technologies and scaling strategies as required.


Required Skills and Qualifications:

• Demonstrated expert proficiency working in terminal environments for system builds, server administration, and infrastructure management.

• Advanced problem-solving skills for multi-step troubleshooting, filesystem navigation, and process management within containerized settings.

• Hands-on experience with Python, Bash, JavaScript/TypeScript, Go, Rust, and/or C/C++.

• Deep familiarity with build systems, package managers, databases, web servers, ML frameworks, version control, and cryptography tools.

• Proven ability to execute dynamic infrastructure recovery and optimize long-running processes under pressure.

• Strong written and verbal communication skills, with a passion for precise technical documentation.

• Systems multilingualism: versatility across operating systems, languages, and emerging DevOps tools.


Preferred Qualifications:

• Prior experience in high-compute environments for AI/ML workloads.

• Background in Site Reliability Engineering or DevOps roles focused on mission-critical infrastructure.

• Familiarity with advanced container orchestration and distributed system design.

Apply now

Please note that after completing the interview process, you’ll be added to our talent pool and considered for this and other roles that match your skills.

Have any questions? See FAQs

Refer and Earn$300