Data Engineer- AI trainer

Required Skills

big data
Hadoop
Spark
data engineering
Kafka
cloud platform
data pipelines
ai/lm applications
prompt engineering
generative ai
data curration
written communication
remote collaboration
verabl communication
problem solving
troubleshooting

Job Description

Job Title: Data Engineer- AI trainer


Job Type: Contract full-time or part-time


Location: Remote


Job Summary:

For UK-Based candidates.

Join our customer’s team as a Data Engineer- AI trainer, where you will play a pivotal role in shaping the next generation of AI systems through expert data engineering. As part of an innovative, remote-first environment, you will collaborate with cross-functional teams to build robust, scalable big data solutions and help train AI models for real-world impact.


Key Responsibilities:

  1. Design, develop, and optimize large-scale data pipelines using Hadoop, Spark, and related technologies.
  2. Build and maintain efficient data architectures to support AI model training and analytics.
  3. Integrate real-time data streams via Kafka and ensure high data quality and reliability.
  4. Leverage cloud platforms to deploy, orchestrate, and monitor distributed data processing workloads.
  5. Collaborate closely with data scientists and machine learning engineers to deliver seamless data solutions for AI initiatives.
  6. Document complex data workflows and provide clear training resources to empower team members.
  7. Champion best practices in data engineering, ensuring security, scalability, and performance.



Required Skills and Qualifications:

  1. BSc in Computer Science, Data Engineering, or related field.
  2. Proven expertise in big data technologies including Hadoop and Spark.
  3. Strong experience with Kafka for stream processing and integration.
  4. Solid background in data engineering with proficiency in building and scaling ETL pipelines.
  5. Hands-on experience working with leading cloud platforms (AWS, GCP, Azure, etc.).
  6. Exceptional written and verbal communication skills to articulate technical concepts with clarity.
  7. Proficient in scripting/programming languages (e.g., Python, Scala, or Java).
  8. Demonstrated ability to work independently in a remote collaboration environment.



Preferred Qualifications:

  1. Prior experience as an AI trainer or in AI/ML prompt engineering projects.
  2. Advanced degree in Computer Science, Data Engineering, or related field.
  3. Industry certifications in cloud or big data technologies.

Please note that by applying & completing our interview process, you will be added to our talent pool. This means you’ll be considered for this and all other possible roles that may match your skills. These potential opportunities will be sent your way as a micro1 certified candidate.

Have any questions? See FAQs