
Data Engineer- AI trainer
Required Skills
big data
Hadoop
Spark
data engineering
Kafka
cloud platform
data pipelines
ai/lm applications
prompt engineering
generative ai
data curration
written communication
remote collaboration
verabl communication
problem solving
troubleshooting
Job Description
Job Title: Data Engineer- AI trainer
Job Type: Contract full-time or part-time
Location: Remote
Job Summary:
For UK-Based candidates.
Join our customer’s team as a Data Engineer- AI trainer, where you will play a pivotal role in shaping the next generation of AI systems through expert data engineering. As part of an innovative, remote-first environment, you will collaborate with cross-functional teams to build robust, scalable big data solutions and help train AI models for real-world impact.
Key Responsibilities:
- Design, develop, and optimize large-scale data pipelines using Hadoop, Spark, and related technologies.
- Build and maintain efficient data architectures to support AI model training and analytics.
- Integrate real-time data streams via Kafka and ensure high data quality and reliability.
- Leverage cloud platforms to deploy, orchestrate, and monitor distributed data processing workloads.
- Collaborate closely with data scientists and machine learning engineers to deliver seamless data solutions for AI initiatives.
- Document complex data workflows and provide clear training resources to empower team members.
- Champion best practices in data engineering, ensuring security, scalability, and performance.
Required Skills and Qualifications:
- BSc in Computer Science, Data Engineering, or related field.
- Proven expertise in big data technologies including Hadoop and Spark.
- Strong experience with Kafka for stream processing and integration.
- Solid background in data engineering with proficiency in building and scaling ETL pipelines.
- Hands-on experience working with leading cloud platforms (AWS, GCP, Azure, etc.).
- Exceptional written and verbal communication skills to articulate technical concepts with clarity.
- Proficient in scripting/programming languages (e.g., Python, Scala, or Java).
- Demonstrated ability to work independently in a remote collaboration environment.
Preferred Qualifications:
- Prior experience as an AI trainer or in AI/ML prompt engineering projects.
- Advanced degree in Computer Science, Data Engineering, or related field.
- Industry certifications in cloud or big data technologies.