AWS Data Engineer
Required Skills
AWS
Python
Glue
Job Description
Job Title: AWS Data Engineer
Job Type: Full-time
Location: On-site Malvern, Pennsylvania, United States, Chester County
Job Summary:
Join our dynamic team as an AWS Data Engineer, where you will architect and implement robust data solutions using cutting-edge AWS services. You will play a key role in building a scalable, high-performing data infrastructure, empowering our analytics efforts and driving business insights. Our async culture values clear written communication and innovation in data engineering best practices.
Key Responsibilities:
- Design, develop, and optimize scalable data pipelines and ETL workflows using AWS Glue, PySpark, and Python.
- Integrate a variety of AWS services—such as Lambda, DynamoDB, EC2, RDS, S3, Athena, Data Pipeline, API Gateway, Glue, and EMR—within reporting applications to enhance data accessibility and efficiency.
- Collaborate with the Data and Analytics team to architect and implement a robust Data Lake using AWS, Apache Airflow, PySpark, and Hive.
- Write, maintain, and troubleshoot SQL, Spark, and Python scripts to ensure reliable and high-performance data processing.
- Develop and deploy ETL jobs that handle data extraction, transformation, and loading from diverse sources—both homogeneous and heterogeneous.
- Monitor, tune, and document data workflows, maintaining best practices for data quality, scalability, and security.
- Champion an asynchronous work culture through proactive, clear written communication with stakeholders and team members.
Required Skills and Qualifications:
- Proven hands-on experience in AWS data engineering, with deep expertise in AWS Glue, S3, Lambda, and related services.
- Advanced proficiency in Python and PySpark for data transformation and automation tasks.
- Strong command of SQL for querying and managing large datasets.
- Experience with scalable ETL pipeline design, deployment, and troubleshooting.
- Solid understanding of data warehousing, Data Lake concepts, and big data technologies (EMR, Hive, Airflow).
- Demonstrated ability to integrate multiple AWS services within a data ecosystem.
- Exceptional written communication skills aligned with an asynchronous work environment.
Preferred Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related discipline.
- Experience building data solutions for analytics or reporting applications in a fast-paced environment.
- Relevant AWS certifications (e.g., AWS Certified Data Analytics, AWS Certified Solutions Architect).