AWS Data Engineer

Required Skills

AWS
Python
Glue

Job Description

Job Title: AWS Data Engineer


Job Type: Full-time


Location: On-site Malvern, Pennsylvania, United States, Chester County


Job Summary:

Join our dynamic team as an AWS Data Engineer, where you will architect and implement robust data solutions using cutting-edge AWS services. You will play a key role in building a scalable, high-performing data infrastructure, empowering our analytics efforts and driving business insights. Our async culture values clear written communication and innovation in data engineering best practices.


Key Responsibilities:

  1. Design, develop, and optimize scalable data pipelines and ETL workflows using AWS Glue, PySpark, and Python.
  2. Integrate a variety of AWS services—such as Lambda, DynamoDB, EC2, RDS, S3, Athena, Data Pipeline, API Gateway, Glue, and EMR—within reporting applications to enhance data accessibility and efficiency.
  3. Collaborate with the Data and Analytics team to architect and implement a robust Data Lake using AWS, Apache Airflow, PySpark, and Hive.
  4. Write, maintain, and troubleshoot SQL, Spark, and Python scripts to ensure reliable and high-performance data processing.
  5. Develop and deploy ETL jobs that handle data extraction, transformation, and loading from diverse sources—both homogeneous and heterogeneous.
  6. Monitor, tune, and document data workflows, maintaining best practices for data quality, scalability, and security.
  7. Champion an asynchronous work culture through proactive, clear written communication with stakeholders and team members.



Required Skills and Qualifications:

  1. Proven hands-on experience in AWS data engineering, with deep expertise in AWS Glue, S3, Lambda, and related services.
  2. Advanced proficiency in Python and PySpark for data transformation and automation tasks.
  3. Strong command of SQL for querying and managing large datasets.
  4. Experience with scalable ETL pipeline design, deployment, and troubleshooting.
  5. Solid understanding of data warehousing, Data Lake concepts, and big data technologies (EMR, Hive, Airflow).
  6. Demonstrated ability to integrate multiple AWS services within a data ecosystem.
  7. Exceptional written communication skills aligned with an asynchronous work environment.



Preferred Qualifications:

  1. Bachelor’s or Master’s degree in Computer Science, Engineering, or a related discipline.
  2. Experience building data solutions for analytics or reporting applications in a fast-paced environment.
  3. Relevant AWS certifications (e.g., AWS Certified Data Analytics, AWS Certified Solutions Architect).