
Data Engineer - Big Data
Required Skills
Hadoop Ecosystem
Cloudera
Job Description
Job Title: Data Engineer - Big Data
Job Type: Full-Time, Contract (6 Months, Renewable)
Location: Remote (Offshore – India)
Job Summary:
Are you passionate about building and optimizing large-scale data platforms? Join our team as a Data Engineer – Big Data, where you will lead cutting-edge data engineering initiatives for a global banking and financial services client. As a technical leader, you will have the opportunity to design, implement, and optimize distributed data systems in a collaborative, asynchronous remote environment.
Key Responsibilities:
- Lead, mentor, and guide our team of data engineers, driving delivery of end-to-end Big Data solutions.
- Design, develop, and optimize scalable data pipelines using Hadoop Ecosystem tools (HDFS, MapReduce, Hive, Pig, HBase, Flume, Sqoop).
- Build, tune, and maintain high-performance Java, MapReduce, Hive, and Spark jobs across the Cloudera Hadoop platform.
- Own the development of dimensional data models and oversee the deployment and auditing of data models for accuracy and quality.
- Develop solutions utilizing NoSQL databases (Cassandra, HBase) and stream-processing frameworks (Spark-Streaming).
- Collaborate with business intelligence teams and vendor partners to prioritize initiatives and deliver robust, business-driven dashboards.
- Champion written communication and asynchronous collaboration to ensure smooth project execution in a global, distributed team.
Required Skills and Qualifications:
- 7-10+ years of proven experience as a Data Engineer specializing in Big Data technologies.
- Expert-level proficiency in Hadoop (preferably Cloudera), Spark, Pig, Hive, and Impala.
- Advanced programming skills in Java and scripting languages such as Python or Scala.
- In-depth knowledge of database technologies: relational SQL (Postgres) and NoSQL (Cassandra, HBase).
- Track record of designing and implementing scalable dimensional data models and stream-processing systems.
- Exceptional written communication skills, with a strong preference for async collaboration.
- Experience managing teams of engineers and third-party vendors in a remote, offshore setting.
Preferred Qualifications:
- Hands-on experience with Azure Cloud services and deployments.
- Background in the Banking or Financial Services industry.
- Familiarity with data science, machine learning tools, and additional scripting languages (C++, Linux, Ruby, PHP, R).