GCP Data Engineer
Location: Irving, TX (Hybrid)
Duration: 12 Month Contract
Overview:
This position is focused on IAM Data Lake -Data Engineering responsibilities. We are seeking a Data Engineer with hands-on experience building Data Lake in Google Cloud Platform using big data tools and technologies. Experience with Hadoop/HDFS is highly desirable. Detailed requirements are listed below. Preferred location is Dallas, TX; Ohio is also an acceptable alternative.
- Strong understanding of Google Cloud architecture, including bucket structuring, naming standards, lifecycle management policies, and access control mechanisms.
- Hands-on experience working with columnar data formats such as Parquet, Avro, and ORC, along with associated compression strategies.
- Experience building both batch and streaming ingestion pipelines using GCP-native services.
Knowledge of Pub/Sub–based streaming architectures, including event schema design, schema evolution, and versioning best practices.
- Familiarity with incremental ingestion techniques and Change Data Capture (CDC) patterns.
- Understanding of data consumption and exposure patterns including views, APIs, and curated analytical datasets.
Skills & Experience:
- Airflow 2 - 4 Years
- API 4 - 6 Years
- CI/CD 2 - 4 Years
- Data Modeling 2 - 4 Years
- data pipelines 4 - 6 Years
- Data Processing 4 - 6 Years
- Google Cloud Platform 4 - 6 Years
- pyspark 4 - 6 Years
- Hadoop Ecosystem 1 - 2 Years