Job Title: Python Developer / Senior Data Engineer
Client: Apex System/CVS
Location: Remote
Duration: 12 Months (Contract to Hire)
Openings: 5
Job Description
• Design, develop, and support scalable data ingestion and processing pipelines on Google Cloud Platform (Google Cloud Platform).
• Migrate and modernize existing big data workloads from Hadoop to Google Cloud Platform.
• Automate data ingestion workflows currently handled through manual processes.
• Build and maintain data pipelines using Python and Spark / PySpark.
• Execute large-scale data processing jobs using Dataproc.
• Develop and run Python scripts using Google Cloud Functions and Google Kubernetes Engine (GKE).
• Ingest data from multiple external sources such as providers, healthcare groups, and hospitals.
• Process and manage structured, semi-structured, and unstructured data, including denormalized datasets.
• Work with cloud-based relational databases such as Cloud SQL (Postgres, Oracle, or similar).
• Ensure data quality, reliability, and performance across pipelines.
• Collaborate with cross-functional teams to support application-facing data needs.
• Contribute to AI-driven initiatives, including data preparation and pipeline development for AI/ML use cases.
Must Have Skills
• Strong hands-on Python programming experience.
• Solid experience with Spark / PySpark for big data processing.
• Experience working in Google Cloud Platform (Google Cloud Platform) environments.
• Knowledge of Google Cloud Platform services such as Dataproc, BigQuery, Cloud Functions, GKE, Cloud SQL.
• Experience handling large-scale, denormalized, structured & unstructured datasets.
• Strong understanding of data ingestion, transformation, and automation.
Nice to Have Skills
• Experience building or supporting AI / ML pipelines.
• Exposure to AI model development or inference pipelines using Python.
• Background in cloud-native data architecture and automation.
Apply Now
Apply Now