Databricks Engineer

Remote Full-time
About Us: At RELI Group, our work is grounded in purpose. We partner with government agencies to solve complex challenges, improve public health, strengthen national security, and make government services more effective and efficient. Our team of over 500 professionals brings deep expertise and a shared commitment to delivering meaningful outcomes. Behind every solution is a group of experts who care deeply about impact—whether we’re supporting data-driven decisions, modernizing systems or safeguarding critical programs. Position Summary: RELI Group is seeking a highly skilled Data Engineer to support the Centers for Medicare & Medicaid Services (CMS) Multidimensional Information Data Analytics System (MIDAS) Program. This role will play a critical part in the modernization of the MIDAS data platform, building and maintaining scalable, high-performing data pipelines that drive enterprise analytics, reporting, and operational decision-making. The Data Engineer will focus on developing data ingestion, transformation, and processing pipelines across AWS and Databricks-based environments, with an emphasis on Delta Lake lakehouse architectures. Responsibilities: • Design, build, and maintain robust ETL/ELT pipelines that ingest, transform, and curate data for analytics and reporting solutions. • Develop efficient data processing workflows using Python, SQL, and PySpark within Databricks and Delta Lake environments. • Implement scalable data lakehouse architectures supporting complex healthcare datasets and enterprise analytics. • Collaborate with Data Architects, Automation Test Engineers, QA Analysts, and Business Analysts to define data transformation logic and data integration standards. • Build reusable code and frameworks for data ingestion, data quality validation, and exception handling. • Optimize data processing jobs for performance, reliability, scalability, and cost efficiency in AWS and Databricks cloud platforms. • Participate in schema design, data modeling, and version-controlled data pipeline development. • Contribute to data governance, metadata management, and data lineage documentation to support audit, compliance, and FISMA reporting requirements. • Partner with DevOps teams to integrate data pipelines into CI/CD workflows using GitHub, Databricks Repos, and related tools. • Perform root cause analysis and resolution of data issues across staging, integration, and production environments. • Bachelor’s degree in Computer Science, Information Systems, Engineering, or related technical field • 6+ years of experience in data engineering or data pipeline development roles • Strong hands-on experience developing data pipelines using Python, SQL, and PySpark • Experience in Java is a plus, to understand existing code and covert to Python and Databricks Notebooks • Extensive experience with Databricks, including Databricks Workflows, Delta Lake, notebooks, and distributed computing • Proven experience with ETL/ELT design, development, and optimization for large-scale data processing • Solid understanding of data lakehouse architecture, data partitioning, and Delta Lake versioning • Experience with AWS data services such as S3, Redshift, Glue, IAM, and RDS • Familiarity with CI/CD processes for data pipelines (GitHub Actions, Jenkins, Databricks Repos) • Experience with data quality validation, data profiling, and debugging data pipeline failures • Strong collaboration and communication skills with ability to translate business needs into technical requirements Preferred Qualifications: • Experience supporting CMS programs and understanding of the CMS Technical Reference Architecture (TRA) and Target Lifecycle (TLC). • Exposure to BI platforms such as QuickSight, Tableau, or Power BI for data consumption and reporting validation. • Familiarity with Unity Catalog, data governance, and access control within Databricks. • Experience supporting regulated environments requiring audit trails, FISMA audits, or CMS Acceptable Risk Safeguards (ARS 5.0). • Exposure to healthcare datasets such as ACA, QHP, HICS, or HIOS. Summary of Core Technologies: • Python, SQL, PySpark • Databricks, Delta Lake, Databricks Workflows • AWS: S3, Redshift, Glue, RDS • CI/CD: GitHub Actions, Jenkins, Databricks Repos • ETL/ELT pipelines and data lakehouse transformations • Distributed data processing and optimization EEO Employer: RELI Group is an Equal Employment Opportunity / Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, national origin, ancestry, citizenship status, military status, protected veteran status, religion, creed, physical or mental disability, medical condition, marital status, sex, sexual orientation, gender, gender identity or expression, age, genetic information, or any other basis protected by law, ordinance, or regulation. HUBZone: We encourage all candidates who live in a HUBZone to apply. You can check to see if your address is located in a HUBZone by accessing the SBA HUBZone Map. The annual salary range for this position is $100,000 to $150,000. Actual compensation will depend on a range of factors, including but not limited to the individual’s skills, experience, qualifications, certifications, location, other business and organizational needs, and applicable employment laws. The estimate displayed represents the typical salary range for this position and is just one component of the total compensation package for employees. RELI Group provides a variety of additional benefits to its employees. For additional details on the benefits that RELI Group offers click here Apply tot his job
Apply Now →

Similar Jobs

Actuary Sr Analyst (IKC)

Remote Full-time

Quantitative ML Consultant (Contract)

Remote Full-time

Revenue Lead, ROPS

Remote Full-time

Cinematic AI LLM Technical Consultant

Remote Full-time

Senior Machine Learning Engineer - Visa AI as a Service

Remote Full-time

Data Entry Career Job At Delta Airlines Hiring Now US

Remote Full-time

AI/ML Scientist - Operational Twinning & Healthcare Optimization

Remote Full-time

Machine Learning Engineer / 100% remote

Remote Full-time

Research Fellow - Deep Learning

Remote Full-time

Machine Learning Research Engineer (Remote | $120/hr)

Remote Full-time

Experienced Technology Technical Support Representative - Remote Work Opportunity with Competitive Salary and Benefits

Remote Full-time

Experienced Remote Data Entry Specialist – Full Time Logistics Information Management at arenaflex

Remote Full-time

Lead Solution Architect (Transportation and Logistics)

Remote Full-time

No Phone Customer Support Representative – Entry Level (Remote) ID-188 – bolthires Store

Remote Full-time

Hiring Full time and Part time Tutors Reading, Math, Science (Start Today)

Remote Full-time

Consulting Partner, Environmental Compliance & Management Systems

Remote Full-time

Experienced Data Entry Professional – Full-Time Remote Opportunity for Career Growth and Development with blithequark

Remote Full-time

**Experienced International Customer Support Representative – Remote Opportunity at blithequark**

Remote Full-time

**Experienced Customer Service Representative – Work from Home Opportunity at blithequark**

Remote Full-time

**Experienced Chat Operations Officer – Mobile Apps ID-2240 – blithequark Store**

Remote Full-time
← Back to Home