Azure Data Fabric and ML Engineer (MPI Program)

New Today

Fabric Data & ML Engineer Remote ok
6 months, potential to extend zure Data Fabric and ML Engineer (MPI Program)
Summary: Hiring a Data & ML Engineer to support the person matching and identity resolution workflows of the MPI initiative, leveraging Microsoft Fabric, Synapse, and ML capabilities. This role involves creating data pipelines, cleansing and linking records, and operationalizing ML-based entity resolution models.
Key Responsibilities:
Build data pipelines and ML workflows within Microsoft Fabric for entity matching and deduplication across data domains. Implement and optimize MLOps pipelines (training, scoring, and retraining). Integrate data from multiple sources: CRM, EHRs, finance, HR, etc. Develop reusable modules for fuzzy matching, rule-based, and ML-based identity resolution. Collaborate with data scientists and SMEs to operationalize models using SynapseML, PySpark, or Azure ML. Required Experience: 5+ years of experience in data engineering and machine learning in the Azure ecosystem. Proficient with Microsoft Fabric (Lakehouse, Pipelines, Notebooks), Synapse, and Azure ML. Solid understanding of identity resolution techniques, especially ML-based approaches. Strong programming skills in Python and PySpark. Familiarity with data privacy, governance, and ethics in ML Fabric Data & ML Engineer Remote ok
6 months, potential to extend zure Data Fabric and ML Engineer (MPI Program)
Summary: Hiring a Data & ML Engineer to support the person matching and identity resolution workflows of the MPI initiative, leveraging Microsoft Fabric, Synapse, and ML capabilities. This role involves creating data pipelines, cleansing and linking records, and operationalizing ML-based entity resolution models.
Key Responsibilities: Build data pipelines and ML workflows within Microsoft Fabric for entity matching and deduplication across data domains. Implement and optimize MLOps pipelines (training, scoring, and retraining). Integrate data from multiple sources: CRM, EHRs, finance, HR, etc. Develop reusable modules for fuzzy matching, rule-based, and ML-based identity resolution. Collaborate with data scientists and SMEs to operationalize models using SynapseML, PySpark, or Azure ML. Required Experience: 5+ years of experience in data engineering and machine learning in the Azure ecosystem. Proficient with Microsoft Fabric (Lakehouse, Pipelines, Notebooks), Synapse, and Azure ML. Solid understanding of identity resolution techniques, especially ML-based approaches. Strong programming skills in Python and PySpark. Familiarity with data privacy, governance, and ethics in ML
Location:
Austin, TX, United States
Category:
Computer And Mathematical Occupations

We found some similar jobs based on your search