Data Solutions Architect at Endpoint Clinical
Job Description
📋 Description
- Architect Lakehouse solutions using Azure Data Lake Gen2, Databricks, Delta Lake.
- Design data models and ingestion pipelines with CDC and schema evolution.
- Implement data governance, security, and compliance (GxP, HIPAA).
- Enable ML workflows with MLflow, Feature Store, and curated datasets.
- Collaborate with clinical operations and biometrics teams.
🎯 Requirements
- 15+ years in data architecture/engineering; 5+ yrs Azure; 3+ yrs Databricks.
- Azure: ADLS Gen2, Data Factory/Fabric, Synapse/SQL, Event Hubs, Key Vault, VNets.
- Databricks: Spark (PySpark/SQL), Unity Catalog, Delta Live Tables, MLflow.
- Data Modeling: Star/Snowflake, Data Vault, CDC, schema evolution.
- Programming: PySpark, SQL; bonus Python (pandas), Scala, dbt.
- Governance & Security: IAM/RBAC/ABAC, row/column security, encryption, auditing.
