BACK

Hadoop to Databricks Migration onto the cloud with AWS

Our Client is a leading life sciences and healthcare company operating one of the largest clinical laboratory networks in the world. The company offers world-class diagnostics to improve patient care and accelerate drug development.

Challenges

  • Highly complex and legacy Big-Data platform
  • Impacted operations that needed real-time insights
  • Existing platform incurred higher maintenance costs and limited flexibilities
  • Lacked solid data-driven enterprise.

Implementation

  • Ingested, stored, processed, enriched, and served data and insights from different sources
  • Migrated healthcare-sensitive data from an on-premise system using AWS Direct Connect, AWS VPC and AWS Data sync
  • Data Warehouse was built for structured data and a data lake for semi-structured & unstructured data

Results

  • Reduced OPEX by 35%
  • Increased productivity, by combining access to 15+ sources and 20k+ tables in near-real-time
  • By migrating to the Databricks Lakehouse, client was able to innovate faster (new use case implementation in minutes)
  • Robust modern data driven architecture enabled them to integrate ML, AI, & analytics capabilities

Tech Stack