Database Engine Internals - Staff Software Engineer

New Today

P-955 Our mission at Databricks is to radically simplify the whole data lifecycle from ingestion to ETL, BI, and all the way up to ML/AI with a unified platform. To achieve this goal, we believe the data warehouse architecture as we know it today will be replaced by a new architectural pattern, Lakehouse (), open platforms that unify data warehousing and advanced analytics. The new architecture will help address several major challenges, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
A critical part of realizing this vision is the next generation (decoupled) query engine and structured storage system that can outperform specialized data warehouses in relational query performance, yet retain the expressiveness and of general purpose systems such as Apache Sparkā„¢ to support diverse workloads ranging from ETL to data science.
As part of this team, you will be working in one or more of the following areas to design and implement these next gen systems that leapfrog state-of-the-art: Query compilation and optimization
Distributed query execution and scheduling
Vectorized execution engine
Data security
Resource management
Transaction coordination
Efficient storage structures (encodings, indexes)
Automatic physical data optimization What we look for: A passion for database systems, storage systems, distributed systems, language design, or performance optimization
Experience working towards a multi-year vision with incremental deliverables
Motivated by delivering customer value and impact
8+ years of experience working in a related system (preferred)
Optional: PhD in databases or distributed systems
Location:
Bellevue

We found some similar jobs based on your search