Avatar for Databricks
One cloud platform for massive scale data engineering and collaborative data science
  • B2B
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • +3

Senior Software Engineer - Distributed Data Systems

Posted: 4 years ago
Visa Sponsorship

Not Available

RelocationAllowed

About the job

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers — and customer obsessed — we leap at every opportunity to solve technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started.

Modern data analysis employs sophisticated methods such as machine learning that go well beyond the roll-up and drill-down capabilities of traditional SQL query engines. As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support diverse workloads ranging from ETL to data science.

Below are some example projects:

Apache Spark: Develop the de facto open source standard framework for big data.

Data Plane Storage: Provide reliable and high performance services and client libraries for storing and accessing humongous amount of data on cloud storage backends, e.g., AWS S3, Azure Blob Store.

Delta Lake: A storage management system that combines the scale and cost-efficiency of data lakes, the performance and reliability of a data warehouse, and the low latency of streaming. Its higher level abstractions and guarantees, including ACID transactions and time travel, drastically simplify the complexity of real-world data engineering architecture.

Delta Pipelines: It's difficult to manage even a single data engineering pipeline. The goal of the Delta Pipelines project is to make it simple and possible to orchestrate and operate tens of thousands of data pipelines. It provides a higher level abstraction for expressing data pipelines and enables customers to deploy, test & upgrade pipelines and eliminate operational burdens for managing and building high quality data pipelines.

Performance Engineering: Build the next generation query optimizer and execution engine that's fast, tuning free, scalable, and robust.

What we look for:

  • BS (or higher) in Computer Science, related technical field or equivalent practical experience.
  • Comfortable working towards a multi-year vision with incremental deliverables.
  • Motivated by delivering customer value and impact.
  • 5+ years of production level experience in either Java, Scala or C++.
  • Strong foundation in algorithms and data structures and their real-world use cases.
  • Experience with distributed systems, databases, and big data systems (Spark, Hadoop).

Benefits

  • Comprehensive health coverage including medical, dental, and vision
  • 401(k) Plan
  • Equity awards
  • Flexible time off
  • Paid parental leave
  • Family Planning
  • Gym reimbursement
  • Annual personal development fund
  • Work headphones reimbursement
  • Employee Assistance Program (EAP)
  • Business travel accident insurance

COVID-19 Vaccination Requirement

As a federal government contractor, Databricks requires new U.S. employees to be fully vaccinated against COVID-19. Proof of vaccination will be required as a condition of employment. Databricks will make reasonable accommodations based on medical conditions or religious grounds for qualified candidates in accordance with applicable law.

About Databricks

Databricks is the data and AI company. More than 7,000 organizations worldwide — including Comcast, Condé Nast, H&M, and over 40% of the Fortune 500 — rely on the Databricks Lakehouse Platform to unify their data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe. Founded by the original creators of Apache Spark™, Delta Lake and MLflow, Databricks is on a mission to help data teams solve the world’s toughest problems. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

About the company

Databricks company logo
One cloud platform for massive scale data engineering and collaborative data science1001-5000 Employees
  • B2B
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • Valuation $1B+
    This company has a valuation of $1B or more
  • 4.6
    Highly rated
    Databricks is highly rated on Glassdoor, with 4.6 out of 5 stars
  • 4.7
    Strong Leadership
    Employees rate Databricks 4.7/5 on Glassdoor for faith in leadership
Learn more about Databricks image

Funding

AMOUNT RAISED
$990M
FUNDED OVER
8 rounds
Rounds
F
$400,000,000
Series F - Oct 2019+7

Founders

Quinn Hodges
Founder • 3 years
San Francisco
image
View the team image

Similar Jobs

Albeado company logo
Albeado
Breakthrough causal AI predictions, optimizations and interventions - in real time
GVOS  company logo
GVOS
An Edge Cloud for Autonomous Driving
Zoox company logo
Zoox
We are building a new kind of transportation
JetInsight company logo
JetInsight
Best-in-class quoting and fleet management software for aircraft charter operators
C3.ai company logo
C3.ai
C3 AI is a leading enterprise AI software provider for accelerating digital transformation
Finch company logo
Finch
Unifying payroll, HR, and benefits under a single API
Enki company logo
Enki
The AI-powered skills coach for professionals and teams
Feathery company logo
Feathery
Collect & activate any (un)structured data