Cloudera

Staff Software Engineer - Apache Spark

New

PayCompetitive
LocationRemote
Employment typeFull-Time

What's your preference?

Apply with job updates
  • Job Description

      Req#: 250523

      Business Area:

      Engineering

      Seniority Level:

      Mid-Senior level

      Job Description:

      At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

      Cloudera is looking for a Staff Software Engineer with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark. We are looking for engineers with experience in large-scale, distributed systems and data processing to help build our enterprise-grade system, designed for customers running Spark on thousands of nodes and processing petabytes of data. You will be working with a distributed team, spread across the United States and Hungary, including multiple committers on Apache Spark.

      As a Staff Software Engineer, you will:

      • Implement new features for Cloudera’s Data Engineering Experience, delivered to customers in production on thousands of nodes

      • Contribute to Apache Spark

      • Develop new features in Scala/Java/Python on a modern platforms

      • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems.

      • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark, which you can utilize in your daily tasks

      • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters

      • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures

      • Work on improving internal infrastructure

      • Collaborate with other team members and stakeholders

      We are excited about you if you have:

      • Bachelor’s or Master degree in Computer Science or equivalent, 4-6 years of experience.

      • Experience leading and delivering complex product enhancements.

      • We use Java/Scala/Python/GoLang in projects, you should have a strong understanding of at least one of the following languages: Java, Scala, GoLang, Rust, C++, Python. And interested to learn the languages we’re using.

      • Experience with systems design, development.

      • Passionate about programming, clean coding habits, attention to detail, and focus on quality

      • Strong oral and written communication skills.

      • Strong ability to research and solve problems independently without constant supervision

      • (Most importantly) Open-minded, desire to learn new things and build great products.

      You may also have:

      • Experience with using/developing Apache Spark/Airflow or other related technologies.

      • Experience building and maintaining containerized applications on Kubernetes.

      • Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.

      • Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift)

      • Contributors to open-source projects.

      What you can expect from us:

      • Generous PTO Policy

      • Support work life balance with Unplugged Days

      • Flexible WFH Policy

      • Mental & Physical Wellness programs

      • Phone and Internet Reimbursement program

      • Access to Continued Career Development

      • Comprehensive Benefits and Competitive Packages

      • Paid Volunteer Time

      • Employee Resource Groups

      EEO/VEVRAA

      # LI-SZ1
      #LI-Remote

  • About the company

      Cloudera, Inc. Cloudera started as a hybrid open-source Apache Hadoop distribution, CDH, that targeted enterprise-class deployments of that technology.

Notice

Talentify is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

Talentify provides reasonable accommodations to qualified applicants with disabilities, including disabled veterans. Request assistance at accessibility@talentify.io or 407-000-0000.

Federal law requires every new hire to complete Form I-9 and present proof of identity and U.S. work eligibility.

An Automated Employment Decision Tool (AEDT) will score your job-related skills and responses. Bias-audit & data-use details: www.talentify.io/bias-audit-report. NYC applicants may request an alternative process or accommodation at aedt@talentify.io or 407-000-0000.