Senior Development Manager, Data Reliability Engineering (Remote - Americas)
Shopify
Senior Development Manager, Data Reliability Engineering (Remote - Americas)
About the role
Data plays a crucial role in enabling Shopify to achieve its mission and the data platform team helps our users extract value from the data and surface the right insights to the right people, at the right time. The Data Reliability Engineering group focuses on operating and scaling the data platform. To do this you will look at making impact along the vectors of availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. You will be working with our core ETL stack that consists of a Python-based batch processing engine built on Spark with a SQL-based transformation engine built on dbt and BigQuery (also moving to Spark). Our streaming transformations engine is built on Beam and Dataflow, with plans to move to Flink in the near future. For interactive or ad-hoc analytics, we use Presto along with OLAP analytic stores which include home-grown systems built on BigTable and Druid (for real-time analytics).
In this role, you will head the reliability engineering group. You will be the bridge between the data product teams and operation teams by bringing a software-centric mindset to reliability problems.
As a Senior Development Manager, you’ll be working on things like :
- Managing several teams and helping them prioritize, build and ship the most impactful features to our merchants and community of developers.
- Building and growing high-performing teams both through internal development and hiring
- Work on problems with your teams, roll up your sleeves, form an opinion and advocate for data engineering-specific roadmap items
- You will strengthen relationships across the entire company and leverage these relationships to unblock teams and deliver amazing products to our users
- You will lead the data reliability group consisting of an embedded product reliability team, tooling and infrastructure operations
Qualifications
- Proven experience leading reliability engineering teams that scale and optimize distributed systems
- Proven technical leadership skills using technologies and frameworks similar to Shopify's
- Experience hiring and mentoring high-caliber, data-focused engineers and managers with diverse technical strengths and backgrounds optimizing for both breadth and depth.
- Highly technical and familiar with all components of a typical data stack (ranging from data acquisition, storage, modeling and process, to serving). You don’t mind getting into the nitty gritty or jumping into the code.
- Strong software centric approach to solving system scaling and reliability problems. Automation and continuous improvement mindset with ability to propagate that culture through their teams.
- Exceptional communication skills, you can easily translate technical concepts into easy to understand language for our stakeholders and partners, both technical and non-technical.
- Ability to design cross-org performance and improvement metrics and turn that into roadmaps for the teams
- You have successfully built and grown large teams of engineers during your career.
- A commitment and drive for quality, technical excellence and results.
It would be great if you have :
- Experience working with remote teams in different timezones
- Experience with providing reliability to data ecosystems at the petabyte scale
Full Time