Site Reliability Engineer II
Instacart
We're transforming the grocery industry
At Instacart, we invite the world to share love through food because we believe everyone should have access to the food they love and more time to enjoy it together. Where others see a simple need for grocery delivery, we see exciting complexity and endless opportunity to serve the varied needs of our community. We work to deliver an essential service that customers rely on to get their groceries and household goods, while also offering safe and flexible earnings opportunities to Instacart Personal Shoppers.
Instacart has become a lifeline for millions of people, and we’re building the team to help push our shopping cart forward. If you’re ready to do the best work of your life, come join our table.
Instacart is a Flex First team
There’s no one-size fits all approach to how we do our best work. Our employees have the flexibility to choose where they do their best work—whether it’s from home, an office, or your favorite coffee shop—while staying connected and building community through regular in-person events. Learn more about our flexible approach to where we work.
Overview
About the Role
Are you passionate about technology and ready to dive into the world of Site Reliability Engineering? We are looking for enthusiastic individuals to join our team as Site Reliability Engineer II, where you'll play a crucial role in ensuring the reliability and performance of our platform.
This is a top-notch opportunity to learn from skilled engineers and contribute to solving complex technical challenges where you will be involved in monitoring systems, responding to incidents, and developing automation to streamline operations. We are looking for someone eager to grow their skills, learn new technologies, and contribute to a culture of reliability.
About the Team
The Site Reliability Engineering (SRE) team combines software and systems engineering to design and manage large-scale, distributed, and fault-tolerant systems. This team is tasked with ensuring high reliability, optimal system performance, and continuous improvement for both Instacart's critical internal services and externally facing systems.
SRE focuses on optimizing existing systems, building robust infrastructure, and automating processes to minimize manual effort. Joining the SRE team means facing unique scaling challenges while leveraging expertise in coding, algorithms, complexity analysis, and large-scale system design.
The team thrives within a culture of intellectual curiosity, problem-solving, and collaboration. With members from diverse backgrounds and experiences, SRE fosters a supportive and risk-tolerant environment where individuals are encouraged to think big, take on impactful projects, and grow with mentorship and guidance.
About the Job
- Participate in monitoring systems and responding to alerts, escalating issues as needed.
- Assist in incident management, following established protocols and documenting steps.
- Help maintain and improve documentation for processes and procedures.
- Contribute to the development and maintenance of automation scripts and tools.
- Learn and apply best practices in site reliability engineering.
- Support the deployment of applications and services, ensuring smooth and reliable releases.
- Collaborate with senior engineers to troubleshoot and tackle technical issues.
About You
Minimum Qualifications
- Understanding of programming concepts and scripting languages.
- Strong interest in system administration and troubleshooting.
- Excellent problem-solving and analytical skills.
- Strong sense of ownership on incident handling.
- Ability to work well in a team environment and communicate effectively.
- Eagerness to learn and grow in the field of Site Reliability Engineering.
Preferred Qualifications
- Experience with Ruby or Go
- Familiarity with cloud platforms (e.g., AWS, GCP, Azure) is a plus.
- Previous SRE experience
#LI-Remote
Instacart provides highly market-competitive compensation and benefits in each location where our employees work. This role is remote and the base pay range for a successful candidate is dependent on their permanent work location. Please review our Flex First remote work policy here. Currently, we are only hiring in the following provinces: Ontario, Alberta, British Columbia and Nova Scotia.
Offers may vary based on many factors, such as candidate experience and skills required for the role. Please read more about our benefits offerings here.
For Canadian based candidates, the base pay ranges for a successful candidate are listed below.