Staff Site Reliability Engineer, Platform

Gemini

San Francisco, United States of America

Remote within United States

Posted over 1 year ago

At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

Tech stack

About the Company

Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency.

At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

The Department: Platform

Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering teams to ensure all our systems are architected, engineered and deployed to be resilient, reliable and performant.

The Embedded SRE team is a part of Site Reliability Engineering with a focus on engaging directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.

The Role: Staff Site Reliability Engineer

You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross functionally across Gemini’s engineering teams to influence and shape our development practices and culture.

Responsibilities:

Provide primary operational support and engineering for various Gemini services
Improve reliability, quality and time-to-market across all Gemini services and offerings
Guide engineering teams onto the various supported services provided by Platform
Run on-going performance evaluations and improvements for Gemini systems
Provide architecture recommendations and engagement as part of SDLC
Create “Production-ready Scorecards” to evaluate the health of systems pre-launch
Implement and teaching monitoring, alerting and automated resolution best practices
Define SLIs, SLOs with Engineering teams
Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments etc.
Build operational tooling and automations

Qualifications:

7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
Good knowledge for various cloud technology providers like AWS, GCP, or Azure
Experience in a code-first environment, developing automated solutions to solve support and operational issues
Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team
Experience working with containerization such as Nomad, EKS (k8s), Docker, etc.
Experience working with Configuration Management such as Ansible, Chef, Puppet
Experience writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc.
Experience analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
Experience working in a code-drive, automation-first public cloud infrastructure (Terraform)

It Pays to Work Here

The compensation & benefits package for this role includes:

Competitive starting salary
A discretionary annual bonus
Long-term incentive in the form of a new hire equity grant
Comprehensive health plans
401K with company matching
Paid Parental Leave
Flexible time off

What makes you a perfect
candidate for this role

An academic degree in the relevant field is good to have
7+
years of commercial experience
Corresponding level of skills:
AWS / GCP
advanced
Python
intermediate
Docker
advanced
Ansible
advanced
Terraform
advanced
Go
intermediate
Language skills:
English
advanced

Compensation

$172K - 241K + Equity

Role type

Full time

Visa sponsorship

Not provided

Benefits & perks

Flexible Working
Dental Insurance
Relaxed office environment
Competitive Salary
Amazing office
Profit interests
Medical insurance
Industry leading healthcare
Diversity Dedicated Staff
Macbook pros for all
Brand new offices
Equity
Unlimited pto
La colombe nitro cold brew
Parental Leave
Profit Share Scheme
401(K)
Snacks
Company Outings
Casual Dress
Up to 4% 401k matching

Similar roles that might interest you

Ground Floor, Verse Building, 18 Brunswick Place, London, N1 6DZ

108 E 16th Street, New York, NY 10003

Subscribe to our newsletter

Join over 111,000 others and get access to exclusive content, job opportunities and more!

Staff Site Reliability Engineer, Platform

Gemini

Tech stack

What makes you a perfect candidate for this role

Gemini

Similar roles that might interest you

Audius

Systems Engineer (Urbit)

Audius

Systems Engineer (Urbit)

Audius

Systems Engineer (Urbit)

WorksHub

For companies

Jobs

Locations

Articles

Subscribe to our newsletter

What makes you a perfect
candidate for this role