Senior Site Reliability Engineer job in New York, NY | Majo...

Senior Site Reliability Engineer
Major League BaseballNew York, NYa month ago
The Site Reliability Engineer position is responsible for creating the infrastructure powering the baseball experience as part of MLB.com. Launched in 2001 as the tech arm of Major League Baseball, MLB.com is renowned for creating experiences that baseball fans love - and we're just getting started!

This role offers the opportunity to collaborate with other world-class engineers, product developers, and designers; contribute to award-winning and complex apps and systems; influence the innovation of products used by millions globally; and work in a highly collaborative, results-oriented, team environment.

Using cutting edge technology, our software is consumed by fans, broadcasters, stadiums, MLB Clubs and the League itself. We are looking for Site Reliability Engineers / SRE's that are passionate about building new technologies for the baseball industry.

The ideal [email protected] has a focus on providing consulting services to any and all engineering teams throughout the organization to help in certain key areas:


  • Incident response

  • Kubernetes operations

  • User experience optimization through SLIs and SLOs

  • Observability tooling

  • Debugging running systems and providing tools to assist runtime debugging

  • Optimizations for cost control

  • High Availability and Disaster recovery planning


As an SRE at MLB you will:


  • Be an technical evangelist for best practices

  • Write software in python, Golang, bash, Typescript, Javascript, HCL, and C/C++.

  • Extensively utilize Terraform for infrastructure as code

  • Use and administer Grafana including writing and maintaining plugins

  • Write and maintain tools that help software engineering teams function better

  • Provide advice around HA and DR initiatives and cost reduction techniques

  • Consult with engineering teams in the key areas outlined above

  • Help enhance the next generation of APM tooling

  • Use Google Cloud Platform

  • Automate telemetry collection

  • Be available for rotational on-call for SRE managed services


Things you will almost certainly work on:


  • Understanding the user experience for services run by the consulted engineering team

  • Helping set target objectives for user experience specific to the team/service

  • Understanding the cost of running services and set targets accordingly

  • Understanding relative service health to help guide engineering effort (SLOs)

  • Enhancing observability tooling to make the above possible

  • Enhancing alerting tooling to make the above actionable

  • Gaining deep understanding of service operation to guide disaster recovery and HA

  • Building 'batteries included' solutions that are extensible across the organization


A good fit for this role will:


  • Prioritize unblocking your teammates, collaboration and knowledge sharing

  • Dedicated to continuous improvement of yourself and our SRE capabilities

  • Passionate about the value of SRE, but accepting of our role as a patient influencer

  • Resourceful in finding extensible and strategic solutions to common problems and creative in engineering custom solutions

  • Have a knack for automation and a passion for reducing toil

  • Have written code in a compiled language that runs in production somewhere

  • Have written code in interpreted languages

  • Have worked with time series data extensively and specifically, Circonus/ IronDB (CAQL experience is a bonus)

  • Have worked with o11y tools extensively

  • Have experience with Terraform

  • Understand that the documentation is a product

  • Have worked with and gained a solid understanding of Grafana and its ecosystem

  • Have experience with Kafka, Helm Charts and Kubernetes

  • Understand basic concepts of APM such as tracing


    • OpenTelemetry experience is a bonus






What's it like to work at MLB?

Major League Baseball (MLB) is the most historic of the major professional sports leagues in the United States and Canada. Employees love working at MLB because of the culture of growth, teamwork, and professionalism. Employees who are most successful at MLB take initiative, know how to identify problems and provide solutions, and always put the Team first. For those ready to step up to the plate and join the Major Leagues, MLB takes the same approach as teams do with their players: empowering them to be at their best by engineering experiences that put employees in the best position to succeed. Major League Baseball is looking for candidates who are passionate about growing America's pastime to best serve its fans for decades to come.

MLB's vision is to be the global sport of choice for youth to play, fans of all backgrounds to enjoy and a desired destination for employment. With a belief that the journey to growth and greatness is ongoing, MLB gives employees the opportunity to continue learning and honing their skills with programs such as: tuition reimbursement; mentorship programs; lunch and learns; online course subscriptions; paid industry certifications; business resource groups; and more.

MLB provides its employees with exceptional medical, dental, and vision coverage. Premiums are 100% employer covered to help employees focus on being their best!

All in-office and ballpark-based positions are subject to MLB's mandatory Covid-19 vaccine policy.
All in-office and ballpark-based positions are subject to MLB's mandatory Covid-19 vaccine policy

Site Reliability Engineering Manager, Trello (Storage Layer)

Atlassian

New York, NY

Tue, 28 Jun 2022 15:38:46 GMT
You’re familiar with system design, site reliability engineering a...
Senior DevOps Engineer, VP - hybrid

MUFG

Jersey City, NJ

Tue, 28 Jun 2022 14:02:06 GMT
Experience implementing enterprise systems with security best practices and s...
Site Reliability Engineer

Jotform

Manhattan, NY

Tue, 28 Jun 2022 09:42:24 GMT
This is a full-time, fully remote opportunity in the Pacific time zone, though a...
Site Reliability/DevOps Engineer - Opportunity for Working Remotely New York, NY

VMware

New York, NY

Tue, 28 Jun 2022 00:20:28 GMT
You will be responsible for improving the reliability and resiliency of m...
Site Reliability Engineer, Americas

Canonical - Jobs

New York, NY

Mon, 27 Jun 2022 08:46:53 GMT
Our site reliability engineers bring Python software-engineering s...
Site Reliability/DevOps Engineer - Opportunity for Working Remotely Newark, NJ

VMware

Newark, NJ

Tue, 28 Jun 2022 00:20:28 GMT
You will be responsible for improving the reliability and resiliency of m...
Infrastructure Site Reliability Engineer

Schrödinger

New York, NY

Tue, 28 Jun 2022 01:05:35 GMT
This position presents the unique opportunity to support researchers and develop...
Site Reliability Engineer / SRE : 10+ years exp needed

PC Services inc

New York, NY

Mon, 27 Jun 2022 23:58:52 GMT
Design, implement and monitor the Service Level Objectives (SLOs) and Service Le...
Site Reliability Engineer

JPMorgan Chase Bank, N.A.

Jersey City, NJ

Sun, 26 Jun 2022 04:14:08 GMT
Engage with development team throughout the life cycle to help develop software ...
Site Reliability Engineer - Private Cloud

JPMorgan Chase Bank, N.A.

Jersey City, NJ

Sun, 26 Jun 2022 04:14:08 GMT
§ Apply standards of cloud compliance to application design to achieve reliab...
Site Reliability Engineer

CVS Health

New York, NY

Sat, 25 Jun 2022 14:11:21 GMT
Improve the reliability of our systems and processes with a keen focus on...
Site Reliability Engineer

Fiserv, Inc.

Short Hills, NJ

Sat, 25 Jun 2022 13:04:31 GMT
Assess the current state of the environment and drive initiatives in collaborati...
Senior Site Reliability Engineer

CVS Health

New York, NY

Sat, 25 Jun 2022 14:11:21 GMT
IT leaders to build digital assets, provide expert level support to resolve comp...
Senior Dev Ops Engineer

PMC

New York, NY

Sat, 25 Jun 2022 04:24:16 GMT
Managing CDN and other infrastructure technologies to ensure site perform...
Site Reliability Engineer (SRE)- Java / Python / Linux Production / Operations / Reliability Engineering

JPMorgan Chase Bank, N.A.

Jersey City, NJ

Sat, 25 Jun 2022 04:15:35 GMT
Engage with development team throughout the life cycle to help develop software ...