Site Reliability Engineer job in Santa Monica, CA | Recruit...

Site Reliability Engineer
Recruiting From ScratchSanta Monica, CAInvalid date

Who is Recruiting from Scratch:


Recruiting from Scratch is a premier talent firm that focuses on placing the best product managers, software, and hardware talent at innovative companies. Our team is 100% remote and we work with teams across the United States to help them hire. We work with companies funded by the best investors including Sequoia Capital, Lightspeed Ventures, Tiger Global Management, A16Z, Accel, DFJ, and more.


If you are a fit, the team will reach out to you about this role or any others that may be a fit for our clients.


Our Client


Our client is revolutionizing how businesses leverage and enhance consumer data. Their platform (APIs, components, and rules engine) enables innovative companies and developers to seamlessly integrate credit and identity data into their apps, websites or workflows. Founded by serial entrepreneurs (with several exits), they’ve been nearly doubling revenue annually. They're looking for passionate and creative teammates to help us scale and supercharge the company. If you’re looking for autonomy, impact, and cutting-edge FinTech, they’d love to hear from you. Though fully remote, our client's team is the foundation of our success. They strive for diverse backgrounds, opinions, and approaches. They encourage respectful dissent, digging for the truth, so that they can deliver the best product for their clients and users. Continuous improvement, experimentation, and a clear mission stretch us individually and together.


They are seeking experienced, results-driven, and passionate engineers to join our infrastructure project team. Their ideal candidate is a self-starter and has excellent communication skills. Their collaborative environment relies heavily on innovation, technical savvy, and problem-solving skills. This is a full-time remote position within the US. As our newest SRE Engineer, you’ll be a major contributor to the company’s success. You’ll work with teams across the organization to build and maintain monitor-able, performant, reliable and highly-scalable software systems. Your technical contributions will help protect dozens of brands, facilitate continuous delivery, support our SLA for high-traffic websites and mobile applications in the credit score and reporting industry.

Responsibilities



  • Evangelize best practices for building and operating highly reliable systems.

  • Consult in system design to meet reliability and capacity requirements

  • Constantly optimizing performance and reliability

  • Support application deployments, building new systems and upgrading and patching existing ones through DevOps methodologies.

  • Automate infrastructure and configuration management.

  • Conduct timely post-mortems of production infrastructure incidents.

  • Assist with all aspects of operational security and PCI compliance.

  • Seek out potential threats to security and reliability and advocate solutions



Job Requirements



  • Passion for reliable, scalable, observable software with a strong sense of ownership.

  • 5+ years of experience developing and monitoring mission-critical systems.

  • Hands-on experience with Docker and docker-compose.

  • Proficiency in working with and understanding a containerized development workflow

  • Strong background in Linux/UNIX administration (e.g. RedHat/CentOS 7/Alpine Linux).

  • Experience with configuration management tools like Ansible.

  • Experience with Infrastructure as Code (IaC) tools like Packer and Terraform.

  • Experience in deploying large-scale Docker based environments with OpenShift or Kubernetes, or similar product.

  • Experience with languages like Bash, Python, or Node.js .

  • Experience implementing Application clustering / load balancing concepts and technologies

  • Experience using devops tooling/modules with VMware vSphere’s API.

  • Proficiency administering a CI/CD pipeline (we use Gitlab).

  • Proficiency with networking fundamentals, diagnostic, troubleshooting, etc.

  • Proficient in using command line tools to quickly triage and fix production issues.

  • Understanding of protocols/technologies like HTTP, SSL, LDAP, SQL, HTML, XML


Nice to Have


  • Experience implementing CI/CD Blue/Green Deployments using Gitlab CI/CD • You're a wizard with Terraform

  • Project Atomic (Red Hat and Fedora atomic host)

  • Consul

  • Build and maintain data stores with PostgreSQL

  • Implement Keepalived / Linux HA in a scalable environment

  • In-depth knowledge of distributed computing and data systems, multi-region presence, high traffic websites.

  • In-depth knowledge of immutable infrastructure, compostable infrastructure, and/or serverless computing.








Site Reliability Engineer

Cognizant Technology Solutions

Burbank, CA

Fri, 20 May 2022 19:45:13 GMT
Work with Automation and Analytics Tools GoToMarket team during engagement to ca...
Site Reliability Engineer - Hybrid

AEG Worldwide

Los Angeles, CA

Wed, 18 May 2022 19:18:58 GMT
In the SRE role you will be working directly with Developers, QA, Infrastructure...
Site Reliability Engineer II

avidxchange

Los Angeles, CA

Wed, 18 May 2022 21:04:31 GMT
The SRE will work very closely with the key stakeholders in Product Development ...
Site Reliability Engineer - Hybrid

AXS

Los Angeles, CA

Wed, 18 May 2022 19:18:58 GMT
In the SRE role you will be working directly with Developers, QA, Infrastructure...
Dev Ops Engineer - Hybrid

AXS

Los Angeles, CA

Wed, 18 May 2022 23:09:25 GMT
In the SRE role you will be working directly with Developers, QA, Infrastructure...
Site Reliability Engineer

Fisker Inc

Manhattan Beach, CA

Tue, 17 May 2022 14:25:35 GMT
You will contribute directly to our web services, applications, tooling, and wor...
Site Reliability Engineer (Stack Automation Service Team)

Splunk

Los Angeles, CA

Tue, 17 May 2022 06:09:06 GMT
Splunk's Cloud group is looking for an experienced Site Reliability
Site Reliability Engineer

Lakeshore Learning Materials, LLC

Carson, CA

Sat, 14 May 2022 05:34:53 GMT
On-site employee gym for all levels/fitness needs. You’ll work with various team...
DevOps Engineer - Site Reliability

Blu Omega

Culver City, CA

Mon, 16 May 2022 12:29:14 GMT
Manage cloud infrastructure, provide resource allocation, system upgrades, user ...
Site Reliability and Big Data Engineer

Assurit

Los Angeles, CA

Mon, 16 May 2022 06:28:41 GMT
Design, write, and deliver software to support and improve availability, scalabi...
Principal Site Reliability Engineer

Xometry Inc

Los Angeles, CA

Mon, 09 May 2022 23:06:27 GMT
Have a passion for resolving reliability issues and identifying strategie...
Site Reliability Engineer (Remote)

Slickdeals

Los Angeles, CA

Thu, 12 May 2022 03:16:14 GMT
Slickdeals is looking for someone who can effectively fill a combined SysAdmin a...
Site Reliability Engineer

Xometry Inc

Los Angeles, CA

Mon, 09 May 2022 23:06:16 GMT
Have a passion for resolving reliability issues and identifying strategie...
Sr. Site Reliability Engineer

Xometry Inc

Los Angeles, CA

Mon, 09 May 2022 23:05:51 GMT
Have a passion for resolving reliability issues and identifying strategie...
Site Reliability Engineer - Platform

BlackLine

Woodland Hills, CA

Fri, 06 May 2022 06:10:31 GMT
The Site Reliability Engineer is responsible for assessing, testin...