Site Reliability Engineering (SRE) --- Only W2 Job at Algebra IT, New Jersey

VGdNQ3kyUkQ5NTBiMy90bnNsSjVicWw0
  • Algebra IT
  • New Jersey

Job Description

Hi,

Hope you are doing well. Below is the Job Description, kindly go through it and please let me know if you are interested.

Role:- Site Reliability Engineering (SRE)

Location NYC, NY (3 days Onsite, Hybrid)

Position Overview:

We are seeking a highly skilled and experienced Senior Site Reliability Engineering (SRE) Engineer to lead our SRE team in ensuring the reliability, scalability, and performance of our production systems. The ideal candidate will have a strong background in cloud infrastructure, automation, and system monitoring, with excellent leadership and communication skills to collaborate across teams and foster a culture of operational excellence

Key Responsibilities:

  • Design and develop enterprise-grade APIs and configuration solutions.
  • Contribute to enterprise and application architecture design.
  • Lead observability initiatives including monitoring, alerting, and incident response.
  • Build and maintain dashboards and alerting systems using Grafana, Prometheus, Splunk, etc.
  • Create and maintain detailed runbooks for operational procedures and incident handling.
  • Define and monitor SLAs, SLOs, and KPIs for critical services.
  • Collaborate with architecture, development, and security teams to ensure system reliability.
  • Evaluate and adopt new technologies to improve system performance and maintainability.

Required Skills:

  • Strong background in IT infrastructure, cloud platforms (AWS, Azure, GCP), and SRE practices.
  • Experience in enterprise and application architecture.
  • Proven experience in building APIs and backend services.
  • Hands-on experience with tools:
  • Monitoring & Observability: Grafana, Prometheus, Splunk
  • ITSM & Operations: ServiceNow, OpsRamp
  • Project & Incident Tracking: JIRA
  • Experience in building alerts, dashboards, and operational runbooks.
  • Experience managing distributed systems and large-scale production environments.
  • Strong leadership, communication, and problem-solving skills.
  • Ability to quickly learn and adapt to new technologies and environments.

Preferred:

  • Exposure to OpenShift and Azure cloud platforms.
  • Certifications: SRE Foundation, ITIL, or relevant cloud certifications.

Job Tags

Similar Jobs

Nichols Contracting Inc.

Electrician Job at Nichols Contracting Inc.

 ...conduit and wiring as needed. Replaces circuit breakers as needed. Troubleshoot motor and control systems Perform routine maintenance on electrical wiring and systems Adhere to all quality and safety codes Performs other related duties as assigned... 

Hire Virtue

Foreman - Natural Gas Distribution Job at Hire Virtue

 ...As a Foreman for Natural Gas Distribution , you will lead field crews on natural gas distribution projects focused on the installation and maintenance of 8 plastic main and below. This hands-on supervisory role ensures operations are completed safely, efficiently, and... 

Builders Supply Co Inc

Delivery - Truck Driver (Non CDL) Job at Builders Supply Co Inc

 ...provided service to contractors and the public for all of their commercial and residential building needs. About The Role The Truck Driver position is responsible for preparing delivery loads, following planned delivery routes, and assisting with returns. Day-to-day... 

Noor

DOOR ATTENDANT Job at Noor

DOOR ATTENDANTThe Door Attendant is a part time, as-needed position that may require weekday, weekend, holiday, early morning, or late night shifts. The Door Attendant main duties include managing the event guest list, directing vendors to load in, and ensuring no guests...