INSPYR
Site Reliability Engineer
What's your preference?
Job Description
- Req#: 25-13887
-
Observability & Monitoring
-
Build, refine, and maintain monitoring, alerting, and dashboards in DataDog to surface application and infrastructure performance metrics.
-
Work closely with product and engineering to define and track SLIs/SLOs.
-
Identify and instrument key user interaction points to improve system visibility.
-
-
Infrastructure & Reliability
-
Contribute to system reliability efforts for our Azure-based back-end and Vercel-hosted front-end.
-
Support disaster recovery planning and implementation.
-
Help define best practices for error budgets, incident response, and availability targets.
-
-
Scalability & Performance
-
Assist in load testing initiatives to prepare the application for broader deployment across thousands of locations.
-
Collaborate with DevOps to enhance deployment pipelines and infrastructure scalability.
-
-
3+ years of experience in a Site Reliability Engineering, DevOps, or related role.
-
Hands-on experience with DataDog, including custom dashboards, alerts, and APM features.
-
Strong grasp of observability principles (SLIs/SLOs, alert fatigue, tracing).
-
Working knowledge of Microsoft Azure services and environments.
-
Familiarity with incident response, root cause analysis, and postmortems.
-
Load testing tools (e.g., k6, Gatling, Locust).
-
Infrastructure as Code (Terraform, Bicep, ARM templates).
-
CI/CD pipeline development and optimization.
-
Vercel deployment and configuration practices.
-
Opportunity to shape observability and reliability for a high-impact internal tool.
-
Collaborative, low-ego team environment.
-
Remote-friendly work culture.
-
Exposure to modern tech stacks and scalable architecture challenges.
Site Reliability Engineer (SRE)
Location: Remote (U.S. based)
Employment Type: 6 Month Contract with possible extension
Industry: Automotive / Internal Tools
Work Requirements: US Citizen, GC Holders or Authorized to Work in the U.S.
Rate: $55-64.10 HR
Team: Engineering (working alongside DevOps, Product, and Back-End/Front-End developersAbout Us
We're developing a mission-critical internal tool for a major automotive service provider, currently live in 20+ stores with plans to scale to over 2,000 nationwide. With a growing user base and a need for real-time operational insights, we're expanding our engineering team to include a dedicated Site Reliability Engineer (SRE) to help us ensure performance, reliability, and observability at scale.
About the Role
We're looking for a Site Reliability Engineer with a strong foundation in observability, particularly with DataDog, to partner with our DevOps engineer and broader development team. You'll be instrumental in helping us understand how users interact with our application, how our systems respond in real time, and how we can scale with confidence.
This is a hands-on role that balances infrastructure insight, system reliability, and the strategic implementation of monitoring, alerting, and recovery practices.
Key Responsibilities
Required Qualifications
Nice-to-Have Experience
What We Offer
-
About the company
Genuent provides an innovative approach to the Delivery of Information Technology Talent. Genuent is becoming INSPYR Solutions.
Notice
Talentify is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.
Talentify provides reasonable accommodations to qualified applicants with disabilities, including disabled veterans. Request assistance at accessibility@talentify.io or 407-000-0000.
Federal law requires every new hire to complete Form I-9 and present proof of identity and U.S. work eligibility.
An Automated Employment Decision Tool (AEDT) will score your job-related skills and responses. Bias-audit & data-use details: www.talentify.io/bias-audit-report. NYC applicants may request an alternative process or accommodation at aedt@talentify.io or 407-000-0000.