Site Reliability Engineering (SRE) Monthly DevOps and Maintenance
For Ruby on Rails products with serious reliability and operations needs, we assign a full time SRE or DevOps Engineer. Your Cloud hosted site remains available and responsive while maintaining a rapid pace of feature development. We improve availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
Features
- Mentor product teams and stakeholders to adopt the SRE mindset
- Implement monitoring and alerting to ensure KPIs are met
- Improve performance and scaling for applications to meet SLOs
- Improve CI/CD pipelines to allow fearless deployment to production
- Deploy new infrastructure meeting scaling, security, and compliance needs
- Implement infrastructure as code to ensure long-term maintainability
- Drive cultural adoption of practices for continually improving services.
- Upskill client’s staff to take on SRE functions
Benefits
- Improve app performance and scaling
- Speed up feature deployment while reducing errors/bugs
- Improve long term stability and maintenance
- Relieve operational teams of incident response
- Reduce the Mean Time-To-Failure (MTTF)
- Increased levels of innovation and experimentation.
- Increased levels of knowledge sharing.
- Facilitates team decision making without reliance on other teams.
- Improved quality of software and better capacity planning.
Pricing
£85 to £200 a unit an hour
- Education pricing available
Service documents
Request an accessible format
Framework
G-Cloud 14
Service ID
1 6 8 6 0 6 2 4 0 3 3 9 9 2 3
Contact
thoughtbot Limited
Kirsten Hurley
Telephone: 0 20 3807 0560
Email: kirsten@thoughtbot.com
Planning
- Planning service
- Yes
- How the planning service works
-
Through a discovery process with our clients we determine the most appropriate solution for their Site Reliability Engineering requirements and whether this can be as a service or augmentation.
Our approach is tailored to the maturity of the customer’s SRE approach and the investment they have already made in platforms and tooling. Certain prerequisite apply. Server infrastructure support including maintenance of CI/CD pipeline for staging and production environments, configuration for running services in a containerized environment, and upgrade components and workload platform as necessary. Monitor running infrastructure and intervene when essential services aren't responding. Add new metrics and alarms as necessary. Scale up services as required to handle application traffic. - Planning service works with specific services
- Yes
- Hosting or software services the planning service works with
-
- Amazon Web Services (AWS)
- Microsoft Azure
- Google Cloud
Training
- Training service provided
- Yes
- How the training service works
- Working closely with our clients to understand their needs, our consultants offer a wealth of knowledge and experience. Our services include sharing learnings and best practices to level up your team and we provide documentation and training to implement and follow capacity planning. Your team has access to ongoing support for questions about the functionality and operation of the systems and services.
- Training is tied to specific services
- Yes
- Services the training service works with
-
- Amazon Web Services (AWS)
- Microsoft Azure
- Kubernetes
Setup and migration
- Setup or migration service available
- No
Quality assurance and performance testing
- Quality assurance and performance testing service
- Yes
- How the quality assurance and performance testing works
-
Our testing and quality assurance processes include:
1. The design and build of continuous integration and performance testing pipelines to ensure the appropriate level of quality assurance is applied depending on the extent of the code change being committed
2. Strategic guidance to devise and implement a comprehensive testing strategy
3. Developer assistance in authoring tests and implementing test-driven development practices
Security testing
- Security services
- Yes
- Security services type
-
- Security strategy
- Security risk management
- Security design
- Security testing
- Security incident management
- Security audit services
- Certified security testers
- No
Ongoing support
- Ongoing support service
- No
Service scope
- Service constraints
- Support is provided remotely
User support
- Email or online ticketing support
- Email or online ticketing
- Support response times
- We intervene on service alerts, outages, bugs, and all other issues affecting the reliable operation of client’s systems within 12 hours
- User can manage status and priority of support tickets
- No
- Phone support
- Yes
- Phone support availability
- 24 hours, 7 days a week
- Web chat support
- No
- Support levels
- We intervene on service alerts, outages, bugs, and all other issues affecting the reliable operation of client’s systems within 12 hours. For clients that meet our prerequisites, we can provide clients with 24x7 monitoring and support for to make sure their applications are always available.
Resellers
- Supplier type
- Not a reseller
Staff security
- Staff security clearance
- Staff screening not performed
- Government security clearance
- Up to Baseline Personnel Security Standard (BPSS)
Standards and certifications
- ISO/IEC 27001 certification
- No
- ISO 28000:2007 certification
- No
- CSA STAR certification
- No
- PCI certification
- No
- Cyber essentials
- Yes
- Cyber essentials plus
- No
- Other security certifications
- No
Social Value
- Social Value
-
Social Value
- Fighting climate change
- Covid-19 recovery
- Tackling economic inequality
- Equal opportunity
- Wellbeing
Fighting climate change
Our team is actively looking for opportunities to partner with companies in the Green Tech Space. Most recently, we partnered with BeeODiversity, a Belgian startup committed to helping companies preserve biodiversity and reduce on-site pollution.Covid-19 recovery
thoughtbot primarily supported the architecture and implementation of the application (front and backend) as well as the creation of the corresponding APIs powering the new loan schemes. Given the amount of activity expected from the many businesses applying for loans, ensuring our product could scale to meet demand from the lending banks was essential.
In April 2021, thoughtbot moved on to supporting the rollout of the Recovery Loan Scheme, the successor to the three COVID-19 schemes. In all, the COVID-19 and RLS schemes have resulted in British Business Bank offering financial support to nearly 80% of the UK's SME market.
In parallel to our loan scheme and application development efforts, thoughtbot also supported a cloud migration from UK cloud to Azure. thoughtbot worked closely with BBB's operations partners to reconfigure the application and seamlessly migrate to the new Azure platform with minimal downtime and disruption.Tackling economic inequality
Tackling economic inequality - create new business/jobs OR increase supply chain resilience/capacity
thoughtbot services enable companies and organizations to meet business goals including operating profitably or meeting other fiscal responsibilities, creating business growth, or improving efficiency. This in turn drives continued employment and growing employment opportunities with in the industry, geography, and/or supply chain.Equal opportunity
When working with our clients, we set out to be true partners. That means being proactive about asking questions, making recommendations, and raising concerns. Although our primary focus is building great software, how we do that and the relationships we grow along the way are greatly impacted by DEI. DEI best practices are another area we are happy to discuss, and through those honest conversations, we build mutual trust as a collective team.Wellbeing
One of our core values at thoughtbot is to do client work at a sustainable pace, which ultimately allows increased productivity, and a happier team. In addition to supporting the holistic well-being of the team through a variety of policies, our client work has been often focused well being. We have done work with many fitness, healthcare, and wellness clients. This includes UK-based clients Samaritans, Be Inspired, and Steel Warriors.
Pricing
- Price
- £85 to £200 a unit an hour
- Discount for educational organisations
- Yes