2022 Marketing Research: The Surprising Cost of On-Call
Get the Report
Product
Product
Product overview
Explore our end-to-end solution
Automated remediation
Fix issues permanently to eliminate ops toil
Operations notebooks
Empower on-call teams with proven recipes
Debug & repair
Instantly access all clouds & clusters
Architecture
Shoreline’s modern architecture
Integrations
How we integrate with your tools
Pre-built solutions
Solutions overview
View all pre-built solutions
JVM memory issues
Catch and fix as it happens
Network issues
Diagnostics eliminate wasted time
Disk resize
Avoid outages and data loss
Developers
Learn more
Architecture
Shoreline’s modern architecture
Security
Built with security in mind
Availability
There when you need it
Safety
Minimize mistakes & their impact
Integrations
How we integrate with your tools
Docs
Explore software specifics
Getting Started
How to quickly get going
Tutorial
A self-guided walkthrough
Support
Contact our support team
Engineer's Perspective
Operations at the Edge
Processing, analyzing, then acting on observability data entirely within your own environment offers a cheaper, faster, fault tolerant, and more secure alternative.
Read blog
Infrastructure as Code for Production Ops
DevOps leaders can apply infrastructure as code lessons and tooling to production ops, use solutions like Terraform + Shoreline to automate repeatable tasks, and make hero-level institutional knowledge accessible to anyone.
Read blog
Resources
Resources
Blog
Stay up to date
Videos
Tips, tricks and walkthroughs
Webinars and Podcasts
Learn from industry leaders
Events
Let's meet up!
Company
About us - team & purpose
News
Shoreline in the news
Support
Contact our support team
Top Posts
Automation at Dataiku Eliminates DevOps Work and Improves Customer Experience
Almost 170 remediations were automatically triggered last month, conservatively saving over 20 FTE days of DevOps work, while improving app performance.
Read blog
Simplifying Automation at Production Scale with Shoreline.io
Shoreline announces customer-driven enhancements that provide enterprise customers with critical safeguards against human errors when executing large scale automations across their multi-cloud infrastructure.
Read blog
Visit the blog
Request demo
Home
Product
Product overview
Explore our end-to-end solution
Operations notebooks
Notifications that empower teams
Automated remediation
Your entire fleet working in unison
CLI debug & repair
Fleetwide repairs in seconds
Solutions overview
View all pre-built solutions
Resources
Blog
Stay up to date with Shoreline
Webinars
Learn from industry leaders
Videos
Tips, tricks and walkthroughs
News
Shoreline in the news
Events
Join us for
Support
Contact our support team
Company
Meet the Shoreline team
Developers
Integrations
How we integrate with your tools
Architecture
Shoreline's modern architecture
Security and Trust
Security and safety protocalls
Documentation
Explore software specifics
Support
Looking for some help? Contact our support team
Request demo
Login
Request demo
Our blog
Resources and insights
The latest industry news, interviews, technologies, and resources.
Featured post
Automation at Dataiku Eliminates DevOps Work and Improves Customer Experience
Almost 170 remediations were automatically triggered last month, conservatively saving over 20 FTE days of DevOps work, while improving app performance.
Louis-Philippe Kronek
•
July 14, 2022
Company
Incident Automation
Product
Reliability
Clear all
Yuvraj Mehta
•
August 8, 2022
Simplifying Automation at Production Scale with Shoreline.io
Shoreline announces customer-driven enhancements that provide enterprise customers with critical safeguards against human errors when executing large scale automations across their multi-cloud infrastructure.
Product
Charles Cary
•
July 29, 2022
Self-Healing: The Key to Fixing the Most Common Kubernetes Issues
Here are three tips for automatically fixing the most common Kubernetes issues through mastering Kubernetes, self-healing, and staying proactive.
Reliability
Incident Automation
Louis-Philippe Kronek
•
July 13, 2022
Automation at Dataiku Eliminates DevOps Work and Improves Customer Experience
Almost 170 remediations were automatically triggered last month, conservatively saving over 20 FTE days of DevOps work, while improving app performance.
Reliability
Incident Automation
Chris Newton
•
June 2, 2022
How to Reduce SRE Toil
Get actionable strategies for reducing toil so your SRE and DevOps teams can spend more time on projects that create net-new value for your business.
Reliability
Incident Automation
Charles Cary
•
May 25, 2022
Operations at the Edge
Processing, analyzing, then acting on observability data entirely within your own environment offers a cheaper, faster, fault tolerant, and more secure alternative.
Product
Chris Newton
•
April 7, 2022
What Is a Runbook?
This guide provides a basic understanding of runbooks and how to build one that saves your company valuable time and resources.
Reliability
Sanjit Kalapatapu
•
March 22, 2022
Code to resize a disk in Kubernetes on AWS
Learn how to manually and automatically resize a disk, including error handling.
Reliability
Chris Newton
•
March 21, 2022
What Is Runbook Automation?
Discover how runbook automation can help you save valuable time and money by shortening incident response time, reducing toil, and boosting innovation.
Reliability
Incident Automation
Chris Newton
•
March 21, 2022
What Is SRE (Site Reliability Engineering)?
Curious about site reliability engineering (SRE) and how it can help you iron out incidents more efficiently and consistently? Read this guide to learn more.
Reliability
Adnan Dosani
•
February 16, 2022
Multi-Cloud Operations can be easy!
Shoreline’s platform hides complexity, eliminates the pain, and makes multi-cloud operations easy on the team.
Product
Brian Scheuermann
•
December 3, 2021
Solving Advent of Code Puzzles with Shoreline
Using Shoreline's Oplang and Metrics System to Solve Advent of Code Puzzles
Product
Tanay Menezes
•
September 24, 2021
Building Shoreline's Azure Agent During My Summer Internship
Important lessons and valuable experiences while developing Shoreline's Azure Agent.
Company
Charles Cary
•
September 15, 2021
Infrastructure as Code for Production Ops
DevOps leaders can apply infrastructure as code lessons and tooling to production ops, use solutions like Terraform + Shoreline to automate repeatable tasks, and make hero-level institutional knowledge accessible to anyone.
Incident Automation
Gabe Wyatt
•
September 10, 2021
Analyze and act upon remediation incidents with Shoreline Events
Learn how to filter, sort, analyze, and act upon remediation incidents with Shoreline Events.
Product
Jainam Shah
•
September 9, 2021
Intern Spotlight: Jainam Shah on Shoreline Notebooks
Jupyter Notebooks for DevOps
Company
Joe Kuo
•
September 1, 2021
Automatically Resolve Kubernetes DNS Issues with the CoreDNS Op Pack
Learn how to resolve Kubernetes DNS issues with Shoreline's CoreDNS Op Pack.
Product
Amanda Palamar
•
August 30, 2021
Intern Spotlight: Amanda Palamar on Time Series Search
For our Intern Spotlight series, we’ll showcase the work of a summer intern at Shoreline with a technical deep dive.
Company
Joe Kuo
•
August 24, 2021
Prevent Kubernetes IP Exhaustion with Shoreline’s Argo Op Pack
Shoreline’s Argo Op Pack is purpose-built to remediate IP exhaustion related to Argo workflows automatically.
Product
Sergiu Iacob
•
August 23, 2021
Minimizing Mean Time to Detect: Real Time Alarms with IREE
Execute 1,000s of alarms on box, with one second of delay.
Product
Gabe Wyatt
•
August 20, 2021
Fleetwide Debugging in 3 Easy Steps
Learn how to rapidly debug and resolve issues across your entire infrastructure.
Incident Automation
Anurag Gupta
•
July 27, 2021
Why I Built Shoreline Incident Automation
The increasing fleet size and complexity of production environments has created an explosion in on-call incidents. You can dramatically reduce on-call fatigue and improve availability using Shoreline’s incident automation platform.
Incident Automation
Charles Cary
•
July 27, 2021
Using Shoreline.io to root-cause transient issues (like JVM garbage collection)
Shoreline makes it easy to collect diagnostic information when you're doing a root-cause analysis of an issue. This example shows how to automatically capture debugging information for slow Java garbage collection and then automatically bounce the process to alleviate customer pain.
Product
Company
Sergiu Iacob
•
July 12, 2021
Shoreline Accelerates Ops with JAX & XLA
Shoreline’s metrics team has leveraged two machine learning technologies from Google, JAX and XLA, to accelerate metric query and data analysis.
Product
Austin Gunter
•
July 6, 2021
What is DevOps Automation
DevOps automation is the process of getting machines to handle repetitive work in the software deployment and operations lifecycle so that operators can deploy iterative updates faster and their systems operate more reliably.
Incident Automation
Austin Gunter
•
May 19, 2021
Why systems fail and what you can do about it
Anurag Gupta spoke at the CTO Summit on Reliability to share his new talk “Why systems fail and what you can do about it.”
Reliability
Incident Automation
Narendra Nath Challa
•
April 26, 2021
Tutorial: automating Kubernetes worker node retirement
This guide shows you how to manually decommission Kubernetes worker nodes and replace them with a new host and then shows you how to automate that process with a Shoreline Op Pack.
Product
Charles Cary
•
April 1, 2021
Advice for someone moving from SRE to backend engineering
SRE and Backend Engineering have a lot of overlap, and you can swap between roles relatively easily. This post addresses the pros and cons of leaving SRE for Backend work.
Reliability
Austin Gunter
•
March 4, 2021
Runbooks vs Playbooks: explaining the difference
The terms runbooks and playbooks are often used interchangeably by SREs. They are similar, but this post explains the differences so you can pair the two together as part of your operational excellence.
Reliability
Charles Cary
•
February 25, 2021
The Guide to Automating Runbook Execution
Creating your runbooks is only the first step. Automating runbook execution to run based on an alarm, without human intervention, is the real goal.
Incident Automation
Charles Cary
•
February 15, 2021
Restarts and rollbacks don't fix everything
Every iteration of automating and streamlining operational procedures has been advertised as the cure-all solution to every ailment, including resolving incidents. While declarative infrastructure, programmatic deployments, and repeatable automations are desirable, they aren’t capable
Reliability
Incident Automation
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Explore Shoreline’s resources
Looking for more information? Visit our other resource sections
Webinars
Learn from industry leaders, explore our product, and revolutionize your production ops.
Events
See Shoreline up close and in person at some of the biggest industry events of the year.
News
Read all the latest Shoreline news and media coverage.