Google Stackdriver performs monitoring, logging and diagnostics to help businesses ensure optimal performance and availability. The service gathers performance metrics and metadata from multiple cloud accounts and allows IT teams to view that data through custom dashboard, charts and reports.
Google Stackdriver is natively integrated with Google Cloud Platform and hosted on Google infrastructure, but the monitoring capabilities can also be used for applications and virtual machines (VMs) that run on Amazon Web Services Elastic Compute Cloud (AWS EC2). In addition, it can pull performance data from open source systems, such as Cassandra, Apache Web Server and ElasticSearch.
Google Stackdriver’s main features include:
- Stackdriver Monitoring measures the health of cloud resources and applications by providing visibility into metrics such as CPU usage, disk I/O, memory, network traffic and uptime. It is based on collectd, an open source daemon that collects system and application performance metrics. Users can receive customizable alerts when Stackdriver Monitoring discovers performance issues. It is used to monitor Google Compute Engine and Amazon EC2 VMs.
- Stackdriver Error Reporting identifies and analyzes cloud application errors. A centralized error management interface provides IT teams with real-time visibility into production errors with cloud applications, as well as the ability to sort and filter content based on the number of error occurrences, when the error was first and last seen, and where the error is located.
- Stackdriver Debugger inspects the state of an application, deployed in Google App Engine or Google Compute Engine, using production data and source code. During production, snapshots can be taken of an application’s state and linked back to a specific line location in the source code, without having to add logging statements. This inspection can occur without affecting the performance of the production application.
- Stackdriver Trace collects network latency data from applications deployed in Google App Engine. Trace data is gathered, analyzed and used to create performance reports to identify network bottlenecks. Trace API and Trace SDK can be used to trace, analyze and optimize custom workloads, as well.
- Stackdriver Logging provides real-time log management and analysis for cloud applications. Log data can be kept for longer periods of time by archiving it with Google Cloud Storage. The service works with both Google and AWS, and can gather logs from Google Compute Engine, Google App Engine and Amazon EC2.
Google Stackdriver was originally founded in 2012 by Dan Belcher and Izzy Azeri. In May 2014, it was acquired by Google.
Google Stackdriver is used by cloud administrators, engineers and developers. Other cloud application monitoring tools include Datadog, LogicMonitor, Sumo Logic and New Relic.