ObservabilityCON 2022 On-demand sessions
How Grafana Cloud enables near real-time visibility into the Just Eat Takeaway.com platform
Just Eat Takeaway.com (JET), a leading global online food delivery marketplace, has experienced an explosion in telemetry data during the past five years of hypergrowth. With the help of Grafana Cloud Metrics, the JET team has gone from self-hosted, single-point-of-failure Graphite stacks to a high-availability monitoring infrastructure that handles millions of active series per day and provides near real-time visibility into the JET platform. In this talk, Principal Platform Engineer Andrew Marwood and Senior Technical Manager Alex Murray will discuss the pitfalls they overcame, the lessons learned, and their plans for the future, including adoption of OpenTelemetry and tracing with Grafana Tempo.
- Andrew Marwood, Principal Platform Engineer, Just Eat Takeaway.com
- Alex Murray, Senior Technology Manager, Just Eat Takeaway.com
Wells Fargo's observability transformation, powered by Grafana Enterprise and Grafana Cloud
One of the “big four” banks in the U.S., Wells Fargo embarked on an observability transformation powered by Grafana Enterprise and Grafana Cloud. To better serve its customers, the bank modernized its observability tools, standardizing on Prometheus metrics and Grafana dashboards that are provided as a service to everyone from developers and SREs to business analysts and executives. In this session, Senior Software Engineering Manager Nikhilesh Tekwani will talk about the team’s journey to a single pane of glass that highlights the customer base, provides visibility into current vs. expected volume, and gives insight to the strength of their applications.
- Nikhilesh Tekwani, Head of Observability Engineering, Wells Fargo
Incident response made easier with Grafana Alerting, OnCall, and Incident
When things go wrong, Grafana dashboards are often the first place teams go to find answers in metrics, logs, and traces, and the last place they look to put together a postmortem. Grafana naturally sits at the heart of incident response management, and with the recent releases of Grafana OnCall and Grafana Incident and improvements to Grafana Alerting, we’ve made it even easier to integrate IRM into the Grafana workflows you already know and love.
In this session, we’ll walk through the new incident declaration and resolution workflow in Grafana Cloud. Clearco Senior Software Engineer Hudson Ash will also share the wins his team has notched moving from PagerDuty to Grafana Alerting, OnCall, and Incident.
- Mat Ryer, Engineering Director, Grafana Labs
- Matvey Kukuy, Senior Engineering Manager, Grafana Labs
- Farhan Manjiyani, Product Marketing Manager, Grafana Labs
- Marc Chipouras, Senior Director of Engineering, Grafana Labs
- Hudson Ash, Senior Software Engineer, Clearco
Observability strategy at Adobe, with OpenTelemetry, Grafana, Mimir, and Tempo
After making a commitment to open standards, Adobe began shifting from closed source tools to OpenTelemetry and the Grafana stack. Sachin Garg and Chris Featherstone will discuss how Adobe has developed a push-button Grafana deployment, which then displays metrics and traces collected via OpenTelemetry and stored in Mimir and Tempo.
- Sachin Garg, Senior Manager of Observability and Tooling, Adobe
- Chris Featherstone, Senior Manager, Software Development, Adobe
Deep dive into Grafana Cloud integrations for easy-to-use infrastructure monitoring
Grafana Cloud provides a seamless and easy-to-use infrastructure monitoring solution with an expansive ecosystem of integrations. With prebuilt dashboards that can monitor Kafka clusters, Kubernetes pods, AWS and GCP deployments, and much more – and alerting rules that can proactively flag issues – Grafana Cloud’s infrastructure integrations take users from zero to observability in just a few clicks.
In this session, Grafana Cloud engineers will discuss the latest infrastructure integrations and demo the new Kubernetes Monitoring solution.
- Richard Lam, Senior Group Product Manager, Grafana Labs
- Coleman Rollins, Software Engineer, Grafana Labs
- Jake Swiss, Senior Product Marketing Manager, Grafana Labs
How Banco Itaú solves infrastructure puzzles with 1B pieces of metrics using Grafana
At Banco Itaú in Brazil, 1.3 billion Prometheus metrics are ingested daily. To make sense of this amount of data, Itaú’s 12,000 engineers use Grafana every day. In this session, Ana Paula Genari Martin from Itaú’s SRE team will share how they use Grafana to observe all of the organization’s AWS services, building traffic lights and alerts to easily see the health of its infrastructure – and even let AWS know that they might have a problem.
- Ana Paula Genari Martin, SRE Manager, Banco Itaú
Catch issues before your customers do: Shift left with k6 and Grafana
Last year, k6 performance testing was added to the Grafana observability stack, a “shift left” that helps organizations develop highly performant, reliable, and stable applications.
In this session, you’ll learn how we have built tighter integrations between k6 and Grafana and hear about new features that will further unify k6 testing and Grafana observability.
- Mark Meier, Senior Product Manager, Grafana Labs
- Lukasz Gut, Senior Software Engineer, Grafana Labs
- Wei Li, Product Marketing Manager, Grafana Labs
Monitoring on steroids: How JPMorgan Chase uses Grafana for their trading platform to spot issues quickly and proactively
The concept: Use trade volumes, synthetic transactions, and proprietary alerting mechanisms derived from SRE precepts to monitor JPMorgan Chase’s technical landscape. Using Grafana, the JPMC team built a comprehensive tool that records trends and highlights issues proactively in real time. Support teams can now track the root cause in minutes, compared to historical cases where the same issue took hours to detect. The team uses AIOps to generate dynamic thresholds that can vary with such factors as time of day, week, or month, and can take into account historic trends to adapt over time. Critical revenue-impacting issues are remediated in a timely manner, and downtime is minimized. In this session, Askari Imam and Crystal Sorensen will give a technical overview of the tool and share how JPMC incorporates error budgets to help users understand why certain issues arise and how they correlate to types of errors – and enable leadership to address those issues in a data-driven way.
- Askari Imam, VP, Asset Wealth Management - Product & Integration Delivery, JPMorgan Chase
- Crystal Sorensen, Certified Scrum Product Owner, JPMorgan Chase
- Rich Mirsberger, Vice President, Global Technology, JPMorgan Chase
LGTM: Scale observability with Mimir, Loki, and Tempo
For organizations transitioning to cloud native infrastructures and microservice architectures, observability data can quickly grow out of control. Scalable and highly performant backend databases for metrics, logs, and traces are no longer just nice to have; they’re critical.
Join the Grafana Labs backend telemetry teams for demos of the latest features in Grafana Mimir for metrics, Grafana Loki for logs, and Grafana Tempo for traces. You’ll hear how we’ve increased database scale and performance, made improvements to ease of use.
- Jen Villa, Senior Group Product Manager, Grafana Labs
- Cyril Tovena, Principal Software Engineer, Grafana Labs
- Ed Welch, Principal Software Engineer, Grafana Labs
- Fiona Liao, Senior Software Engineer, Grafana Labs
- Joe Elliott, Principal Software Engineer, Grafana Labs