Company: Dojo
Industry: Financial Services (Fintech)
Dojo is a UK-based payments provider offering fast, secure card terminals and transaction management apps for small and enterprise businesses. With point-to-point encryption and seamless EPOS integration, Dojo enables easy and secure payment processing. The company is expanding internationally, with operations in Spain and Ireland.
Challenge
Dojo’s observability setup initially centered on metrics, with dashboards and alerting based on this single pillar. However, the team recognized that faster troubleshooting required correlation across metrics, traces, and logs. They had some traces in a self-hosted Jaeger instance and logs in GCP, but this fragmented approach was difficult to scale and maintain.
Solution
Grafana Cloud offered Dojo a unified observability solution to centralize metrics, traces, and logs. The managed observability platform also facilitated seamless correlation and drill-down capabilities to streamline troubleshooting and enhance operational efficiency.
“We can very easily surface relevant issues for teams, and the troubleshooting experience is so quick.”
Roberto Nunes, Staff Engineer
Impact
- Cloud-agnostic flexibility and vendor independence: Grafana Tempo and Loki provided Dojo with a fully cloud-agnostic solution, eliminating vendor lock-in and offering increased flexibility.
- Accelerated troubleshooting: With the Grafana Stack, Dojo significantly improved and simplified their troubleshooting process by enabling seamless correlation across observability pillars. Engineers can now drill down from metrics to traces to logs and back, using features like exemplars to surface relevant issues quickly. This streamlined workflow—from alert to dashboard, exemplar, trace, and log—reduced resolution time and enhanced team efficiency.
- Streamlined alerting with global templates and Grafana Agent: To address challenges with symptom-based alerting and reduce alert fatigue, Dojo implemented a global alert templating system. This provided teams with out-of-the-box, standardized alerts, such as CPU throttling, across all domains and systems. The approach encouraged awareness and gradual adoption of more advanced, symptom-based alerts, while reducing redundancy and alert fatigue. The system also empowered platform teams to ship products with pre-configured alerts, scaling observability to over 400 workloads, 80 agents, and 200 active users across engineering, ops, and product teams.
- Enhanced debugging with profiling: Dojo plans to expand into the fourth pillar of observability by integrating profiling tools like Grafana Pyroscope. Early use cases with Pyroscope have proven invaluable, helping to identify and resolve complex issues like memory leaks.
Your guide
