Loading video player...
Are you spending too much time firefighting issues in your Elasticsearch or OpenSearch clusters? The daily toil of monitoring dashboards, diagnosing performance bottlenecks, and optimizing costs can be overwhelming. What if you could automate this process with an AI Site Reliability Engineer (SRE)? In this episode, we're joined by Itamar, founder of Big Data Boutique and creator of Pulse, an AI SRE platform designed specifically for Elasticsearch and OpenSearch. With over 15 years of experience, Itamar shares his journey from being a power user to building a tool that automates diagnostics, provides actionable recommendations, and helps prevent downtime. Join us as we discuss: š¹ The evolution from manual monitoring to AI-powered SRE. š¹ The pros and cons of open-source vs. proprietary observability solutions. š¹ A deep dive into key metrics that *actually* matter for cluster health. š¹ A live demo of the Pulse platform, showcasing its health assessments, cost optimizer, and advanced dashboards. Whether you're a DevOps professional, a platform engineer, or an SRE, this conversation is packed with expert insights to help you manage your data platforms more efficiently. **Chapters:** (00:00) Introduction to AI for Observability (02:27) Meet Itamar: From Elasticsearch User to Creator of Pulse (07:20) Open Source vs. Vendor Lock-in (11:27) Comparing Search Platforms: Elasticsearch, OpenSearch & ClickHouse (22:50) The Real Pain of Cluster Maintenance & The Cost of Downtime (26:15) What Key Metrics to Monitor in Your Elasticsearch/OpenSearch Cluster (44:44) Demo: Pulse the AI SRE for Elasticsearch & OpenSearch (55:50) Demo: Deep Dive into Advanced Dashboards & The Cost Optimizer (1:06:55) Top Tips for Managing Elasticsearch & OpenSearch Clusters (1:09:10) Final Thoughts #Elasticsearch #OpenSearch #SRE #AIOps #Observability #DevOps #SiteReliability #Monitoring