We helped a major U.S.-based NASDAQ-listed streaming company optimize their AWS cloud costs across multiple teams and products by automating waste identification, managing cost anomalies, and improving security governance for cloud resources.
Problem Statement/Definition:
A leading streaming platform listed on NASDAQ, operating with numerous products and globally distributed teams, faced challenges managing cloud costs and maintaining security across their AWS infrastructure. The company lacked clear visibility into resource ownership and struggled to respond to cost anomalies and security issues, such as expired RDS certificates, in a timely manner. They needed a solution that could automate these processes, reduce cloud waste, and improve governance without adding to their operational overhead.
Proposed Solution & Architecture:
Wiv.ai implemented a custom AWS FinOps automation solution that addressed their need for
centralized management of cloud costs and security across multiple teams and products:
- Cloud Waste Identification Across Products and Teams: We deployed automated detection of cloud waste, including idle EC2 instances, underutilized RDS databases, and excessive S3 storage. The system flagged cost anomalies in real-time and automatically identified the responsible owners and product owners across various teams, streamlining accountability for resource usage.
- Cost Alerts and Anomaly Handling: Our solution integrated an anomaly detection handler that monitored for unusual cost spikes or drops. Upon detection, the system would automatically route alerts to the relevant owner, with built-in escalation processes for product owners to ensure that high-priority issues were addressed. This process reduced delays in handling cost overruns and provided early detection of potential misconfigurations
- Monthly Cost Tracking: The solution provided continuous tracking of cloud costs, offering monthly reports that detailed both individual team expenditures and total organizational cloud costs. These reports highlighted key areas of spending, efficiency opportunities, and cost impact metrics, giving leadership a clear view of cloud cost performance.
- Security and Compliance Management: To address security concerns, such as expired RDS certificates, our system automatically detected certification issues and alerted the responsible team members to take action. This automated monitoring ensured that the company’s cloud infrastructure stayed compliant with AWS security best practices, minimizing the risk of disruptions or security breaches.
Outcomes of Project & Success Metrics:
- Cloud Waste Reduction: Automated waste identification and real-time anomaly handling led to a 40% reduction in unnecessary cloud expenditures within the first six months, streamlining costs across various products and teams.
- Improved Governance: Automatically identifying and assigning ownership for cloud resources and cost anomalies improved accountability, leading to quicker remediation times and a 20% reduction in time spent managing cost-related issues
- Security Compliance: The system’s proactive management of RDS certification expirations and other security concerns ensured 100% compliance with cloud security protocols, minimizing operational risks and ensuring infrastructure stability.
- Cost Impact Measurement: Monthly cost tracking provided insights into spending patterns and efficiency opportunities, helping the company measure the financial impact of their cloud management practices and make data-driven decisions.
Lessons Learned:
Automating cloud waste identification and cost anomaly handling is essential in large, multi-team organizations to maintain operational efficiency. Proactively managing security issues, such as expired certificates, ensures compliance and minimizes risk. Monthly tracking of cloud costs is vital for understanding and measuring the ongoing impact of cloud optimization efforts.