Case Study

How Rapyd solved observability gaps to cut Kubernetes costs by 40%

Client
Rapyd
Industry

Financial Services & Insurance

Region
EMEA, UKI
Country
United Kingdom
Technologies
Amazon Elastic Kubernetes Services, AWS

40%

Decrease in EKS cost

Meet Rapyd

Rapyd is a leading fintech company that unifies global payment technologies and networks on a single platform. The company enables businesses to easily accept, process, and disburse payments across more than 100 countries and enables payouts in over 190 countries.

By integrating a wide variety of payment methods and card issuance (both virtual and physical), foreign exchange, and money management capabilities, the Rapyd platform enables fast, secure, and seamless global commerce for a foundation of innovation and scale in fintech.

The Challenge

Rapyd set out to migrate from AWS EC2 to EKS to improve performance and reduce cloud spend—but without compromising the resilience essential to fintech operations. Kubernetes was already central to Rapyd’s infrastructure, supporting over 15 clusters and enabling the speed and agility needed to innovate at scale.

To ensure a smooth migration, the CI/CD, Infrastructure, and SRE teams initially over-provisioned resources across nodes, pods, and clusters. While this safeguarded stability, it also limited the elasticity and cost-efficiency Kubernetes could offer. Existing observability tools lacked the depth to support precise optimization, leaving potential savings on the table.

To fully realize their performance and efficiency goals, Rapyd needed a solution that could detect and right-size Kubernetes resource consumption with actionable, granular insights.

The Solution

Under the leadership of DevOps Team Leader Boris Isakov, Rapyd turned to PerfectScale by DoiT for a more innovative and scalable approach to Kubernetes optimization.

Smarter Optimization, Built-In

PerfectScale was introduced early in the service lifecycle, allowing new services to run for a few days before initiating optimization. This workflow, combined with PerfectScale’s powerful tools like PodFit for workload rightsizing and InfraFit for node optimization, enabled Rapyd to quickly achieve better cloud resource utilization.

“The initial implementation of PerfectScale to our global environments was a revelation. It exposed the vast extent of our resource waste, significant deviations from Kubernetes best practices,” said Boris Isakov. “With PerfectScale, we are now on a path to efficient scaling and optimization.”

AI-Driven Performance Gains

Beyond cost savings, PerfectScale enhanced the performance and resilience of Rapyd’s platform. Data-driven and infrastructure-focused recommendations empowered the team to fine-tune resource requests and limits to improve performance while reducing spend.

“We understood that it’s not all about the cost. It’s about the performance of the cluster,” Boris shared. “The cost is going down, and performance is getting better.”

PerfectScale’s advanced alerting system proactively identified issues before they impacted end users. Integrated with Slack, the alerts were categorized by severity and highlighted critical problems, like out-of-memory errors, that were missed by other observability tools.

“We got alerts from PerfectScale that we did not get from our other solutions. These allowed us to address issues proactively across our entire environment – averting issues before they affected our customers.”

This proactive approach ensured business continuity in a high-stakes industry where milliseconds matter.

A Trusted Partnership

Throughout the process, the PerfectScale team delivered hands-on guidance and deep Kubernetes expertise that further extended the capabilities of their technology. This support was key in helping Rapyd adopt and scale best practices across teams.

“The PerfectScale team was very professional, helped explain the features, and helped guide us through the optimization process.”

The Results

Rapyd’s collaboration with PerfectScale is driving a smarter, more resilient Kubernetes strategy that balances performance, cost, and scalability.

Key Outcomes:

  • 35–40% reduction in EKS costs
    Optimization led to significant savings through more intelligent resource allocation.
  • Workload rightsizing & node optimization
    PodFit and InfraFit helped Rapyd fine-tune clusters with precision.
  • Improved platform performance
    Adjustments based on PerfectScale’s insights enhanced service reliability.
  • Advanced alerting system
    Critical issues were resolved faster with severity-based alerts integrated into Slack before they impacted service quality.
Boris Isakov, DevOps Team Leader at Rapyd
“The initial implementation of PerfectScale to our global environments was a revelation. It exposed the vast extent of our resource waste – including excessive provisioning across nodes, pods, and clusters. These are significant deviations from Kubernetes best practices. With PerfectScale, we are now on a path to efficient scaling and optimization”

Learn more about how DoiT can help you

Latest case studies

Schedule a call with our team

You will receive a calendar invite to the email address provided below for a 15-minute call with one of our team members to discuss your needs.

You will be presented with date and time options on the next step

JP form

This field is for validation purposes and should be left unchanged.