Enterprise Cloud Cost Management - Part 3
In this final part of our series, we explore how to optimize costs, infrastructures, and billing by effectively combining two approaches: manual action by app teams (with in-depth operational visibility) and automatic resource cleanup (when no manual action is taken).
-
-
-
-
URL copied!
In Part 3 of our blog series, we defined a comprehensive cost management framework for enterprise clouds and took a deep dive into initial planning processes and operational visibility. In this final part of our series, we explore how to optimize costs, infrastructures, and billing by effectively combinin two approaches: manual action by app teams (with in-depth operational visibility) and automatic resource cleanup (when no manual action is taken).
Cost Optimization
Stakeholders should be given multiple opportunities to take action or register exceptions for their apps. If no stakeholder action is taken on dev/test environments, then recommendations can be automatically actioned. These two approaches, used in combination, will lead to better awareness and accountability.
Infrastructure Optimization
This section discusses various opportunities to optimize the cloud infrastructure landscape to fit the given utilization. This follows the cloud’s tenet on provisioning only what you need, and paying for only what you use.
Instance Rightsizing
Upsize or downsize the instances based on actual utilization trends, so that the peak average utilization hovers around the optimal (70%-80%) range.
Cleanup of Unused Resources
Remove any orphaned resources that are no longer being used. Some of these include:
- Unattached disks (delete)
- Orphaned snapshots (delete)
- Unallocated IPs (release)
- Unused Storage (recommend moving to Glacier/ColdLine)
Cleanup of Underutilized Resources
Identify and recommend clean up of resources that have been provisioned but are not being actively used. A common example is dev environments that were not deleted after testing. Metrics that can be used to identify these type of resources are:
- Minimal or no CPU utilization
- Minimal or no disk activity
- Minimal or no IO activity
Instance Scheduling
Turn resources on and off based on when they are needed, rather than running them all the time. Considerations include:
- Based on spikes in usages patterns
- Instance scheduling for dev/test servers that don't need to be run 24/7
Instance Modernization
Cloud providers regularly release new versions of their instance families. These are based on the latest hardware and are often faster and cheaper than the older instance families. Modernizing instance families to the latest versions can optimize both performance and costs.
Cleanup of Other Cloud Services
For managed services provided by the cloud provider, metrics can be used to identify if services are being used, and released if they are not needed.
Billing Optimization
To optimize billing processes, (1) leverage reserved/committed use discounts in the production environment and (2) enable committed use and spot/pre-emptible instances in the dev/test environment. This allows users to fully utilize the usage discounts provided by cloud platforms. Some of these discount categories make sense for specific application environments. Details are below:
Production Environment
- Start with 30% - 40% servers to achieve immediate cost savings before the app stabilizes in the cloud.
- End with 100% servers after the app stabilizes in the cloud.
Dev/Test Environment
- Start with 10% servers that need to run 24x7 (e.g., build servers).
- End with 100% servers after the app stabilizes in the cloud.
- Use spot/pre-emptible instances for environments that can be torn down and recreated.
- Integrate the use of spot/pre-emptible instances with DevOps build processes.
Automation Approach & Opportunities
If automation tools are not available, then you should build them in-house. Start small and grow the automation catalogue. Remember, no single tool will solve all cost management problems — build and integrate tools as services. Below are areas in which you can apply automation, along with some tips on how to do it.
Tagging and Labeling
- Report on tag non-conformance
- Automatically add certain missing tags such as “created-by” (use to track creators of orphaned resources)
- Create and maintain virtual tags for cloud services that don't yet support tags in the inventory management system
Reporting
- Send daily reports directly to stakeholders on costs, projections, violations, and non-conformance
Resource Scheduling
- Detect usage patterns and suggest server start/shut down schedules (to be used only during their usage periods)
- Inform stakeholders and automatically implement scheduling for dev/test environments
Resource Cleanup
- Automatically shut down instances/resources that don't have the required tags
- Automatically shut down instances/resources that are not being used
- Recommend and implement auto instance scheduling based on usage patterns
- Remove unattached volumes and old snapshots (unless tagged)
- Clean up other resources
Reservation Planning (committed use)
- Track usage patterns and recommend instances for committed use
- Track usage commitments and renew automatically (inform stakeholders of reservation expiry)
- Track total savings and ROI for committed use discounts
Instance Modernization
- Recommend instances that can be modernized to new instance types (i.e., cheaper and more efficient)
Spot/Pre-Emptible Instances
- Track CPU load patterns for dev/test environments and recommend spot/pre-emptible instances
Tools Reference
The following table shows a representative list of tools that can be used for cost management at the various stages of cloud adoption. This is not an exhaustive list, as there are other tools in the market that fulfill niche requirements.
Concern |
AWS | Azure | GCP | Third Party/Custom |
Initial Sizing | •AWS Cost calculator | •Azure Pricing Calculator | •GCP Pricing Calculator | •GL’s custom tools |
Operational Visibility and Forecasting | •Trusted Advisor
•Tags |
•Azure Advisor
|
•GCP Labels | •Cloudability |
Cost Optimization | •Trusted Advisor | •Azure Automation | •Google Cloud Functions |
Conclusion
We hope that this blog series has helped you start thinking about cost management holistically. The information given in this blog is not limited to any one cloud, either — these principles can be applied to all public clouds. With private clouds, some of these principles can be used to optimize resource densification, rather than the direct cost itself. If you would like more information about how GlobalLogic can help your business with cloud adoption, please email us at practice-cloud@globallogic.com.
Top Insights
Best practices for selecting a software engineering partner
SecurityDigital TransformationDevOpsCloudMediaMy Intro to the Amazing Partnership Between the...
Experience DesignPerspectiveCommunicationsMediaTechnologyAdaptive and Intuitive Design: Disrupting Sports Broadcasting
Experience DesignSecurityMobilityDigital TransformationCloudBig Data & AnalyticsMediaLet’s Work Together
Related Content
Unlock the Power of the Intelligent Healthcare Ecosystem
Welcome to the future of healthcare The healthcare industry is on the cusp of a revolutionary transformation. As we move beyond digital connectivity and data integration, the next decade will be defined by the emergence of the Intelligent Healthcare Ecosystem. This is more than a technological shift—it's a fundamental change in how we deliver, experience, … Continue reading Enterprise Cloud Cost Management – Part 3 →
Learn More
Power & Utilities – Changing Landscape
Power & utilities industry have gone through a supplied, and utilized, but it also poses threats to unparalleled shift in recent years, fueled by rapid established, legacy business models and regulatory technological advances, rising consumer demands, frameworks.
Learn More
The Chromatic Symphony: Unveiling the Palette of Productivity
We observe how color affects our ability to capture information quickly, laying the groundwork for understanding the importance of color in our environment. The blog concludes by posing intriguing questions about the prevalence of specific colors in corporate branding and its connection to the psychology of color theory.
Learn More
A Lakehouse Implementation using Delta Lake
A data lake is a centralized repository that enables a cost–effective storage of large volumes of data that provides a single source of truth (SOT). However, organizations face numerous challenges when using data lakes built on top of cloud-native storage solutions.
Learn More
Luxury Fashion Industry
This blog is an attempt to understand the difference in approach that a technology partner like GlobalLogic should consider while providing tech solutions to Luxury fashion customers. Let us first understand the Global market, challenges and opportunities for this market segment.
Learn More
Share this page:
-
-
-
-
URL copied!