Architectural Approaches to Scaling Distributed Microservice Systems in The Cloud
Kumar Avinash , Software Development Engineer, Google Seattle, USAAbstract
This article explores how distributed microservice systems in the cloud are scaled. The goal is to piece together, from scattered research, working solutions and what might still be experimental. The focus is on design and deployment strategies that aim for reliability, elasticity, and cost-effective growth. Open-access, peer-reviewed papers published since 2021 were reviewed with a special emphasis placed on those with empirical tests, diagrams, or case studies to see how ideas play out in practice. Across the papers certain themes keep reappearing. Infrastructure work revolves around horizontal and vertical scaling, while orchestration and autoscaling — Kubernetes HPA/VPA, serverless computing, various service meshes — are interpreted as part of the broader field. The unique contribution is the attempt to frame old and new together — to see classical ideas such as service modularity alongside hybrid autoscalers, energy-aware verification, probabilistic checks. In some cases, these tactics reinforce one another; in others they collide, or create unexpected trade-offs. Observing them in the same frame highlights that scaling microservices is as an ongoing experiment, where established patterns and cutting-edge proposals coexist. Taken together, the available studies do not present a single blueprint but instead sketch what looks like a layered approach. In many cases services are kept as stateless and decoupled as possible; in others that ideal is only partly met. Horizontal and vertical scaling appear side by side — sometimes combined in the same system — with their usefulness varying by workload. Orchestration policies are often automated, yet the literature also warns that energy use and security concerns grow quietly in the background as systems expand and, if left unmanaged, can erode the gains of scaling. This article will provide value to cloud architects, DevOps engineers, developers, and researchers interested in gaining awareness of current practices as well as possible venues of where the field might be heading next.
Keywords
Microservices, Cloud computing, Horizontal scaling, Vertical scaling, Kubernetes, Autoscaling, Serverless computing, Reinforcement learning, Architectural design, Scalability
References
Alharthi, S., Alshamsi, A., Alseiari, A., & Alwarafy, A. (2024). Auto-Scaling Techniques in Cloud Computing: Issues and Research Directions. Sensors, 24(17), 5551. https://doi.org/10.3390/s24175551
Berardi, D., Giallorenzo, S., Mauro, J., Melis, A., Montesi, F., & Prandini, M. (2022). Microservice security: a systematic literature review. PeerJ Computer Science, 8, e779. https://doi.org/10.7717/peerj-cs.779
Blinowski, G. J., Ojdowska, A., & Przybyłek, A. (2022). Monolithic vs. microservice architecture: A performance and scalability evaluation. IEEE Access, 10, 20357–20374. https://doi.org/10.1109/ACCESS.2022.3152803
Chavan, A. (2023). Managing Scalability and Cost in Microservices Architecture – Balancing Infinite Scalability with Financial Constraints. Journal of Artificial Intelligence & Cloud Computing, 2(4), 1–14. https://doi.org/10.47363/JMHC/2023(5)E102
Domakonda, D. (2025). Secure and Scalable Microservices Architecture: Principles, Benefits, and Challenges. International Journal of Scientific Research in CSEIT, 11(2), 1897–1902. https://doi.org/10.32628/CSEIT23112569
Filippone, G., Pompilio, C., Autili, M., & Tivoli, M. (2022). An architectural style for scalable choreography-based microservice-oriented distributed systems. Computing, 105(9), 1933–1956. https://doi.org/10.32628/CSWEIT23112569
Agos Jawaddi, S.N., Ismail, A., Sulaiman, M.S. et al. Analyzing Energy-Efficient and Kubernetes-Based Autoscaling of Microservices Using Probabilistic Model Checking. J Grid Computing, 23, 3 (2025). https://doi.org/10.1007/s10723-024-09789-9
Dileep Kumar Pandiya. (2021). Scalability Patterns for Microservices Architecture. Educational Administration: Theory and Practice, 27(3), 1178–1183. https://doi.org/10.53555/kuey.v27i3.6897
Saurav Sharma. (2025). The Impact of Microservices Architecture on System Scalability. American Scientific Research Journal for Engineering, Technology, and Sciences, 102(1), 140-148. https://asrjetsjournal.org/American_Scientific_Journal/article/view/11677
Xu, M., Song, C., Ilager, S., Gill, S. S., Zhao, J., Ye, K., & Xu, C. (2022). CoScal: Multi-faceted scaling of microservices with reinforcement learning. IEEE Transactions on Network and Service Management, 19(4), 3995–4009.https://doi.org/10.1109/TNSM.2022.3210211
Article Statistics
Copyright License
Copyright (c) 2025 Kumar Avinash

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain the copyright of their manuscripts, and all Open Access articles are disseminated under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which licenses unrestricted use, distribution, and reproduction in any medium, provided that the original work is appropriately cited. The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.


Engineering and Technology
| Open Access |
DOI: