Engineering and Technology | Open Access | DOI: https://doi.org/10.37547/tajet/Volume07Issue10-12

Architectural Approaches to Scaling Distributed Microservice Systems in The Cloud

Kumar Avinash , Software Development Engineer, Google Seattle, USA

Abstract

This article explores how distributed microservice systems in the cloud are scaled. The goal is to piece together, from scattered research, working solutions and what might still be experimental. The focus is on design and deployment strategies that aim for reliability, elasticity, and cost-effective growth. Open-access, peer-reviewed papers published since 2021 were reviewed with a special emphasis placed on those with empirical tests, diagrams, or case studies to see how ideas play out in practice. Across the papers certain themes keep reappearing. Infrastructure work revolves around horizontal and vertical scaling, while orchestration and autoscaling — Kubernetes HPA/VPA, serverless computing, various service meshes — are interpreted as part of the broader field. The unique contribution is the attempt to frame old and new together — to see classical ideas such as service modularity alongside hybrid autoscalers, energy-aware verification, probabilistic checks.  In some cases, these tactics reinforce one another; in others they collide, or create unexpected trade-offs. Observing them in the same frame highlights that scaling microservices is as an ongoing experiment, where established patterns and cutting-edge proposals coexist. Taken together, the available studies do not present a single blueprint but instead sketch what looks like a layered approach. In many cases services are kept as stateless and decoupled as possible; in others that ideal is only partly met. Horizontal and vertical scaling appear side by side — sometimes combined in the same system — with their usefulness varying by workload. Orchestration policies are often automated, yet the literature also warns that energy use and security concerns grow quietly in the background as systems expand and, if left unmanaged, can erode the gains of scaling. This article will provide value to cloud architects, DevOps engineers, developers, and researchers interested in gaining awareness of current practices as well as possible venues of where the field might be heading next.

Keywords

Microservices, Cloud computing, Horizontal scaling, Vertical scaling, Kubernetes, Autoscaling, Serverless computing, Reinforcement learning, Architectural design, Scalability

References

Alharthi, S., Alshamsi, A., Alseiari, A., & Alwarafy, A. (2024). Auto-Scaling Techniques in Cloud Computing: Issues and Research Directions. Sensors, 24(17), 5551. https://doi.org/10.3390/s24175551

Berardi, D., Giallorenzo, S., Mauro, J., Melis, A., Montesi, F., & Prandini, M. (2022). Microservice security: a systematic literature review. PeerJ Computer Science, 8, e779. https://doi.org/10.7717/peerj-cs.779

Blinowski, G. J., Ojdowska, A., & Przybyłek, A. (2022). Monolithic vs. microservice architecture: A performance and scalability evaluation. IEEE Access, 10, 20357–20374. https://doi.org/10.1109/ACCESS.2022.3152803

Chavan, A. (2023). Managing Scalability and Cost in Microservices Architecture – Balancing Infinite Scalability with Financial Constraints. Journal of Artificial Intelligence & Cloud Computing, 2(4), 1–14. https://doi.org/10.47363/JMHC/2023(5)E102

Domakonda, D. (2025). Secure and Scalable Microservices Architecture: Principles, Benefits, and Challenges. International Journal of Scientific Research in CSEIT, 11(2), 1897–1902. https://doi.org/10.32628/CSEIT23112569

Filippone, G., Pompilio, C., Autili, M., & Tivoli, M. (2022). An architectural style for scalable choreography-based microservice-oriented distributed systems. Computing, 105(9), 1933–1956. https://doi.org/10.32628/CSWEIT23112569

Agos Jawaddi, S.N., Ismail, A., Sulaiman, M.S. et al. Analyzing Energy-Efficient and Kubernetes-Based Autoscaling of Microservices Using Probabilistic Model Checking. J Grid Computing, 23, 3 (2025). https://doi.org/10.1007/s10723-024-09789-9

Dileep Kumar Pandiya. (2021). Scalability Patterns for Microservices Architecture. Educational Administration: Theory and Practice, 27(3), 1178–1183. https://doi.org/10.53555/kuey.v27i3.6897

Saurav Sharma. (2025). The Impact of Microservices Architecture on System Scalability. American Scientific Research Journal for Engineering, Technology, and Sciences, 102(1), 140-148. https://asrjetsjournal.org/American_Scientific_Journal/article/view/11677

Xu, M., Song, C., Ilager, S., Gill, S. S., Zhao, J., Ye, K., & Xu, C. (2022). CoScal: Multi-faceted scaling of microservices with reinforcement learning. IEEE Transactions on Network and Service Management, 19(4), 3995–4009.https://doi.org/10.1109/TNSM.2022.3210211

Article Statistics

Copyright License

Download Citations

How to Cite

Kumar Avinash. (2025). Architectural Approaches to Scaling Distributed Microservice Systems in The Cloud. The American Journal of Engineering and Technology, 7(10), 90–98. https://doi.org/10.37547/tajet/Volume07Issue10-12