Articles | Open Access | DOI: https://doi.org/10.37547/tajet/Volume07Issue04-15

Conceptual Approaches to Optimizing ETL Processes in Distributed Systems

Yura Abharian , Software Engineer at SeekOut Bellevue, WA, United States

Abstract

This article explores conceptual approaches to optimizing ETL processes in distributed systems using a hybrid algorithmic solution based on the integration of Grey Wolf Optimizer (GWO) and Tabu Search (TS) methods. The study analyzes the characteristics of ETL under cloud-based architectures and identifies key challenges, such as high computational complexity, data redundancy, and the difficulty of clustering when handling large volumes of information. The results confirm the hypothesis that the synergy between GWO and TS algorithms leads to more efficient ETL processes, which is especially relevant for modern distributed systems and cloud computing environments. The article will be of interest to other researchers and graduate students specializing in distributed computing systems, big data processing, and ETL process optimization, as it presents an analysis of methodological approaches aimed at improving data integration efficiency within scalable architectures. The findings are also valuable for IT practitioners, enterprise system architects, and developers seeking to integrate advanced ETL optimization methods into modern information systems to enhance their performance and resilience.

Keywords

ETL, optimization, distributed systems, cloud computing, Grey Wolf Optimizer

References

Dinesh L., Devi K. G. An efficient hybrid optimization of ETL process in data warehouse of cloud architecture //Journal of Cloud Computing. – 2024. – Vol. 13 (1). – pp. 12.

Li S. et al. Hybrid method with parallel-factor theory, a support vector machine, and particle filter optimization for intelligent machinery failure identification //Machines. – 2023. – Vol. 11 (8). – pp. 837.

Peng Y., Zhao Y., Hu J. On the role of community structure in evolution of opinion formation: A new bounded confidence opinion dynamics //Information Sciences. – 2023. – Vol. 621. – pp. 672-690.

Zhao K. et al. Multi-scale integrated deep self-attention network for predicting remaining useful life of aero-engine //Engineering Applications of Artificial Intelligence. – 2023. – Vol. 120. – pp. 1-10.

Kossmann F, Wu Z, Lai E, Tatbul N, Cao L, Kraska T, Madden S.Extract-transform-load for video streams. Proc VLDB Endow.- 2023.- Vol. - 16(9). – pp. 2302–2315.

Zhang J. et al. Forecast-assisted service function chain dynamic deployment for SDN/NFV-enabled cloud management systems //IEEE Systems Journal. – 2023. – Vol. 17 (3). – pp. 4371-4382.

Fan W., Yang L., Bouguila N. Unsupervised grouped axial data modeling via hierarchical Bayesian nonparametric models with Watson distributions //IEEE Transactions on Pattern Analysis and Machine Intelligence. – 2021. – Vol. 44 (12). – pp. 9654-9668.

Zhang X. et al. A hybrid-convolution spatial–temporal recurrent network for traffic flow prediction //The Computer Journal. – 2024. – Vol. 67 (1). – pp. 236-252.

Wang Y., Han X., Jin S. MAP based modeling method and performance study of a task offloading scheme with time-correlated traffic and VM repair in MEC systems //Wireless Networks. – 2023. – Vol. 29 (1). – pp. 47-68.

Guo F. et al. Path extension similarity link prediction method based on matrix algebra in directed networks //Computer Communications. – 2022. – Vol. 187. – pp. 83-92.

Article Statistics

Copyright License

Download Citations

How to Cite

Yura Abharian. (2025). Conceptual Approaches to Optimizing ETL Processes in Distributed Systems. The American Journal of Engineering and Technology, 7(04), 113–118. https://doi.org/10.37547/tajet/Volume07Issue04-15