Engineering and Technology | Open Access |

Modeling Oracle Performance Degradation in Production: Predictive Analytics Based on Wait Events, I/O, Memory/Sga/Pga, Background Process, And Storage Parameters

Olga Badiukova , Oracle DBA ETS, LLC "Сі Ес Тек" Kyiv, Ukraine

Abstract

The paper proposes an integrated methodological approach to modeling and forecasting Oracle DBMS performance degradation under high-load industrial environments. Against the backdrop of digital transformation and the consolidation of the AIOps paradigm, there is a shift from reactive monitoring toward proactive intelligent support mechanisms focused on early detection of adverse indicators and incident prevention. A central role is assigned to the integration of multidimensional telemetry: wait events, performance metrics of the input/output (I/O) subsystem, temporal profiles of memory consumption in the SGA and PGA areas, as well as behavioral characteristics of key background processes, including LGWR, DBWR, and ARCH, are taken into account. The proposed methodology for the first time combines classical Oracle diagnostics (AWR, ASH, V$ views) with machine learning and AIOps approaches to predict future degradations before they manifest at the user level.

The architecture of a predictive solution is described, based on a combination of machine learning methods (Random Forest, XGBoost) and deep learning models (LSTM, BERT). This composition ensures robust extraction of latent patterns in operational data streams and enables detection of anomalous states 7–12 minutes before the formation of a critical failure. Substantial attention is devoted to the reconstruction of causal relationships between infrastructural parameters and application-level indicators, which makes it possible to interpret degradation not as a set of disparate symptoms, but as a result of the interaction of resources, internal DBMS mechanisms, and the load profile. Empirical verification on experimental data demonstrates a significant improvement in operational diagnostics, including a MTTD 63%. The material is intended for practicing database operations specialists, designers of high-load solutions, and researchers developing intelligent methods for IT operations.

Keywords

Oracle Database, predictive analytics, machine learning, wait events, SGA, PGA, input/output performance, AIOps, background processes, production environments, performance degradation, anomaly detection, root cause analysis

References

Oracle. (2019, February 4). Oracle Database 19c introduction and overview. Retrieved from: https://www.oracle.com/a/tech/docs/database19c-wp.pdf (date accessed: March 18, 2022).

International Data Corporation. (2022, May 12). Worldwide digital transformation investments forecast to reach $1.8 trillion in 2022, according to new IDC Spending Guide. Business Wire. Retrieved from: https://www.businesswire.com/news/home/20220512005848/en/Worldwide-Digital-Transformation-Investments-Forecast-to-Reach-%241.8-Trillion-in-2022-According-to-New-IDC-Spending-Guide (date accessed: June 3, 2022).

Uptime Institute. (2022, April 29). Annual outage analysis 2022. Uptime Intelligence. Retrieved from: https://intelligence.uptimeinstitute.com/index.php/resource/annual-outage-analysis-2022 (date accessed: June 15, 2022).

Uptime Institute. (2022, April 29). Annual outage analysis 2022. Retrieved from: https://intelligence.uptimeinstitute.com/index.php/resource/annual-outage-analysis-2022 (date accessed: June 15, 2022).

Oracle. (2021). Oracle Database performance method. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tdppt/oracle-database-performance-method.html (date accessed: March 21, 2022).

Research and Markets. (2021). Global AIOps market 2021–2026. Retrieved from: https://www.researchandmarkets.com/reports/5360397/global-aiops-market-2021-2026 (date accessed: July 1, 2022).

Notaro, P., Cardellini, V., Casalicchio, E., & Lo Presti, F. (2021). A survey of AIOps methods for failure management. ACM Transactions on Intelligent Systems and Technology, 12(6), Article 71. https://doi.org/10.1145/3483424

Pang, G., Shen, C., Cao, L., & van den Hengel, A. (2021). Deep learning for anomaly detection: A review. ACM Computing Surveys, 54(2), Article 38. https://doi.org/10.1145/3439950

Soldani, J., Brogi, A., & Wang, J. (2021). Anomaly detection and failure root cause analysis in (micro)service-based cloud applications: A survey. arXiv. https://doi.org/10.48550/arXiv.2105.12378

Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., & Kraska, T. (2021). Bao: Making learned query optimization practical. In Proceedings of the 2021 International Conference on Management of Data (pp. 1275–1288). https://doi.org/10.1145/3448016.3452838

Van Aken, D., Yang, D., Brillard, S., Fiorino, A., Zhang, B., Pavlo, A., & Gordon, G. J. (2021). An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems. Proceedings of the VLDB Endowment, 14(7), 1241–1253. https://doi.org/10.14778/3450980.3450992

Nordal, H., & El-Thalji, I. (2021). Assessing the technical specifications of predictive maintenance: A case study of centrifugal compressor. Applied Sciences, 11(4), Article 1527. https://doi.org/10.3390/app11041527

Oracle. (2021). Descriptions of wait events. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/descriptions-of-wait-events.html (date accessed: March 29, 2022).

Oracle. (2021). Automatic performance diagnostics. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tgdba/automatic-performance-diagnostics.html (date accessed: April 4, 2022).

Oracle. (2021). Gathering optimizer statistics. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tgsql/gathering-optimizer-statistics.html (date accessed: April 11, 2022).

Pavlo, A., Butrovich, M., Joshi, A., Ma, L., Menon, P., Misra, P., Ryan, G., Stonebraker, M., & Zdonik, S. (2021). Make your database system dream of electric sheep: Towards self-driving operation. Proceedings of the VLDB Endowment, 14(12), 3211–3221. https://doi.org/10.14778/3476311.3476411

Oracle. (2021). Automatic database performance monitoring. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tdppt/automatic-database-performance-monitoring.html (date accessed: April 18, 2022).

Datadog. (n.d.). Oracle database. Retrieved from: https://docs.datadoghq.com/integrations/oracle/ (date accessed: April 26, 2022).

Guo, H., Yuan, S., & Wu, X. (2021). LogBERT: Log anomaly detection via BERT. In 2021 International Joint Conference on Neural Networks (IJCNN) (pp. 1–8). https://doi.org/10.1109/IJCNN52387.2021.9534113

Oracle. (2022, April 29). 2-steps OCI Logging and OCI Log Analytics enablement for OCI services. Retrieved from: https://docs.oracle.com/iaas/releasenotes/changes/35ef3492-4164-4e38-b9c4-c80bb15a589e/index.htm (date accessed: May 2, 2022).

Li, G., Zhou, X., Sun, J., & Aken, D. V. (2021). Machine learning for databases. Proceedings of the VLDB Endowment, 14(12), 3190–3193. https://doi.org/10.14778/3476311.3476405

Oracle. (2021). Database performance tuning guide. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tgdba/database-performance-tuning-guide.pdf (date accessed: May 9, 2022).

Li, G., Zhou, X., Sun, J., Yu, X., Han, Y., Jin, L., Li, W., Wang, T., & Li, S. (2021). openGauss: An autonomous database system. Proceedings of the VLDB Endowment, 14(12), 3028–3041. https://doi.org/10.14778/3476311.3476380

Oracle. (2021). Real application clusters administration and deployment guide. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/racad/real-application-clusters-administration-and-deployment-guide.pdf (date accessed: May 16, 2022).

Zhou, X., Jin, L., Sun, J., Zhao, X., Wang, T., & Li, S. (2021). DBMind: A self-driving platform in openGauss. Proceedings of the VLDB Endowment, 14(12), 2743–2746. https://doi.org/10.14778/3476311.3476334

Oracle. (2021). V$EVENT_NAME. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/V-EVENT_NAME.html (date accessed: May 23, 2022).

Oracle. (2021). Configuring options for optimizer statistics gathering. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tgsql/options-for-optimizer-statistics-gathering.html (date accessed: May 31, 2022).

Oracle. (2021). Monitoring real-time database performance. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tdppt/monitoring-real-time-database-performance.html (date accessed: June 6, 2022).

Oracle. (2021). Managing memory. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/admin/managing-memory.html (date accessed: June 13, 2022).

Oracle. (2021). PGA_AGGREGATE_TARGET. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/PGA_AGGREGATE_TARGET.html (date accessed: June 20, 2022).

Oracle. (2021). Oracle Database concepts. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/cncpt/database-concepts.pdf (date accessed: June 27, 2022).

Oracle. (2021). Oracle Database reference. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/database-reference.pdf (date accessed: July 4, 2022).

Oracle. (2021). Database performance tuning guide. Retrieved from: https://docs.oracle.com/en/database/oracle/oracle-database/21/tgdba/database-performance-tuning-guide.pdf (date accessed: July 8, 2022).

Oracle. (2019). Thinking autonomous: What’s your business’s data worth? Retrieved from: https://www.oracle.com/a/ocom/docs/thinking-autonomous-business-data-worth.pdf (date accessed: July 12, 2022).

Oracle. (2022, July). Using Oracle Analytics Day by Day. Retrieved from: https://docs.oracle.com/en/cloud/paas/analytics-cloud/biday/index.html (date accessed: July 18, 2022).

Oracle. (2020, October 20). Ops Insights. Retrieved from: https://docs.oracle.com/iaas/releasenotes/services/operations-insights/ (date accessed: July 25, 2022).

Download and View Statistics

Views: 0   |   Downloads: 0

Copyright License

Download Citations

How to Cite

Badiukova, O. (2022). Modeling Oracle Performance Degradation in Production: Predictive Analytics Based on Wait Events, I/O, Memory/Sga/Pga, Background Process, And Storage Parameters. The American Journal of Engineering and Technology, 4(09), 17–27. Retrieved from https://theamericanjournals.com/index.php/tajet/article/view/7651