Machine Learning and Decision Support in The Exploitation of Mdx Queries: Case of Banking Products and Services
Isi Artikel Utama
Abstrak
The increasing complexity of analytical workloads in banking systems challenges traditional query scheduling mechanisms, particularly in OLAP environments where MDX queries exhibit heterogeneous computational costs and business criticality. This study proposes a semantic-aware scheduling framework based on incremental machine learning to dynamically prioritize MDX queries in real time. The approach models query prioritization as a classification problem, integrating both technical features and business-driven criticality.
A Hoeffding Tree algorithm is employed to enable continuous learning from streaming query data without requiring retraining. The model is evaluated using a simulated dataset of 10,000 MDX queries reflecting realistic banking scenarios, including risk monitoring and regulatory reporting. Experimental results show that the proposed approach achieves a classification accuracy of 94.1% and significantly reduces processing latency for high-priority queries, with improvements reaching 42.4% compared to FIFO scheduling. The inference overhead remains negligible, ensuring compatibility with real-time system constraints.
These findings demonstrate the effectiveness of integrating incremental learning into query scheduling and highlight the potential of semantic-driven optimization in decision support systems. The study contributes to bridging the gap between learned database systems and business-aware query management.
Rincian Artikel
Referensi
[1] Abelló, A., Romero, O., & Vassiliadis, P. (2021). Evolution of OLAP: From data cubes to data lakes. Foundations and Trends in Databases, 11(1–2), 1–189.
[2] Arnaboldi, F., et al. (2021). Data Quality in Banking Risk Reporting: The Role of BCBS 239. Journal of Financial Regulation and Compliance.
[3] Bahri, S., et al. (2021). Machine Learning for the Internet of Things: Data Streams and Real-time Analytics. IEEE Internet of Things Journal, 8(20).
[4] Bifet, A., Holmes, G., Kirkby, R., & Pfahringer, B. (2010). MOA: Massive Online Analysis. Journal of Machine Learning Research, 11, 1601–1604.
[5] Bifet, A., Read, J., Žliobaitė, I., & Pfahringer, B. (2023). Machine learning for data streams: With practical examples in MOA. MIT Press.
[6] Domingos, P., & Hulten, G. (2000). Mining high-speed data streams. In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 71–80).
[7] Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014). A survey on concept drift adaptation. ACM Computing Surveys, 46(4), 44.
[8] Gomes, H. M., Read, J., Bifet, A., Barddal, J. P., & Gama, J. (2023). Machine learning for streaming data: State of the art, challenges, and opportunities. ACM Computing Surveys, 55(13), 1–36
[9} Hammami, M., et al. (2021). Adaptive Hoeffding Tree for Non-Stationary Data Stream Classification. International Journal of Interactive Multimedia and Artificial Intelligence, 6(5).
.
[10] Krishnan, K., Rengaswamy, S., Agrawal, D., & El Abbadi, A. (2019). Prediction-based concurrency control. In Proceedings of the ACM SIGMOD Conference.
[11} Li, J., et al. (2024). Dynamic Resource Allocation for Cloud OLAP Systems using Deep Reinforcement Learning. Information Sciences, 642, 119-135.
[12] Marcus, R., & Papaemmanouil, O. (2021). Bao: Making learned query optimization practical. In Proceedings of the 2021 International Conference on Management of Data (SIGMOD) (pp. 1275–1288).
[13] Vaisman, A., & Zimányi, E. (2022). Data warehouse systems: Design and implementation (2nd ed.). Springer.
[14] Zhou, X., Chawla, S., Jin, W., & Li, Y. (2022). Database meets AI: A survey. IEEE Transactions on Knowledge and Data Engineering, 34(3), 1096–1116.