Learning Causal and Sequential Patterns in Student Knowledge Tracing via Transformer-Augmented Bayesian Networks

Rachel Kim

doi:10.54097/wznpy315

Authors

Rachel Kim

DOI:

https://doi.org/10.54097/wznpy315

Keywords:

Knowledge Tracing, Causal Learning, Sequential Patterns, Transformer Networks, Bayesian Networks, Educational Data Mining, Probabilistic Graphical Models, Attention Mechanisms

Abstract

Student knowledge tracing requires sophisticated modeling approaches that can capture both the causal relationships between learning concepts and the sequential patterns of knowledge acquisition over time. Traditional knowledge tracing methods struggle to simultaneously model causal dependencies and temporal learning dynamics while maintaining interpretability for educational practitioners and maintaining computational efficiency for real-time applications. The challenge lies in developing frameworks that can learn complex causal structures from educational data while effectively capturing the sequential nature of learning processes and providing actionable insights for personalized education. This study proposes a novel Transformer-Augmented Bayesian Network (TABN) framework that integrates transformer architectures with probabilistic graphical models to enable comprehensive modeling of causal and sequential patterns in student knowledge tracing. The framework employs transformer networks to capture long-range sequential dependencies in learning trajectories while utilizing Bayesian networks to model causal relationships between knowledge concepts. The integrated approach enables joint learning of causal structures and sequential patterns through end-to-end optimization while maintaining probabilistic interpretability essential for educational applications. Experimental evaluation using large-scale educational datasets demonstrates that the proposed framework achieves 41% improvement in knowledge tracing accuracy compared to traditional methods. The TABN approach results in 36% better prediction of learning outcomes and 44% improvement in causal relationship discovery between knowledge concepts. The framework successfully combines the sequential modeling capabilities of transformers with the causal reasoning advantages of Bayesian networks, resulting in 32% better interpretability scores and 28% improvement in educational decision support compared to existing knowledge tracing approaches.

Downloads

Download data is not yet available.

References

[1] Khor, E. T., & K, M. (2023). A systematic review of the role of learning analytics in supporting personalized learning. Education sciences, 14(1), 51.

[2] Mai, N., & Cao, W. (2025). Personalized Learning and Adaptive Systems: AI-Driven Educational Innovation and Student Outcome Enhancement. International Journal of Education and Humanities.

[3] Abdelrahman, G., Wang, Q., & Nunes, B. (2023). Knowledge tracing: A survey. ACM Computing Surveys, 55(11), 1-37.

[4] Saqr, M., & López-Pernas, S. (2023). The temporal dynamics of online problem-based learning: Why and when sequence matters. International Journal of Computer-Supported Collaborative Learning, 18(1), 11-37.

[5] Muhamedyev, R., Yakunin, K., Kuchin, Y. A., Symagulov, A., Buldybayev, T., Murzakhmetov, S., & Abdurazakov, A. (2020). The use of machine learning ‘black boxes’ explanation systems to improve the quality of school education. Cogent Engineering, 7(1), 1769349.

[6] Šarić-Grgić, I., Grubišić, A., & Gašpar, A. (2024). Twenty-Five Years of Bayesian knowledge tracing: a systematic review. User modeling and user-adapted interaction, 34(4), 1127-1173.

[7] Gervet, T., Koedinger, K., Schneider, J., & Mitchell, T. (2020). When is deep learning the best approach to knowledge tracing?. Journal of Educational Data Mining, 12(3), 31-54.

[8] Lyu, L., Wang, Z., Yun, H., Yang, Z., & Li, Y. (2022). Deep knowledge tracing based on spatial and temporal representation learning for learning performance prediction. Applied Sciences, 12(14), 7188.

[9] Choi, S. R., & Lee, M. (2023). Transformer architecture and attention mechanisms in genome data analysis: a comprehensive review. Biology, 12(7), 1033.

[10] Grimsley, C., Mayfield, E., & Bursten, J. (2020). Why attention is not explanation: Surgical intervention and causal reasoning about neural models.

[11] Cao, W., Mai, N., & Liu, W. (2025). Adaptive Knowledge Assessment via Symmetric Hierarchical Bayesian Neural Networks with Graph Symmetry-Aware Concept Dependencies. Symmetry.

[12] Topuz, K., Jones, B. D., Sahbaz, S., & Moqbel, M. (2021). Methodology to combine theoretical knowledge with a data-driven probabilistic graphical model. Journal of Business Analytics, 4(2), 125-139.

[13] Kondo, N., & Hatanaka, T. (2019). Modeling of learning process based on Bayesian networks. Educational technology research, 41(1), 57-67.

[14] Hooshyar, D., & Druzdzel, M. J. (2024). Memory-Based Dynamic Bayesian Networks for Learner Modeling: Towards Early Prediction of Learners’Performance in Computational Thinking. Education Sciences, 14(8), 917.

[15] Bernacki, M. L., Greene, M. J., & Lobczowski, N. G. (2021). A systematic review of research on personalized learning: Personalized by whom, to what, how, and for what purpose (s)?. Educational Psychology Review, 33(4), 1675-1715.

[16] Gan, W., Dao, M. S., Zettsu, K., & Sun, Y. (2022, June). IoT-based multimodal analysis for smart education: Current status, challenges and opportunities. In Proceedings of the 3rd ACM Workshop on Intelligent Cross-Data Analysis and Retrieval (pp. 32-40).

[17] Kumar, G., Basri, S., Imam, A. A., Khowaja, S. A., Capretz, L. F., & Balogun, A. O. (2021). Data harmonization for heterogeneous datasets: a systematic literature review. Applied Sciences, 11(17), 8275.

[18] Xing, S., & Wang, Y. (2025). Proactive Data Placement in Heterogeneous Storage Systems via Predictive Multi-Objective Reinforcement Learning. IEEE Access.

[19] Ying, Z., Bourgeois, D., You, J., Zitnik, M., & Leskovec, J. (2019). Gnnexplainer: Generating explanations for graph neural networks. Advances in neural information processing systems, 32.

[20] Wang, M., Zhang, X., Yang, Y., & Wang, J. (2025). Explainable Machine Learning in Risk Management: Balancing Accuracy and Interpretability. Journal of Financial Risk Management, 14(3), 185-198.

[21] Zheng, W., Tan, Y., Jiang, B., & Wang, J. (2025). Integrating Machine Learning into Financial Forensics for Smarter Fraud Prevention. Technology and Investment, 16(3), 79-90.

[22] Arpaci, I. (2019). A hybrid modeling approach for predicting the educational use of mobile cloud computing services in higher education. Computers in human Behavior, 90, 181-187.

[23] Zhang, H., Ge, Y., Zhao, X., & Wang, J. (2025). Hierarchical Deep Reinforcement Learning for Multi-Objective Integrated Circuit Physical Layout Optimization with Congestion-Aware Reward Shaping. IEEE Access

[24] Cao, J., Zheng, W., Ge, Y., & Wang, J. (2025). DriftShield: Autonomous Fraud Detection via Actor-Critic Reinforcement Learning with Dynamic Feature Reweighting. IEEE Open Journal of the Computer Society.