Efficient Machine Learning Acceleration with Randomized Linear Algebra for Big Data

Authors

DOI:

https://doi.org/10.33050/rw6c2y67

Keywords:

Machine Learning, Big Data, Randomized Linear Algebra, Computational Efficiency, Scalable Algorithms

Abstract

The rapid growth of big data has significantly increased the computational complexity of machine learning models, particularly due to intensive linear algebra operations that limit scalability and efficiency. This study aims to investigate the effectiveness of Randomized Linear Algebra (RLA) as an acceleration strategy for machine learning in large scale data environments. The research adopts an experimental methodology by integrating randomized techniques such as matrix sketching and random projection into standard machine learning pipelines and evaluating their performance against deterministic baseline approaches. Experiments are conducted on large dimensional datasets using multiple machine learning models, with performance assessed in terms of computational time, memory usage, model accuracy, and scalability. The results demonstrate that the proposed RLA based approach substantially reduces computational cost and memory consumption while maintaining comparable predictive accuracy to conventional methods. These findings indicate that randomized techniques provide an effective trade off between efficiency and accuracy, enabling scalable machine learning for big data applications. In conclusion, this study contributes to the advancement of efficient Artificial Intelligence (AI) systems by demonstrating that RLA can serve as a practical and scalable solution for accelerating machine learning computations in big data contexts, aligning with the growing demand for resource efficient and high performance AI infrastructures.

References

M. Derezi´nski and M. W. Mahoney, “Recent and upcoming developments in randomized numerical linear algebra for machine learning,” in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024, pp. 6470–6479.

E. Gelvez-Almeida, M. Mora, R. J. Barrientos, R. Hern´andez-Garc´ıa, K. Vilches-Ponce, and M. Vera, “A review on large-scale data processing with parallel and distributed randomized extreme learning machine neural networks,” Mathematical and Computational Applications, vol. 29, no. 3, p. 40, 2024.

T. Diamandis, Z. Frangella, S. Zhao, B. Stellato, and M. Udell, “Genios: an (almost) second-order operator-splitting solver for large-scale convex optimization,” Mathematical Programming Computation, pp. 1–49, 2026.

D. Robert, F. Oganda, A. Sutarman, W. Hidayat, and A. Fitriani, “Machine learning techniques for predicting the success of ai-enabled startups in the digital economy,” in CORISINTA, vol. 1, no. 1, 2024, pp. 61–69.

Y. Cho, J. Demmel, M. Derezi´nski, H. Li, H. Luo, M. Mahoney, and R. Murray, “Surrogate-based auto- tuning for randomized sketching algorithms in regression problems,” SIAM Journal on Matrix Analysis and Applications, vol. 46, no. 2, pp. 1247–1279, 2025.

B. Carrel, “Randomized methods for dynamical low-rank approximation,” Journal of Computational Physics, p. 114421, 2025.

R. Murray, J. Demmel, M. W. Mahoney, N. B. Erichson, M. Melnichenko, O. A. Malik, L. Grigori, P. Luszczek, M. Derezi´nski, M. E. Lopes et al., “Randomized numerical linear algebra: A perspective on the field with an eye to software,” arXiv preprint arXiv:2302.11474, 2023.

U. Rahardja, M. Miftah, M. Rakhmansyah, and J. Zanubiya, “Revolutionizing financial services with big data and fintech: A scalable approach to innovation,” ADI Journal on Recent Innovation, vol. 6, no. 2, pp. 118–129, 2025.

Y. Nakatsukasa and J. A. Tropp, “Fast and accurate randomized algorithms for linear systems and eigenvalue problems,” SIAM journal on matrix analysis and applications, vol. 45, no. 2, pp. 1183–1214, 2024.

S. Garg, K. Tan, and M. Derezi´nski, “Distributed least squares in small space via sketching and bias reduction,” Advances in Neural Information Processing Systems, vol. 37, pp. 73 745–73 782, 2024.

S. Na, M. Derezi´nski, and M. W. Mahoney, “Hessian averaging in stochastic newton methods achieves superlinear convergence,” Mathematical Programming, vol. 201, no. 1, pp. 473–520, 2023.

M. Kuswari, R. Gantino, and J. Maratis, “Maximizing healthcare service information system: Under- standing the influence of integration on efficiency,” ADI Journal on Recent Innovation, vol. 6, no. 2, pp. 108–117, 2025.

M. A. Ibrahim, S. Aga, A. Li, S. Pati, and M. Islam, “Just-in-time quantization with processing-in-memory

for efficient ml training,” arXiv preprint arXiv:2311.05034, 2023.

H. Ootomo and R. Yokota, “Mixed-precision random projection for randnla on tensor cores,” in Proceedings of the Platform for Advanced Scientific Computing Conference, 2023, pp. 1–11.

H. Wang, Y. Zhu, J. Li, X. Liu, Y. Ma, Y. Yao, J. Zhang, and L. Pan, “Machine learning-accelerated exploration on element doping-triggering material performance improvement for energy conversion and storage applications,” Journal of Materials Chemistry A, vol. 13, no. 23, pp. 17 197–17 213, 2025.

S. Shaumiwaty, H. R. C. MOCHAMAD, and N. HENI, “Enhancing personalized learning using artificial intelligence and machine learning approaches,” BLOCKCHAIN FRONTIER TECHNOLOGY: Pandawan, vol. 4, no. 2, pp. 156–170, 2025.

M. Akram, S. U. Bazai, M. I. Ghafoor, S. Akram, Q. M. Ilyas, A. Mehmood, S. Iqbal, and M. A. Rafique, “Eemlcr: Energy-efficient machine learning-based clustering and routing for wireless sensor networks,” IEEE Access, 2025.

C. Ajmal, S. Yerram, V. Abishek, V. M. Nizam, G. Aglave, J. D. Patnam, R. S. Raghuvanshi, and S. Srivastava, “Innovative approaches in regulatory affairs: leveraging artificial intelligence and machine learning for efficient compliance and decision-making,” The AAPS Journal, vol. 27, no. 1, p. 22, 2025.

L. Ding, D. Hong, M. Zhao, H. Chen, C. Li, J. Deng, N. Yokoya, L. Bruzzone, and J. Chanussot, “A survey of sample-efficient deep learning for change detection in remote sensing: Tasks, strategies, and challenges,” IEEE Geoscience and Remote Sensing Magazine, 2025.

N. P. L. Santoso, B. Rawat, S. R. Ratri, D. Danang, D. F. C. Kumoro, R. Supriati, and E. A. Natalia, “Transformation of indonesian language in social media using ai expert systems and machine learning,” International Transactions on Artificial Intelligence, vol. 3, no. 2, pp. 130–139, 2025.

K. Zhu, E. Trizio, J. Zhang, R. Hu, L. Jiang, T. Hou, and L. Bonati, “Enhanced sampling in the age of machine learning: algorithms and applications,” Chemical Reviews, vol. 126, no. 1, pp. 671–713, 2025.

O. Azeez and A. Abdulazeez, “Classification of brain tumor based on machine learning algorithms: a review,” Journal of Applied Science and Technology Trends, vol. 6, no. 1, pp. 01–15, 2025.

V. V. R. Karna, V. R. Karna, V. Janamala, V. K. R. Devana, V. R. S. Ch, and A. B. Tummala, “A comprehensive review on heart disease risk prediction using machine learning and deep learning algorithms,” Archives of Computational Methods in Engineering, vol. 32, no. 3, pp. 1763–1795, 2025.

S. Purnama, C. S. Bangun, and E. P. Mahadewi, “Predicting consumer purchase intention in personal shopper services using big data analytics and sem,” International Journal of Cyber and IT Service Management (IJCITSM), vol. 5, no. 1, pp. 105–119, 2025.

A. Jaya, F. Saputra, Derlina, D. N. Ramadhan, and T. Green, “Efficient machine learning acceleration with randomized linear algebra for big data,” Journal of Computer Science and

Technology Application (CORISINTA), vol. 3, no. 1, pp. 1–10, 2026. [Online]. Available: https://doi.org/10.33050/corisinta.v3n1.147

R. F. Dzikra and D. Syarifah, “Linear algebra in geometric transformations,” INFORMATECH: Jurnal Ilmiah Informatika dan Komputer, 2025, accessed: Feb. 19, 2026. [Online]. Available: https://garuda.kemdiktisaintek.go.id/documents/detail/5313022

V. Patel, D. A. Maldonado, M. Melnichenko, N. Pritchard, V. Rao, E. Rebrova, S. Sankararaman, and M. Schweitzer, “Scientific applications leveraging randomized linear algebra,” arXiv preprint arXiv:2506.16457, 2025.

C. Cama˜no, E. N. Epperly, R. A. Meyer, and J. A. Tropp, “Faster linear algebra algorithms with structured random matrices,” arXiv preprint arXiv:2508.21189, 2025.

E. N. Epperly, “Make the most of what you have: Resource-efficient randomized algorithms for matrix computations,” arXiv preprint arXiv:2512.15929, 2025.

A. Yesypenko and P.-G. Martinsson, “Randomized strong recursive skeletonization: Simultaneous compression and lu factorization of hierarchical matrices using matrix–vector products: A. yesypenko, p.-g. martinsson,” Journal of Scientific Computing, vol. 106, no. 3, p. 63, 2026.

C. Sanderson and R. Curtin, “Armadillo: an efficient framework for numerical linear algebra,” in 2025 17th International Conference on Computer and Automation Engineering (ICCAE). IEEE, 2025, pp. 303–307.

A. Lavaee, “Sketch’n solve: An efficient python package for large-scale least squares using randomized numerical linear algebra,” arXiv preprint arXiv:2409.14309, 2024.

H. M. Cekirge, “Algebraic σ-based (cekirge) model for deterministic and energy-efficient unsupervised machine learning,” Am. J. Artif. Intell, vol. 9, no. 2, pp. 198–205, 2025.

M. Choiri, E. S. Pramudito, F. Sutisna, and R. S. Sean, “Business artificial intelligence for enhancing sustainable decision intelligence,” IAIC Transactions on Sustainable Digital Innovation (ITSDI), vol. 7, no. 1, pp. 106–116, 2025.

K. Hellmuth, R. Jin, Q. Li, and S. J. Wright, “Data selection: at the interface of pde-based inverse problem and randomized linear algebra,” arXiv preprint arXiv:2510.01567, 2025.

R. Afiyanto, E. D. Astuti, A. A. Kamal, and N. P. L. Santoso, “Reliability assessment of attendance systems based on face recognition under varying lighting conditions,” International Transactions on Artificial Intelligence, vol. 4, no. 1, pp. 25–37, 2025.

Z. Frangella, Randomized numerical linear algebra for large-scale optimization. Stanford University, 2025.

C. Zhang, “Mathematical computation on high-dimensional data via array programming and parallel acceleration,” arXiv preprint arXiv:2506.22929, 2025.

L. Theodorakopoulos, A. Theodoropoulou, A. Tsimakis, and C. Halkiopoulos, “Big data-driven distributed machine learning for scalable credit card fraud detection using pyspark, xgboost, and catboost,” Electronics, vol. 14, no. 9, p. 1754, 2025.

R. G. Rahmadani, O. D. Nurhayati, and D. M. K. Nugraheni, “Oriented enterprise architecture for enhancing digital governance and technopreneurship in regional governments,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 3, pp. 942–956, 2025.

A. Rajeev, R. Shah, P. Shah, M. Shah, and R. Nanavaty, “The potential of big data and machine learning

for ground water quality assessment and prediction: A. rajeev et al.” Archives of Computational Methods in Engineering, vol. 32, no. 2, pp. 927–941, 2025.

A. Pagliaro, “Artificial intelligence vs. efficient markets: A critical reassessment of predictive models in the big data era,” Electronics, vol. 14, no. 9, p. 1721, 2025.

M. Shamsa, J. D. Martin, M. Phielipp, T. Macieira, L. Lingappan, B. Kelly, D. Lerner, M. Tucknott, and E. Hansen, “Improved silent data error detection through test optimization using reinforcement learning,” in 2025 IEEE International Reliability Physics Symposium (IRPS). IEEE, 2025, pp. 8C–1.

M. Gates, A. Abdelfattah, K. Akbudak, M. Al Farhan, R. Alomairy, D. Bielich, T. Burgess, S. Cayrols, N. Lindquist, D. Sukkari et al., “Evolution of the slate linear algebra library,” The International Journal of High Performance Computing Applications, vol. 39, no. 1, pp. 3–17, 2025.

R. Z. Ikhsan, S. Rahayu, A. H. Arribathi, and N. Azizah, “Integrating artificial intelligence with 3d printing technology in healthcare: Sustainable solutions for clinical training optimization,” ADI Journal on Recent Innovation, vol. 6, no. 2, pp. 99–107, 2025.

A. K. Saibaba and A. Miedlar, “Randomized low-rank approximations beyond gaussian random matrices,” SIAM Journal on Mathematics of Data Science, vol. 7, no. 1, pp. 136–162, 2025.

O. R. Dunbar, N. H. Nelsen, and M. Mutic, “Hyperparameter optimization for randomized algorithms: a case study on random features,” Statistics and Computing, vol. 35, no. 3, p. 56, 2025.

U. Bakhtiar, D. Joo, and B. Asgari, “Pipirima: Predicting patterns in sparsity to accelerate matrix algebra,” in 2025 62nd ACM/IEEE Design Automation Conference (DAC). IEEE, 2025, pp. 1–7.

M. O. Syaidina, R. Fahrudin, and I. A. Mutiara, “Implementation of ethics of using artificial intelligence in the education system in indonesia,” Blockchain Frontier Technology, vol. 4, no. 1, pp. 63–71, 2024.

J. Xie, H.-D. Qi, and D. Han, “Randomized iterative methods for generalized absolute value equations: Solvability and error bounds,” SIAM Journal on Optimization, vol. 35, no. 3, pp. 1731–1760, 2025.

Downloads

Published

2026-02-27

Issue

Section

Articles

How to Cite

Efficient Machine Learning Acceleration with Randomized Linear Algebra for Big Data. (2026). Journal of Computer Science and Technology Application, 3(1), 105-116. https://doi.org/10.33050/rw6c2y67