CFP last date
20 May 2025
Reseach Article

Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification

by Vishniakou Uladzimir Anatol'evich, Yu ChuYue
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 81
Year of Publication: 2025
Authors: Vishniakou Uladzimir Anatol'evich, Yu ChuYue
10.5120/ijca2025924782

Vishniakou Uladzimir Anatol'evich, Yu ChuYue . Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification. International Journal of Computer Applications. 186, 81 ( Apr 2025), 49-54. DOI=10.5120/ijca2025924782

@article{ 10.5120/ijca2025924782,
author = { Vishniakou Uladzimir Anatol'evich, Yu ChuYue },
title = { Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2025 },
volume = { 186 },
number = { 81 },
month = { Apr },
year = { 2025 },
issn = { 0975-8887 },
pages = { 49-54 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number81/hardware-support-for-intelligent-text-analysis-using-fpga-for-accelerating-random-forest-based-classification/ },
doi = { 10.5120/ijca2025924782 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-04-26T02:19:43.125308+05:30
%A Vishniakou Uladzimir Anatol'evich
%A Yu ChuYue
%T Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 81
%P 49-54
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Efficient analysis and classification of text performed at the edge of a network, especially on platforms with limited resources such as embedded systems and FPGA devices, creates computational challenges. Traditional CPU and GPU-based natural language processing (NLP) methods struggle to meet the real-time and energy efficiency requirements of peripheral computing scenarios. To eliminate these limitations, this study suggests hardware support for an FPGA-based random forest algorithm for text classification. To meet the resource constraints inherent in embedded and FPGA-based systems, the proposed methodology includes model compression, simplified algorithmic optimization, fixed-parameter configurations, fixed-point computing, and dimensionality reduction techniques, which effectively reduces both computational complexity and memory consumption. A hybrid CPU-FPGA pipelining architecture has been developed, in which the central processor performs text preprocessing tasks, including tokenization, TF-IDF vector computing, and function normalization, while the FPGA accelerates data output from the random forest algorithm using parallel computing and pipelining strategies. The FPGA implementation has been thoroughly tested for compliance with the Python-based reference processor model through a joint software and hardware verification process. The results demonstrated a high degree of numerical consistency, reaching a similarity of 0.9990, which confirms the correctness of the end-to-end logic of feature extraction and classification. The proposed FPGA architecture provides a scalable solution for high-performance, low-latency NLP applications suitable for deployment in peripheral computing environments.

References
  1. Andrzej Janowski. Natural Language Processing Techniques for Clinical Text Analysis in Healthcare. Journal of Advanced Analytics in Healthcare Management, 7(1):51–76, Mar. 2023.
  2. Rehana H, Çam NB, Basmaci M, Zheng J, Jemiyo C, He Y, Özgür A, Hur J. Evaluation of GPT and BERT-based models on identifying proteinprotein interactions in biomedical text. ArXiv, 2023: arXiv: 2303.17728 v2.
  3. Martin Wisniewski, Lucas, Jean-Michel Bec, Guillaume Boguszewski, and Abdoulaye Gamatié. Hardware Solutions for Low-Power Smart Edge Computing. Journal of Low Power Electronics and Applications, 12(4):61, 2022.
  4. Movva, Rajiv, Jinhao Lei, Shayne Longpre, Ajay Gupta, and Chris DuBois. Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks. arXiv preprint, 2022. arXiv:2208.09684.
  5. Liao, Youqi, Shuhao Kang, Jianping Li, Yang Liu, Yun Liu, Zhen Dong, Bisheng Yang, and Xieyuanli Chen. Mobile-seed: Joint semantic segmentation and boundary detection for mobile robots. IEEE Robotics and Automation Letters, 2024.
  6. Liu, Linyuan, Haibin Zhu, Tianxing Wang, and Mingwei Tang. A Fast and Efficient Task Offloading Approach in Edge-Cloud Collaboration Environment. Electronics, 13(2): 313, 2024.
  7. Zhang, Chaoyu, Hexuan Yu, Yuchen Zhou, and Hai Jiang. High-Performance and Energy-Efficient FPGA-GPU-CPU Heterogeneous System Implementation. In Advances in Parallel & Distributed Processing, and Applications: Proceedings from PDPTA'20, CSC'20, MSV'20, and GCC'20, pages 477–492. Springer, 2021.
  8. Mouri Zadeh Khaki A, Choi A. Optimizing Deep Learning Acceleration on FPGA for Real-Time and Resource-Efficient Image Classification. Applied Sciences, 15(1), 2025.
  9. Hamza Khan, Asma Khan, Zainab Khan, Lun Bin Huang, Kun Wang, and Lei He. NPE: An FPGA-based Overlay Processor for Natural Language Processing. arXiv preprint arXiv:2104.06535, 2021.
  10. Vishniakou U.A and Chuyue Yu. Using Machine Learning for Recognition of Alzheimer's Disease Based on Transcription Information. Reports of BSUIR, 21(6): 106–112, 2023.
  11. Saturnino Luz, Fasih Haider, Sofia de la Fuente, Davida Fromm, and Brian MacWhinney. Alzheimer’s dementia recognition through spontaneous speech: The ADReSS challenge. arXiv preprint arXiv:2004.06833, 2020.
Index Terms

Computer Science
Information Sciences
Algorithms
Hardware Acceleration
Natural Language Processing
FPGA
Embedded Systems
Edge Computing
Machine Learning
Performance Optimization
Verification.

Keywords

FPGA Random Forest Text Analytics TF-IDF Hardware Acceleration.