Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification

Vishniakou Uladzimir Anatol'evich; Yu ChuYue

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 20 June 2025

Submit your paper

Know more

The week's pick

Designing Multi-Tenant E-Learning Systems in the Cloud: A Process-Oriented Approach for Higher Education

Sameh Azouzi Sonia Ayachi Ghannouchi

Random Articles

Data Mining using Modified GFMM Neural Network

April

2015

Monitoring System using GSM

May

2015

ON Tiling Patterns Involving Islamic Stars with an Odd Number of Vertices

March

2013

Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine

February

2011

Reseach Article

Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification

by Vishniakou Uladzimir Anatol'evich, Yu ChuYue

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 186 - Number 81

Year of Publication: 2025

Authors: Vishniakou Uladzimir Anatol'evich, Yu ChuYue

10.5120/ijca2025924782

Vishniakou Uladzimir Anatol'evich, Yu ChuYue . Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification. International Journal of Computer Applications. 186, 81 ( Apr 2025), 49-54. DOI=10.5120/ijca2025924782

@article{ 10.5120/ijca2025924782,

author = { Vishniakou Uladzimir Anatol'evich, Yu ChuYue },

title = { Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification },

journal = { International Journal of Computer Applications },

issue_date = { Apr 2025 },

volume = { 186 },

number = { 81 },

month = { Apr },

year = { 2025 },

issn = { 0975-8887 },

pages = { 49-54 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume186/number81/hardware-support-for-intelligent-text-analysis-using-fpga-for-accelerating-random-forest-based-classification/ },

doi = { 10.5120/ijca2025924782 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2025-04-26T02:19:43.125308+05:30

%A Vishniakou Uladzimir Anatol'evich

%A Yu ChuYue

%T Hardware Support for Intelligent Text Analysis using FPGA for Accelerating Random Forest-based Classification

%J International Journal of Computer Applications

%@ 0975-8887

%V 186

%N 81

%P 49-54

%D 2025

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Efficient analysis and classification of text performed at the edge of a network, especially on platforms with limited resources such as embedded systems and FPGA devices, creates computational challenges. Traditional CPU and GPU-based natural language processing (NLP) methods struggle to meet the real-time and energy efficiency requirements of peripheral computing scenarios. To eliminate these limitations, this study suggests hardware support for an FPGA-based random forest algorithm for text classification. To meet the resource constraints inherent in embedded and FPGA-based systems, the proposed methodology includes model compression, simplified algorithmic optimization, fixed-parameter configurations, fixed-point computing, and dimensionality reduction techniques, which effectively reduces both computational complexity and memory consumption. A hybrid CPU-FPGA pipelining architecture has been developed, in which the central processor performs text preprocessing tasks, including tokenization, TF-IDF vector computing, and function normalization, while the FPGA accelerates data output from the random forest algorithm using parallel computing and pipelining strategies. The FPGA implementation has been thoroughly tested for compliance with the Python-based reference processor model through a joint software and hardware verification process. The results demonstrated a high degree of numerical consistency, reaching a similarity of 0.9990, which confirms the correctness of the end-to-end logic of feature extraction and classification. The proposed FPGA architecture provides a scalable solution for high-performance, low-latency NLP applications suitable for deployment in peripheral computing environments.

References

Andrzej Janowski. Natural Language Processing Techniques for Clinical Text Analysis in Healthcare. Journal of Advanced Analytics in Healthcare Management, 7(1):51–76, Mar. 2023.
Rehana H, Çam NB, Basmaci M, Zheng J, Jemiyo C, He Y, Özgür A, Hur J. Evaluation of GPT and BERT-based models on identifying proteinprotein interactions in biomedical text. ArXiv, 2023: arXiv: 2303.17728 v2.
Martin Wisniewski, Lucas, Jean-Michel Bec, Guillaume Boguszewski, and Abdoulaye Gamatié. Hardware Solutions for Low-Power Smart Edge Computing. Journal of Low Power Electronics and Applications, 12(4):61, 2022.
Movva, Rajiv, Jinhao Lei, Shayne Longpre, Ajay Gupta, and Chris DuBois. Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks. arXiv preprint, 2022. arXiv:2208.09684.
Liao, Youqi, Shuhao Kang, Jianping Li, Yang Liu, Yun Liu, Zhen Dong, Bisheng Yang, and Xieyuanli Chen. Mobile-seed: Joint semantic segmentation and boundary detection for mobile robots. IEEE Robotics and Automation Letters, 2024.
Liu, Linyuan, Haibin Zhu, Tianxing Wang, and Mingwei Tang. A Fast and Efficient Task Offloading Approach in Edge-Cloud Collaboration Environment. Electronics, 13(2): 313, 2024.
Zhang, Chaoyu, Hexuan Yu, Yuchen Zhou, and Hai Jiang. High-Performance and Energy-Efficient FPGA-GPU-CPU Heterogeneous System Implementation. In Advances in Parallel & Distributed Processing, and Applications: Proceedings from PDPTA'20, CSC'20, MSV'20, and GCC'20, pages 477–492. Springer, 2021.
Mouri Zadeh Khaki A, Choi A. Optimizing Deep Learning Acceleration on FPGA for Real-Time and Resource-Efficient Image Classification. Applied Sciences, 15(1), 2025.
Hamza Khan, Asma Khan, Zainab Khan, Lun Bin Huang, Kun Wang, and Lei He. NPE: An FPGA-based Overlay Processor for Natural Language Processing. arXiv preprint arXiv:2104.06535, 2021.
Vishniakou U.A and Chuyue Yu. Using Machine Learning for Recognition of Alzheimer's Disease Based on Transcription Information. Reports of BSUIR, 21(6): 106–112, 2023.
Saturnino Luz, Fasih Haider, Sofia de la Fuente, Davida Fromm, and Brian MacWhinney. Alzheimer’s dementia recognition through spontaneous speech: The ADReSS challenge. arXiv preprint arXiv:2004.06833, 2020.

Index Terms

Computer Science

Information Sciences

Algorithms

Hardware Acceleration

Natural Language Processing

FPGA

Embedded Systems

Edge Computing

Machine Learning

Performance Optimization

Verification.

Keywords

FPGA Random Forest Text Analytics TF-IDF Hardware Acceleration.