| International Journal of Computer Applications |
| Foundation of Computer Science (FCS), NY, USA |
| Volume 187 - Number 90 |
| Year of Publication: 2026 |
| Authors: Rodiah, Diana Tri Susetianingtias, Eka Patriya |
10.5120/ijca2026926592
|
Rodiah, Diana Tri Susetianingtias, Eka Patriya . Integrated Framework for House Price and Price-Zone Prediction with Natural Language Processing Chatbot. International Journal of Computer Applications. 187, 90 ( Mar 2026), 36-44. DOI=10.5120/ijca2026926592
Accurate housing price estimation is essential for supporting real estate decision making and urban economic planning. This study proposes an integrated framework that combines ensemble machine learning models with a Natural Language Processing (NLP) based conversational interface for housing price prediction and price-zone classification in the JABODETABEK region. A dataset of 3,553 property listings was preprocessed through data cleaning, missing value handling, outlier detection using the Interquartile Range (IQR) method, logarithmic transformation, and feature engineering. Comparative experiments were conducted using Linear Regression, Random Forest, Gradient Boosting, and XGBoost for regression tasks, and Random Forest, Decision Tree, K-Nearest Neighbors, and Gradient Boosting for classification tasks. XGBoost achieved the best regression performance with approximately 96% predictive accuracy, while Random Forest demonstrated superior classification performance with an accuracy of 87.46%. The NLP intent classification module, developed using a Bag-of-Words representation and Multinomial Naïve Bayes, achieved 94.82% training accuracy and 90.20% testing accuracy. All components were integrated into a Command Line Interface (CLI)-based chatbot capable of interpreting user queries and generating automated price estimations and price-zone classifications. The results demonstrate that the proposed unified framework provides robust predictive performance while enhancing user accessibility through conversational interaction.