
www.rsisinternational.org
INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING,
MANAGEMENT & APPLIED SCIENCE (IJLTEMAS)
ISSN 2278-2540 | DOI: 10.51583/IJLTEMAS | Volume XV, Issue III, March 2026
Machine learning techniques like logistic regression, support vector machines (SVMs), K nearest neighbors
(KNNs), decision trees and aggregation methods, are widely used to predict heart disease. These techniques
allow automated prediction by observing from existing patient data and identifying patterns connected
to the emergence of illnesses. Despite this, many previous studies face challenges such as limited prediction
accuracy and inadequate treatment of characteristic interactions, and insufficient generalization in different
datasets. Furthermore, some models are too modified or do not precisely adapt non-linear connections between
medical attributes.
This study aims to overcome these constraints proposes a model for the forecast of heart disease using many
machine learning algorithms, focusing mainly on a random forest classification. The model undergoes testing
and training on a UCI database containing relevant clinical features. To enhance data quality and model
performance, a variety of preprocessing methods are employed. Our study shows that the KNN model achieves
best performance which is 91.8 % accuracy, indicating its effectiveness in heart disease prediction. Our aims are
to provide a reliable and efficient decision support system to help medical professionals identify and assess risks
early. Our approach uses AI&ML which contributes to improving prediction accuracy and supporting better
health outcomes.
LITERATURE REVIEW
In recent years, various studies have been performed to improve early forecasting cardiovascular disease using
machine learning techniques. Researchers have studied data-driven approaches for analyzing clinical data sets
and identifying patterns associated with cardiovascular diseases. These methods are intended to assist medical
professionals in making accurate and timely decisions and ultimately reduce mortality rates. Machine learning
algorithms have demonstrated considerable promise in deriving valuable insights from intricate medical data
that traditional methods cannot capture. Previous research focused on classical classification algorithms like
Logistic regression and decision trees for predicting heart disease. These techniques' simplicity and clarity serve
as a foundation for forecasting modeling.
Logistic regression is widely used in binary classification problems, and decision trees represent effective in
handling nonlinear relationships between features. However, when applied to large and complex datasets, these
models frequently experience issues with limitations such as lower accuracy and sensitivity to data change. To
achieve overcome these challenges, advanced machine learning algorithms, such as support vector machines
(SVMs) and K-nearest neighbors (KNNs), have been introduced. SVM is well-known for establishing optimal
and managing high-dimensional data boundaries for medical classification tasks. Even so, the
prediction performance is improved by these techniques, they may require careful parameter adjustments and
require computational costs for large data sets.
In recent times, combined learning techniques such as random forest have received widespread attention due to,
they have improved prediction accuracy and reduced over-adaptation. Random forests integrate numerous
decision trees to produce more robust and reliable predictions. Various research has shown that ensemble models
have the advantage of outperforming individual classifications by recording complex character interactions and
reducing variances. In addition, techniques for choosing features are utilized in many research to find out the
most important clinical traits, making models even better. Despite progress in this region, there exist still some
limitations. Many studies indicate problems such as imbalanced data sets, insufficient generalization between
different populations, and the lack of complex model interpretation.
Furthermore, some models do not effectively use all available clinical features, which can affect predictive
accuracy. These difficulties underscore the need for greater efficient and scalable approaches to providing
precise and trustworthy predictions. Considering this situation, the current study centers on creating a heart
disease predicting system employing a range of machine learning algorithms, with a focus on KNN
classification. This study aims to achieve higher prediction accuracy and better performance by applying
appropriate pre-processing techniques and evaluating different models. The results contribute to ongoing
research into intelligent health systems and show the efficacy of group learning in medical diagnosis.