Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

28,451 Article Results

Data-driven support vector regression-genetic algorithm model for predicting the diphtheria distribution

10.11591/ijai.v14.i4.pp2909-2921
Wiwik Anggraeni , Yeyen Sudiarti , Muhammad Ilham Perdana , Edwin Riksakomara , Adri Gabriel Sooai
Indonesia is one of the countries with the largest number of diphtheria sufferers in the world. Diphtheria is a case of re-emerging disease, especially in Indonesia. Diphtheria can be prevented by immunization. Diphtheria immunization has drastically reduced mortality and susceptibility to diphtheria, but it is still a significant childhood health problem. This study predicted the number of diphtheria patients in several regions using support vector regression (SVR) combined with the genetic algorithm (GA) for parameter optimization. The area is grouped into 3 clusters based on the number of cases. The proposed method is proven to overcome overfitting and avoid local optima. Model robustness tests were carried out in several other regions in each cluster. Based on the experiments in three scenarios and 12 areas, the hybrid model shows good forecasting results with an average mean squared error (MSE) of 0.036 and a symmetric mean absolute percentage error (SMAPE) of 41.2% with a standard deviation of 0.075 and 0.442, respectively. Based on experiments in various scenarios, the SVR-GA model shows better performance than others. Compares two- means tests on MSE and SMAPE were given to prove that SVR-GA models have better performance. The results of this forecasting can be used as a basis for policy-making to minimize the spread of diphtheria cases.
Volume: 14
Issue: 4
Page: 2909-2921
Publish at: 2025-08-01

An innovative approach for predictive modeling and staging of chronic kidney disease

10.11591/ijict.v14i2.pp684-707
Safa Boughougal , Mohamed Ridda Laouar , Abderrahim Siam , Sean Eom
Diagnosing silent diseases such as chronic kidney disease (CKD) at an early stage is challenging due to the absence of symptoms, making early detection crucial to slowing disease progression. This study addresses this challenge by introducing a novel feature, the estimated glomerular filtration rate (eGFR), calculated using the modification of diet in renal disease (MDRD) formula. We enriched our dataset by incorporating this feature, effectively increasing the volume of data at our disposal. eGFR serves as a critical indicator for diagnosing CKD and assessing its progression, thereby guiding clinical management. Our focus is on developing machine learning and deep learning models for the efficient and precise prediction of CKD. To ensure the reliability of our approach, we employed robust data collection and preprocessing techniques, resulting in refined information for model training. Our methodology integrates various machine learning and deep learning models, including four machine learning algorithms: adaptive boosting (AdaBoost), random forest (RF), Bagging, and artificial neural network (ANN), as well as a hybrid model. Our proposed ANN_AdaBoost model not only introduces a novel perspective by addressing an identified gap but significantly enhances CKD prediction.
Volume: 14
Issue: 2
Page: 684-707
Publish at: 2025-08-01

Domain-specific knowledge and context in large language models: challenges, concerns, and solutions

10.11591/ijai.v14.i4.pp2568-2578
Kiran Mayee Adavala , Om Adavala
Large language models (LLMs) are ubiquitous today with major usage in the fields of industry, research, and academia. LLMs involve unsupervised learning with large natural language data, obtained mostly from the internet. There are several challenges that arise because of these data sources. One such challenge is with respect to domain-specific knowledge and context. This paper deals with the major challenges faced by LLMs due to data sources, such as, lack of domain expertise, understanding specialized terminology, contextual understanding, data bias, and the limitations of transfer learning. This paper also discusses some solutions for the mitigation of these challenges such as pre-training LLMs on domain-specific corpora, expert annotations, improving transformer models with enhanced attention mechanisms, memory-augmented models, context-aware loss functions, balanced datasets, and the use of knowledge distillation techniques.
Volume: 14
Issue: 4
Page: 2568-2578
Publish at: 2025-08-01

Image analysis and machine learning techniques for accurate detection of common mango diseases in warm climates

10.11591/ijai.v14.i4.pp2935-2944
Md Abdullah Al Rahib , Naznin Sultana , Nirjhor Saha , Raju Mia , Monisha Sarkar , Abdus Sattar
Mangoes are valuable crops grown in warm climates, but they often suffer from diseases that harm both the trees and the fruits. This paper proposes a new way to use machine learning to detect these diseases early in mango plants. We focused on common issues like mango fruit diseases, leaf diseases, powdery mildew, anthracnose/blossom blight, and dieback, which are particularly problematic in places like Bangladesh. Our method starts by improving the quality of images of mango plants and then extracting important features from these images. We use a technique called k-means clustering to divide the images into meaningful parts for analysis. After extracting ten key features, we tested various ways to classify the diseases. The random forest algorithm stood out, accurately identifying diseases with a 97.44% success rate. This research is crucial for Bangladesh, where mango farming is essential for the economy. By spotting diseases early, we can improve mango production, quality, and the livelihoods of farmers. This automated system offers a practical way to manage mango diseases in regions with similar climates.
Volume: 14
Issue: 4
Page: 2935-2944
Publish at: 2025-08-01

Performance analysis and comparison of machine learning algorithms for predicting heart disease

10.11591/ijai.v14.i4.pp2849-2863
Neha Bhadu , Jaswinder Singh
Heart disease (HD) is a serious medical condition that has an enormous effect on people's quality of life. Early as well as accurate identification is crucial for preventing and treating HD. Traditional methods of diagnosis may not always be reliable. Non-intrusive methods like machine learning (ML) are proficient in distinguishing between patients with HD and those in good health. The prime objective of this study is to find a robust ML technique that can accurately detect the presence of HD. For this purpose, several ML algorithms were chosen based on the relevant literature studied. For this investigation, two different heart datasets the Cleveland and Statlog datasets were downloaded from Kaggle. The analysis was carried out utilizing the Waikato environment for knowledge analysis (WEKA) 3.9.6 software. To assess how well various algorithms predicted HD, the study employed a variety of performance evaluation metrics and error rates. The findings showed that for both the datasets radio frequency is a better option for predicting HD with an accuracy and receiver operating characteristic (ROC) values of 94% and 0.984 for the Cleveland dataset and 90% and 0.975 for the Statlog dataset. This work may aid researchers in creating early HD detection models and assist medical practitioners in identifying HD.
Volume: 14
Issue: 4
Page: 2849-2863
Publish at: 2025-08-01

Using the ResNet-50 pre-trained model to improve the classification output of a non-image kidney stone dataset

10.11591/ijai.v14.i4.pp3182-3191
Kazeem Oyebode , Anne Ngozi Odoh
Kidney stone detection based on urine samples seems to be a cost-effective way of detecting the formation of stones. Urine features are usually collected from patients to determine if there is a likelihood of kidney stone formation. There are existing machine learning models that can be used to classify if a stone exists in the kidney, such as the support vector machine (SVM) and deep learning (DL) models. We propose a DL network that works with a pre-trained (ResNet-50) model, making non-image urine features work with an image-based pre-trained model (ResNet-50). Six urine features collected from patients are projected onto 172,800 neurons. This output is then reshaped into a 240 by 240 by 3 tensors. The reshaped output serves as the input to the ResNet-50. The output of this is then sent into a binary classifier to determine if a kidney stone exists or not. The proposed model is benchmarked against the SVM, XGBoost, and two variants of DL networks, and it shows improved performance using the AUC-ROC, Accuracy and F1-score metrics. We demonstrate that combining non-image urine features with an image-based pre-trained model improves classification outcomes, highlighting the potential of integrating heterogeneous data sources for enhanced predictive accuracy.
Volume: 14
Issue: 4
Page: 3182-3191
Publish at: 2025-08-01

Synthesizing strategies and innovations in combating land degradation: a global perspective on sustainability and resilience

10.11591/ijai.v14.i4.pp3133-3142
Gangamma Hediyalad , Ashoka Kukkuvada , Govardhan Hegde Kota
This paper presents a comprehensive examination of land degradation, a critical environmental challenge with far-reaching implications for agricultural productivity, ecosystem sustainability, and socio-economic stability worldwide. With the backdrop of escalating human population pressures and the exacerbating impact of climate change. It delves into the causes and consequences of soil erosion, desertification, salinization, and biodiversity loss, highlighting the interplay between natural processes and anthropogenic activities. Through a detailed review of literature spanning various remediation technologies, conservation practices, and policy frameworks, the paper critically assesses the effectiveness of current land management approaches, including the utilization of biosurfactants, remote sensing technologies, and agroforestry systems. Furthermore, it identifies significant research gaps and future directions, emphasizing the need for quantitative assessments, exploration of socio-economic impacts, and evaluation of restoration techniques. By offering evidence-based recommendations for policymakers and practitioners, this paper contributes to the global dialogue on sustainable land management and aims to catalyze action towards halting the advance of land degradation, ensuring food security, and preserving biodiversity for future generations. This work not only advances our understanding of land degradation challenges but also outlines a path forward for research, policy, and practice in the pursuit of environmental sustainability and resilience.
Volume: 14
Issue: 4
Page: 3133-3142
Publish at: 2025-08-01

Enhancing precision agriculture: a comprehensive investigation into pathogen detection and management

10.11591/ijai.v14.i4.pp3121-3132
Shaista Farhat , Chokka Anuradha
Agriculture is an important sector of Indian agronomy for human livelihood. All areas are affected by the effects of environmental toxic farms, which makes managing various difficult situations more challenging. Agriculture must adopt new technology in accordance with daily environmental changes if it is going to benefit from a crop from the perspectives of farmers and end users. Farmers will benefit from early detection of agricultural diseases rather than risking their lives in dangerous circumstances. Computer technology will be very helpful in maintaining sustainable and healthy crops for the objective of identifying crop diseases in addition to the farmer's close observation. Deep learning (DL) techniques are very influential among various computing technologies. In this work, we explore several current approaches to precision agriculture, such as artificial intelligence (AI), DL, and machine learning (ML). The findings of the study make clear modern methods, their drawbacks, and the knowledge lacking that needs to be addressed to explore precision agriculture fully.
Volume: 14
Issue: 4
Page: 3121-3132
Publish at: 2025-08-01

Investigation on low-performance tuned-regressor of inhibitory concentration targeting the SARS-CoV-2 polyprotein 1ab

10.11591/ijai.v14.i4.pp3003-3013
Daniel Febrian Sengkey , Angelina Stevany Regina Masengi , Alwin Melkie Sambul , Trina Ekawati Tallei , Sherwin Reinaldo Unsratdianto Sompie
Hyperparameter tuning is a key optimization strategy in machine learning (ML), often used with GridSearchCV to find optimal hyperparameter combinations. This study aimed to predict the half-maximal inhibitory concentration (IC50) of small molecules targeting the SARS-CoV-2 replicase polyprotein 1ab (pp1ab) by optimizing three ML algorithms: histogram gradient boosting regressor (HGBR), light gradient boosting regressor (LGBR), and random forest regressor (RFR). Bioactivity data, including duplicates, were processed using three approaches: untreated, aggregation of quantitative bioactivity, and duplicate removal. Molecular features were encoded using twelve types of molecular fingerprints. To optimize the models, hyperparameter tuning with GridSearchCV was applied across a broad parameter space. The results showed that the performance of the models was inconsistent, despite comprehensive hyperparameter tuning. Further analysis showed that the distribution of Murcko fragments was uneven between the training and testing datasets. Key fragments were underrepresented in the testing phase, leading to a mismatch in model predictions. The study demonstrates that hyperparameter tuning alone may not be sufficient to achieve high predictive performance when the distribution of molecular fragments is unbalanced between training and testing datasets. Ensuring fragment diversity across datasets is crucial for improving model reliability in drug discovery applications.
Volume: 14
Issue: 4
Page: 3003-3013
Publish at: 2025-08-01

Non-small cell lung cancer active compounds discovery holding on protein expression using machine learning models

10.11591/ijai.v14.i4.pp2815-2825
Hamza Hanafi , M’hamed Aït Kbir , Badr Dine Rossi Hassani
Computational methods have transformed the field of drug discovery, which significantly helped in the development of new treatments. Nowadays, researchers are exploring a wide ranger of opportunities to identify new compounds using machine learning. We conducted a comparative study between multiple models capable of predicting compounds to target non-small cell lung cancer, we focused on integrating protein expressions to identify potential compounds that exhibit a high efficacy in targeting lung cancer cells. A dataset was constructed based on the trials available in the ChEMBL database. Then, molecular descriptors were calculated to extract structure-activity relationships from the selected compounds and feed into several machine learning models to learn from. We compared the performance of various algorithms. The multilayer perceptron model exhibited the highest F1 score, achieving an outstanding value of 0,861. Moreover, we present a list of 10 drugs predicted as active in lung cancer, all of which are supported by relevant scientific evidence in the medical literature. Our study showcases the potential of combining protein expression analysis and machine learning techniques to identify novel drugs. Our analytical approach contributes to the drug discovery pipeline, and opens new opportunities to explore and identify new targeted therapies.
Volume: 14
Issue: 4
Page: 2815-2825
Publish at: 2025-08-01

Traffic flow prediction using long short-term memory-Komodo Mlipir algorithm: metaheuristic optimization to multi-target vehicle detection

10.11591/ijai.v14.i4.pp3343-3353
Imam Ahmad Ashari , Wahyul Amien Syafei , Adi Wibowo
Multi-target vehicle detection in urban traffic faces challenges such as poor lighting, small object sizes, and diverse vehicle types, impacting traffic flow prediction accuracy. This study introduces an optimized long short-term memory (LSTM) model using the Komodo Mlipir algorithm (KMA) to enhance prediction accuracy. Traffic video data are processed with YOLO for vehicle classification and object counting. The LSTM model, trained to capture traffic patterns, employs parameters optimized by KMA, including learning rate, neuron count, and epochs. KMA integrates mutation and crossover strategies to enable adaptive selection in global and local searches. The model's performance was evaluated on an urban traffic dataset with uniform configurations for population size and key LSTM parameters, ensuring consistent evaluation. Results showed LSTM-KMA achieved a root mean square error (RMSE) of 14.5319, outperforming LSTM (16.6827), LSTM-improved dung beetle optimization (IDBO) (15.0946), and LSTM-particle swarm optimization (PSO) (15.0368). Its mean absolute error (MAE), at 8.7041, also surpassed LSTM (9.9903), LSTM-IDBO (9.0328), and LSTM-PSO (9.0015). LSTM-KMA effectively tackles multi-target detection challenges, improving prediction accuracy and transportation system efficiency. This reliable solution supports real-time urban traffic management, addressing the demands of dynamic urban environments.
Volume: 14
Issue: 4
Page: 3343-3353
Publish at: 2025-08-01

Survey on 3D biometric traits for human identification

10.11591/ijai.v14.i4.pp3143-3152
Divya Gangachannaiah , Mamatha Aruvanalli Shivaraj , Honganur Chandrasekharaiah Nagaraj , Prasanna Gururaj Paga
Individuals are verified and identified using Biometric technology based on their biological or behavioral traits. Biometric-based personal authentication systems are more reliable and user friendly, overruns the traditional personal authentication systems. The physiological biometric traits get abraded due to aging and massive work, while the behavioral biometric traits are having high variations due to external factors such as fatigue, and mood. Among the physiological biometric traits, Finger geometry patterns are widely deployed authentication system reason being its stability, user acceptability and uniqueness. Recent trends in Biometrics attempt to incorporate 3D domain traits, 3D reconstruction is done using 2D multiple images. 3D images are usually more robust and illumination invariant as compared to their 2D counterparts. 3D reconstruction algorithms are compared by finding mean square error (MSE).
Volume: 14
Issue: 4
Page: 3143-3152
Publish at: 2025-08-01

An IoT-based approach for microclimate surveillance in greenhouse environments

10.11591/ijict.v14i2.pp717-727
Irfan Ardiansah , Sophia Dwiratna Nur Perwitasari , Roni Kastaman , Totok Pujianto
As the demand for efficient and cost-effective greenhouse microclimate surveillance has increased, we developed a microclimate surveillance system using microcontroller technology that automatically collects temperature and relative humidity data and transmits it to a cloud server for remote surveillance and data analysis. 1971 microclimate data points were acquired over a 20-day evaluation period, providing insights into greenhouse environmental conditions with a negative linear regression between air temperature and relative humidity within the greenhouse and an R-squared of 0.73. This illustrates the system’s ability to record and quantify environmental conditions data. Additionally, we derived a predictive model to manage microclimate conditions using the regression formula y = -6.12X + 238.33, where X represents the air temperature and y represents the relative humidity. All the results show that the acquired data can be used to make decisions to optimize plant growth. The prototype we developed can be an alternative option for small and medium-sized farms that need a greenhouse surveillance system to improve operational efficiency and reduce surveillance costs. The system can be further developed by implementing additional sensors to monitor light intensity, wind speed, or soil moisture and further data analysis models to optimize greenhouse management.
Volume: 14
Issue: 2
Page: 717-727
Publish at: 2025-08-01

Federated deep learning intrusion detection system on software defined-network based internet of things

10.11591/ijai.v14.i4.pp3109-3120
Heba Dhirar , Ali H. Hamad
The internet of things (IoT) and software-defined networks (SDN) play a significant role in enhancing efficiency and productivity. However, they encounter possible risks. Artificial intelligence (AI) has recently been employed in intrusion detection systems (IDSs), serving as an important instrument for improving security. Nevertheless, the necessity to store data on a centralized server poses a potential threat. Federated learning (FL) addresses this problem by training models locally. In this work, a network intrusion detection system (NIDS) is implemented on multi-controller SDN-based IoT networks. The interplanetary file system (IPFS) FL has been employed to share and train deep learning (DL) models. Several clients participated in the training process using custom generated dataset IoT-SDN by training the model locally and sharing the parameters in an encrypted format, improving the overall effectiveness, safety, and security of the network. The model has successfully identified several types of attacks, including distributed denial of service (DDoS), denial of service (DoS), botnet, brute force, exploitation, malware, probe, web-based, spoofing, recon, and achieving an accuracy of 99.89% and a loss of 0.005.
Volume: 14
Issue: 4
Page: 3109-3120
Publish at: 2025-08-01

Lightweight mutual authentication protocol for resource-constrained radio frequency identification tags with PRINCE cipher

10.11591/ijai.v14.i4.pp3435-3443
Mahendra Shridhar Naik , Desai Karanam Sreekantha , Kanduri V S S S S Sairam , Chaitra Soppinahally Nataraju
Radio frequency identification (RFID) is a key technology for the internet of things (IoT), with widespread applications in the commercial, healthcare, enterprise, and community sectors. However, privacy and security concerns remain with RFID systems. This manuscript presents a novel RFID-based mutual authentication protocol (MAP) using the PRINCE cipher to address these concerns. The proposed MAP leverages a PRINCE cipher architecture capable of both encryption and decryption based on a mode signal. It performs five encryption and two decryption processes during tag and reader mutual authentication, with updated seed values ensuring synchronization and secure data communication. The PRINCE cipher implementation utilizes less than 1% of slices, operates at 226 MHz with a latency of 3.5 clock cycles (CC), and has a throughput of 4.125 Gbps. The complete RFID-based MAP consumes 721 mW of power, occupies 2% of the chip area, and achieves a latency of 35.5 CC and a throughput of 262 Mbps. This represents a 25% reduction in latency, a 40% increase in throughput, and a 30% decrease in execution time compared to existing MAP approaches. The findings demonstrate the potential of the proposed MAP to enhance latency, throughput, and execution time, offering a promising solution for secure and efficient RFID authentication.
Volume: 14
Issue: 4
Page: 3435-3443
Publish at: 2025-08-01
Show 49 of 1897

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration