Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

30,185 Article Results

Explainable social media disaster image classification using a lightweight attention-based deep learning approach

10.11591/ijai.v15.i2.pp1464-1472
Rashmi Kangokar Taranath , Geeta Chidanandappa Mara
In recent years, the rapid dissemination of social media content during natural and man-made disasters has created a need for automated and accurate disaster image classification systems. This paper proposes lightweight explainable attention-based disaster network (LEAD-Net), a deep learning (DL) model designed for classifying disaster-related images with high accuracy and interpretability. The system integrates an EfficientNet-B0 backbone enhanced with squeeze-and-excitation (SE) attention modules and a lightweight neural architecture search (NAS-lite) strategy for tuning the classifier head and training hyperparameters. The model was evaluated on two benchmark datasets comprehensive disaster dataset (CDD) and damage multimodal dataset (DMD) achieving 96% and 87% accuracy, respectively, outperforming several established convolutional neural network (CNN) baselines. To ensure transparency, gradient-weighted class activation mapping (Grad-CAM) was employed to generate visual explanations of the model’s decisions, confirming its focus on semantically relevant image regions.
Volume: 15
Issue: 2
Page: 1464-1472
Publish at: 2026-04-01

The effects of data imbalance on fraud detection model accuracy

10.11591/ijai.v15.i2.pp1402-1408
Rusma Anieza Ruslan , Nureize Arbaiy , Pei-Chun Lin
Machine learning (ML) model performance is often assessed by accuracy, but the quality and balance of data also play crucial roles. Imbalanced datasets, where the minority class has fewer samples than the majority class, can lead to biased predictions favoring the majority class. This study addresses the issue of class imbalance through resampling techniques, including random undersampling (RUS) and random oversampling (ROS), specifically applied to a fraud detection dataset. We classify the resampled datasets using random forest (RF) and gradient boosting (GB) models. Our findings indicate that the RF model, when combined with ROS, achieves an accuracy of 97.4%, surpassing the 96.1% accuracy of the GB model with RUS. This approach demonstrates the importance of addressing class imbalance to improve prediction accuracy in ML.
Volume: 15
Issue: 2
Page: 1402-1408
Publish at: 2026-04-01

Drone-assisted deep learning weed detection for sustainable agriculture and environmental resilience

10.11591/ijai.v15.i2.pp1428-1440
Agustan Latif , Handaru Jati , Herman Dwi Surjono , Mani Yusuf
Effective weed detection plays a crucial role in sustainable agriculture, boosting crop productivity and supporting environmental conservation. This study compares three deep learning models—YOLOv5, YOLO-NAS, and mask region-based convolutional neural network (Mask R-CNN)-against traditional methods in terms of accuracy, processing speed, and adaptability in tropical agricultural conditions, with Merauke, Indonesia, as the case study. The results show that YOLO-NAS delivers the highest accuracy at 96% with a processing time of 25 ms per image, making it suitable for high precision applications. YOLOv5 balances strong accuracy (94%) with faster processing at 12 ms per image, establishing it as the most effective for real time scenarios. Mask R-CNN also achieves 94% accuracy and provides advanced segmentation capabilities, but its slower processing speed of 31 ms limits large-scale implementation. Traditional methods perform poorly in comparison, with only 85% accuracy and processing time above 50 ms per image. These findings highlight the transformative potential of artificial intelligence (AI)-based weed detection for precision agriculture, particularly in tropical regions like Merauke. Adoption of models such as YOLOv5 reduces manual labor dependence while advancing efficient, eco-friendly weed management. Future research should expand datasets and explore newer models like YOLOv8, YOLO-NAS, vision transformers (ViTs), and hybrid approaches.
Volume: 15
Issue: 2
Page: 1428-1440
Publish at: 2026-04-01

NN-SVM: a hybrid neural network–support vector machine framework for accurate pneumonia detection from chest X-rays

10.11591/ijai.v15.i2.pp1349-1361
Santosh Kumar Jankatti , Raghavendra Srinivasaiah , Mohammad Shahina Parveen , Harish H. Kenchannavar , Danthuluri Sudha , Srihari Sharma Karigiri Narah , Mahadev Shivaraj
We present neural network (NN)–support vector machine (SVM), hybrid NN-SVM framework for three-class pneumonia detection (normal, bacterial, and viral) from chest X-rays (CXRs). Pretrained NN backbone is fine-tuned for radiographic textures; global average pooling (GAP) yields embeddings that feed calibrated radial basis function (RBF)-SVM. Standardized preprocessing (resize, normalization) and class-aware augmentation are applied. We report accuracy, precision, recall, F1-score, area under the curve (AUC), confusion matrices, and per-class receiver operating characteristic (ROC). Statistical significance is assessed via DeLong (AUC), McNemar (accuracy), and paired bootstrap (F1-score). Gradient-weighted class activation mapping (grad-CAM) supports interpretability; external validation and domain adaptation (batch normalization re-estimation and temperature scaling) assess robustness. NN-SVM attains 97.46% accuracy with strong macro-F1 and AUC. Compared with SoftMax head, SVM improves margin separation and calibration. We present NN-SVM, hybrid deep learning approach that combines transfer-learned convolutional neural networks (CNNs) with SVM classifier to automatically diagnose pneumonia from CXRs into three clinically relevant categories: viral pneumonia, bacterial pneumonia, and normal. We use pre-trained CNN to extract robust image embeddings after standardized preprocessing (resizing and normalization) and train RBF-kernel SVM on resulting features. Performance is evaluated with accuracy, precision, recall, F1-score, and confusion matrices. On labeled CXR dataset, NN-SVM achieves 97.46% accuracy, demonstrating strong diagnostic capability that can reduce radiologist burden and support timely clinical decision-making.
Volume: 15
Issue: 2
Page: 1349-1361
Publish at: 2026-04-01

An intelligent and explainable IoT-Edge-Cloud architecture for real-time water quality monitoring

10.11591/ijai.v15.i2.pp1109-1120
Sara Bouziane , Badraddine Aghoutane , Aniss Moumen , Anas El Ouali , Ali Essahlaoui , Abdellah El Hmaidi
Continuous and reliable monitoring of water quality is critical for early detection of environmental deterioration, yet conventional monitoring approaches are often slow and lack timely data availability. This study proposes an intelligent and explainable internet of things (IoT)–Edge–Cloud architecture to monitor water quality in real time, using IoT sensing, edge based artificial intelligence (Edge AI), cloud-stream processing, and explainable artificial intelligence (XAI). The system calculates the water quality index (WQI) directly at the edge and predicts its evolution using a stacking ensemble model trained on physicochemical measurements taken from the Moulouya River Basin in Morocco. An explainability module based on Shapley additive explanations (SHAP) values gives a clearer image of the contribution of various parameters to WQI predictions, providing transparency of the features, which builds trust in the model’s output. The proposed architecture was implemented as an end-to-end prototype and validated using a simulation-based experimental that mimicked realistic sensor dynamics and connectivity interruptions. The experimental results show strong predictive performance (R² =0.945), stable system operations, and reliable interpretability highlighting the potential of the proposed approach for scalable, intelligent, and transparent environmental monitoring.
Volume: 15
Issue: 2
Page: 1109-1120
Publish at: 2026-04-01

Deep learning ensembles for lung cancer detection in thoracic CT scans leveraging generative adversarial network technology

10.11591/ijai.v15.i2.pp1605-1612
Bineesh Moozhippurath , Jayapandian Natarajan
Effective treatment of lung cancer depends on early and accurate detection, which continues to be a major cause of cancer-related fatalities globally. Conventional diagnostic techniques are useful, but their efficacy in handling large amounts of thoracic computed tomography (CT) scan data is limited by their time-consuming nature and susceptibility to human error. The research here suggests a new deep learning model that integrates generative adversarial networks (GANs) for data improvement with a sophisticated ensemble approach to classification. GANs are employed to generate realistic synthetic CT images, addressing the challenges of limited datasets. The backbone of the proposed approach is a consensus-guided adaptive blending (CGAB) ensemble model that learns to dynamically combine the predictions of three top-performing convolutional neural networks (CNNs): ResNet-152, DenseNet-169, and EfficientNet-B7. The CGAB model improves prediction accuracy through model contribution weighting based on confidence scores and inter-model consensus, while a conflict-resolving auxiliary decision model is used. The approach was tested using the lung image database consortium and the image database resource initiative (LIDC-IDRI) dataset with a detection rate of 97.35%, surpassing single model and traditional ensemble methods. The current work provides a solid and scalable approach to lung cancer detection with better generalization, increased diagnostic consistency, and applicability for clinical use.
Volume: 15
Issue: 2
Page: 1605-1612
Publish at: 2026-04-01

Image feature extraction for road surface damage classification

10.11591/ijai.v15.i2.pp1578-1592
Octaviani Hutapea , Sarifuddin Madenda , Hustinawaty Hustinawaty , Iffatul Mardhiyah
Road surface deterioration poses a critical risk to driving safety and comfort, necessitating timely and accurate detection to support effective maintenance. Manual inspection methods are often inefficient, underscoring the need for automated approaches based on computer vision. This study investigates the integration of feature extraction techniques histogram of oriented gradients (HOG) and local binary pattern (LBP) with convolutional neural network (CNN) architectures ResNet50 and InceptionV3 for the classification of road damage. A dataset of 1,580 images was categorized into five damage types: alligator crack, longitudinal crack, other crack, patching, and potholes. Experimental results indicate that HOG–ResNet50 achieved 79% accuracy, while LBP–InceptionV3 yielded the best performance at 97%. The contributions of this study are threefold: i) an automated framework is proposed that combines texture-based features with deep learning for road damage detection, ii) the LBP–InceptionV3 combination is shown to provide superior accuracy compared to conventional pairings, and iii) the approach offers a scalable and reliable alternative to manual inspection methods, supporting more efficient road maintenance planning.
Volume: 15
Issue: 2
Page: 1578-1592
Publish at: 2026-04-01

A hybrid model for enhanced aspect-based sentiment analysis using large language models

10.11591/ijai.v15.i2.pp1825-1838
Mohammed Ziaulla , Arun Biradar
Aspect-based sentiment analysis (ABSA) is a crucial task within natural language processing (NLP), enabling fine-grained opinion mining by identifying sentiments associated with specific aspects of a product or service. While transformer-based models like bidirectional encoder representations from transformers (BERT) have improved sentiment classification, they still struggle with limited contextual adaptability, especially in customer reviews containing complex expressions. Most existing approaches rely heavily on benchmark datasets such as semantic evaluation (SemEval) and multi-aspect multi-sentiment (MAMS), which do not fully capture the diversity of real-world review scenarios. Hence, this research addresses these limitations by proposing a novel hybrid model, called as hybrid-BERT (H-BERT), that integrates span-aware BERT (SpanBERT) with bidirectional long short-term memory (BiLSTM), conditional random field (CRF), and large language models (LLMs). The objective is to enhance aspect extraction and sentiment classification performance using both annotated and synthetic data. The methodology includes preprocessing, hybrid model training, and evaluation using the SemEval 2014 dataset. Experimental results show that H-BERT achieved 90.58% accuracy and 90.56% F-score in the laptop domain and 91.21% accuracy with a 92.03% F-score in the restaurant domain. These results outperform existing models, confirming H-BERT’s robustness and effectiveness. In conclusion, H-BERT improves sentiment understanding in customer reviews.
Volume: 15
Issue: 2
Page: 1825-1838
Publish at: 2026-04-01

Accurate stroke area classification using extreme gradient boosting with multi-feature extraction

10.11591/ijai.v15.i2.pp1390-1401
Kavikondala Praveen Kumar Rao , Maha Lakshmi Bondla , Bommaraju Srinivasa Rao , Ambidi Naveena , K. V. Balaramakrishna , Srinivasarao Goda
Stroke, one of the most common neurological disorders leading to long-term disability and mortality, requires accurate detection of affected brain regions for timely treatment planning. However, conventional deep learning models face challenges in achieving precise segmentation and robust classification due to noisy inputs, weak feature representation, and poor generalization. To address these gaps, this study introduces a hybrid framework that integrates the ConvNeXt architecture for stroke region segmentation with XGBoost based classification, strengthened through three complementary feature extraction methods: local binary patterns (LBP), adaptive threshold directional binary gradient matrix (AT-DBGM), and wavelet packet transform (WPT). These methods capture textural, directional, and multi resolution features, which are concatenated into a stacked vector and classified using XGBoost. Preprocessing steps, including normalization and resizing, ensure improved input consistency. Experimental evaluations on benchmark stroke imaging datasets show that the proposed framework achieves 98.56% Dice similarity coefficient (DSC), 12.96 mm Hausdorff distance (HD), 99.12% accuracy, 98.69% sensitivity, 99.06% specificity, 98.98% precision, and 98.85% F1-score.
Volume: 15
Issue: 2
Page: 1390-1401
Publish at: 2026-04-01

Deep learning for early detection of cardiovascular diseases via auscultation sound classification

10.11591/ijai.v15.i2.pp1746-1761
Shreyas Kasture , Sudhanshu Maurya , Amit Kumar Sharma , Santhosh Chitraju Gopal Varma , Kashish Mirza , Firdous Sadaf Mohammad Ismail
Heart diseases are one of the most prominent causes of death globally, which requires immediate and accurate diagnosis. The auscultation methods used in conventional medical practice, where the doctor listens to the sounds produced by the body without intervention is very ineffective because of the limitations in the actual skills and perception of the doctor. The main goal of this project will be designing a mobile-based system for the early detection of cardiovascular disease (CVD) by utilizing deep learning for auscultation sound classification. The approach involves the use of deep learning structures to classify cardiac sounds into normal and abnormal patterns on its own. Wavelet transformations, time-frequency representations, and Mel frequency cepstral coefficients (MFCC) have been used in feature extraction. The ResNet152V2 model showed high classification performance with area under the receiver operating characteristic curve (AUROC) of 0.9797 and 0.9636 on two datasets. Contrary to that, data augmentation, hyperparameter optimization, attention mechanisms, as well as input-output residual connections, led to better functionality and interpretability. This research seeks to overcome the limitations of traditional stethoscope use through the incorporation of sophisticated algorithms and the availability of mobile technology that could result in early diagnosis and prevention of CVDs, especially in underprivileged areas.
Volume: 15
Issue: 2
Page: 1746-1761
Publish at: 2026-04-01

An efficient approach for cyber-attack detection by using machine learning and deep learning algorithms

10.11591/ijai.v15.i2.pp1219-1235
Yasir Hussein Shakir , Mahmoud Mohamed Abdelhamied , Eshaq Aziz Awadh AL Mandhari , Ali Alkhazraji , Naglaa M. Reda
Cybercrime gained traction in the late 20th century. The capabilities of cyber-attackers have improved dramatically. One of the biggest challenges facing cybersecurity developers is safeguarding consumers' security and privacy. Interest in using AI approaches in cybersecurity has grown significantly because of the incredible proficiency these techniques have demonstrated across all domains. Even while machine learning algorithms are very effective at identifying malicious activity, there are still certain issues that lower performance accuracy. This paper has the novelty of deploying the Artificial Bee Colony (ABC) meta-heuristic algorithm with the K-Nearest Neighbors (KNN) classifier to detect cyber-attacks. It proposes a variant approach called KNN+Bee that detects attacks efficiently, achieving 99.86% overall accuracy. The NSL-KDD dataset of cyberattacks has been leveraged in the training and testing phases. The proposed approach has been contrasted with the most popular machine learning. According to experimental findings, the suggested model delves deeper into the identification of cyberattacks. It achieves unprecedented performance, outperforming other models in terms of precision, Recall, F-score and MCC. Furthermore, popular deep learning models have been implemented and examined on the same dataset. Results prove that GRU is the most accurate, reaching 99.71%.
Volume: 15
Issue: 2
Page: 1219-1235
Publish at: 2026-04-01

Adaptive control of ball and beam system using SNA-PID combined with recurrent fuzzy neural network identifier

10.11591/ijai.v15.i2.pp1202-1210
Minh-Thanh Le , Chi-Ngon Nguyen
The ball and beam system is a nonlinear and inherently unstable single input, multiple-output (SIMO) system, which poses significant challenges for control design. Intelligent control algorithms are often applied to autonomously control complex systems when there are changes in parameters or the control environment. Therefore, in this paper, we research and develop two methods: proportional integral derivative (PID) and single neuron adaptive (SNA)-PID-recurrent fuzzy neural network identifier (RFNNI) to control the ball and beam system. Simulation results on MATLAB/Simulink show that the SNA-PID-RFNNI controller provides a more stable output signal than the traditional PID controller, with minimal overshoot and a settling time of about 15 seconds. Next, we will conduct real-time experiments on the object using the proposed algorithm through the MEGA2560 control board with an ultrasonic positioning mechanism.
Volume: 15
Issue: 2
Page: 1202-1210
Publish at: 2026-04-01

Automated bacteria and fungi classification using convolutional neural network on embedded system

10.11591/ijai.v15.i2.pp1132-1142
Tarik Bouganssa , Maryem Ait Moulay , Samar Aarabi , Abedelali Lasfar , Abdelatif EL Afia
In this study, we created and applied novel concepts for hardware-based image identification and categorization. For artificial intelligence (AI) and image recognition applications, this includes putting algorithms for recognizing colors, textures, and shapes into practice. Our contribution uses an embedded device with a camera and a microcomputer (Raspberry-Pi4 type) to replace the optical assessment of Petri dishes. Our object recognition system processes images efficiently by using a state-of-the-art kernel function and a new neighborhood architecture. Using the well-known convolutional neural network (CNN) architecture, YOLOv8, as a pre-trained model, we evaluated the proposed CNN-based method for object recognition in a number of demanding scenarios. Several Petri plates, uncontrolled settings, and different backgrounds and illumination were used to evaluate the technology. Our dynamic mode integrates a CNN network with an attention mask to highlight the traits of bacteria and fungi, ensuring robust recognition. We implemented our algorithm on a Raspberry Pi 400, connected to a CMOS 3.0 camera sensor and a human-machine interface (HMI) for instant display of results.
Volume: 15
Issue: 2
Page: 1132-1142
Publish at: 2026-04-01

Identification of chemical markers for species differentiation in Aquilaria essential oils using self-organizing maps

10.11591/ijai.v15.i2.pp1339-1348
Nur Athirah Syafiqah Noramli , Muhammad Ikhsan Roslan , Noor Aida Syakira Ahmad Sabri , Nurlaila Ismail , Zakiah Mohd Yusoff , Mohd Nasir Taib
This study analyzes the chemical diversity of essential oils from four Aquilaria species, A. beccariana, A. malaccensis, A. crassna, and A. subintegra, which are important sources of agarwood used in perfumery and traditional medicine. Despite their economic and ecological value, the chemical profiles of these species remain insufficiently characterized, hindering accurate species differentiation and resource management. This research aims to identify distinctive chemical patterns to improve species classification. Self-organizing maps (SOMs) were employed to analyze complex chemical composition data and to identify significant compounds responsible for species separation. The analysis revealed several compounds with strong discriminatory power and species-specific distribution patterns, with compounds C, D, and E identified as the most significant markers. These findings demonstrate substantial biochemical diversity among Aquilaria species and confirm the effectiveness of SOM for essential oil profiling. The results support improved species identification and have important implications for ecological conservation, sustainable agarwood management, and pharmacological development.
Volume: 15
Issue: 2
Page: 1339-1348
Publish at: 2026-04-01

Blockchain-enabled framework using diversity mutation with siberian tiger optimization for offloading in fog computing

10.11591/ijai.v15.i2.pp1371-1380
Srikanta Murthy Rajini , Reginald Shilpa
Fog computing has developed as a promising framework to support latency sensitive internet of things (IoT) applications for mobile devices operating in dynamic environments. During the offloading process, malicious activities interrupt the existing methods, which increases the execution time. Therefore, this research proposes a diversity mutation with siberian tiger optimization (DM-STO) for computation offloading in blockchain based fog computing. The blockchain is used to secure offload and attain quality of service (QoS) mobile users with less energy consumption and execution time. The DM-STO can balance workloads among local devices and fog servers. The diversity mutation operation improves the exploration ability to dynamic network conditions, leading to efficient computational offloading in fog computing. The execution time, service cost and energy consumption are evaluated to calculate the performance of the proposed DM-STO with varying numbers of IoT requests such as 50, 100, 200, and 300. For 50 IoT requests with a fixed fog server of 10, the DM-STO achieves an execution time of 18 s, a service cost of 10$ and energy consumption of 5 mJ compared to the BAT algorithm.
Volume: 15
Issue: 2
Page: 1371-1380
Publish at: 2026-04-01
Show 31 of 2013

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration