Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

30,096 Article Results

Energy-efficient virtual machine allocation using directional and boundary-aware bobcat optimization

10.11591/ijai.v15.i2.pp1286-1299
Nida Kousar Gouse , Gopala Krishnan Chandrasekaran
Cloud computing (CC) has gained significant traction due to its ability to deliver services in a scalable and adaptable manner, catering to diverse user requirements. However, in virtualization technology, one of the primary challenges is managing the energy consumption required to maintain service quality, as it directly impacts the operational expenses of data centers. To address this challenge, this research proposes a directional movement and boundary-aware strategy-based bobcat optimization algorithm (DMBABOA) for energy-efficient virtual machine (VM) allocation aimed at minimizing energy consumption in cloud environments. The directional search and boundary-aware correction enhance convergence and ensure feasible resource distribution. This ensures effective utilization of resources, improved virtualization management, and substantial energy savings. The experimental findings establish that the proposed DMBABOA optimizer reaches a minimum execution time of 134.48 s when the number of VMs is equal to 1,200 with 200 users, compared to existing methods such as the metaheuristic VM allocation approach to power efficiency of sustainable cloud environment (MV-PESC).
Volume: 15
Issue: 2
Page: 1286-1299
Publish at: 2026-04-01

Unimodal and multimodal techniques for depression diagnosis: a comprehensive survey

10.11591/ijai.v15.i2.pp1947-1954
Swathy Jayasree , Yashawini Sridhar
Depression is a common and major mental health condition that affects individuals across all age groups and any backgrounds, severely reducing their physical, emotional, and cognitive functioning. It goes beyond typical mood swings and requires a timely and accurate diagnosis to prevent severe consequences such as suicidal tendencies, self-harm, and long-term mental decline. The improving performance of deep learning and machine learning techniques has significantly enhanced the speed and accuracy of depression diagnosis using both unimodal and multimodal features. This comprehensive study gives a complete overview of the unimodal and multimodal methods used to diagnose depression in its early stages. Additionally, this survey summarizes the dataset, methods, and limitations of previous work presented in the domain of depression diagnosis and serves as a suitable reference for future analysis.
Volume: 15
Issue: 2
Page: 1947-1954
Publish at: 2026-04-01

Breast cancer detection using residual DenseNets in deep learning

10.11591/ijai.v15.i2.pp1632-1645
Naganandini Gururajarao , Vishwanath R. Hulipalled
Breast cancer, the leading cause of cancer-related deaths among women globally, requires a prompt and precise diagnosis in order to increase survival rates via therapy. There is a possibility of bias and inconsistency in the results of traditional diagnostic procedures like mammography, ultrasound, and histological testing since they rely on the expertise of radiologists and pathologists. There are exciting new opportunities for breast cancer diagnostics to be enhanced by artificial intelligence (AI) and deep learning. The purpose of this research is to examine the feasibility of using convolutional neural networks (CNNs) and residual dense networks (ResDenseNets) used for breast cancer automated detection in medical images. Because of their superior capacity to learn hierarchical features from raw image data, CNNs are ideal for medical image interpretation. By including residual connections, which allow for the training of considerably deeper models, ResDenseNets—an extension of CNNs—mitigate the problem of vanishing gradient in deep networks. ResDenseNet and CNNs considerably enhance the accuracy of breast cancer diagnosis in comparison to conventional approaches, according to the findings. Notably, ResDenseNets outperform other types of networks because they are able to learn intricate and nuanced properties directly from the data.
Volume: 15
Issue: 2
Page: 1632-1645
Publish at: 2026-04-01

Hybrid kernel support vector machine with cuckoo search optimization for malaria detection from blood smear images

10.11591/ijai.v15.i2.pp1316-1326
Sri Huning Anwariningsih , Indrarini Dyah Irawati
Microscopic image-based malaria detection still struggles to capture complex features due to variations in lighting and color. The support vector machine (SVM) method is often used in medical image detection, but its performance depends heavily on the selection of optimal kernel and hyperparameters (C and gamma). Conventional approaches, with single kernels and manual tuning, have limitations in capturing both spatial information and color distribution simultaneously. Therefore, this research proposes hybrid kernel support vector machine-cuckoo search algorithm (HKSVM-CSA) method that combines the radial basis function (RBF) kernel and histogram intersection for SVM, along with hyperparameter optimization using the CSA. The dataset used is malaria cell images, which contains parasitized and uninfected images of blood cells. The proposed method comprises five main steps: dataset preparation, feature extraction, HKSVM, hyperparameter optimization, and model evaluation. Experiments demonstrate that the proposed model achieves 94% accuracy, 93% sensitivity, 94% specificity, and area under the curve (AUC) of 0.98, which is significantly better than standard SVM, SVM-genetic algorithm (GA), and k-nearest neighbors (KNN). These results show that combining kernel and CSA significantly improves detection accuracy. This approach is promising for image-based automatic systems for infectious disease diagnosis.
Volume: 15
Issue: 2
Page: 1316-1326
Publish at: 2026-04-01

Analysis of tuberculosis detection using deep learning technique and explainable artificial intelligence

10.11591/ijai.v15.i2.pp1623-1631
Shashikiran Srinivas , Kavita Avinash Patil , Kushalatha Monappa Rama , Sudha Venkateshlu , Jayanthi Muthuswamy , Srinivas Babu Narayanappa
Tuberculosis (TB) affects the health of many individuals and is still a prime worldwide health concern despite having so many advanced treatments, as it still lacks technical advancement in its treatment and diagnosis. Accuracy in identification and early detection is essential to reduce the spread and improve treatment outcomes. Traditional methods of diagnosis, such as sputum microscopy and culture, are labor-dependent and subject to human mistakes as it is done by lab technicians. Recent improvements in deep learning have demonstrated significant potential for enhancing and automating diagnostic accuracy. Our research proposes a deep learning based technique that detects TB from chest X-rays after image processing techniques like augmentation. After training on big data, our model pulls off an astonishing accuracy of 97.42% and a loss of 7.17%, outperforming traditional methods. The model uses convolutional neural network (CNN) as a base and transfer learning method, like DenseNet-121, and explainable artificial intelligence (XAI) technique, like Grad-CAM, to recognize TB related patterns effectively and with low false positives. This approach has the ability to revolutionize the diagnosis of TB and offer more dependable, scalable, and timely solutions to healthcare systems worldwide.
Volume: 15
Issue: 2
Page: 1623-1631
Publish at: 2026-04-01

Efficient text detection and recognition in natural scene images using novel blended ensemble deep learning

10.11591/ijai.v15.i2.pp1664-1679
Rajeswari Reddy Patil , Aradhana Dammergidda
Text detection and recognition in natural scene images is a critical task in computer vision, with applications ranging from document analysis to autonomous navigation. This work presents a robust and efficient pipeline that integrates YOLOv8 for text detection and EasyOCR for recognition, enhanced by an adaptive preprocessing mechanism between the two stages. The YOLOv8 model is trained on a custom dataset with polygonal annotations converted into YOLO format ensures precise bounding box formations around the text regions. An adaptive preprocessing module dynamically optimizes the detected regions adjusting resolution, noise reduction, and orientation before passing them to EasyOCR, significantly improving robustness. The lightweight yet powerful EasyOCR engine then recognizes text across diverse fonts, styles, and orientations. Evaluated on the benchmark Total-Text dataset, the proposed method demonstrates superior performance in detection accuracy, recognition precision, and computational efficiency. Additionally, this work provides a detailed analysis of training metrics, to validate the model’s robustness. The proposed system is scalable and can be integrated into real-time applications such as license plate recognition, document digitization, and assistive technologies for the visually impaired.
Volume: 15
Issue: 2
Page: 1664-1679
Publish at: 2026-04-01

Session click sentiment behavior aware personalized recommendations system

10.11591/ijai.v15.i2.pp1539-1547
Suraj Bevinahalli Suresh , Padma Muthalambikashettahally Cheluvegowda
Session-based recommendations use short-term behavior of users to provide personalized suggestions to consumers in ecommerce platform. However, cold start, considering newly joined users and sparsity issues, where not enough short-term behavior is available, and the performance of traditional session-based recommendations is significantly impacted. Deep learning (DL) like recurrent neural networks (RNNs), long short-term memory networks (LSTMs), and graph neural networks (GNNs) have been employed to capture session-clicks and enhance product recommendation accuracy. However, the current method is significantly affected due to the gradient descent problem in meeting convergence for top-K product recommendation. Further, the current method failed to capture product sentiment for session-clicks between inter-session and intra-session clicks. In addressing the research problems, the current research work introduced a session click sentiment behavior aware (SCSBA) personalized recommendation system using novel inter and intra session (IIS)-LSTM model. Finally, the objective function to recommend top K items to users is done using optimized Bayesian personalized ranking (OBPR) algorithm. Experiment outcome shows the SCSBA model achieves much better performance than state of art model, considering standard Tmall dataset.
Volume: 15
Issue: 2
Page: 1539-1547
Publish at: 2026-04-01

Development of rough set based machine learning approach to screen breast cancer

10.11591/ijai.v15.i2.pp1982-1998
Sangeetha Sivakumar , Shakeela Sathish , Debabrata Datta
One of the major causes of death for women is breast cancer. A substantial number of women diagnosed with breast cancer die due to inaccuracies in diagnosis and delays in treatment. Cancer prediction must be accurate in order to improve treatment quality and patient survival rates. This study evaluates logistic regression (LR), decision tree algorithm (DTA), and adaptive boosting (AdaBoost) (AB ensemble learning algorithm) in conjunction with rough set theory (RST) to enhance breast cancer classification using the Wisconsin diagnosis breast cancer dataset (WDBC). By employing rough set approximations, including the upper and lower bounds of features, this study introduces a novel rough AdaBoost (Rough AB) algorithm to improve classification accuracy. Various performance indices are compared across algorithms. The proposed Rough AB algorithm demonstrated superior performance, particularly in prediction accuracy for both benign and malignant cases. It incorporates roughness to determine the starting node of the decision stump, offering a significant improvement in ensemble learning techniques for medical diagnostics. It gives practical implications for clinical decision-making, potentially enabling more reliable and timely breast cancer diagnoses, which can significantly impact patient outcomes. The proposed method leverages rough set approximations to refine feature selection and improve prediction accuracy. Also, it positions RST as an explainable artificial intelligence (XAI) technique, highlighting its interpretability, ethical transparency, and potential integration with deep learning for clinical deployment.
Volume: 15
Issue: 2
Page: 1982-1998
Publish at: 2026-04-01

Semantic-syntactic graph network for aspect-based sentiment analysis

10.11591/ijai.v15.i2.pp1814-1824
Rekha Bdurga Harish , Neelambike Siddalingaiah
Aspect-based sentiment analysis (ABSA) is a fine-grained sentiment analysis task that identifies sentiment polarities toward specific aspects within a sentence. While conventional models have achieved progress, they often neglect to jointly consider both semantic context and syntactic structure, limiting performance in complex linguistic scenarios. Nevertheless, most existing graph convolutional network (GCN)-based approaches have recently focused on either semantic or syntactic information individually, leading to suboptimal sentiment classification accuracy. Hence, this work aims to design an effective ABSA model that simultaneously captures both semantic relationships and syntactic dependencies for enhanced aspect-level sentiment analysis. For solving issues of GCN-based approaches, this work proposed a model called sentiment semantic syntactic network (SentSemSynNet), which constructs a unified graph by integrating semantic and syntactic features and applies graph neural networks to learn rich, aspect-specific representations. The model was evaluated on the SemEval2014 restaurant and laptop datasets. It achieved 88.25% accuracy and 82.95% macro-F-score for restaurant, and 84.52% accuracy and 80.26% macro-F-score for laptop. The model’s unique integration of both semantic and syntactic importance through a unified graph structure improved sentiment detection accuracy.
Volume: 15
Issue: 2
Page: 1814-1824
Publish at: 2026-04-01

Correlation-based assessment of 4G LTE network performance during rainfall events in tropical regions

10.11591/ijeecs.v42.i1.pp105-114
Ngozi C. Eli-Chukwu , Uma Uzubi Uma , Handel Emezue , Ogechi Akudo Nwogu , Ogah E. Oga , Calister N. Ogbonna-Mba , Samuel I. Ezichi
This paper presents a performance evaluation of a fourth-generation (4G) cel lular network under adverse weather conditions in a tropical region. While the impact of rainfall on frequencies above 10 GHz is well documented, this study addresses the research gap concerning 4G LTE performance (sub-6 GHz) in high-precipitation environments such as Nigeria. Using a drive-test approach with TEMS Investigation software (v16.3), measurements were collected over 48 days between July and September 2025 along a fixed 15 km route in the Lagos metropolis on the MTN Nigeria network. Samples were recorded at 1 second intervals. Four critical key performance indicators (KPIs)—reference signal received power (RSRP), reference signal received quality (RSRQ), signal to-interference-plus-noise ratio (SINR), and received signal strength indicator (RSSI)—wereanalyzedtodeterminetheir influence on the network performance index (NPI). Correlation analysis revealed that while RSRP exhibits no sig nificant correlation with NPI during rainfall (rs = 0.009), SINR and RSRQ demonstrate strong positive correlations (rs = 0.828 and rs = 0.824, respec tively). Despite these high correlations, average performance values remained low (mean SINR = 23.72%), indicating significant rain-induced degradation. These findings provide a novel empirical basis for the development of weather aware adaptive algorithms in tropical 4G network deployments.
Volume: 42
Issue: 1
Page: 105-114
Publish at: 2026-04-01

Genetic algorithm for generalized time-window assignment problem

10.11591/ijai.v15.i2.pp1261-1274
Ali Kansou , Bilal Kanso , Houssein Wehbe , Haydar Bazzi , Ali Mcheik
This paper presents a hybrid genetic algorithm (GA) for the generalized time-window assignment problem (GTWAP), a complex artificial intelligence (AI) scheduling challenge that involves assigning agents to resources under strict temporal and capacity constraints. Our method integrates a problem specific heuristics and a repair mechanism to generate feasible and high quality solutions. We provide a mathematical formulation for GTWAP and introduce a new public benchmark set, using CPLEX to obtain exact solutions. Computational experiments demonstrate that our GA is highly competitive with CPLEX, often matching its performance. This effectiveness makes our method a practical and scalable AI-driven tool for complex scheduling in domains like logistics and healthcare.
Volume: 15
Issue: 2
Page: 1261-1274
Publish at: 2026-04-01

A sequential attention-enhanced deep learning framework for robust potato leaf disease diagnosis under real field conditions

10.11591/ijai.v15.i2.pp1790-1803
Watcharkorn Yoochomboon , Nithizethe Mhuadthongon , Piyaporn Krachodnok
Diagnosing potato leaf diseases from images collected in real-life field settings is challenging, mainly because of uneven lighting, complex backgrounds, and disease symptoms that are often subtle or visually inconsistent. In this study, a deep learning-based framework was developed to support potato leaf disease diagnosis, with particular attention given to improving generalization and interpretation. Several convolutional neural network (CNN) architectures were first examined under the same experimental conditions, and ResNeXt-50 showed the most stable overall performance. The model was then extended by applying efficient channel attention (ECA), followed by spatial attention adapted from the convolutional block attention module (CBAM). Test results indicate that this sequential attention design performs better than the baseline model as well as variants using only a single attention mechanism. Additional evaluation using 300 real-field images collected under different field conditions suggests improved robustness, while visualization results from gradient weighted class activation mapping (Grad-CAM) show clearer focus on lesion-related regions. Overall, the findings suggest that combining channel wise and spatial attention can improve both prediction reliability and interpretability, making the approach suitable for practical agricultural use.
Volume: 15
Issue: 2
Page: 1790-1803
Publish at: 2026-04-01

Evaluating hybrid and standard deep learning models for maximum temperature forecasting in a semi-arid region

10.11591/ijeecs.v42.i1.pp183-193
Oussama Zemnazi , Sanaa El Filali , Sara Ouahabi , Abderrahim Mouhtadi
Temperature forecasting is important for industries affected by climate, especially in semi-arid regions where the weather can change quickly and is hard to predict over time. Many studies have examined various deep learning models, including long short-term memory (LSTM), gated recurrent unit (GRU), convolutional neural networks (CNNs), and transformer-based hybrids. However, their performance in data-limited semi-arid environments is often unclear and inconsistent. This study compares six deep learning methods for predicting daily maximum temperatures in Settat, Morocco. It uses 11 years of ground-observed meteorological data. The models examined include a baseline artificial neural network (ANN) and five hybrid structures: ANN-LSTM, ANN-GRU, ANN-CNN, ANN–random forest (RF), and ANN-transformer. The results indicate that the ANN performs the best overall, with MAE = 0.0432, root mean square error (RMSE) = 0.0543, and R² = 0.8820. It surpasses all hybrid models. When using a relative improvement metric, the ANN shows accuracy gains of 32% to 42% compared to the recurrent, convolutional, and attention-based hybrids. These results suggest that in semi-arid climates, where maximum temperature mainly depends on the same-day atmospheric conditions, simpler feedforward models work better than more complex temporal models. The study underscores the need to match model complexity with climatic factors and dataset size, offering a useful benchmark for temperature forecasting in regions with limited data.
Volume: 42
Issue: 1
Page: 183-193
Publish at: 2026-04-01

Automated classification of apple bruises from hyperspectral images: an approach for fruit quality assessment

10.11591/ijai.v15.i2.pp1381-1389
Peddireddy Venkateswara Reddy , Alaguchamy Parivazhagan
Apple bruise detection plays a crucial role in post-harvest quality control; however, conventional manual inspection remains labor-intensive, subjective, and unsuitable for large-scale industrial deployment. This study proposes an automated classification framework for identifying bruised regions in apples using hyperspectral imaging combined with deep learning and adaptive optimization techniques. The proposed model integrates a long short-term memory (LSTM) network optimized using an adaptive sand cat swarm optimization (ASCSO) algorithm, along with a ResNet-50 feature extraction backbone. The adaptive behavior embedded within ASCSO dynamically adjusts the optimization parameters to enhance convergence and prevent premature stagnation during LSTM hyperparameter tuning. Hyperspectral images were processed to extract relevant spectral–spatial features, which were subsequently fed into the optimized classifier. Experimental evaluations demonstrate that the proposed hybrid model significantly outperforms conventional and baseline deep learning approaches, achieving a classification accuracy of 98.0% while maintaining robustness across varying bruise patterns and intensity levels. The results highlight the effectiveness of combining hyperspectral imaging with adaptive deep learning optimization for high-precision fruit quality assessment. This research contributes a reliable, scalable solution for automated bruise detection and quality grading in the fruit supply chain, offering strong potential to reduce post-harvest losses and improve operational efficiency in the agro-food industry.
Volume: 15
Issue: 2
Page: 1381-1389
Publish at: 2026-04-01

TMA-Net: a transformer-based multi-modal attention network for abnormal behavior detection

10.11591/ijai.v15.i2.pp1441-1450
Huong-Giang Doan , Ngoc-Trung Nguyen
Abnormal behavior detection in crowded environments remains challenging due to complex motion patterns, occlusions, and domain variability. This paper presents transformer-based multi-modal attention network (TMA-Net), a unified framework that integrates red, green, and blue (RGB), optical flow (OF), and heat map (HM) modalities through a dual-stage attention fusion mechanism. The system employs you only look once version 11 (YOLOv11) for human localization and vision transformer (ViT)-B/16 for feature encoding, followed by intra-modal self-attention and cross-modal fusion to capture fine-grained spatial–temporal and motion energy dependencies. Extensive experiments on six public benchmarks as UMN, Crowd-11, UBNormal, ShanghaiTech, CUHK Avenue, UCSD Ped2, and EPUAbN dataset, demonstrate that TMA-Net achieves up to 97.5% area under the curve (AUC) and 96–100% accuracy, outperforming previous other state-of-the-art approaches. These results highlight the framework’s strong generalization and robustness across both single- and cross-dataset evaluations, underscoring its potential for reliable deployment in real intelligent surveillance systems.
Volume: 15
Issue: 2
Page: 1441-1450
Publish at: 2026-04-01
Show 16 of 2007

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration