Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

30,376 Article Results

Dynamic service-aware network selection framework for multi objective optimization in 5G-advanced heterogeneous wireless networks

10.11591/ijai.v14.i6.pp4993-5007
Bhavana Srinivas , Nadig Vijayendra Uma Reddy
The increasing complexity of heterogeneous wireless networks (HWNs) and the diverse requirements of mobility patterns and service classes necessitate advanced solutions for network selection and resource optimization. Existing models often fall short in addressing dynamic mobility scenarios and service differentiation, leading to inefficiencies in resource allocation, suboptimal throughput, and increased latency. To overcome these limitations, this study proposes a dynamic service-aware network selector (DSANS) framework for 5G-advanced environments. The framework integrates an adaptive deep decision network (ADDN) for multi-objective optimization, addressing critical quality of service (QoS) metrics such as throughput, delay, and energy efficiency while enhancing quality of experience (QoE) for applications like enhanced mobile broadband (eMBB), ultra-reliable low latency communication (URLLC), and internet of things (IoT). The DSANS framework dynamically adapts to mobility patterns and varying network conditions, ensuring efficient resource estimation and optimal network selection. Simulation results highlight its superiority, achieving up to 25% improvement in throughput and a 15% reduction in latency compared to state-of-the-art algorithms. These findings validate DSANS as a robust solution for mitigating the limitations of existing models, optimizing network performance, and meeting the stringent demands of next-generation HWNs.
Volume: 14
Issue: 6
Page: 4993-5007
Publish at: 2025-12-01

Catalysing precision in bone x-ray analysis for image detection and classification: the triple context attention model advancement

10.11591/ijai.v14.i6.pp4957-4970
Tabassum N. Sultana , Nagaratna P. Hegde , Asma Parveen
Accurate detection and classification of fractures in bone x-ray images are crucial for effective medical diagnosis and treatment. In this study, we propose the triple context attention model (TCAN) as a novel approach to address the challenges in this domain. TCAN offers several key contributions that significantly enhance the accuracy and efficiency of bone x-ray image recognition and classification. Firstly, TCAN introduces the coordination attention mechanism, which considers both horizontal and vertical positional data during the recognition process. Secondly, TCAN mitigates the common issue of mislabelling fractures in bone x-ray images, particularly in the you only look once (YOLO) model, due to the absence of positional data during training. Thirdly, TCAN efficiently enhances positional data by focusing on weights, and increasing feature dimension while maintaining a manageable model size. This allows for effective utilization of positional data without computational overhead. Lastly, TCAN combines the visual attention network (VAN) with its capabilities, resulting in a comprehensive system that can handle diverse image dimensions and accurately classify various types of fractures across different body regions. Overall, TCAN presents a promising advancement in medical image analysis, improving fracture detection accuracy and classification efficiency in bone x-ray images, thus aiding in more effective clinical decision-making.
Volume: 14
Issue: 6
Page: 4957-4970
Publish at: 2025-12-01

Kannada handwritten numeral recognition through deep learning and optimized hyperparameter tuning

10.11591/ijai.v14.i6.pp5038-5048
Ujwala B. S. , Pramod Kumar S. , H. R. Mahadevaswamy , Sumathi K.
The classification of handwritten numerals is a vital and challenging task in developing automated systems, including postal address sorting and license plate recognition. The present study elucidates a new methodology for recognizing Kannada handwritten numerals using deep learning ResNet and VGG architecture with transfer learning. The challenge in Kannada handwritten recognition is complicated structural hierarchy and large vocabulary. The major problem in deep neural networks is vanishing gradient, which can lead to degradation in character recognition, and was addressed using our new methodology using ResNet architecture. We apply the proposed ResNet method in various real-world applications and compare it with convolutional neural networks (CNN) architecture, VGG. The experiment was implemented with the Google Colab software version on a self-created dataset, with handwritten Kannada numerals fed as the input to the recognition process. Our proposed method achieved a high accuracy of 99.20% on training samples and a generalization accuracy of 97.5% on test samples, indicating our method's effectiveness in recognizing handwritten Kannada numerals.
Volume: 14
Issue: 6
Page: 5038-5048
Publish at: 2025-12-01

Hybrid N-gram-based framework for payload distributed denial of service detection and classification

10.11591/ijai.v14.i6.pp4763-4774
Andi Maslan , Cik Feresa Mohd Foozy , Kamaruddin Malik Bin Mohamad , Abdul Hamid , Dedy Fitriawan , Joni Hasugian
There are three primary approaches to DDoS detection: anomaly-based, pattern-based, and heuristic-based. The heuristic-based method integrates both anomaly- and pattern-based techniques. However, existing DDoS detection systems face challenges in performing HTTP payload-level analysis, mainly due to high false positive rates and insufficient granularity in current datasets. To address this, the study introduces a novel heuristic approach based on a hybrid N-Gram model. This hybrid combines two components: CSDPayload+N-Gram and CSPayload+N-Gram. CSDPayload represents the gap (measured via Chi-Square Distance) between a given payload and normal traffic payloads, while CSPayload reflects the similarity (measured via Cosine Similarity) between them. These metrics form a new feature set evaluated using three datasets: CIC2019, MIB2016, and H2N-Payload. The methodology begins with packet extraction and conversion of TCP/IP traffic—specifically HTTP traffic—into hexadecimal payloads. N-Gram analysis (from 1-Gram to 6-Gram) is then applied to these payloads. For each N-Gram, frequency counts are computed, followed by calculations of Chi-Square Distance (CSD), Cosine Similarity (CS), and Pearson’s Chi-Square test to classify payloads as either benign or malicious. Subsequently, feature selection is performed using weight correlation, and the resulting features are fed into three machine learning classifiers: Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Neural Network. Experimental results demonstrate high detection accuracy, particularly in the 4-Gram feature category: Neural Network achieves 99.65%, KNN 95.14%, and SVM 99.73% accuracy on average.
Volume: 14
Issue: 6
Page: 4763-4774
Publish at: 2025-12-01

Transmission line fault detection using empirical mode decomposition in presence of wind intermittency

10.11591/ijape.v14.i4.pp960-969
Venkata Krishna Bokka , E. R. Biju , Sai Veerraju Mortha , Majahar Hussain Mahammad , Shaik Mohammad Irshad
The regular fault detection approaches are failed to detect the faults in wind integrated transmission networks due to intermittency nature of the wind energy. More reliable schemes are required to accomplish the detection of faults in presence wind. This article proposed empirical mode decomposition (EMD) based fault detection scheme to detect various faults in wind integrated transmission lines during the normal and stressed conditions of the system. The instantaneous current measurements available at either sending or receiving end are processed through EMD to decompose it into a series of intrinsic mode functions (IMFs) and IMF2 is identified as a dominated IMF with numerous case wise investigations. 1/4th cycle moving window is used to calculate the absolute sum of the IMF2 coefficients to detect the faults with the support of a predefined threshold. The efficacy of the method is tested on different types of faults during the normal condition in presence of wind and later extended to stressed conditions such as power swing. The method is reliable during the typical cases and includes remote end and high resistance faults. All the experiments are carried out in Simulink to generate the measurement data and programs are executed in MATLAB.
Volume: 14
Issue: 4
Page: 960-969
Publish at: 2025-12-01

A blended ensemble approach for accurate human activity recognition

10.11591/ijai.v14.i6.pp5131-5139
Rezwana Karim , Afsana Begum , Miskatul Jannat , Abu Kowshir Bitto
Human activity recognition (HAR) is a novel computer vision area with applications in fashion, entertainment, healthcare, and urban planning. Previously, convolutional neural networks (CNNs) were used in HAR due to their ability to extract spatial features from images. However, CNNs are not effective in processing varying input sizes and long-range dependencies in complex human motions. This work examines another approach using vision transformers (ViT) and swin transformers (SwinT) that process images as patch sequences and perform self-attention. These models particularly excel in learning global relationships and minor motion changes in body motion and are therefore very well-suited to variegated and subtle activity detection. To further enhance recognition performance, we propose a hybrid ensemble method by combining ViT and SwinT models with different scales (small, base, and large). Experimental outcomes show that while single transformer models are competitive, the hybrid ensemble beats them across the board with the highest accuracy and balanced precision, recall, and F1-score. These findings confirm that the intended ensemble model provides a more scalable and robust solution than either single-model or CNN-based approaches, and this encourages accurate human activity recognition.
Volume: 14
Issue: 6
Page: 5131-5139
Publish at: 2025-12-01

Revolutionizing human activity recognition with prophet algorithm and deep learning

10.11591/ijict.v14i3.pp1108-1118
Jaykumar S. Dhage , Avinash K. Gulve
Various industries, such as healthcare and surveillance, depend heavily on the ability to recognize human activity. The “human activity recognition (HAR) using smartphones data set” can be found in the UCI online repository and includes accelerometer and gyroscope readings recorded during a variety of human activities. The accelerometer and gyroscope signals are also subjected to a band-pass filter to eliminate unwanted frequencies and background noise. This method effectively decreases the dimensionality of the feature space while improving the model's accuracy and efficiency. “Convolutional neural networks (CNNs)” and “long shortterm memory (LSTM)” networks are combined to create pyramidal dilated convolutional memory network (PDCMN), which is the final proposal. Results from experiments demonstrate the effectiveness and reliability of the suggested method, demonstrating its potential for precise and effective HAR in actuality schemes.
Volume: 14
Issue: 3
Page: 1108-1118
Publish at: 2025-12-01

Unified BERT-LSTM framework enhances machine learning in fraud detection, financial sentiment, and biomedical classification

10.11591/ijai.v14.i6.pp5081-5095
Oussama Ndama , Ismail Bensassi , Safae Ndama , El Mokhtar En-Naimi
The current paper proposes a hybrid framework based on the bidirectional encoder representations from transformers (BERT) and long short-term memory (LSTM) networks for classification tasks in three diverse domains: credit card fraud detection (CCFD), financial news sentiment analysis (FNSA), and biomedical paper abstract classification (BPAC). The model leverages the strengths of BERT regarding the learning of contextual embeddings and those of LSTM in capturing sequential dependencies, thus setting the new state-of-the-art performance in each of the three domains. In the CCFD use case, the model was able to achieve an accuracy of 99.11%, considerably outperforming all the competing systems in fraud transaction detection. The BERT-LSTM model achieved a performance of 96.74% for FNSA, improving significantly in sentiment analysis. Finally, the use case of BPAC was robust, with 88.42% accuracy, which clearly classified biomedical abstract sections correctly. It is evident from the findings that this framework generalizes to a wide range of tasks and hence is an adaptable but strong tool in combating challenges of cross-domain classification.
Volume: 14
Issue: 6
Page: 5081-5095
Publish at: 2025-12-01

Electric load forecasting using ARIMA model for time series data

10.11591/ijict.v14i3.pp830-836
Balasubramanian Belshanth , Haran Prasad , Thirumalaivasal Devanathan Sudhakar
Any country's economic progress is heavily reliant on its power infrastructure, network, and availability, as energy has become an essential component of daily living in today's globe. Electricity's distinctive quality is that it cannot be stored in huge quantities, which explains why global demand for home and commercial electricity has grown at an astonishing rate. On the other hand, electricity costs have varied in recent years, and there is insufficient electricity output to meet global and local demand. The solution is a series of case studies designed to forecast future residential and commercial electricity demand so that power producers, transformers, distributors, and suppliers may efficiently plan and encourage energy savings for consumers. However, load prognosticasting has been one of the most difficult issues confronting the energy business since the inception of electricity. This study covers a new one–dimensional approach algorithm that is essential for the creation of a short–term load prognosticasting module for distribution system design and operation. It has numerous operations, including energy purchase, generation, and infrastructure construction. We have numerous time series forecasting methods of which autoregressive integrated moving average (ARIMA) outperforms the others. The auto–regressive integrated moving average model, or ARIMA, outperforms all other techniques for load forecasting.
Volume: 14
Issue: 3
Page: 830-836
Publish at: 2025-12-01

Evaluation of midwifery educated mobile applications for labor guidance and a roadmap for future developers

10.11591/ijai.v14.i6.pp5268-5278
Seeta Devi , Swapnil Vitthal Rahane , Lily Podder , Sangeetha X. , Kumari Dimple
The objective of the study was to review the midwifery guided mobile apps for labor advice, assessing features, functions, and content relevance. In February to March 2024, midwifery labor-guided applications were reviewed in mobile platforms such as the Google Play Store and Apple iTunes Store. We used multimodal evaluation tools, such as the mobile app rating scale (MARS), specific statements, and IQVIA ratings, to assess the quality of these applications. The study evaluated midwifery-guided applications, resulting in an average objective quality score of 3.96±0.96 out of 5. 'Safe delivery' scored the highest rating of 4.94, followed by 'Pregnancy mentor' (4.89), 'Hypno-birthing' (4.61), 'Obstetrics 6th edition' (4.68), and 'MSD manual guide to obstetrics' (4.56). Functionality received the highest score (4.16±0.865), followed by information (3.99±0.97), engagement (3.88±1.07), and aesthetics (3.82±0.28) areas. Subjective quality score was 3.6±1.18 out of 5 for an overall MARS score of 3.76±1.02. Most applications received favorable reviews, indicating good quality, and it is recommended that future app developers design applications that include comprehensive information on labor management.
Volume: 14
Issue: 6
Page: 5268-5278
Publish at: 2025-12-01

Shellcode classification analysis with binary classification-based machine learning

10.11591/ijict.v14i3.pp923-932
Jaka Naufal Semendawai , Deris Stiawan , Iwan Pahendra Anto Saputra , Mohamed Shenify , Rahmat Budiarto
The internet enables people to connect through their devices. While it offers numerous benefits, it also has adverse effects. A prime example is malware, which can damage or even destroy a device or harm its users, highlighting the importance of cyber security. Various methods can be employed to prevent or detect malware, including machine learning techniques. The experiments are based on training and testing data from the UNSW_NB15 dataset. K-nearest neighbor (KNN), decision tree, and Naïve Bayes classifiers determine whether a record in the test data represents a Shellcode attack or a non-Shellcode attack. The KNN, decision tree, and Naïve Bayes classifiers reached accuracy rates of 96.26%, 97.19%, and 57.57%, respectively. This study's findings aim to offer valuable insights into the application of machine learning to detect or classify malware and other forms of cyberattacks.
Volume: 14
Issue: 3
Page: 923-932
Publish at: 2025-12-01

A review of driver distraction detection while driving based on convolutional neural networks

10.11591/ijai.v14.i6.pp4415-4426
Ghady Alhamad , Mohamad-Bassam Kurdy
Driver distraction represents a major cause of traffic accidents, posing a serious threat to human life. In this review, we present the latest research findings of driver distraction detection based on convolutional neural networks (CNNs). In general, the analysis of driver behavior while driving is represented by either detecting driver drowsiness or attention diversion from driving by other activities, all of which fall under the definition of driver distraction. Facial features are often the basis for detecting driver drowsiness. In most papers, it is typically done by eye blinking, yawning, and head movement. As for the driver attention diversion, it is through the position of the hand and face. It involves many activities, text messages, making phone calls, adjusting the radio, consuming beverages, reaching for objects behind the driver, applying makeup, interacting with passengers, and other similar distractions. However, suggesting new methodologies in driver distraction detection and choosing appropriate CNN-based techniques is a big challenge given the wide variety experiments and studies in this field. Therefore, previous papers should be revisited to produce new methods by taking advantage of the techniques used. As a result, this paper reviews research approaches and reveals the effectiveness of CNN in detecting driver distraction. Finally, the article lists techniques that can be used as benchmarks in this context.
Volume: 14
Issue: 6
Page: 4415-4426
Publish at: 2025-12-01

Optimizing sparse ternary compression with thresholds for communication-efficient federated learning

10.11591/ijai.v14.i6.pp4902-4912
Nithyanianjan Murthy Chittaiah , Manjula Sunkadakatte Haladappa
Federated learning (FL) enables decentralized model training while preserving client data privacy, yet suffers from significant communication overhead due to frequent parameter exchanges. This study investigates how varying sparse ternary compression (STC) thresholds impact communication efficiency and model accuracy across the CIFAR-10 and MedMNIST datasets. Experiments tested thresholds ranging from 1.0 to 1.9 and batch sizes of 10, 15, and 20. Results demonstrated that selecting thresholds between 1.2 and 1.5 reduced total communication costs by approximately 10–15%, while maintaining acceptable accuracy levels. These findings suggest that careful threshold tuning can achieve substantial communication savings with minimal compromise in model performance, offering practical guidance for improving the efficiency and scalability of FL systems.
Volume: 14
Issue: 6
Page: 4902-4912
Publish at: 2025-12-01

Comparative analysis of u-net architectures and variants for hand gesture segmentation in parkinson’s patients

10.11591/ijict.v14i3.pp972-982
Avadhoot Ramgonda Telepatil , Jayashree Sathyanarayana Vaddin
U-Net is a well-known method for image segmentation, and has proven effective for a variety of segmentation challenges. A deep learning architecture for segmenting hand gestures in parkinson’s disease is explored in this paper. We prepared and compared four custom models: a simple U-Net, a three-layer U-Net, an auto encoder-decoder architecture, and a U-Net with dense skip pathways, using a custom dataset of 1,000 hand gesture images and their corresponding masks. Our primary goal was to achieve accurate segmentation of parkinsonian hand gestures, which is crucial for automated diagnosis and monitoring in healthcare. Using metrics including accuracy, precision, recall, intersection over union (IoU), and dice score, we demonstrated that our architectures were effective in delineating hand gestures under different conditions. We also compared the performance of our custom models against pretrained deep learning architectures such as ResNet and VGGNet. Our findings indicate that the custom models effectively address the segmentation task, showcasing promising potential for practical applications in medical diagnostics and healthcare. This work highlights the versatility of our architectures in tackling the unique segmentation challenges associated with parkinson’s disease research and clinical practice.
Volume: 14
Issue: 3
Page: 972-982
Publish at: 2025-12-01

Semantic search-enhanced healthcare chatbot for hospital information management system using vector database and transformer models

10.11591/ijai.v14.i6.pp4600-4613
Erda Guslinar Perdana , Arya Adhi Nugraha
Healthcare chatbots are increasingly used to assist hospital staff, yet most existing systems rely on rule-based or generic machine learning (ML) approaches that lack the ability to comprehend natural language queries, while proprietary deep learning systems often incur high licensing costs. This work addresses this gap by proposing a cost-effective and scalable semantic vector retrieval solution for user intent recognition in a hospital information management system (HIMS) helpdesk chatbot. The MPNet based transformer model is employed to convert user inquiries and predefined intents into feature vectors, enabling highly accurate natural language understanding through cosine similarity retrieval within a dedicated vector database. The proposed vector search method was validated via an ablation study, achieving an accuracy of 0.70 for intent recognition, which demonstrates a significant performance gain of 28.0 percentage points over a traditional keyword-based search baseline. Usability testing across developer and doctor groups yielded an average score of 7.78 on a 10-point Likert scale. This study concludes that integrating semantic vector retrieval with a vector database is highly effective for recognizing specialized clinical intents, offering a more accurate solution that significantly reduces the manual helpdesk workload and enhances 24-hour assistance in healthcare.
Volume: 14
Issue: 6
Page: 4600-4613
Publish at: 2025-12-01
Show 92 of 2026

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration