Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

30,096 Article Results

Deep learning for early detection of cardiovascular diseases via auscultation sound classification

10.11591/ijai.v15.i2.pp1746-1761
Shreyas Kasture , Sudhanshu Maurya , Amit Kumar Sharma , Santhosh Chitraju Gopal Varma , Kashish Mirza , Firdous Sadaf Mohammad Ismail
Heart diseases are one of the most prominent causes of death globally, which requires immediate and accurate diagnosis. The auscultation methods used in conventional medical practice, where the doctor listens to the sounds produced by the body without intervention is very ineffective because of the limitations in the actual skills and perception of the doctor. The main goal of this project will be designing a mobile-based system for the early detection of cardiovascular disease (CVD) by utilizing deep learning for auscultation sound classification. The approach involves the use of deep learning structures to classify cardiac sounds into normal and abnormal patterns on its own. Wavelet transformations, time-frequency representations, and Mel frequency cepstral coefficients (MFCC) have been used in feature extraction. The ResNet152V2 model showed high classification performance with area under the receiver operating characteristic curve (AUROC) of 0.9797 and 0.9636 on two datasets. Contrary to that, data augmentation, hyperparameter optimization, attention mechanisms, as well as input-output residual connections, led to better functionality and interpretability. This research seeks to overcome the limitations of traditional stethoscope use through the incorporation of sophisticated algorithms and the availability of mobile technology that could result in early diagnosis and prevention of CVDs, especially in underprivileged areas.
Volume: 15
Issue: 2
Page: 1746-1761
Publish at: 2026-04-01

Adaptive control of ball and beam system using SNA-PID combined with recurrent fuzzy neural network identifier

10.11591/ijai.v15.i2.pp1202-1210
Minh-Thanh Le , Chi-Ngon Nguyen
The ball and beam system is a nonlinear and inherently unstable single input, multiple-output (SIMO) system, which poses significant challenges for control design. Intelligent control algorithms are often applied to autonomously control complex systems when there are changes in parameters or the control environment. Therefore, in this paper, we research and develop two methods: proportional integral derivative (PID) and single neuron adaptive (SNA)-PID-recurrent fuzzy neural network identifier (RFNNI) to control the ball and beam system. Simulation results on MATLAB/Simulink show that the SNA-PID-RFNNI controller provides a more stable output signal than the traditional PID controller, with minimal overshoot and a settling time of about 15 seconds. Next, we will conduct real-time experiments on the object using the proposed algorithm through the MEGA2560 control board with an ultrasonic positioning mechanism.
Volume: 15
Issue: 2
Page: 1202-1210
Publish at: 2026-04-01

Fetal organ detection using feature enhancement with attention and residual block

10.11591/ijai.v15.i2.pp1593-1604
Nuswil Bernolian , Siti Nurmaini , Ade Iriani Sapitri , Annisa Darmawahyuni , Muhammad Naufal Rachmatullah , Bambang Tutuko , Firdaus Firdaus
The rapid advancements in fetal ultrasonography have significantly enhanced prenatal diagnosis in recent years. Deep learning (DL) architectures have further streamlined the process of organ detection, improved diagnostic accuracy, and reduced observer dependency. This study proposes a computer-aided DL approach for fetal organ segmentation using the you only look once (YOLO) algorithm, a state-of-the-art method for object detection and image segmentation. This study identified and classified 15 fetal organs, including the umbilical vein, stomach, abdomen, brain (trans-cerebellum, trans-thalamic, and trans-ventricular regions), femur, head, thorax (chest cavity), heart (circumference, left atrium, left ventricle, right atrium, right ventricle), and aorta. We compared the performance of YOLOv7, YOLOv8, YOLOv9, and YOLOv11 architectures. The results showed that YOLOv9 outperformed YOLOv7, YOLOv8, and YOLOv11 achieving mAP50 and mAP95 scores of 91.90% and 94.50%, respectively. This performance surpasses previous studies that focused on classifying only a limited number of fetal organs.
Volume: 15
Issue: 2
Page: 1593-1604
Publish at: 2026-04-01

Blockchain-enabled framework using diversity mutation with siberian tiger optimization for offloading in fog computing

10.11591/ijai.v15.i2.pp1371-1380
Srikanta Murthy Rajini , Reginald Shilpa
Fog computing has developed as a promising framework to support latency sensitive internet of things (IoT) applications for mobile devices operating in dynamic environments. During the offloading process, malicious activities interrupt the existing methods, which increases the execution time. Therefore, this research proposes a diversity mutation with siberian tiger optimization (DM-STO) for computation offloading in blockchain based fog computing. The blockchain is used to secure offload and attain quality of service (QoS) mobile users with less energy consumption and execution time. The DM-STO can balance workloads among local devices and fog servers. The diversity mutation operation improves the exploration ability to dynamic network conditions, leading to efficient computational offloading in fog computing. The execution time, service cost and energy consumption are evaluated to calculate the performance of the proposed DM-STO with varying numbers of IoT requests such as 50, 100, 200, and 300. For 50 IoT requests with a fixed fog server of 10, the DM-STO achieves an execution time of 18 s, a service cost of 10$ and energy consumption of 5 mJ compared to the BAT algorithm.
Volume: 15
Issue: 2
Page: 1371-1380
Publish at: 2026-04-01

TunDC: a public benchmark dataset for sentiment analysis and language modeling in the Tunisian dialect

10.11591/ijai.v15.i2.pp1891-1908
Ahmed Khalil Boulahia , Mourad Mars
The development of natural language processing (NLP) applications has increasingly focused on dialectal variations of languages. The Tunisian dialect (TD), a widely spoken variant of Arabic, poses unique linguistic challenges due to its lack of standardized writing conventions and influences from multiple languages, including French, Italian, Turkish, and Berber. In this work, we introduce TunDC, a dataset of 20,044 labeled comments designed to advance NLP research on the TD. The dataset covers diverse linguistic forms (Arabic, Latin, and mixed scripts), and each comment was manually annotated for positive or negative sentiment by native speakers, achieving high inter-annotator agreement. To evaluate its effectiveness, we fine-tuned various models on TunDC. The bert-base-arabic-TunDC-mixed model achieved an accuracy of 0.84 and a macro-averaged F1-score of 0.83, demonstrating strong generalization across sentiment categories and writing systems. A stratified data-splitting strategy considering both sentiment and script type further improved accuracy by approximately 8% compared to standard splits. As a publicly available resource, TunDC contributes to the computational linguistics community, fostering advancements in language modeling and applications tailored to the TD.
Volume: 15
Issue: 2
Page: 1891-1908
Publish at: 2026-04-01

A novel approach to detect tomato leaf disease using vision transformer

10.11591/ijai.v15.i2.pp1548-1565
Sanjeela Sagar , Jaswinder Singh
Tomatoes are one of the most widely consumed vegetables across the world. However, tomatoes are prone to diseases. Recognizing and classifying tomato leaf diseases is crucial task. Various deep learning (DL) methods have been developed by several researchers, but they have some complex issues like noise in images, high computational complexity, poor accuracy, and limited feature selection. The main goal of this research is to present novel DL based tomato leaf disease classification framework with neural network based gated vision transformer (G-ViT) model assisted attention mechanism. The proposed framework uses dilated convolution with bidirectional long short-term memory (Bi-DLSTM) used for efficient feature extraction to enhance the classification. An effective chaotic spider wasp optimization (CSWO) is used for feature selection. Further, novel attention based gated vision transformer (A-GVT) is used to classify tomato leaf diseases which integrates strengths of attention mechanism and G-ViT models. Further, to improve the generalizability of classification model, its parameters are tuned with black widow optimization (BWO) algorithm. The experimental findings shows that proposed framework outperformed previous studies on tomato leaf disease identification and classification models in terms of accuracy, precision, recall, F1-score, specificity, mean absolute error (MAE), and root mean square error (RMSE) with 99.7%, 98.29%, 98.22%, 98.25%, 99.19%, 0.03, and 0.25 respectively. The proposed study can pave a way for new agricultural revolution.
Volume: 15
Issue: 2
Page: 1548-1565
Publish at: 2026-04-01

Breast cancer detection using residual DenseNets in deep learning

10.11591/ijai.v15.i2.pp1632-1645
Naganandini Gururajarao , Vishwanath R. Hulipalled
Breast cancer, the leading cause of cancer-related deaths among women globally, requires a prompt and precise diagnosis in order to increase survival rates via therapy. There is a possibility of bias and inconsistency in the results of traditional diagnostic procedures like mammography, ultrasound, and histological testing since they rely on the expertise of radiologists and pathologists. There are exciting new opportunities for breast cancer diagnostics to be enhanced by artificial intelligence (AI) and deep learning. The purpose of this research is to examine the feasibility of using convolutional neural networks (CNNs) and residual dense networks (ResDenseNets) used for breast cancer automated detection in medical images. Because of their superior capacity to learn hierarchical features from raw image data, CNNs are ideal for medical image interpretation. By including residual connections, which allow for the training of considerably deeper models, ResDenseNets—an extension of CNNs—mitigate the problem of vanishing gradient in deep networks. ResDenseNet and CNNs considerably enhance the accuracy of breast cancer diagnosis in comparison to conventional approaches, according to the findings. Notably, ResDenseNets outperform other types of networks because they are able to learn intricate and nuanced properties directly from the data.
Volume: 15
Issue: 2
Page: 1632-1645
Publish at: 2026-04-01

TMA-Net: a transformer-based multi-modal attention network for abnormal behavior detection

10.11591/ijai.v15.i2.pp1441-1450
Huong-Giang Doan , Ngoc-Trung Nguyen
Abnormal behavior detection in crowded environments remains challenging due to complex motion patterns, occlusions, and domain variability. This paper presents transformer-based multi-modal attention network (TMA-Net), a unified framework that integrates red, green, and blue (RGB), optical flow (OF), and heat map (HM) modalities through a dual-stage attention fusion mechanism. The system employs you only look once version 11 (YOLOv11) for human localization and vision transformer (ViT)-B/16 for feature encoding, followed by intra-modal self-attention and cross-modal fusion to capture fine-grained spatial–temporal and motion energy dependencies. Extensive experiments on six public benchmarks as UMN, Crowd-11, UBNormal, ShanghaiTech, CUHK Avenue, UCSD Ped2, and EPUAbN dataset, demonstrate that TMA-Net achieves up to 97.5% area under the curve (AUC) and 96–100% accuracy, outperforming previous other state-of-the-art approaches. These results highlight the framework’s strong generalization and robustness across both single- and cross-dataset evaluations, underscoring its potential for reliable deployment in real intelligent surveillance systems.
Volume: 15
Issue: 2
Page: 1441-1450
Publish at: 2026-04-01

Usability analysis of the individual creativity assessment tool using the adjusted system usability scale

10.11591/ijai.v15.i2.pp1955-1962
Mohamad Rahimi Mohamad Rosman , Noor Arina Md Arifin , Siti Aishah Mokhtar , Nur Ainatul Mardiah Mat Nawi , Huda Hamidon , Salliza Md Radzi
Creativity is a critical element in the learning environment, which leads to innovation and research advancement in higher education. However, assessing creativity is challenging due to its diverse nature and the lack of standardized tools. The existing assessment tools often overlook the critical role of organizational culture in shaping individual creativity within academic settings. To address this gap, the individual creativity assessment tool (i-CAT) was developed based on the framework of organizational culture to assess its contribution to creativity among Malaysian academicians. This study aimed to i) assess the usability of i-CAT and ii) determine the significant effect of demographic factors on its usability assessment. A quantitative methodology, utilizing expert sampling and the system usability scale (SUS), was employed as the primary evaluation method. 20 experts with relevant professional and academic experience were selected for the validation. The results showed excellent usability, with 95% of experts rating the information system as functionally acceptable. A one way analysis of variance (ANOVA) found no significant difference in usability based on profession or education levels, but a significant difference was observed for experience levels. These findings confirm that i-CAT is a functional, user-friendly, and culturally relevant tool for creativity assessment within Malaysia’s higher education institutions.
Volume: 15
Issue: 2
Page: 1955-1962
Publish at: 2026-04-01

AI-induced fatigue among students in higher education: a latent profile analysis

10.11591/ijai.v15.i2.pp1963-1971
Dynah D. Soriano , Jordan L. Salenga , John Paul P. Miranda , Juvy C. Grume , Emerson Q. Fernando , Jr., Amado B. Martinez , Raymond A. Cabrera , Jaymark A. Yambao
The integration of artificial intelligence (AI) tools in education offers significant benefits but also introduces challenges, including AI-induced fatigue among students. This study aimed to classify students’ experiences with AI tools using latent profile analysis (LPA). A quantitative cross sectional design and referral approach were used to collect survey data from 388 college students who actively used AI tools for academic purposes from November to December 2024. The survey measured AI usage intensity, AI literacy, self-efficacy, perceived usefulness, cognitive load, technostress, sleep quality, general fatigue levels, and attitude toward AI. Descriptive results indicated moderate levels of AI usage intensity, AI literacy, perceived usefulness, cognitive load, sleep quality, and general fatigue, with technostress and attitude toward AI also at moderate levels. Model selection considered Akaike information criterion (AIC), Bayesian information criterion (BIC), entropy, and profile size adequacy, and expert review supported the retained six-profile structure. The LPA identified six interpretable user groups: competent but sleep-deprived users, overwhelmed and high-strain users, stable moderate users, strained moderate users, high intensity strained users, and low-strain selective users. The findings show differences in patterns of competence, strain, fatigue, and sleep outcomes associated with AI tool use, which supports the development of profile specific strategies to manage technostress, cognitive load, fatigue, and sleep disruption among higher education students.
Volume: 15
Issue: 2
Page: 1963-1971
Publish at: 2026-04-01

Automated classification of apple bruises from hyperspectral images: an approach for fruit quality assessment

10.11591/ijai.v15.i2.pp1381-1389
Peddireddy Venkateswara Reddy , Alaguchamy Parivazhagan
Apple bruise detection plays a crucial role in post-harvest quality control; however, conventional manual inspection remains labor-intensive, subjective, and unsuitable for large-scale industrial deployment. This study proposes an automated classification framework for identifying bruised regions in apples using hyperspectral imaging combined with deep learning and adaptive optimization techniques. The proposed model integrates a long short-term memory (LSTM) network optimized using an adaptive sand cat swarm optimization (ASCSO) algorithm, along with a ResNet-50 feature extraction backbone. The adaptive behavior embedded within ASCSO dynamically adjusts the optimization parameters to enhance convergence and prevent premature stagnation during LSTM hyperparameter tuning. Hyperspectral images were processed to extract relevant spectral–spatial features, which were subsequently fed into the optimized classifier. Experimental evaluations demonstrate that the proposed hybrid model significantly outperforms conventional and baseline deep learning approaches, achieving a classification accuracy of 98.0% while maintaining robustness across varying bruise patterns and intensity levels. The results highlight the effectiveness of combining hyperspectral imaging with adaptive deep learning optimization for high-precision fruit quality assessment. This research contributes a reliable, scalable solution for automated bruise detection and quality grading in the fruit supply chain, offering strong potential to reduce post-harvest losses and improve operational efficiency in the agro-food industry.
Volume: 15
Issue: 2
Page: 1381-1389
Publish at: 2026-04-01

Exponential long short-term memory with Levy flight optimization for lung nodule classification

10.11591/ijai.v15.i2.pp1451-1463
Kaliba Gowthami , Kamalakannan Jayaseelan
Lung cancer, which commonly appears as lung nodules is a deadly type of cancer that develops in a lung. Early detection of lung cancer is critical and challenging task due to presence of overlapping structures, which make it challenging to differentiate the benign and malignant regions. This research proposes long short-term memory (LSTM) with exponential linear unit (ELU) method for the classification of different classes of lung nodules. The hyperparameters of the LSTM network are optimized using the developed dynamic Levy flight – Archimedes optimization algorithm (DLF-AOA), which effectively identifies the optimal parameters for classification. The ResNet-18 method is used for the extraction of high-level features to differentiate various classes of lung nodules. Furthermore, Bayesian active contour (BAC) is employed for the segmentation of images as containing cancerous and non-cancerous regions of lung nodules. The LSTM with ELU method achieves 98.56% accuracy, 97.54% sensitivity, 98.22% specificity, 96.93% precision, 96.33% F1-score, and 1.44 error rate in IQ-OTH/NCCD lung cancer dataset.
Volume: 15
Issue: 2
Page: 1451-1463
Publish at: 2026-04-01

Double-hop of reconfigurable intelligent surfaces-aided for wireless optical link under log-normal fading channels

10.11591/ijai.v15.i2.pp1174-1180
Duong Huu Ai , Van Loi Nguyen , Khanh Ty Luong
In optical wireless communication (OWC), the reconfigurable intelligent surfaces (RIS) are used to manipulate optical signals by controlling the phase shifts or amplitude of reflected beams, which helps improve signal quality. RIS units can be tailored to increase the strength and reliability of the communication link, especially in challenging fading conditions. The double-hop scenario involves two RIS-assisted segments, such as transmitter to RIS-1 and RIS-1 to RIS-2 or a receiver. Each hop encounters log-normal fading, which impacts the overall link performance. Log-normal fading models the irradiance fluctuation caused by turbulence, which is significant in free-space optical (FSO) systems, this fading model assumes that the received optical signal’s amplitude varies with a log-normal distribution, making it more suited for weak to moderate turbulence. Numerical results are obtained under different of link distance, subcarrier quadrature amplitude modulation (QAM) is displayed quantitatively illustrate the average symbol error rate in the absence of RIS and with double-hop of RIS.
Volume: 15
Issue: 2
Page: 1174-1180
Publish at: 2026-04-01

A comparative study of Arabic morphological analyzers

10.11591/ijai.v15.i2.pp1876-1890
Omar Saadiyeh , Alaaeddine Ramadan , Chamseddine Zaki , Mohamad Hajjar , Gilles Bernard
The field of Arabic natural language processing (NLP) has witnessed significant advancements, driven by the development of various morphological analyzers. This paper compares several major Arabic morphological analyzers and examines their ability to handle word ambiguities, process dialects, operate efficiently, and support downstream NLP tasks. By reviewing previous studies, we identify key gaps, including the limited resources for dialects, the shortage of annotated corpora, and challenges related to system scalability. The study also highlights future directions, such as building larger and more diverse corpora, adapting neural models for dialects, and developing analyzers that are more interpretable and trustworthy. Overall, this comparative overview aims to provide a clearer understanding of the current state of Arabic morphological analyzers, synthesize existing research, and offer practical recommendations for future work in this area.
Volume: 15
Issue: 2
Page: 1876-1890
Publish at: 2026-04-01

A deep learning-based approach for hearing loss detection

10.11591/ijai.v15.i2.pp1701-1708
Deepa Deepa , Manjula Gururaj Rao
Millions of people across the world are affected by hearing loss and early detection is very important for effective intervention. The traditional hearing screening methods are effective but they often rely on specialized equipment and clinical resources, making them less accessible to common people. Hearing loss is a state that affects the ability to communicate, socially interact and overall quality of life. The advancements in recent years have aimed to enhance the accessibility and efficiency of hearing tests, mainly in remote areas. The accurate classification of hearing loss is essential for effective detection and treatment in audiology. This study presents a deep learning (DL)-based approach based on a feedforward neural network (FNN). This paper focuses on common causes like cerumen impaction, otitis media, and otosclerosis. The study tries to explore ways to improve the diagnosis of hearing loss. The goal is to develop solutions that make hearing screenings more accessible and cost-effective for populations with limited access to healthcare resources. The results show the advantages of DL models in supporting automated accurate classification of hearing loss for intelligent diagnostic systems in audiological healthcare.
Volume: 15
Issue: 2
Page: 1701-1708
Publish at: 2026-04-01
Show 18 of 2007

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration