Abstract: Traffic sign recognition (TSR) is one of the most important visual perception tasks for autonomous vehicles. Traffic signs situated far away occupy only a few pixels in the images captured ...
Audio analysis application designed to distinguish between Human-Made and AI-Generated music using custom CNN trained on Mel spectrograms and OpenL3 embeddings. - mtp0326/SynthToSoul ...
Abstract: The identification and analysis of human daily activities has garnered substantial attention in recent years, driven by its expansive applications in areas including healthcare, surveillance ...
Abstract: Handwritten Optical Character Recognition (OCR) for the Tifinagh script remains a challenging task due to the script’s geometric complexity, high similarity among rotated characters, and the ...
Abstract: The rapid progress in audio synthesis technologies allows the generating of artificial audio deep-fakes that sound convincingly real, thus causing security, privacy, and authenticity ...
Abstract: Emotion recognition is essential for improving user experience and interaction quality in human-centered applications. While recent studies have leveraged both event and traditional cameras ...
Abstract: Music genre classification is an important task in music information retrieval with uses in recommendation, indexing and streaming. This paper proposes a CNN-based framework that classifies ...
Abstract: Spiking neural networks (SNNs) are one of the best practices for efficient event-driven object recognition. To achieve high recognition accuracy, existing methods generally accumulate ...
Abstract: Medical image segmentation is increasingly reliant on deep learning techniques, yet the promising performance often come with high annotation costs. This paper introduces Weak-Mamba-UNet, an ...
Abstract: Electroencephalography EEG - based decoding of visual stimuli has gained traction in computational neuroscience and brain-computer interface applications. This study explores the feasibility ...
Abstract: Most existing studies on table tennis training focus on either action recognition or scoring alone, lacking a systematic way to model both tasks together, which limits the ability to provide ...
Abstract: Bronchoscopy is central to diagnosing central lung cancers but remains limited by reliance on operator expertise and variability in visual interpretation. In this work, we adapt and evaluate ...