Abstract: Traffic sign recognition (TSR) is one of the most important visual perception tasks for autonomous vehicles. Traffic signs situated far away occupy only a few pixels in the images captured ...
Audio analysis application designed to distinguish between Human-Made and AI-Generated music using custom CNN trained on Mel spectrograms and OpenL3 embeddings. - mtp0326/SynthToSoul ...
Abstract: The identification and analysis of human daily activities has garnered substantial attention in recent years, driven by its expansive applications in areas including healthcare, surveillance ...
Abstract: Handwritten Optical Character Recognition (OCR) for the Tifinagh script remains a challenging task due to the script’s geometric complexity, high similarity among rotated characters, and the ...
Abstract: The rapid progress in audio synthesis technologies allows the generating of artificial audio deep-fakes that sound convincingly real, thus causing security, privacy, and authenticity ...
Abstract: Emotion recognition is essential for improving user experience and interaction quality in human-centered applications. While recent studies have leveraged both event and traditional cameras ...
Abstract: Music genre classification is an important task in music information retrieval with uses in recommendation, indexing and streaming. This paper proposes a CNN-based framework that classifies ...
Abstract: Spiking neural networks (SNNs) are one of the best practices for efficient event-driven object recognition. To achieve high recognition accuracy, existing methods generally accumulate ...
Abstract: Medical image segmentation is increasingly reliant on deep learning techniques, yet the promising performance often come with high annotation costs. This paper introduces Weak-Mamba-UNet, an ...
Abstract: Electroencephalography EEG - based decoding of visual stimuli has gained traction in computational neuroscience and brain-computer interface applications. This study explores the feasibility ...
Abstract: Most existing studies on table tennis training focus on either action recognition or scoring alone, lacking a systematic way to model both tasks together, which limits the ability to provide ...
Abstract: Bronchoscopy is central to diagnosing central lung cancers but remains limited by reliance on operator expertise and variability in visual interpretation. In this work, we adapt and evaluate ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results