Search results for: MUSICAL GENRE RECOGNITION

Search results for: MUSICAL GENRE RECOGNITION

results on page:
embed this view on your website

Filters

total: 935

clear all filters disabled

Recognition of environmentally important ions
Publication
- N. Łukasik
- E. Wagner-Wysiecka
- V. Hubscher-Bruder
- M. Bocheńska
- S. Michel
- Logistyka - Year 2013
..
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publication
- Year 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publication
- Year 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publication
- S. Erpolat Tasabat
- O. Aydin
- Year 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publication
- P. Rościszewski
- Computer Science - Year 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download
Automatic singing quality recognition employing artificial neural networks
Publication
- P. Żwan
- Archives of Acoustics - Year 2008
Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Full text available to download
Journal of the Royal Musical Association

Journals

ISSN: 0269-0403 , eISSN: 1471-6933
Journal of the Musical Arts in Africa

Journals

ISSN: 1812-1004 , eISSN: 2070-626X
Greek and Roman Musical Studies

Journals

ISSN: 2212-974X , eISSN: 2212-9758
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
Publication
- Electronics - Year 2022
Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Full text available to download
Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI
Publication
- Year 2013
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
Publication
- H. Zhang
- Z. Xiao
- J. Wang
- F. Li
- E. Szczerbicki
- IEEE Internet of Things Journal - Year 2019
Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Full text available to download
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publication
- A. Blokus
- H. Krawczyk
- Year 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Full text to download in external service
Unraveling the Interplay between DNA and Proteins: A Computational Exploration of Sequence and Structure-Specific Recognition Mechanisms
Publication
- K. A. Hossain
- Year 2023
My PhD dissertation focused on DNA-protein interactions and the recognition of specific DNA sequences and structures. I discovered that acidic amino acid residues (Asp/Glu) play a crucial role by exhibiting a preference for cytosine. Their contribution to binding affinity depends on nearby cytosines, balancing electrostatic repulsion with specific interactions. Acidic residues act as negative selectors, discouraging non-cytosine...

Full text available to download
Musical Instrument Classification and Duet Analysis Employing Music Information Retrieval Techniques.
Publication
- B. Kostek
- Year 2004
Artykuł przedstawia w sposób przeglądowy prace Katedry Systemów Multimedialnych Politechniki Gdańskiej związane z wyszukiwaniem informacji muzycznej, a w szczególności z klasyfikacją dźwięków instrumentów muzycznych. W opisywanych eksperymentach wykorzystano sztuczne sieci neuronowe.
Musical Metadata Retrieval with Flow Graphs, in Rough Sets and Current Trends in Computing.
Publication
- A. Czyżewski
- B. Kostek
- Year 2004
W pracy opisano metody wyszukiwania muzyki w Internecie w oparciu o opis semantyczny. W eksperymentach wykorzystano opis muzyczny stosowany w bazie CDDB. Zaprezentowano metodę grafów przepływowych zaproponowaną przez Pawlaka.
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publication
- K. Łopatka
- Year 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
Karolina Zielińska-Dąbkowska dr inż. arch.

People

Department of Urban Architecture and Waterscapes

Karolina M. Zielinska-Dabkowska, Ph.D., Eng. Arch., M. Arch., is an Assistant Professor at the Faculty of Architecture of Gdańsk University of Technology (GUT). In 2002, she completed her studies of Architecture and Urban Planning at Gdańsk University of Technology (Gdańsk Tech) and in 2004, Architectural Engineering at the University of Applied Sciences and Arts (HAWK) in Hildesheim, Germany. After graduation, she worked for several...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Role of cholesterol in substrate recognition by -secretase
Publication
- Scientific Reports - Year 2021
-Secretase is an enzyme known to cleave multiple substrates within their transmembrane domains, with the amyloid precursor protein of Alzheimer’s Disease among the most prominent examples. The activity of -secretase strictly depends on the membrane cholesterol content, yet the mechanistic role of cholesterol in the substrate binding and cleavage remains unclear. In this work, we used all-atom molecular dynamics simulations to examine...

Full text available to download
Viruses, cancer and non-self recognition
Publication
- M. Padariya
- U. Kalathiya
- S. Mikac
- K. Dziubek
- M. Tovar
- E. Sroka
- R. Fahraeus
- A. Sznarkowska
- Open Biology - Year 2021
Full text to download in external service
Face Recognition: Shape versus Texture
Publication
- M. Smiatacz
- Year 2015
This paper describes experiments related to the application of well-known techniques of the texture feature extraction (Local Binary Patterns and Gabor filtering) to the problem of automatic face verification. Results of the tests show that simple image normalization strategy based on the eye center detection and a regular grid of fiducial points outperforms the more complicated approach, employing active models that are able to...

Full text to download in external service
Balance recognition on the basis of EEG measurement.
Publication
- Annals of Computer Science and Information Systems - Year 2016
Although electroencephalography (EEG) is not typically used for verifying the sense of balance, it can be used for analysing cortical signals responsible for this phenomenon. Simple balance tasks can be proposed as a good indicator of whether the sense of balance is acting more or less actively. This article presents preliminary results for the potential of using EEG to balance sensing....

Full text available to download
MPEG-7-based low level descriptor effectiveness in the automatic musical sound classification.
Publication
- Year 2004
Celem referatu jest określenie, które z parametrów opisowych MPEG-7 są najbardziej przydatne w klasyfikacji dźwięków instrumentów muzycznych. Określana jest wysokość dźwięku a następnie wyznaczane są wartości parametrów zawartych w standardzie MPEG-7. Otrzymany wektor parametrów poddawany jest analizie statystycznej w celu wyeliminowania danych nadmiarowych. Do celów automatycznej klasyfikacji i testów zaprojektowano dwa systemy...
Musical instrument sound separation methods supported by artificial nueural network decision system
Publication
- M. Dziubiński
- Year 2006
Rozprawa doktorska (27 czerwica 2006).Celem prowadzonych prac badawczych było opracowanie algorytmów separacji dźwięków instrumentów muzycznych. Dodatkowo dobrano zestaw parametrów tak aby możliwe było wytrenowanie sztucznej sieci neuronowej w celu automatycznego rozpoznawania odseparowanych sygnałów. Zaproponowano również aby algorytm decyzyjny odpowiedzialny za klasyfikacje dźwięków pełnił funkcję automatycznej metody oceny algorytmów...
Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage
Publication
- A. Kołakowska
- A. Landowska
- P. Jarmolkowicz
- M. Jarmolkowicz
- K. Sobota
- Internet Research - Year 2016
Purpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...

Full text to download in external service
Artur Gańcza dr inż.

People

Department of Marine Electronic Systems

I received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
The Influence of Selecting Regions from Endoscopic Video Frames on The Efficiency of Large Bowel Disease Recognition Algorithms
Publication
- Year 2012
The article presents our research in the field of the automatic diagnosis of large intestine diseases on endoscopic video. It focuses on the methods of selecting regions of interest from endoscopic video frames for further analysis by specialized disease recognition algorithms. Four methods of selecting regions of interest have been discussed: a. trivial, b. with the deletion of characteristic, endoscope specific additions to the...
Examining Feature Vector for Phoneme Recognition / Analiza parametrów w kontekście automatycznej klasyfikacji fonemów
Publication
- G. Korvel
- B. Kostek
- Year 2017
The aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
International Journal of Applied Pattern Recognition

Journals

ISSN: 2049-887X , eISSN: 2049-8888
World Research Journal of Pattern Recognition

Journals

ISSN: 2278-8557
International Journal on Document Analysis and Recognition

Journals

ISSN: 1433-2833 , eISSN: 1433-2825
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
Publication
- Year 2020
A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
Determination of toxic gases based on the responses of a single electrocatalytic sensor and pattern recognition techniques
Publication
- MEASUREMENT SCIENCE & TECHNOLOGY - Year 2014
A response from an electrocatalytic gas sensor contains fingerprint information about the type of gas and its concentration. As a result, a single gas sensor can be used for the determination of different gases. However, information about the type of gas and its concentration is hidden in the unique shape of the current–voltage response and it is quite difficult to explore. One of the ways to get precise information about the measured...

Full text to download in external service
Vowel recognition based on acoustic and visual features
Publication
- Archives of Acoustics - Year 2006
W artykule zaprezentowano metodę, która może ułatwić naukę mowy dla osób z wadami słuchu. Opracowany system rozpoznawania samogłosek wykorzystuje łączną analizę parametrów akustycznych i wizualnych sygnału mowy. Parametry akustyczne bazują na współczynnikach mel-cepstralnych. Do wyznaczenia parametrów wizualnych z kształtu i ruchu ust zastosowano Active Shape Models. Jako klasyfikator użyto sztuczną sieć neuronową. Działanie systemu...

Full text available to download
Recognition, understanding and aestheticization of freehand drawing flowcharts
Publication
- W. Szwoch
- Year 2007
In this paper a concept of FCA, a system for recognizing, understanding and aestheticization of freehand drawing flow charts is described. The system is based on a proposed by the author FlowGram graph grammar describing flow charts drawing. An open format FlowChartML for flow charts description is also proposed. The aestheticization criterion is formulated that allows for automatic beautification of flow charts. First experiments...
Speech recognition system for hearing impaired people.
Publication
- P. Dalka
- A. Czyżewski
- Year 2005
Praca przedstawia wyniki badań z zakresu rozpoznawania mowy. Tworzony system wykorzystujący dane wizualne i akustyczne będzie ułatwiał trening poprawnego mówienia dla osób po operacji transplantacji ślimaka i innych osób wykazujących poważne uszkodzenia słuchu. Active Shape models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na...
Acylic congener of cucurbituril: synthesis and recognition properties.
Publication
- C. A. Burnett
- D. Witt
- J. C. Fettinger
- L. Isaacs
- Asian Journal of Organic Chemistry - Year 2003
Zaprezentowano syntezę analogów acyklicznych cucurbiturilu oraz ich zdolności do kompleksowania wybranych 16 amin, dioli, kwasów dikarboksylowych, pochodnych guanidyny oraz pirydyny. Obserwowane tworzenie kompleksów przebiegało około 180 razy słabiej niż dla cucurbiturilu. Wyniki te świadczą o potencjalnych możliwościach zbliżonych do analogów cyklicznych pod względem tworzenia kompleksów i rozpoznawania wyżej wymienionych...
Multimodal Audio-Visual Recognition of Traffic Events
Publication
- Year 2011
Przedstawiono demonstrator systemu wykrywania niebezpiecznych zdarzeń w ruchu drogowym oparty na jednoczesnej analizie danych wizyjnych i akustycznych. System jest częścią systemu automatycznego nadzoru bezpieczeństwa. Wykorzystuje on kamery i mikrofony jako źródła danych. Przedstawiono wykorzystane algorytmy - algorytmy rozpoznawania zdarzeń dźwiękowych oraz analizy obrazu. Zaprezentowano wyniki działania algorytmów na przykładzie...
Gazetteer compression technique based on substructure recognition
Publication
- J. Daciuk
- J. Piskorski
- Year 2006
Automaty skończone są najlepszą formą reprezentacji słowników do przetwarzania języka naturalnego. Przedstawiamy nową technikę kompresji, która jest szczególnie użyteczna w stosunku do pewnego rodzaju słowników. Zastępujemy wielokrotnie występujące podstruktury ich niepowtarzalnymi reprezentantami. Do ich znalezienia traktujemy wektor przejść jako tekst i stosujemy technikę kompresji tekstu w stylu Ziv-Lempel, która znajduje powtórzenia...

Full text to download in external service
Royal Musical Association Research Chronicle

Journals

ISSN: 1472-3808 , eISSN: 2167-4027
Journal of the American Musical Instrument Society

Journals

ISSN: 0362-3300
Information and Communication Technology in Musical Field

Journals

ISSN: 2067-9408 , eISSN: 2069-654X
Revista Internacional de Educacion Musical

Journals

ISSN: 2307-4841
Interactions of telomeric proteins with nucleic acids: sequence recognition on intact and oxidatively damaged telomeres
Publication
- M. Wieczór
- Year 2019
Telomeres are complex nucleoprotein assemblies that play a vital role in the maintenance of functional ends of linear chromosomes. Telomeric DNA, composed of tandem repeats of the 5'-TTAGGG-3' motif, solves the so-called end replication problem: as chromosomes shorten with each cell division, no information is lost, and the telomere can be re-extended. In the cell, many protein factors regulate telomere length, nuclear positioning...

Full text available to download
PATTERN RECOGNITION LETTERS

Journals

ISSN: 0167-8655 , eISSN: 1872-7344
Automatic Watercraft Recognition and Identification on Water Areas Covered by Video Monitoring as Extension for Sea and River Traffic Supervision Systems
Publication
- N. Wawrzyniak
- A. Stateczny
- Polish Maritime Research - Year 2018
The article presents the watercraft recognition and identification system as an extension for the presently used visual water area monitoring systems, such as VTS (Vessel Traffic Service) or RIS (River Information Service). The watercraft identification systems (AIS - Automatic Identification Systems) which are presently used in both sea and inland navigation require purchase and installation of relatively expensive transceivers...

Full text to download in external service
Luminescence recognition material as an INHIBIT logic gate in presence of Pb2+ and Cu2+ ions in aqueous solutions
Publication
- M. Orłowska
- A. Kłonkowski
- J. Jezierska
- J. Ryl
- SENSORS AND ACTUATORS B-CHEMICAL - Year 2013
A recognition material consisting of silica xerogel with amino-modified surface selectively recognizes Pb2+ and Cu2+ (but only in presence of Pb2+ ions) in aqueous solutions of other metal ions. The analytical action of the material is based on a significant change in luminescence emission spectra of the material after chemisorption of Pb2+ ions. In the presence of Pb2+ in octahedral coordination environment, a new broad and strong...

Full text to download in external service

Search

Filters

Catalog

Search results for: MUSICAL GENRE RECOGNITION

Karolina Zielińska-Dąbkowska dr inż. arch.

Artur Gańcza dr inż.