Search results for: SOUND RECOGNITION
-
Automatic sound recognition for security purposes
PublicationIn the paper an automatic sound recognition system is presented. It forms a part of a bigger security system developed in order to monitor outdoor places for non-typical audio-visual events. The analyzed audio signal is being recorded from a microphone mounted in an outdoor place thus a non stationary noise of a significant energy is present in it. In the paper an especially designed algorithm for outdoor noise reduction is presented,...
-
Acceleration of decision making in sound event recognition employing supercomputing cluster
PublicationParallel processing of audio data streams is introduced to shorten the decision making time in hazardous sound event recognition. A supercomputing cluster environment with a framework dedicated to processing multimedia data streams in real time is used. The sound event recognition algorithms employed are based on detecting foreground events, calculating their features in short time frames, and classifying the events with Support...
-
Dangerous sound event recognition using Support Vector Machine classifiers
PublicationA method of recognizing events connected to danger based on their acoustic representation through Support Vector Machine classification is presented. The method proposed is particularly useful in an automatic surveillance system. The set of 28 parameters used in the classifier consists of dedicated parameters and MPEG-7 features. Methods for parameter calculation are presented, as well as a design of SVM model used for classification....
-
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
PublicationThe main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...
-
Multimodal English corpus for automatic speech recognition
PublicationA multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
PublicationSpatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...
-
Further developments of parameterization methods of audio stream analysis for secuirty purposes
PublicationThe paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublicationThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
The Innovative Faculty for Innovative Technologies
PublicationA leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...
-
Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations
PublicationEvaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...
-
Metoda i algorytmy sterowania procesami miksowania dźwięku za pomocą gestów w oparciu o analizę obrazu wizyjnego
PublicationGłównym celem rozprawy było opracowanie systemu miksowania dźwięku za pomocą gestów rąk wykonywanych w powietrzu oraz zbadanie możliwości oferowanych przez takie rozwiązanie w porównaniu ze współczesną metodą miksowania sygnałów fonicznych, wykorzystującą środowisko komputera. Opracowany system rozpoznaje zarówno dynamiczne jak i statyczne gesty rąk. Rozpoznawanie gestów dynamicznych zrealizowano w oparciu o metody logiki rozmytej...
-
Bożena Kostek prof. dr hab. inż.
People -
Sound signals generated during lapping of technical ceramics using electroplated tools with diamond grains
Open Research DataData contains the recordings of sound generated during single-sided lapping with the use of electroplated diamond tools. This relationship was examined with the use of spectral analysis of the sound signal in the frequency domain with a focus on the Ra parameter of the surface roughness. The estimated sound coefficient increased as the surface roughness...
-
Buzz-based recognition of the honeybee colony circadian rhythm
PublicationHoneybees are one of the highly valued pollinators. Their work as individuals is appreciated for crops pollination and honey production. It is believed that work of an entire bee colony is intense and almost continuous. The goal of the work presented in this paper is identification of bees circadian rhythm with a use of sound-based analysis. In our research as a source of information on bee colony we use their buzz that have been...
-
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
PublicationW artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....
-
Audible noise in ship’s electrical drive systems with frequency converters
Open Research DataThe presented dataset is part of research focusing on the impact of the ship's electrical drive systems with frequency converters on vibrations and the level of audible noise on ships.
-
Data recorded for the purpose of the 3D sound intensity visualization around the organ pipe (des sound)
Open Research DataThe set contains data recorded using the Cartesian robot and multichannel acoustic vector sensor (from Microflown) for the purpose of the 3D sound intensity visualization of radiated acoustic energy around the organ pipe.
-
Audio content analysis in the urban area telemonitoring system
PublicationArtykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych...
-
Decoding imagined speech for EEG-based BCI
PublicationBrain–computer interfaces (BCIs) are systems that transform the brain's electrical activity into commands to control a device. To create a BCI, it is necessary to establish the relationship between a certain stimulus, internal or external, and the brain activity it provokes. A common approach in BCIs is motor imagery, which involves imagining limb movement. Unfortunately, this approach allows few commands. As an alternative, this...
-
The AC motor voltage and audible noise waveforms in ship’s electrical drive systems with frequency converters
Open Research DataThe presented dataset is part of research focusing on the impact of the ship's electrical drive systems with frequency converters on vibrations and the level of audible noise on ships.
-
Underwater noise recorded in Hornsund Fjord, Spitsbergen, at the front of the Hans Glacier
Open Research DataThe data includes underwater noise recorded in Hornsund Fjord, Spitsbergen at the front of the Hans Glacier between 05/09/2013 and 30/09/2013. A description of the data is in the article
-
A model of stealth maritime object having some innovative solutions concerning the object form, structure and materials.
Open Research DataThe aim of the project is to work out a model of the stealth maritime object which will have innovative solutions concerning the object form, structure and materials. These solutions should enable a modification of combinations of the object features defining the object stealth characteristics (difficulty of the object detection in the water). It is...
-
Emotions in polish speech recordings
Open Research DataThe data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
-
Detection of the acoustic interferences during AFM operation
Open Research DataAtomic force microscopy is a particularly complicated surface imaging technique due to the large number of factors that affect the quality of the resulting images. They are obviously difficult and sometimes even impossible to control at the same time. One of such factors may even be the seismological location of the building or the influence of mechanical...
-
Mode shapes of a beam and plate with defects, obtained by experimental modal analysis
Open Research DataThe DataSet contains the experimental results of the first mode shape for a beam and a plate.
-
The power spectral density of audible noise and electric disturbances in ship’s electrical drive systems with frequency converters
Open Research DataThe presented dataset is part of research focusing on the impact of the ship's electrical drive systems with frequency converters on vibrations and the level of audible noise on ships.
-
Frequency Response Functions for Sandvik CoroBore 825 XL boring tool, 733 mm
Open Research DataFrequency Response Functions (FRF) obtained for Sandvik CoroBore® 825 XL mounted at WHN 13-15 TOS VARNSDORF CNC machine at the minimum extension of the spindle adaptor. Tool set to bore a hole with a diameter 733mm
-
CTD Gdańsk Deep_2001_2005
Open Research DataDataset includes measurements of conductivity (mS cm-1), temperature (°C), sound speed (m s-1) and salinity (PSU) made with the probe Falmouth Scientific Inc. The research was carried out in 2001 (at the turn of May and June), 2003 (beginning of May), 2005 (end of April) and in 2002 and 2004 (at the turn of September and October). In 2002, only temperature...
-
Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster . Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym
PublicationA method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The...