Filters
total: 3051
displaying 1000 best results Help
Search results for: AUDIO PROCESSING OBJECTS
-
The generalization of objects representing groups of buildings in the Kartuzy district by simplification operator with the Simplify Building tool - scale 1:10000. Data from OSM.
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the Open Street Map databases (OSM) [1].
-
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
PublicationW artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....
-
Computer vision techniques applied for reconstruction of seafloor 3D images from side scan and synthetic aperture sonars data
PublicationThe Side Scan Sonar and Synthetic Aperture Sonar are well known echo signal processing technologies that produce 2D images of the seafloor. Both systems combines a number of acoustic pings to form a high resolution image of seafloor. It was shown in numerous papers that 2D images acquired by such systems can be transformed into 3D models of seafloor surface by algorithmic approach using intensity information, contained in a grayscaled...
-
Framework for Structural Health Monitoring of Steel Bridges by Computer Vision
PublicationThe monitoring of a structural condition of steel bridges is an important issue. Good condition of infrastructure facilities ensures the safety and economic well-being of society. At the same time, due to the continuous development, rising wealth of the society and socio-economic integration of countries, the number of infrastructural objects is growing. Therefore, there is a need to introduce an easy-to-use and relatively low-cost...
-
INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY
PublicationIn recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublicationA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
IFE: NN-aided Instantaneous Pitch Estimation
PublicationPitch estimation is still an open issue in contemporary signal processing research. Nowadays, growing momentum of machine learning techniques application in the data-driven society allows for tackling this problem from a new perspective. This work leverages such an opportunity to propose a refined Instantaneous Frequency and power based pitch Estimator method called IFE. It incorporates deep neural network based pitch estimation...
-
Analiza stanu nawierzchni i klas pojazdów na podstawie parametrów ekstrahowanych z sygnału fonicznego
PublicationCelem badań jest poszukiwanie parametrów wektora cech ekstrahowanego z sygnału fonicznego w kontekście automatycznego rozpoznawania stanu nawierzchni jezdni oraz typu pojazdów. W pierwszej kolejności przedstawiono wpływ warunków pogodowych na charakterystykę widmową sygnału fonicznego rejestrowanego przy przejeżdżających pojazdach. Następnie, dokonano parametryzacji sygnału fonicznego oraz przeprowadzano analizę korelacyjną w celu...
-
Improving listeners' experience for movie playback through enhancing dialogue clarity in soundtracks
PublicationThis paper presents a method for improving users' quality of experience through processing of movie soundtracks. The dialogue clarity enhancement algorithms were introduced for detecting dialogue in movie soundtrack mixes and then for amplifying the dialogue components. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity...
-
Visual Data Encryption for Privacy Enhancement in Surveillance Systems
PublicationIn this paper a methodology for employing reversible visual encryption of data is proposed. The developed algorithms are focused on privacy enhancement in distributed surveillance architectures. First, motivation of the study performed and a short review of preexisting methods of privacy enhancement are presented. The algorithmic background, system architecture along with a solution for anonymization of sensitive regions of interest...
-
Environmental Protection in Energetics, PG_00049751, W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Environmental Protection in Energetics (PG_00049751), W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Examining Acoustic Emission of Engineered Ultrasound Loudspeakers
PublicationMeasurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...
-
Surface Science - 2021/2022
e-Learning CoursesThe goal of the subject is the presentation of basic problems resulting from he existence of interfacebetween material objects and its surroundings. Discussion of the consequences arising from the existenceof surface energy. Analysis of possible applications of surface phenomena in technology. Understanding ofproblems and benefits resulting from decreasing dimensions of objects with the special emphasis on thesemiconductor band...
-
A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics
PublicationA research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...
-
Damage localisation in a stiffened plate structure using a propagating wave
PublicationThe paper presents an application of changes in propagating waves for damage detection in a stiffened aluminium plate. The experimental investigation was conducted on an aluminium plate with riveted two L-shape stiffeners. The wave has been excited with a piezoelectric transducer and measured with the Laser Scanning Doppler Vibrometer. Recorded signals were analysed using the special signal processing techniques developed for damage...
-
Measurements and Simulations of Engineered Ultrasound Loudspeakers
PublicationSimulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...
-
A Method of Object Re-identiciation Applicable to Multicamera Surveillance Systems
PublicationThe paper addresses some challenges pertaining to the methods for tracking of objects in multi-camera systems. The tracking methods related to a single Field of Vision (FOV) are quite different from inter-camera tracking, especially in case of non-overlapping FOVs. In this case, the processing is directed to determine the probability of a particular object’s identity seen in a pair of cameras in the presence of places non-observed...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Bimodal deep learning model for subjectively enhanced emotion classification in films
PublicationThis research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....
-
Environmental Protection in Energetics (PG_00049751), W, ET, sem.1, winter 2024/2025
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Usage of parametric echosounder with emphasis on buried object searching.
PublicationThe purpose of this article is to present the results of investigation to search for buried objects. The paper will contain echograms and other means of visualization from buried pipe placed between area of W?adys?awowo and gas platform and interesting in terms of the number of small and medium-sized unidentified objects found in the muddy bottom at different depths localized in the Gulf of Puck - results will be presented also...
-
Comparative study on the effectiveness of various types of road traffic intensity detectors
PublicationVehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring...
-
Wow defect reduction based on interpolation techniques
PublicationW referacie przedstawiono wyniki badania różnych technik interpolacji wykorzystanych w redukcji kołysania dźwięku. W badaniach użyto: interpolację liniową, dwie techniki interpolacji wielomianowej (Hermite i spline), i technikę sumowania okienkowanych funkcji sink. Jakość rekonstrukcji wykonano wykorzystując sztucznie spreparowany sygnał audio, rekonstruowany wymienionymi metodami interpolacji. Jakość rekonstrukcji oceniono wykorzystując...
-
Geometry Modeling and Processing
Conferences -
Symposium on Geometry Processing
Conferences -
Przetwarzanie rozproszone
e-Learning CoursesFoundations and rules of distributed and parallel processing in networked computer systems.
-
Transmitting Alarm Information in DAB+ Broadcasting System
PublicationThe main goal of digital broadcasting is to deliver high-quality content with the lowest possible bitrate. This paper is focused on transmitting alarm information, such as emergency warning and alerting, in the DAB+ (Digital Audio Broadcasting plus) broadcasting system. These additional services should be available at the lowest possible bitrate, in order to provide a clear and understandable voice message to people. Furthermore, additional...
-
In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation
PublicationWe present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...
-
Dynamic mass measurement in checkweighers using a discrete time-variant low-pass filter
PublicationConveyor belt type checkweighers are complex mechanical systems consisting of a weighing sensor (strain gauge load cell, electrodynamically compensated load cell), packages (of different shapes, made of different materials) and a transport system (motors, gears, rollers). Disturbances generated by the vibrating parts of such a system are reflected in the signal power spectra in a form of strong spectral peaks, located usually in...
-
Statistically efficient smoothing algorithm for time-varying frequency estimation
PublicationThe problem of extraction/elimination of a nonstationary sinusoidal signal from noisy measurements is considered. This problem is usually solved using adaptive notch filtering (ANF) algorithms. It is shown that the accuracy of frequency estimates can be significantly increased if the results obtained from ANF are backward-time filtered by an appropriately designed lowpass filter. The resulting adaptive notch smoothing (ANS) algorithm...
-
The instantaneous frequency rate spectogram
PublicationAn accelerogram of the instantaneous phase of signal components referred to as an instantaneous frequency rate spectrogram (IFRS) is presented as a joint time-frequency distribution. The distribution is directly obtained by processing the short-time Fourier transform (STFT) locally. A novel approach to amplitude demodulation based upon the reassignment method is introduced as a useful by-product. Additionally, an estimator of energy...
-
Extraction of stable foreground image regions for unattended luggage detection
PublicationA novel approach to detection of stationary objects in the video stream is presented. Stationary objects are these separated from the static background, but remaining motionless for a prolonged time. Extraction of stationary objects from images is useful in automatic detection of unattended luggage. The proposed algorithm is based on detection of image regions containing foreground image pixels having stable values in time and...
-
Spectral measurement of birefringence using particle swarm optimization analysis
PublicationThe measurement of birefringence is useful for the examination of both technical and biological objects. One of the main problems is that the polarization state of light in birefringent media changes periodically. Without the knowledge of the period number, the birefringence of a given medium cannot be determined reliably. We propose to analyse the spectrum of light in order to determine the birefringence. We use a Particle Swarm...
-
Method for the correlation coefficient estimation of the bottom echo signal in the shallow water application using interferometric echo sounder
PublicationThe article presents a new method for the assessment of bottom echo correlation coefficient in the presence of multiple echoes. Bottom correlation coefficient is a parameter that characterizes spatial properties of echo signal. Large variability of the bottom shape or properties (for example caused by the presence of bottom objects) and the presence of the acoustic shadow strongly influence the value of the correlation coefficient....
-
Identification of Emotional States Using Phantom Miro M310 Camera
PublicationThe purpose of this paper is to present the possibilities associated with the use of remote sensing methods in identifying human emotional states, and to present the results of the research conducted by the authors in this field. The studies presented involved the use of advanced image analysis to identify areas on the human face that change their activity along with emotional expression. Most of the research carried out in laboratories...
-
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)
Conferences -
Rough Sets Applied to Mood of Music Recognition
PublicationWith the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
-
English Language Learning Employing Developments in Multimedia IS
PublicationIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Exception handling model influence factors for discributed systems. W: Proceedings. PPAM 2003. Parallel Processing and Applied Mathematics. 5th In- ternational Conference. Częstochowa, 7-10 September 2003.Model obsługi wyjątków uwzględniający wpływ czynników systemu rozproszonego.
PublicationSpecyfikacja programu jest jasno określona w systemach sekwencyjnych, gdzie posiada standardowe i wyjątkowe przejścia. Praca przedstawia rozszerzony model specyfikacji systemu w środowiskach rozproszonych uwzględniający szereg specyficznych czynników. Model zawiera analizę specyfikacji pod kątem obsługi wyjątków dla rozproszonych danych oraz komunikacji międzyprocesorowej. Ogólny model został zaimplementowany w środowisku...
-
International Journal of Image Processing and Visual Communication
Journals -
Building Knowledge for the Purpose of Lip Speech Identification
PublicationConsecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...
-
Generalized adaptive notch smoothers for real-valued signals and systems
PublicationSystems with quasi-periodically varying coefficients can be tracked using the algorithms known as generalized adaptive notch filters (GANFs). GANF algorithms can be considered an extension, to the system case, of classical adaptive notch filters (ANFs). We show that estimation accuracy of the existing algorithms, as well as their robustness to the choice of design parameters, can be considerably improved by means of compensating...
-
Krzysztof Piotr Okarma prof. dr hab. inż.
PeopleScientific and teaching specialties: Applied computer science Image processing and analysis Computer vision Machine vision in automation and robotics Signal processing Numerical methods and computational techniques Scientific degrees, positions, and academic titles: 29.02.2024 - professor (engineering and technology - disciplines: automation, electronics, electrical engineering and space technologies; information and communication...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Iwona Kochańska dr hab. inż.
PeopleIwona Kochańska is a graduate of Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology. She received M.Sc. degree in Automation Control and Robotics, specializing in the mobile objects control. In 2012 she received Ph.D. degree in hydroacoustics. In the same year she started working as an assistant professor in the Department of Marine Electronic Systems. The main area of interest is hydroacoustis, ...
-
Shore Construction Detection by Automotive Radar for the Needs of Autonomous Surface Vehicle Navigation
PublicationAutonomous surface vehicles (ASVs) are becoming more and more popular for performing hydrographic and navigational tasks. One of the key aspects of autonomous navigation is the need to avoid collisions with other objects, including shore structures. During a mission, an ASV should be able to automatically detect obstacles and perform suitable maneuvers. This situation also arises in near-coastal areas, where shore structures like...
-
Postprodukcja nagrania wideo z dzwiekiem dookolnym
PublicationOne of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
-
Frequency Guided Generalized Adaptive Notch Filtering - Tracking Analysis and Optimization
PublicationGeneralized adaptive notch filters (GANFs) are estimators of coefficients of quasi-periodically time-varying systems, encountered e.g., in RF applications when Doppler effect takes place. Current state of the art GANFs can deliver highly accurate estimates of system variations’ frequency, but underperform in terms of accuracy of system coefficient estimates. The paper proposes a novel multistage GANF with improved coefficient...
-
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
PublicationA network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....