Filtry
wszystkich: 386
-
Katalog
Wyniki wyszukiwania dla: audit
-
Classification of Music Genres Based on Music Separation into Harmonic and Drum Components . Klasyfikacja gatunków muzycznych wykorzystująca separację instrumentów muzycznych
PublikacjaThis article presents a study on music genre classification based on music separation into harmonic and drum components. For this purpose, audio signal separation is executed to extend the overall vector of parameters by new descriptors extracted from harmonic and/or drum music content. The study is performed using the ISMIS database of music files represented by vectors of parameters containing music features. The Support Vector...
-
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
PublikacjaThe main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...
-
DAB vs DAB+ Radio Broadcasting: a Subjective Comparative Study
PublikacjaIn the age of digital media, delivering high quality content to consumers is one of the most demanding tasks. There exist numerous broadcasting standards, with different pros and cons, and the DAB/DAB (Digital Audio Broadcasting) system is one of the most popular among them. From an engineer’s perspective, efficient resource management under limited bandwidth conditions has always been a challenge. In this paper a subjective quality...
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja -
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublikacjaSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublikacjaPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Sebastian Wilczewski dr inż.
OsobyDr inż. Sebastian Wilczewski jest zatrudniony na stanowisku adiunkta w Katedrze Zastosowań Informatyki w Zarządzaniu, Wydział Zarządzania i Ekonomii. Ponadto od ponad 20 lat pracuje w firmach informatycznych, gdzie zajmował stanowiska od inżyniera i konsultanta, poprzez stanowisko Project Managera, Portfolio Manager i Proces Ownera aż do stanowiska Team Leadera dla zespołu około 30 analityków IT. Zdobytą wiedzą i doświadczeniem...
-
Health Enhancing Physical Activity Policies in Poland: Findings from the HEPA PAT Survey
PublikacjaInsufficient physical activity (PA) is one of major risk factors for serious diseases and premature mortality worldwide. Public policies to enhance PA across society are recognized as an effective tool against the problem. This paper presents the results of a comprehensive assessment of national-level PA policy approach in Poland. A standardized survey of Word Health Organization named the Health-Enhancing Physical Activity Policy...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
Geospatial Coverage and Signal Quality Measurements of Terrestrial DAB+ Network in Northern Poland
PublikacjaModern signal coverage maps are prepared based on industry-standard radio propagation models, which take into account a number of parameters, including: type of antenna, distance from the transmitter, type of terrain, etc. However, such simulations are prone to location-specific inaccuracies, and should be verified with in-situ measurements. This paper presents results of a field test of a terrestrial DAB+ (Digital Audio Broadcasting...
-
Stradar - Multimedia Dispatcher and Teleinformation System for the Border Guard
PublikacjaSecurity of national borders requires utilization of multimedia surveillance systems automatically gathering, processing and sharing various data. The paper presents such a system developed for the Maritime Division of the Polish Border Guard within the STRADAR project. The system, apart from providing communication means, gathers data, such as map data from AIS, GPS and radar receivers, videos and photos from camera or audio from...
-
Metody i narzędzia prewencyjne w projektowaniu i eksploatacji dróg
PublikacjaSpośród wielu systemowych elementów zarządzania bezpieczeństwem infrastruktury drogowej, wyróżnić można cztery rodzaje działań mające charakter prewencyjny, a mianowicie: Ocena oddziaływania na bezpieczeństwo ruchu, Audyt bezpieczeństwa ruchu drogowego, Zarządzanie bezpiecezństwem sieci drogowej i Przeglądy dróg. Działania te, jako środki prewencyjne w projektowaniu i eksploatacji dróg powinny stanowić spójny system ocen projektowanych...
-
Prevention methods and tools in road designing and maintenance
PublikacjaSpośród wielu systemowych elementów zarządzania bezpieczeństwem infrastruktury drogowej, wyróżnić można cztery rodzaje działań mające charakter prewencyjny, a mianowicie: Ocena oddziaływania na bezpieczeństwo ruchu, Audyt bezpieczeństwa ruchu drogowego, Zarządzanie bezpiecezństwem sieci drogowej i Przeglądy dróg. Działania te, jako środki prewencyjne w projektowaniu i eksploatacji dróg powinny stanowić spójny system ocen projektowanych...
-
Modern approaches to file system integrity checking
PublikacjaOne of the means to detect intruder's activity is totrace all unauthorized changes in a file system.Programs which fulfill this functionality are called fileintegrity checkers. This paper concerns modernapproach to file system integrity checking. It reviewsarchitecture of popular systems that are widely used inproduction environment as well as scientific projects,which not only detect intruders but also take actions tostop their...
-
Low-Level Music Feature Vectors Embedded as Watermarks
PublikacjaIn this paper a method consisting in embedding low-level music feature vectors as watermarks into a musical signal is proposed. First, a review of some recent watermarking techniques and the main goals of development of digital watermarking research are provided. Then, a short overview of parameterization employed in the area of Music Information Retrieval is given. A methodology of non-blind watermarking applied to music-content...
-
Multi-Aspect Quality Assessment Of Mobile Image Classifiers For Companion Applications In The Publishing Sector
PublikacjaThe paper presents the problem of quality assessment of image classifiers used in mobile phones for complimentary companion applications. The advantages of using this kind of applications have been described and a Narrator on Demand (NoD) functionality has been described as one of the examples, where the application plays an audio file related to a book page that is physically in front of the phone's camera. For such a NoD application,...
-
Spotkanie informacyjne dla kandydatów do Szkoły Doktorskiej
WydarzeniaW dniu 29 czerwca, o godz. 11:00 (UTC+2) , odbędzie się spotkanie informacyjne dla kandydatów, dotyczące rekrutacji do Szkoły Doktorskiej PG na r.a. 2021/2022. Dostęp na hasło: PhD
-
Koncepcja społecznej odpowiedzialności biznesu w usługach ubezpieczeniowych
PublikacjaCelem artykułu jest przedstawienie koncepcji społecznej odpowiedzialności biznesu (CSR) w usługach ubezpieczeniowych. Zidentyfikowano w tym celu podstawowe oczekiwania interesariuszy oraz najważniejsze obszary zastosowania CSR w instytucjach ubezpieczeniowych. Zaprezentowano również podstawowy związek pomiędzy działaniami społecznie odpowiedzialnymi realizowanymi przez instytucje ubezpieczeniowe a ich wizerunkiem. Na podstawie...
-
Uwierzytelnienie i autoryzacja w systemie STRADAR
PublikacjaPrzedstawiono rozwiązanie serwera uwierzytelnienia i autoryzacji (AA) w rozproszonym systemie STRADAR, udostępniającym funkcjonalności dla prowadzenia działań operacyjnych Morskiego Oddziału Straży Granicznej. System umożliwia prezentację na stanowisku wizualizacji zdarzeń (SWZ) bieżącej i archiwalnej sytuacji na mapie (AIS, radary), obrazu z kamer, zdjęć, notatek, rozmów telefonicznych oraz plików i wiadomości tekstowych (SMS)...
-
Kontekstowo zorientowany model bezpieczeństwa systemów internetowych
PublikacjaW rozprawie dokonano analizy usługowych systemów internetowych pod względem bezpieczeństwa. Przedstawiono autorskie uniwersalne modele bezpieczeństwa CoRBAC oraz jego rozszerzenie TCoRBAC, będące rozwinięciem tradycyjnego modelu RBAC. Uwzględniono analizę szeroko rozumianego kontekstu funkcjonowania systemu oraz poziom jego zaufania do użytkownika. Opracowano metodę wyznaczania dwóch parametrów: zaufania do systemu (λ) będącego...
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublikacjaArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublikacjaA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
A comparative study of English viseme recognition methods and algorithm
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY
PublikacjaIn recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...
-
Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations
PublikacjaEvaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...
-
Loudness Scaling Tests in Hearing Problems Detection
PublikacjaThe number of people using portable audio players has increased significantly over the recent years. This implies the rise in the number of people having hearing loss problems. Therefore, there is a need to find appropriate procedures that simplify the process of the hearing problem detection. Investigations performed show that audiometric tests may not be sufficient to assess hearing in young people. Contrarily, the obtained results...
-
A comparative study of English viseme recognition methods and algorithms
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
IFE: NN-aided Instantaneous Pitch Estimation
PublikacjaPitch estimation is still an open issue in contemporary signal processing research. Nowadays, growing momentum of machine learning techniques application in the data-driven society allows for tackling this problem from a new perspective. This work leverages such an opportunity to propose a refined Instantaneous Frequency and power based pitch Estimator method called IFE. It incorporates deep neural network based pitch estimation...
-
Igor Garnik dr inż.
OsobyJest absolwentem Wydziału Elektroniki Politechniki Gdańskiej (1992). Z Politechniką Gdańską związany zawodowo od 1997 roku – zatrudniony najpierw jako asystent w Zakładzie Ergonomii i Eksploatacji Systemów Technicznych na Wydziale Zarządzania i Ekonomii, a następnie – po uzyskaniu stopnia doktora w 2006 roku jako adiunkt. W latach 2009–2015 pełnił funkcję koordynatora Bałtyckiego Festiwalu Nauki na Wydziale Zarządzania i Ekonomii....
-
Multimodal Surveillance Based Personal Protection System
PublikacjaA novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Bluetooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case...
-
Testbed analysis of video and VoIP transsmission performance in IEEE 802.11 b/g/n networks
PublikacjaThe aim of the work is to analyze capabilities and limitations of different implementations of IEEE 802.11 technologies (IEEE 802.11 b/g/n), utilized for both video streaming and VoIP calls directed to mobile devices. Our preliminary research showed that results obtained with currently popular simulation tools can be drastically different than these possible in real-world environment, so, in order to correctly evaluate performance...
-
Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders
PublikacjaAn experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their...
-
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
PublikacjaPraca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...
-
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
PublikacjaIn this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...
-
Ranking Speech Features for Their Usage in Singing Emotion Classification
PublikacjaThis paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...
-
Andrzej Czyżewski prof. dr hab. inż.
OsobyProf. zw. dr hab. inż. Andrzej Czyżewski jest absolwentem Wydziału Elektroniki PG (studia magisterskie ukończył w 1982 r.). Pracę doktorską na temat związany z dźwiękiem cyfrowym obronił z wyróżnieniem na Wydziale Elektroniki PG w roku 1987. W 1992 r. przedstawił rozprawę habilitacyjną pt.: „Cyfrowe operacje na sygnałach fonicznych”. Jego kolokwium habilitacyjne zostało przyjęte jednomyślnie w czerwcu 1992 r. w Akademii Górniczo-Hutniczej...
-
Assessing Tram Infrastructure Safety Using the Example of the City of Gdańsk
PublikacjaAnalysis of Gdańsk’s tram network statistics shows that incidents are quite frequent (about 650 within 5 years) and mostly involve collisions and crashes. As well as reducing the tram systems’ efficiency and reliability, incidents have a nega-tive effect on road safety. As Polish cities extend their tram networks, they must also ensure that their existing networks are safe. This is to be achieved by conducting safety assessments....
-
New Applications of Multimodal Human-Computer Interfaces
PublikacjaMultimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...
-
Analiza spostrzeżeń podczas audytów w przedsiębiorstwach z branży spożywczej
PublikacjaAnalizowano dane z 54 przedsiębiorstw potwierdzających zgodność z wdrożonym systemem ISO 22000 w latach 2005–2013 przez jednostkę certyfikującą DNV GL–BA. Przedstawiono najczęściej spotykane kategorie niezgodności stwierdzane podczas audytów certyfikujących w przedsiębiorstwach należących do łańcucha żywnościowego. Przeanalizowano występujące problemy wpływające na bezpieczeństwo żywności podczas procesu produkcyjnego. Obliczono...
-
Architecture Design of a Networked Music Performance Platform for a Chamber Choir
PublikacjaThis paper describes an architecture design process for Networked Music Performance (NMP) platform for medium-sized conducted music ensembles, based on remote rehearsals of Academic Choir of Gdańsk University of Technology. The issues of real-time remote communication, in-person music performance, and NMP are described. Three iterative steps defining and extending the architecture of the NMP platform with additional features to...
-
Towards a process based management system for oil port infrastructure in context of insurance
PublikacjaThis article addresses selected methodological aspects of a process based management system based on analysis of hazards and threats and risk evaluation for an oil port infrastructure in context of insurance. The oil port terminal is regarded as important system of the critical infrastructure that require careful system oriented approach to deal with integrated aspects of environmental, safety and security management to reduce...
-
Rough Sets Applied to Mood of Music Recognition
PublikacjaWith the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
-
Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium
PublikacjaThis paper presents results of research on the effectiveness of bi-directional voice transmission in a 6 kV mine cable network using BPL-PLC (Broadband over Power Line - Power Line Communication) technology. It concerns both emergency cable state (supply outage with cable shorted at both ends) and loaded with distorted current waveforms. The narrowband (0.5 MHz–15 MHz) and broadband (two different modes, frequency range of 3 MHz–7.5...
-
Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services
PublikacjaStreaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...
-
Bimodal Emotion Recognition Based on Vocal and Facial Features
PublikacjaEmotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
-
Musical Instrument Identification Using Deep Learning Approach
PublikacjaThe work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...
-
A nanostructured carbon-reinforced polyisobutylene-based thermoplastic elastomer
Publikacja -
Halucynacje chatbotów a prawda: główne nurty debaty i ich interpretacje
PublikacjaGeneratywne systemy sztucznej inteligencji (SI) są w stanie tworzyć treści medialne poprzez zastosowanie uczenia maszynowego do dużych ilości danych szkoleniowych. Te nowe dane mogą obejmować tekst (np. Bard firmy Google, LLaMa firmy Meta lub ChatGPT firmy OpenAI) oraz elementy wizualne (np. Stable Diffusion lub DALL-E OpenAI) i dźwięk (np. VALL-E firmy Micro- soft). Stopień zaawansowania tych treści może czynić je nieodróżnialnymi...
-
Problemy zagrażające bezpieczeństwu dzieci i ich rodziców w ruchu drogowym podczas podróży do szkoły na przykładzie Przymorza Małego w Gdańsku
PublikacjaArtykuł opisuje problemy wpływające negatywnie na bezpieczeństwo ruchu drogowego na obszarze Przymorza Małego, a także przedstawia pomysły, których realizacja powinna poprawić bezpieczeństwo pieszych, rowerzystów oraz kierowców pojazdów samochodowych poruszających się po tym obszarze. W pierwszej części przedstawiono informacje dotyczące stanu aktualnego analizowanego terenu, opisano zlokalizowane tutaj placówki oświatowe, a także...