Katedra Systemów Multimedialnych

Publikacje

Rok 2020

Projekt INZNAK - aktywne znaki drogowe
Publikacja
- A. Czyżewski
- Magazyn Autostrady - Rok 2020
W Politechnice Gdańskiej na Wydziale Elektroniki, Telekomunikacji i Informatyki we współpracy z Akademią Górniczo-Hutniczą w Krakowie i dwiema firmami z województwa pomorskiego (Siled Sp. z o.o. i Microsystems Sp. z o.o.) od 2017 r. realizowany jest projekt badawczy pt. „INZNAK – inteligentne znaki drogowe do adaptacyjnego sterowania ruchem pojazdów, komunikujące się w technologii V2X”. Projekt jest dofinansowywany przez NCBR w...

Pełny tekst do pobrania w serwisie zewnętrznym
O nadjeżdżającej rewolucji w transporcie
Publikacja
- P. Gora
- Pismo PG - Rok 2020
1,3 miliona – tyle osób rocznie na świecie ginie w wypadkach drogowych. Ponad 20 milionów zostaje rannych! 4 miliardy złotych – prawie tyle rocznie tracą kierowcy w 7 największych miastach w Polsce z powodu korków (a są to jedynie szacowane koszty straconego czasu i paliwa, bez uwzględnienia np. negatywnego wpływu na środowisko). Czy możemy coś z tym zrobić?

Pełny tekst do pobrania w portalu
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publikacja
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu
Multimedia Communications, Services and Security MCSS. 10th International Conference, MCSS 2020, Preface
Publikacja
- A. Dziech
- W. Mees
- A. Czyżewski
- Rok 2020
Multimedia surrounds us everywhere. It is estimated that only a part of the recorded resources are processed and analyzed. These resources offer enormous opportunities to improve the quality of life of citizens. As a result, of the introduction of a new type of algorithms to improve security by maintaining a high level of privacy protection. Among the many articles, there are examples of solutions for improving the operation of...

Pełny tekst do pobrania w serwisie zewnętrznym
Multifactor consciousness level assessment of participants with acquired brain injuries employing human–computer interfaces
Publikacja
- Biomedical Engineering Online - Rok 2020
Background A lack of communication with people suffering from acquired brain injuries may lead to drawing erroneous conclusions regarding the diagnosis or therapy of patients. Information technology and neuroscience make it possible to enhance the diagnostic and rehabilitation process of patients with traumatic brain injury or post-hypoxia. In this paper, we present a new method for evaluation possibility of communication and the...

Pełny tekst do pobrania w portalu
Microscopic traffic simulation models for connected and automated vehicles (CAVs) – state-of-the-art
Publikacja
- P. Gora
- C. Kartakazas
- A. Drabicki
- F. Islam
- P. Ostaszewski
- Procedia Computer Science - Rok 2020
Research on connected and automated vehicles (CAVs) has been gaining substantial momentum in recent years. However, thevast amount of literature sources results in a wide range of applied tools and datasets, assumed methodology to investigate thepotential impacts of future CAVs traffic, and, consequently, differences in the obtained findings. This limits the scope of theircomparability and applicability and calls for a proper standardization...

Pełny tekst do pobrania w portalu
Investigating Feature Spaces for Isolated Word Recognition
Publikacja
- P. Treigys
- G. Korvel
- G. Tamulevicius
- J. Bernataviciene
- B. Kostek
- Rok 2020
The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...

Pełny tekst do pobrania w serwisie zewnętrznym
Improving Objective Speech Quality Indicators in Noise Conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2020
This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
Publikacja
- G. Korvel
- K. Kąkol
- O. Kurasova
- B. Kostek
- IEEE Access - Rok 2020
The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Pełny tekst do pobrania w portalu
Evaluating calibration and robustness of pedestrian detectors
Publikacja
- S. Cygert
- A. Czyżewski
- Rok 2020
In this work robustness and calibration of modern pedestrian detectors are evaluated. Pedestrian detection is a crucial perception com- ponent in autonomous driving and here we study its performance under different image corruptions. Furthermore, we provide analysis of classifi- cation calibration of pedestrian detectors and we show a positive effect of using style-transfer augmentation technique. Our analysis is aimed as a step...

Pełny tekst do pobrania w serwisie zewnętrznym
Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres
Publikacja
- Electronics - Rok 2020
The purpose of this research is two-fold: (a) to explore the relationship between the listeners’ personality trait, i.e., extraverts and introverts and their preferred music genres, and (b) to predict the personality trait of potential listeners on the basis of a musical excerpt by employing several classification algorithms. We assume that this may help match songs according to the listener’s personality in social music networks....

Pełny tekst do pobrania w portalu
Constructing a Dataset of Speech Recordingswith Lombard Effect
Publikacja
- D. Weber
- S. Zaporowski
- D. Korzekwa
- Rok 2020
Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
Comparison of two methods of sound extraction from guitar string video recordings
Publikacja
- M. Zaporowska (dawniej: M. Stefaniak)
- A. Czyżewski
- Rok 2020
A comparison of two sound extraction methods from guitar string video recordings is presented in the paper. A brief overview of highframe rate camera technology and possible applications are included. The method using the image analysis from two such cameras is presented. The cameras are placed at the angle of 90 degrees for recording the image in three planes. The results achieved...
Comparison of sound of organ pipes in contemporary and historical instruments
Publikacja
- Rok 2020
The aim of this research is to examine the differences in the timbre of organ pipes’ sound between a historical and a contemporary organ instrument. The historical instrument is the Oliwa organ from Gdansk, Poland, and the contemporary one is from Kartuzy, Poland. Recordings are made of single notes played by an open labial pipe that belongs to the Principal rank. The analyses and comparison of several sound features compatible...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparing traffic intensity estimates employing passive acoustic radar and microwave Doppler radar sensor
Publikacja
- A. Czyżewski
- Journal of the Acoustical Society of America - Rok 2020
The purpose of our applied research project is to develop an autonomous road sign with built-in radar devices of our design. In this paper, we show that it is possible to calibrate the acoustic vector sensor so that it can be used to measure traffic volume and count the vehicles involved in the traffic through the analysis of the noise emitted by them. Signals obtained from a Doppler radar are used as a reference source. Although...

Pełny tekst do pobrania w serwisie zewnętrznym
Chór wirtualny
Publikacja
- M. Mróz
- B. Mróz
- Rok 2020
Wiosna roku 2020 została zapisana emocjami, które należy zaliczać do tych niepożądanych. Praca on-line stała się jedyną możliwą formą pracy z zespołem. Prekursorem pomysłu wirtualnego chóru był amerykański kompozytor i dyrygent Eric Whitacre. Eric wybrał do wykonania przez chór wirtualny utwory posiadające wspólne cechy. Kolejnym poruszanym zagadnieniem jest stworzenie przestrzennego dźwięku. Technologia na której opiera się dźwięk...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Marking of Allophone Boundaries in Isolated English spoken Words
Publikacja
- J. Rafałko
- A. Czyżewski
- Rok 2020
The work presents a method that allows delimiting the borders of allophones in isolated English words. The described method is based on the DTW algorithm combining two signals, a reference signal and an analyzed one. As the reference signal, recordings from the MODALITY database were used, from which the words were extracted. This database was also used for tests, which were described. Test results show that the automatic determination...

Pełny tekst do pobrania w portalu
Audio Feature Analysis for Precise Vocalic Segments Classification in English
Publikacja
- S. Zaporowski
- A. Czyżewski
- Rok 2020
An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...

Pełny tekst do pobrania w serwisie zewnętrznym
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
Publikacja
- SENSORS - Rok 2020
The aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...

Pełny tekst do pobrania w portalu
Analiza ruchu drogowego z wykorzystaniem analizy akustycznej
Publikacja
- K. Marciniuk
- B. Kostek
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2020
Tematyka pracy porusza zagadnienia dotyczące pozyskiwania informacji o ruchu drogowym z wykorzystaniem monitoringu akustycznego. Przybliżono podstawowe techniki nadzoru nad ruchem drogowym. Przedstawiono założenia akustycznego detektora ruchu i zbadano jego skuteczność na trzech płaszczyznach działania – zliczania pojazdów, klasyfikacji rodzajowej i klasyfikacji warunków pogodowych panujących na nawierzchni

Pełny tekst do pobrania w serwisie zewnętrznym
Ambisoniczna mapa wybranych miejsc w Trójmieście
Publikacja
- C. Pietrzak
- P. Odya
- Rok 2020
Projekt miał na celu stworzenie ambisonicznej mapy Trójmiasta w formie aplikacji internetowej. Materiały wideo w technologii 360 z dźwiękiem w postaci sygnału ambisonicznego zostały zarejestrowane w lokalizacjach Trójmiasta, które uznano za charakterystyczne dla tej aglomeracji. Celem badawczym projektu było porównanie dostępnych algorytmów miksowania sygnałów ambisonicznych poprzez przeprowadzenie testów odsłuchowych. Przeprowadzono...

Pełny tekst do pobrania w portalu
Adaptive traffic optimization using Variable Speed Limits; Adaptacyjna optymalizacja ruchu drogowego przy pomocy zmiennych ograniczeń prędkości
Publikacja
- P. Gora
- Rok 2020
Variable speed limits (VSL) is an intelligent transportation system (ITS) solution for traffic management. The speed limits can be changed dynamically in order to adapt to traffic, weather, or road surface conditions. This paper presents an approach for such an adaptive traffic control where the primary goal is to ensure traffic safety and efficiency of the traffic control system (fast response to dynamically changing traffic,...

Pełny tekst do pobrania w serwisie zewnętrznym
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
Publikacja
- G. Tamulevicius
- G. Korvel
- A. B. Yayak
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Electronics - Rok 2020
In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
Publikacja
- Rok 2020
A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

Rok 2019

Wykorzystanie sieci neuronowych do syntezy mowy wyrażającej emocje
Publikacja
- S. Zaporowski
- Rok 2019
W niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opratych na mowie i możliwości ich wykprzystania w syntezie mowy z emocjami stosując do tego celu sieci neuronowe. Wskazano również przydatnośc parametrów typowo stosowanych do rozpoznawania mowy w detekcji emocji w śpiewie i rozróżnianiu tych emocji w obu przypadkach. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy...
Wpływ kolorystyki ujęć oraz ścieżki dźwiękowej na emocje widza - wstępne eksperymenty
Publikacja
- D. Weber
- Rok 2019
Brak
Wind Turbines Modeling as the Tool for Developing Algorithms of Processing their Video Recordings
Publikacja
- Rok 2019
In the real world, many factors exist disturbing observation of the examined phenomena and causing various noises and distortions in recorded signals. It very often makes it difficult or even impossible to optimize various signal processing algorithms, through finding appropriate parameters. In this paper, we show an application, that retrieves wind turbine rotor speed from recorded video. Next, we describe the process of reduction...
Wind Turbines Modeling as the Tool for Developing Algorithms of Processing their Video Recordings
Publikacja
- P. Sokolowski
- S. Cygert
- M. Szczodrak
- Rok 2019
In the real world, many factors exist disturbing observation of the examined phenomena and causing various noises and distortions in recorded signals. It very often makes it difficult or even impossible to optimize various signal processing algorithms, through finding appropriate parameters. In this paper, we show an application, that retrieves wind turbine rotor speed from recorded video. Next, we describe the process of reduction...

Pełny tekst do pobrania w portalu
Weryfikacja autentyczności kolorów na zdjęciach wykonanych w technice analogowej
Publikacja
- P. Sokołowski
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
W artykule opisano zagadnienie odróżniania historycznych fotografii pomiędzy oryginalnie kolorowe a koloryzowane. Rozważono problem doboru zdjęć pod względem technologii, w jakiej zostały wykonane. Następnie wykorzystując sieci neuronowe już w części wyuczone na innych zbiorach danych, sprawdzono ich efektywność w rozwiązywaniu badanego problemu. Rozważono wpływ rozmiaru obrazu podanego na wejściu, architektury zastosowanej sieci,...

Pełny tekst do pobrania w portalu
Vehicle detector training with minimal supervision
Publikacja
- S. Cygert
- A. Czyżewski
- Rok 2019
Recently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not...
Variable length sliding models for banking clients face biometry
Publikacja
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2019
An experiment was organized in 100 bank branches to acquire biometric samples from nearly 5000 clients including face images. A procedure for creating face verification models based on continuously expanding database of biometric samples is proposed, implemented, and tested. The presented model applies to circumstances where it is possible to collect and to take into account new biometric samples after each positive verification...

Pełny tekst do pobrania w portalu
Validating data acquired with experimental multimodal biometric system installed in bank branches
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2019
An experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...

Pełny tekst do pobrania w portalu
Unsupervised machine-learning classification of electrophysiologically active electrodes during human cognitive task performance
Publikacja
- K. Saboo
- Y. Varatharajah
- B. M. Berry
- V. Kremen
- M. R. Sperling
- K. A. Davis
- B. C. Jobst
- R. E. Gross
- B. C. Lega
- S. A. Sheth... i 3 innych
- Scientific Reports - Rok 2019
Identification of active electrodes that record task-relevant neurophysiological activity is needed for clinical and industrial applications as well as for investigating brain functions. We developed an unsupervised, fully automated approach to classify active electrodes showing event-related intracranial EEG (iEEG) responses from 115 patients performing a free recall verbal memory task. Our approach employed new interpretable...

Pełny tekst do pobrania w portalu
Subjective tests for gathering konwledge for applaying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot,and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with...

Pełny tekst do pobrania w serwisie zewnętrznym
Subjective tests for gathering knowledge for applying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot, and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with or...

Pełny tekst do pobrania w portalu
Style Transfer for Detecting Vehicles with Thermal Camera
Publikacja
- S. Cygert
- A. Czyżewski
- Rok 2019
In this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...
Speech Analytics Based on Machine Learning
Publikacja
- Rok 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym
Special techniques and future perspectives: Simultaneous macro- and micro-electrode recordings
Publikacja
- M. T. Kucewicz
- B. M. Berry
- G. A. Worrell
- Rok 2019
There are many approaches to studying the inner workings of the brain and its highly interconnected circuits. One can look at the global activity in different brain structures using non-invasive technologies like positron emission tomography (PET) or functional magnetic resonance imaging (fMRI), which measure physiological changes, e.g. in the glucose uptake or blood flow. These can be very effectively used to localize active patches...

Pełny tekst do pobrania w serwisie zewnętrznym
Sound engineering as our commitment to its creators in Poland
Publikacja
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Rok 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Pełny tekst do pobrania w serwisie zewnętrznym
Relationship between album cover design and music genres.
Publikacja
- A. Dorochowicz
- B. Kostek
- Rok 2019
The aim of the study is to find out whether there exists a relationship between typographic, compositional and coloristic elements of the music album cover design and music contained in the album. The research study involves basic statistical analysis of the manually extracted data coming from the worldwide album covers. The samples represent 34 different music genres, coming from nine countries from around the world. There are...
Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification
Publikacja
- Rok 2019
The recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals...

Pełny tekst do pobrania w serwisie zewnętrznym
Real and Virtual Instruments in Machine Learning – Training and Comparison of Classification Results
Publikacja
- Rok 2019
The continuous growth of the computing power of processors, as well as the fact that computational clusters can be created from combined machines, allows for increasing the complexity of algorithms that can be trained. The process, however, requires expanding the basis of the training sets. One of the main obstacles in music classification is the lack of high-quality, real-life recording database for every instrument with a variety...
Projektowanie oraz implementacja cyfrowego multiefektu gitarowego z wykorzystaniem procesora sygnałowego
Publikacja
- Ł. Pindor
- P. Piesik
- Rok 2019
W artykule został przedstawiony proces projektowania i realizacji cyfrowego multiefektu gitarowego z wykorzystaniem procesora sygnałowegoTMS320C5535 firmy Texas Instruments, dla którego oprogramowanie napisano w języku C. Omówiono zasady działania oraz algorytmy wybranych efektów dźwiękowych, które zostały zaimplementowane w procesorze sygnałowym. Zaprojektowano również uniwersalny moduł wejściowy zawierający wzmacniacz z regulowanym...
Post-comatose patients with minimal consciousness tend to preserve reading comprehension skills but neglect syntax and spelling
Publikacja
- A. Kwiatkowska
- M. Lech
- P. Odya
- A. Czyżewski
- Scientific Reports - Rok 2019
Modern eye tracking technology provides a means for communication with patients suffering from disorders of consciousness (DoC) or remaining in locked-in-state. However, being able to use an eye tracker for controlling text-based contents by such patients requires preserved reading ability in the first place. To our knowledge, this aspect, although of great social importance, so far has seemed to be neglected. In the paper, we...

Pełny tekst do pobrania w portalu
Porównanie klasycznych i ewolucyjnych metod projektowanie dyfuzorów akustycznych w warunkach odsłuchowych symulowanych metodą FDTD
Publikacja
- A. Kurowski
- Rok 2019
W niniejszym rozdzaiel przedstawione zostanie porównanie dwóch podejść do projektowania dyfuzorów akustycznych. Pierwsze z nich bazuje na klasycznych założeniach dotyczących wykorzystywania sekwewncji pseudolosowych. Drugie z nich wykorzystuje automatyczne podejście bazujące na wykorzystaniu algorytmów genetycznych. Porównanie takie pozwala na określenie zalet i wad podejśc klasycznych oraz podejśc bazujących na zastosowaniu algorytmu...
New applications of sound and vision engineering
Publikacja
- A. Czyżewski
- Rok 2019
Multimedia, Sound & Vision Engineering are relatively new fields within the area of science and technology, but teaching and research in this area has been carried out at Gdansk University of Technology (Gdansk, Poland) for nearly 5 decades. Current project carried-out in the Multimedia Systems Department are in the scope of the paper.

Pełny tekst do pobrania w serwisie zewnętrznym
Music signal equalization in a changing environment
Publikacja
- P. Hoffmann
- Rok 2019
The paper presents the concept of an automatic system for music signal correction, considering room frequency response and music genre being played. The proposed algorithm, based on the room frequency response, compensates acoustic conditions surrounding the sound source. Additionally, the compensation process considers the signal content by recognizing music genre. As part of the described research, a series of subjective tests...
Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.
Publikacja
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2019
The exponential growth of computer processing power, cloud data storage, and crowdsourcing model of gathering data bring new possibilities to music information retrieval (mir) field. Mir is no longer music content retrieval only; the area also comprises the discovery of expressing feelings and emotions contained in music, incorporating other than hearing modalities for helping this issue, users’ profiling, merging music with social...

Pełny tekst do pobrania w portalu
MULTIMODALNE POMIARY DRGAŃ STRUNY
Publikacja
- M. Zaporowska (dawniej: M. Stefaniak)
- W. Nosorowski
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
W artykule zostały przedstawione badania drgań struny zrealizowane przy użyciu szybkich kamer wizyjnych, mikrofonu oraz akcelerometru. Obiektem badań były instrumenty muzyczne. Opisano zjawiska zachodzące w instrumencie podczas tworzenia się i wydobywania z niego dźwięku. Celem pracy było zbadanie różnic w wynikach otrzymanych poprzez pomiary wykonane z użyciem zróżnicowanych reprezentacji obrazowych i sygnałowych. Zaproponowano...

Pełny tekst do pobrania w portalu
Metoda i system adaptacyjnego sterowania parametrami algorytmu syntezy niskich częstotliwości dźwięków muzycznych
Publikacja
- P. Hoffmann
- Rok 2019
W ostatnich latach można zaobserwować bardzo wyraźny i systematyczny wzrost wykorzystywania urządzeń mobilnych jako środka do odtwarzania muzyki, czy odtwarzania filmów w dowolnych warunkach akustycznych. Ich użytkownicy oczekują przy tym jak najlepszych walorów brzmieniowych dźwięku. W niniejszej rozprawie zostały zaproponowane metody, mające na celu poprawę brzmienia urządzeń mobilnych w zakresie niskich częstotliwości i korekcji...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Katedra Systemów Multimedialnych

Publikacje

Filtry

Kategoria

Rok

Opcje

Katalog Publikacji

Rok 2020

Rok 2019