Search results for: ARCHIWIZACJA AUDIO-WIDEO - Bridge of Knowledge

Search

Search results for: ARCHIWIZACJA AUDIO-WIDEO

Search results for: ARCHIWIZACJA AUDIO-WIDEO

  • Emotions in polish speech recordings

    Open Research Data
    open access

    The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...

  • Musical Instrument Identification Using Deep Learning Approach

    Publication

    - SENSORS - Year 2022

    The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

    Full text available to download

  • Architecture Design of a Networked Music Performance Platform for a Chamber Choir

    This paper describes an architecture design process for Networked Music Performance (NMP) platform for medium-sized conducted music ensembles, based on remote rehearsals of Academic Choir of Gdańsk University of Technology. The issues of real-time remote communication, in-person music performance, and NMP are described. Three iterative steps defining and extending the architecture of the NMP platform with additional features to...

    Full text to download in external service

  • Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services

    Publication

    Streaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...

    Full text to download in external service

  • Bimodal Emotion Recognition Based on Vocal and Facial Features

    Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

    Full text available to download

  • Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

    Publication

    - Elektronika Ir Elektrotechnika - Year 2020

    This paper presents results of research on the effectiveness of bi-directional voice transmission in a 6 kV mine cable network using BPL-PLC (Broadband over Power Line - Power Line Communication) technology. It concerns both emergency cable state (supply outage with cable shorted at both ends) and loaded with distorted current waveforms. The narrowband (0.5 MHz–15 MHz) and broadband (two different modes, frequency range of 3 MHz–7.5...

    Full text available to download

  • Rough Sets Applied to Mood of Music Recognition

    Publication

    - Year 2016

    With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...

  • Halucynacje chatbotów a prawda: główne nurty debaty i ich interpretacje

    Publication
    • J. Kreft
    • M. Boguszewicz-kreft
    • B. Cyrek

    - Roczniki Nauk Społecznych - Year 2024

    Generatywne systemy sztucznej inteligencji (SI) są w stanie tworzyć treści medialne poprzez zastosowanie uczenia maszynowego do dużych ilości danych szkoleniowych. Te nowe dane mogą obejmować tekst (np. Bard firmy Google, LLaMa firmy Meta lub ChatGPT firmy OpenAI) oraz elementy wizualne (np. Stable Diffusion lub DALL-E OpenAI) i dźwięk (np. VALL-E firmy Micro- soft). Stopień zaawansowania tych treści może czynić je nieodróżnialnymi...

    Full text available to download

  • Studenci dla Ukrainy! Koncert charytatywny w AK Kwadratowa

    Events

    24-04-2022 17:00 - 24-04-2022 20:00

    Samorząd Studentów Politechniki Gdańskiej zaprasza na wyjątkowy koncert charytatywny “Studenci dla Ukrainy” w wykonaniu 12 trójmiejskich artystów.

  • Video recordings of bees at entrance to hives

    Open Research Data
    open access - series: Bees

    Video recordings of bees at entrance to hives from 2017-04-22, 2017-04-23 and 2018-05-22. All recordings were made using hand-held full HD camera (Samsung Galaxy S3) and encoded using H.264 video codec (Standard Baseline Profile for mov files from 2017, High Profile for mp4 files from 2018) , 30 FPS and bit rate 14478 kb/s (mov files from 2017) or 16869 kb/s...

  • Automatic Breath Analysis System Using Convolutional Neural Networks

    Publication

    Diseases related to the human respiratory system have always been a burden for the entire society. The situation has become particularly difficult now after the outbreak of the COVID-19 pandemic. Even now, however, it is common for people to consult their doctor too late, after the disease has developed. To protect patients from severe disease, it is recommended that any symptoms disturbing the respiratory system be detected as...

    Full text to download in external service

  • Automatic Breath Analysis System Using Convolutional Neural Networks

    Publication

    Diseases related to the human respiratory system have always been a burden for the entire society. The situation has become particularly difficult now after the outbreak of the COVID-19 pandemic. Even now, however, it is not uncommon for people to consult their doctor too late, after the disease has developed. To protect patients from severe disease, it is recommended that any symptoms disturbing the respiratory system be detected...

    Full text to download in external service

  • A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors

    Publication

    In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

    Full text available to download

  • Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera

    Publication

    This paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...

    Full text to download in external service

  • TRANSPORT POSSIBILITY FOR MPEG-4/AVC- AND MPEG-2-ENCODED VIDEO DATA IN IPTV: A COMPARISON STUDY

    Publication

    - Year 2013

    IPTV (Television over IP) is a modern service with a great potential to expand. It uses the IP transport platform, that is already in worldwide operation. At the time of writing, two techniques are used to transport the video and audio data of IPTV: MPEG-2 TS and Native RTP. The two techniques quite definitely have an influence on both quality of service (QoS) and quality of experience (QoE). This paper sets out to demonstrate...

  • Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

    Publication

    The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...

  • Speech Analytics Based on Machine Learning

    Publication

    In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

    Full text to download in external service

  • Personalized avatar animation for virtual reality

    Publication

    - Year 2008

    The paper presents a method for creating a personalized animation of avatar for virtual reality application such as multiplayer on-line games. Animation is stored in a simplified version, containing only keyframes for important avatar poses. This version defines key movements, i.e. roughly describes the avatar's action. Animation is enriched by the user with new motion phases utilizing fuzzy descriptors.Various degrees of motion...

  • e-wykład "Fizyk pod wodą" - Brygida Mielewska (FTiMS)

    e-Learning Courses
    • B. Mielewska

    Kurs zawiera materiał wykładowy pt. "Fizyk pod wodą" dotyczący fizycznych i biofizycznych aspektów nurkowania. Wykład stanowi uzupełnienie treści do przedmiotu "Biofizyka", może tez stanowić samodzielny materiał popularyzatorski, nie wymagający wiedzy specjalistycznej. Kurs zawiera 3-częściowy wykład audio w formacie SCORM, materiały pomocnicze do notatek oraz krótkie quizy tematyczne do każdej z części. Do korzystania z pełnej...

  • SYNAT Music Genre Parameters PCA 19

    Open Research Data

    The dataset contains feature vector after  Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...

  • SYNAT_PCA_48

    Open Research Data

    There is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...

  • SYNAT_PCA_11

    Open Research Data

    The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...

  • Creating a Remote Choir Performance Recording Based on an Ambisonic Approach

    Publication

    The aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...

    Full text available to download

  • Comparing traffic intensity estimates employing passive acoustic radar and microwave Doppler radar sensor

    The purpose of our applied research project is to develop an autonomous road sign with built-in radar devices of our design. In this paper, we show that it is possible to calibrate the acoustic vector sensor so that it can be used to measure traffic volume and count the vehicles involved in the traffic through the analysis of the noise emitted by them. Signals obtained from a Doppler radar are used as a reference source. Although...

    Full text to download in external service

  • Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

    Publication

    - Year 2018

    DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

    Full text available to download

  • Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

    Publication

    - Year 2013

    The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

  • Akustyczna analiza parametrów ruchu drogowego z wykorzystaniem informacji o hałasie oraz uczenia maszynowego

    Publication

    - Year 2018

    Celem rozprawy było opracowanie akustycznej metody analizy parametrów ruchu drogowego. Zasada działania akustycznej analizy ruchu drogowego zapewnia pasywną metodę monitorowania natężenia ruchu. W pracy przedstawiono wybrane metody uczenia maszynowego w kontekście analizy dźwięku (ang.Machine Hearing). Przedstawiono metodologię klasyfikacji zdarzeń w ruchu drogowym z wykorzystaniem uczenia maszynowego. Przybliżono podstawowe...

    Full text available to download

  • Buzz-based honeybee colony fingerprint

    Non-intrusive remote monitoring has its applications in a variety of areas. For industrial surveillance case, devices are capable of detecting anomalies that may threaten machine operation. Similarly, agricultural monitoring devices are used to supervise livestock or provide higher yields. Modern IoT devices are often coupled with Machine Learning models, which provide valuable insights into device operation. However, the data...

    Full text available to download

  • Evaluation of aspiration problems in L2 English pronunciation employing machine learning

    The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

    Full text available to download

  • Akustyczna analiza natężenia ruchu drogowego dla systemów zarządzania ruchem

    Publication

    - Year 2019

    W pracy przybliżono wybrane zagadnienia z dziedziny zarządzania transportem drogowym w Polsce i na świecie. W tym kontekście pzredstawiono potrzeby rynkowe, wymagania jak i możliwości w zakresie pozyskiwania informacji o aktualnym stanie sieci drogowych. Zaproponowano akustyczną metodę nadzorowania ruchu drogowego i jej możliwości w kontekście systemów zarządzania ruchem. Przedstawiono schemat akwizycji sygnału wraz z danymi odniesienia....

  • Układ do prowadzenia telekonferencji

    Przedmiotem wynalazku jest układ do prowadzenia wideokonferencji. Układ ma usprawnić prowadzenie konferencji między kilkoma użytkownikami dzięki aplikacji umożliwiającej dostosowanie wyświetlanego obrazu do punktu fiksacji wzroku użytkownika tak, aby jego oczy znajdowały się w centrum kadru. Kamera jest również wyposażona w sterowniki położenia w poziomie i pionie. 

  • Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions

    Publication

    - Year 2021

    Increased interest in non-contact evaluation of the health state has led to higher expectations for delivering automated and reliable solutions that can be conveniently used during daily activities. Although some solutions for cough detection exist, they suffer from a series of limitations. Some of them rely on gesture or body pose recognition, which might not be possible in cases of occlusions, closer camera distances or impediments...

    Full text to download in external service

  • Comparative study on the effectiveness of various types of road traffic intensity detectors

    Publication

    - Year 2019

    Vehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring...

    Full text to download in external service

  • MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

    Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

    Full text available to download

  • Improving automatic surveillance by sound analysis

    Publication

    An automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands...

  • Pomorskie drogi ku Niepodległej

    Events

    05-03-2019 17:00 - 05-03-2019 19:00

    Politechnika Otwarta zaprasza na premierowy pokaz filmu Jana Butowskiego pt. „Pomorskie drogi ku Niepodległej”. Poznamy sylwetki bohaterów, miejsca i wydarzenia, które odegrały znaczącą rolę na drodze ku niepodległości.

  • ALOFON corpus

    The ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/.  The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...

  • Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions

    Publication

    - Year 2018

    The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...

  • Metoda i algorytmy sterowania procesami miksowania dźwięku za pomocą gestów w oparciu o analizę obrazu wizyjnego

    Publication

    - Year 2013

    Głównym celem rozprawy było opracowanie systemu miksowania dźwięku za pomocą gestów rąk wykonywanych w powietrzu oraz zbadanie możliwości oferowanych przez takie rozwiązanie w porównaniu ze współczesną metodą miksowania sygnałów fonicznych, wykorzystującą środowisko komputera. Opracowany system rozpoznaje zarówno dynamiczne jak i statyczne gesty rąk. Rozpoznawanie gestów dynamicznych zrealizowano w oparciu o metody logiki rozmytej...

  • Badania kliniczne z udziałem ludzi

    Events

    02-03-2021 11:00 - 02-03-2021 13:00

    Centrum Transferu Wiedzy i Technologii PG zaprasza na drugą część webinarium z dużą dawką praktycznej wiedzy, tym razem z zakresu badań klinicznych z udziałem ludzi. Obowiązuje rejestracja.

  • 2 termin webinarium: Badania kliniczne z udziałem ludzi

    Events

    16-03-2021 11:00 - 16-03-2021 13:30

    Centrum Transferu Wiedzy i Technologii PG zaprasza na drugą edycję webinarium z dużą dawką praktycznej wiedzy z zakresu badań klinicznych z udziałem ludzi. Obowiązuje rejestracja.

  • Multimedia i Interfejsy 2022

    e-Learning Courses
    • J. Daciuk
    • W. Szwoch
    • M. Szwoch

    {mlang pl} Celem kursu jest zapoznanie studentów z: rodzajami danych multimedialnych oraz metodami ich pozyskiwania formatami i standardami danych multimedialnych metodami kompresji danych multimedialnych podstawami przetwarzania danych multimedialnych oraz ich rozpoznawania programowaniem aplikacji multimedialnych, w tym gier wideo rodzajami interfejsów użytkownika w systemach komputerowych metodami opisu oraz zasadami...

  • Multimedia i Interfejsy 2023

    e-Learning Courses
    • J. Daciuk
    • W. Szwoch
    • M. Szwoch

    {mlang pl} Celem kursu jest zapoznanie studentów z: rodzajami danych multimedialnych oraz metodami ich pozyskiwania formatami i standardami danych multimedialnych metodami kompresji danych multimedialnych podstawami przetwarzania danych multimedialnych oraz ich rozpoznawania programowaniem aplikacji multimedialnych, w tym gier wideo rodzajami interfejsów użytkownika w systemach komputerowych metodami opisu oraz zasadami...

  • ZINTEGROWANY SYSTEM DOMOWEGO MONITORINGU PARAMETRÓW MEDYCZNYCH OSÓB STARSZYCH I CHORYCH

    Proponowane rozwiązania mają na celu wspomaganie osób starszych i chorych, tak by mogły jak najdłużej mieszkać i żyć samodzielnie ze zwiększonym poczuciem bezpieczeństwa, iż są nadzorowane i w razie nagłego zagrożenia życia nie pozostaną bez pomocy. System jednocześnie nie narusza poczucia zachowania prywatności i intymności, gdyż nie są używane do monitoringu kamery wizyjne czy też stały nasłuch audio. Dodatkowo gromadzone informacje...

  • Algorytmy analizy i przetwarzania danych z sonarów wielowiązkowych w rozproszonych systemach GIS

    Publication

    - Year 2011

    Telemonitoring morski oraz szeroko rozumiane badania morza są ważnym elementem aktywności człowieka w sferze badań, nauki oraz gospodarki. Prowadzenie działań związanych z tworzeniem map dna, inspekcją nadbrzeży, umocnień, badaniem fauny morskiej pozwala zrozumieć procesy zachodzące w środowisku morskim oraz przyczynia się do rozwoju wielu gałęzi gospodarki takich jak transport morski, bezpieczeństwo, ochrona portów i inne. W ramach...

  • Zdolni z Pomorza 2022/23 - Podstawy programowania urządzeń brzegowych na Raspberry Pi w języku Python

    e-Learning Courses
    • T. Neumann

    Cel kursu: Celem kursu jest zapoznanie uczniów z podstawami w zakresie programowania w języku Python w tym urządzeń brzegowych podłączanych do minikomputera Raspberry Pi 4B, umożliwiających pomiar wielkości fizycznych, środowiskowych lub biomedycznych. Opis kursu: Python jest jednym z najpopularniejszych języków programowania, który wraz w połączeniu z dodatkowymi narzędziami może posłużyć do tworzenia stron internetowych, przetwarzania...

  • Zdolni z Pomorza 2022/23 - Podstawy programowania urządzeń brzegowych na Raspberry Pi w języku Python - arch.

    e-Learning Courses
    • T. Neumann
    • B. Wikieł

    Cel kursu: Celem kursu jest zapoznanie uczniów z podstawami w zakresie programowania w języku Python w tym urządzeń brzegowych podłączanych do minikomputera Raspberry Pi 4B, umożliwiających pomiar wielkości fizycznych, środowiskowych lub biomedycznych. Opis kursu: Python jest jednym z najpopularniejszych języków programowania, który wraz w połączeniu z dodatkowymi narzędziami może posłużyć do tworzenia stron internetowych, przetwarzania...