Search results for: KODOWANIE AUDIO

Search results for: KODOWANIE AUDIO

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 812

clear all filters disabled

MODALITY corpus - SPEAKER 27 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - COMMANDS C6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - COMMANDS C2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - COMMANDS C4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 32 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 32 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - SEQUENCE S5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - COMMANDS C4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 32 - COMMANDS C2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - COMMANDS C2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - COMMANDS C5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - COMMANDS C2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - COMMANDS C6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 32 - COMMANDS C5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - COMMANDS C5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - COMMANDS C4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - SEQUENCE S5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
Metaheurystyki w szeregowaniu zadań uwarunkowanych czasowo
Publication
- K. Ocetkiewicz
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2008
w artykule tym zbadano zastosowanie algorytmów metaheurystycznych w problemach szeregowania zadań uwarunkowanych czasowo. porównano wyniki algorytmu genetycznego, ewolucji różnicowej oraz symulowanego wyżarzania, z reprezentacjami rozwiązania: permutacyjną, opartą o priorytety reguł i kodowaniem przedziałowym, osiągnięte w rozwiązywaniu np-trudnego problemu 1 | pi = ai + bisi | sum wici. gdzie to możliwe, wyniki porównano z rozwiązaniami...
On the Consumption of Multimedia Content Using Mobile Devices: a Year to Year User Case Study
Publication
- P. Falkowski-Gilski
- Archives of Acoustics - Year 2020
In the early days, consumption of multimedia content related with audio signals was only possible in a stationary manner. The music player was located at home, with a necessary physical drive. An alternative way for an individual was to attend a live performance at a concert hall or host a private concert at home. To sum up, audio-visual effects were only reserved for a narrow group of recipients. Today, thanks to portable players,...

Full text available to download
Subiektywny pomiar jakości sygnałów mowy i muzyki w lokalnych multipleksach radiofonii DAB+ w Gdańsku i Wrocławiu
Publication
- P. Falkowski-Gilski
- S. Brachmański
- Year 2021
Radiofonia cyfrowa DAB+ (Digital Audio Broadcasting plus) dostępna jest dla słuchaczy w Polsce od 2013 r. Standard ten oferuje szerokie możliwości konfiguracji multipleksów lokalnych nie tylko pod względem liczby, lecz także jakości nadawanych programów radiowych. Dzięki temu możliwe jest dostosowanie parametrów emitowanych sygnałów w celu sprostania oczekiwaniom odbiorców końcowych. W przeciwieństwie do radiofonii analogowej FM...

Full text to download in external service
Nowy system cyfrowej transmisji sygnału mowy o szybkości 16 kbit/s
Publication
- Ł. Waga
- Elektronika : konstrukcje, technologie, zastosowania - Year 2003
Celem pracy jest przedstawienie nowego systemu cyfrowej transmisji sygnału mowy wykorzystującego kanał cyfrowy o prędkości transmisji 16 kbit/s. Nowa metoda kodowania sygnału mowy zaproponowana w pracy pozwala zmniejszyć czterokrotnie, w porównaniu z telefonią cyfrową, wymaganą szybkość transmisji, przy zachowaniu akceptowanej jakości przesyłanej mowy, a jednocześnie nie wymaga drogich numerycznie algorytmów wykorzystywanych w...
Sample Rate Conversion with Fluctuating Resampling Ratio
Publication
- M. Blok
- Year 2012
In this paper a sample rate conversion with continuouslychanging resampling ratio has been presented. The proposed implementation is based on variable fractional delay filter implemented using a Farrow structure. It have been demonstrated that using the proposed approach instantaneous resampling ratio can be freely changed. This allows for simulation of audio recored on magnetic tape with nonuniform velocity as well as removal...
Online Sound Restoration for Digital Library Applications
Publication
- Year 2012
A system for sound restoration was conceived and engineered having the following features: no special sound restoration software is needed to perform audio restoration by the user, the process of restoration employs automatic reduction of noise, wow and impulse distortions performed in the online mode, no skills in digital signal processing from the user are needed. The principles of the created system and its features as well...

Full text to download in external service
Joanna Mytnik dr hab.

People

Center for Innovative Education, Gdańsk University of Technology

Dyrektor Centrum Nowoczesnej Edukacji Politechniki Gdańskiej, pasjonatka projektowania procesów uczenia się za pomocą niestandardowych metod i narzędzi (UX i design thinking). Posiada ponad 20 lat doświadczenia w pracy dydaktycznej ze studentami i nauczycielami. Jej pasją jest uczenie, które rozumie jako organizację przestrzeni edukacyjnej realizującej potrzeby każdego ze studentów. Projektując środowisko uczenia się bazuje na...
Czynniki generujące zmiany w podnoszeniu konkurencyjności przedsiębiorstw
Publication
- J. Łopatowska
- G. Zieliński
- PRACE I MATERIAŁY WYDZIAŁU ZARZĄDZANIA UNIWERSYTETU GDAŃSKIEGO - Year 2012
W niniejszej publikacji autorzy zaprezentowali podstawowe grupy czynników generujących zmiany w podnoszeniu konkurencyjności podmiotów gospodarczych. Uwzgędniono przy tym trzy kluczowe obiekty które mają wpływ na kreowanie tych czynników. Są to dostawcy, firma oraz klienci.
Visualization of events using various kinds of synchronized data for the Border Guard
Publication
- B. Czaplewski
- S. Kaczmarek
- J. A. Litka
- M. Miszewski
- Zeszyty Naukowe Akademii Marynarki Wojennej - Year 2017
STRADAR project is dedicated to streaming real-time data in a distributed dispatcher and teleinfor-mation system of the Border Guard. The Events Visualization Post is a software designed for simultaneous visualization of data of different types in BG headquarters. The software allows the operator to visualize files, images, SMS, SDS, video, audio, and current or archival data on naval situation on digital maps. All the visualized...

Full text available to download
Sample Rate Conversion with Fluctuating Resampling Ratio
Publication
- M. Blok
- Archives of Acoustics - Year 2012
In this paper a sample rate conversion with continuously changing resampling ratio has been presented. The proposed implementation is based on variable fractional delay filter implemented using a Farrow structure. It have been demonstrated that using the proposed approach instantaneous resampling ratio can be freely changed. This allows for simulation of audio recored on magnetic tape with nonuniform velocity as well as removal...

Full text available to download
Scenariusze przepływu pracy sprzężone z automatyczną akwizycją danych
Publication
- T. Dziubich
- Year 2010
Przedstawiono tematykę inteligentnych przepływów pracy (smart workflow). Przedstawiono aplikacje oparte o inteligentne scenariusz przepływu pracy: sterowanie systemem audio, monitorowanie warunków środowiskowych pomieszczenia i dynamiczną kontekstową listę zadań. Opisano komponentową architekturę systemu. Opisano etapy poszerzające proces projektowania i implementacji. Wskazano na problemy występujące podczas wykonywania tych aplikacji...
Rozproszone przechowywanie zapasowych kopii danych
Publication
- J. Kuchta
- Year 2012
Pokazano metodę wykorzystania systemu przetwarzania rozproszonego do zabezpieczenia instytucji przed skutkami ataku hakerskiego połączonego ze zniszczeniem bazy danych tej instytucji. Metoda ta polega na wplataniu pakietów danych do materiałów audio-video ściąganych przez internautów korzystających z serwisów filmowych Video-on-Demand i przechowywaniu danych w rozproszeniu na setki lub nawet tysiące komputerów.

Full text to download in external service
Learning and memory processes in autonomous agents using an intelligent system of decision-making
Publication
- Year 2016
This paper analyzes functions and structures of the memory that is an indispensable part of an Intelligent System of Decision-making (ISD), developed as a universal engine for autonomous robotics. A simplified way of processing and coding information in human cognitive processes is modelled and adopted for the use in autonomous systems. Based on such a knowledge structure, an artificial model of reality representation and a model...

Full text available to download
Quantum channel capacities: multiparty communication
Publication
- M. Demianowicz
- P. Horodecki
- PHYSICAL REVIEW A - Year 2006
Analizowane są różne aspekty wieloużytkownikowej komunikacji kwantowymi kanałami bez pamięci. Uogólnione zostały pewne znane rezultaty dotyczące komunikacji kwantowej w układzie jeden nadawca -jeden odbiorca. W szczególności pokazana została bezużyteczność komunikacji klasycznej ''w przód'' w procesie transmisji informacji kwantowej oraz równoważność definicji regionów pojemności opartych na różnych miarach wierności transmisji:...

Full text available to download
Comparative control of the bioactivity of some frequently consumed vegetables subjected to different processing conditions
Publication
- S. Gorinstein
- Z. Jastrzębski
- H. Leontowicz
- M. Leontowicz
- J. Namieśnik
- K. Najman
- Y. Park
- B. Heo
- J. Cho
- J. Bae
- FOOD CONTROL - Year 2009
Celem prowadzonych badań było określenie wpływu warunków obróbki przed spożyciem na własciwości przeciwutleniające białej i czerwonej cebuli oraz czosnku poddanym obróbce poprzez gotowanie i blanszowanie. Do określenia poziomu aktywności przeciwutleniającej wykorzystano testy ABTS, DPPH, FRAP i CUPRAC.
Wartość wiedzy dla współpracy nauki i biznesu
Publication
- I. Richter
- Year 2011
Budowanie przewagi konkurencyjnej na podstawie kapitału ludzkiego stanowi istotę gospodarki opartej na wiedzy. Krytyczną wartością współczesnego przedsiębiorstwa jest zatem wiedza i kapitał intelektualny. Efektywny sposób zarządzania nimi wpływa bezpośrednio na poziom kreatywności i innowacyjności. W konsekwencji stanowi główny trzon w procesie budowania konkurencyjnej pozycji przedsiębiorstwa w dynamicznie zmieniającym się otoczeniu....
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
Publication
- P. Żwan
- A. Czyżewski
- Journal of Digital Forensic Practice - Year 2010
W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Full text to download in external service
Integracja bezprzewodowych heterogenicznych sieci IP dla poprawy efektywności transmisji danych na morzu
Publication
- M. Hoeft
- Year 2023
Wraz ze wzrostem istotności środowiska morskiego w naszym codziennym życiu np. w postaci zwiększonego wolumenu transportu realizowanego drogą morską. czy zintensyfikowanych prac dotyczących obserwacji i monitoringu środowiska morskiego, wzrasta również potrzeba opracowania efektywnych systemów komunikacyjnych dedykowanych dla tego środowiska. Heterogeniczne systemy łączności bezprzewodowej integrowane na poziomie warstwy sieciowej...

Full text available to download
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
Publication
- G. Korvel
- B. Kostek
- Journal of the Acoustical Society of America - Year 2019
Music analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...

Full text available to download
Stepwise development of distributed interactive simulation systems.
Publication
- T. Orłowski
- B. Wiszniewski
- Year 2004
Metoda krokowa pozwala na budowanie wydajnych i skalowalnych systemów symu-lacji rozproszonej obiektów rzeczywistych jak np. pojazdy terenowe, samocho-dy i śmigłowce. Dzięki uwzględnianiu parametrów operacyjnych oraz parametrównarzędzi wizualizacyjnych możliwe jest znaczne ograniczenie liczby komunika-tów przesyłanych między obiektami.
Projekt techniczny i budowa platformy latającej typu quadrocopter
Publication
- Pomiary Automatyka Robotyka - Year 2014
Jedną z licznych platform latających jest pojazd typu quadrocopter. Rozwój techniki pozwala na budowanie konstrukcji przemieszczających się w wielu osiach. W artykule przedstawiono projekt, wykonanie i oprogramowanie pojazdu typu quadrocopter. Dodatkowo dokonano filtracji sygnałów pomiarowych i opracowano algorytm sterowania.

Full text available to download
Gesture-controlled Sound Mixing System With a Sonified Interface
Publication
- M. Lech
- B. Kostek
- Year 2013
In this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...

Full text to download in external service
Implementation Of The Innovative Radiolocalization System VCS-MLAT (Voice Communication System Multilateration)
Publication
- Year 2020
In the article the concept of the radiolocalization subsystem of the VHF communication for aviation VCS-MLAT (Voice Communication System – Multilateration) is presented. The distributed localization system can estimate the position of the aircraft using the audio signals from aircraft transmitters in the VHF band (118-136 MHz). This paper shows initial verification of the possibility to use voice airband communication to estimate...

Full text to download in external service
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
Publication
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2016
W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...

Search

Filters

Catalog

Search results for: KODOWANIE AUDIO

Joanna Mytnik dr hab.