Wyniki wyszukiwania dla: AUDIO PROCESSING OBJECTS

Wyniki wyszukiwania dla: AUDIO PROCESSING OBJECTS

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 2980

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

AITP - AI Thermal Pedestrians Dataset
Dane Badawcze
open access
- A. Górska
- P. Guzal
- I. Namiotko
- A. Wędołowska
- M. Włoszczyńska
- J. Rumiński
AITP is a pedestrian detection dataset consisting of 9178 annotated thermal images. The training set contains 7801 images on which15448 pedestrians were labeled. The test set has 1377 images on which 2731 objects were marked. All images are in PNG file format (120x160) captured with FLIR Lepton Thermal Camera on the streets of Gdańsk, Poland. All pedestrians...
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym
Sound engineering as our commitment to its creators in Poland
Publikacja
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Rok 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Pełny tekst do pobrania w serwisie zewnętrznym
The impact of cooking method on the phenolic composition, total antioxidant activity and starch digestibility of rice (Oryza sativa L.)
Publikacja
- T. Chmiel
- I. E. Saputro
- B. Kusznierewicz
- A. Bartoszek-Pączkowska
- JOURNAL OF FOOD PROCESSING AND PRESERVATION - Rok 2018
This study investigated changes in the phenolic composition, total antioxidant activity (TAA) and starch digestibility in white and brown rice due to three different cooking procedures, and subsequent reheating of cooked rice after storage. Among the analyzed samples, brown rice showed the highest TAA and phenolic content (622.5 mg kg-1 DW). All cooking methods resulted in significant decrease of phenolic content and TAA of rice...

Pełny tekst do pobrania w portalu
The Importance of Contextual Topology in the Process of Harmonization of the Spatial Databases on Example BDOT500
Publikacja
- A. Inglot
- K. Kozioł
- Rok 2016
In this work, we present two detailed problems of topological errors in spatial database. Both issues are inconsistencies in the database, i.e. interior topological relationships layers of buildings and the relationship between the buildings layer and the layer of plots. That inconsistency is related to the residual polygons that arise as a result of overlapping objects, or gaps between objects. The occurrence of this type of error...

Pełny tekst do pobrania w portalu
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
Resolving conflicts in object tracking for automatic detection of events in video
Publikacja
- Rok 2010
W referacie przedstawiono algorytm rozwiązywania konfliktów w śledzeniu obiektów ruchomych. Proponowana metoda wykorzystuje predykcję stanu obiektu obliczaną przez filtry Kalmana oraz dopasowuje wykryte obiekty do struktur śledzących ich ruch na podstawie deskryptorów koloru i tekstury. Omówiono specyficzne sytuacje powodujące konflikty, takie jak rozdzielanie obiektów. Przedstawiono wyniki testów. Algorytm może być zastosowany...
Cultural Heritage in Spatial Planning
Publikacja
- K. Rzasa
- M. Ogryzek
- M. Kulawiak
- Rok 2016
The cultural heritage objects of each country should have a major impact on the development of space. Unfortunately, most often the investment needs prevail and only the most precious historical objects are protected. Thus often a monument is preserved, but its surroundings (which put it in context) are lost forever. This article addressed the issues of cultural heritage in relation to the spatial planning system in Poland. The...

Pełny tekst do pobrania w serwisie zewnętrznym
Robustness in Compressed Neural Networks for Object Detection
Publikacja
- S. Cygert
- A. Czyżewski
- Rok 2021
Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...

Pełny tekst do pobrania w portalu
Methods of data extraction from sub-bottom profiler's signal
Publikacja
- HYDROACOUSTICS - Rok 2010
Data obtain during sounding Gdansk Bay with SES-2000 Standard parametric sub-bottom profiler has two types of information: envelope and pure signal. First is used to plot echograms in real time and contain envelope of echo. The second one is stored during sounding and can be processed after recording data. Comparison of results will be shown and discussed. First step in investigation was proper configuration of small measurement...

Pełny tekst do pobrania w portalu
Design and numerical testing of 5-box gfrp shell footbridge
Publikacja
- M. Klasztorny
- J. Chróścielewski
- P. Szurgott
- R. Romanowski
- Rok 2014
The paper formulates new design conditions for composite footbridges, taking into account the material and structural specificity of these objects, i.e. the strength condition, the serviceability condition and the frequency condition. A GFRP composite footbridge labelled with CFB2 code has been designed, with the original superstructure in the form of an opened-closed 5-box girder. The foot and cycle track bridge is simply supported,...
Measurements of OF QoS/QoE parameters for media streaming in a PMIPv6 TESTBED WITH 802.11 b/g/n WLANs
Publikacja
- Metrology and Measurement Systems - Rok 2012
A growing number of mobile devices and the increasing popularity of multimedia services result in a new challenge of providing mobility in access networks. The paper describes experimental research on media (audio and video) streaming in a mobile IEEE 802.11 b/g/n environment realizing network-based mobility. It is an approach to mobility that requires little or no modification of the mobile terminal. Assessment of relevant parameters...

Pełny tekst do pobrania w portalu
Multisensor System for the Protection of Critical Infrastructure of Seaport
Publikacja
- M. Kastek
- R. Dulski
- M. Życzkowski
- M. Szustakowski
- P. Trzaskawka
- W. Ciurapiński
- P. Markowski
- M. Karol
- G. Grelowska
- I. Gloza... i 2 innych
- Rok 2013
There are many separated infrastructural objects within a harbor area that may be considered “critical”, such as gas and oil terminals or anchored naval vessels. Those objects require special protection, including security systems capable of monitoring both surface and underwater areas, because an intrusion into the protected area may be attempted using small surface vehicles (boats, kayaks, rafts, floating devices with weapons...
Jerzy Proficz dr hab. inż.

Osoby

Centrum Informat. Trójmiejskiej Akadem.Sieci Komputerowej, Katedra Architektury Systemów Komputerowych

Jerzy Proficz – dyrektor Centrum Informatycznego Trójmiejskiej Akademickiej Sieci Komputerowej (CI TASK) na Politechnice Gdańskiej. Uzyskał stopień naukowy doktora habilitowanego (2022) w dyscyplinie: Informatyka techniczna i telekomunikacja. Autor i współautor ponad 50 artykułów w czasopismach i na konferencjach naukowych związanych głównie z równoległym przetwarzaniem danych na komputerach dużej mocy (HPC, chmura obliczeniowa). Udział...
Automatically created and partially veriffied Wikipedia - WordNet mappings
Dane Badawcze
open access
- T. Boiński
- J. Szymański
Mapping between Wikipedia articles and WordNet synsets. The mappings between Wikipedia articles and WordNet synsets were obtained automatically using 4 algorithms of data processing. The automatically generated mappings were than a subject of verification by a group of volunteers using crowdsourcing approach through so called Games with a Purpose. The...
Expert system for automatic classification and quality assessment of singing voices
Publikacja
- P. Żwan
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2006
.

Pełny tekst do pobrania w serwisie zewnętrznym
DSP techniques for determining ''Wow'' distortions
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2007
Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.
System for automatic singing voice recognition
Publikacja
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
Tonality Estimation and Frequency Tracking of Modulated Tonal Components
Publikacja
- M. Kulesza
- A. Czyżewski
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track...

Pełny tekst do pobrania w serwisie zewnętrznym
Measurements and Visualization of Sound Intensity Around the Human Head in Free Field Using Acoustic Vector Sensor
Publikacja
- J. Kotus
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2015
This paper presents measurements and visualization of sound intensity around the human head simulator in a free field. A Cartesian robot, applied for precise positioning of the acoustic vector sensor, was used to measure sound intensity. Measurements were performed in a free field using a head and torso simulator and the setup consisting of four different loudspeaker configurations. The acoustic vector sensor was positioned around...

Pełny tekst do pobrania w portalu
Data Analysis in Bridge of Data
Publikacja
- Rok 2022
The chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.

Pełny tekst do pobrania w portalu
Mitigating Time-Constrained Stolen-Credentials Content Poisoning in an NDN Setting
Publikacja
- J. Konorski
- Rok 2019
NDN is a content-centric networking architecture using globally addressable information objects, created by publishers and cached by network nodes to be later accessed by subscribers. Content poisoning attacks consist in the substi-tution by an intruder publisher of bogus objects for genuine ones created by an honest publisher. With valid credentials stolen from an honest publisher, such attacks seem unstoppa-ble unless object...

Pełny tekst do pobrania w portalu
Suppression of distortions in signals received from Doppler sensor for vehicle speed measurement
Publikacja
- G. Szwoch
- Rok 2018
Doppler sensors are commonly used for movement detection and speed measurement. However, electromagnetic interference and imperfections in sensor construction result in degradation of the signal to noise ratio. As a result, detection of signals reflected from moving objects becomes problematic. The paper proposes an algorithm for reduction of distortions and noise in the signal received from a simple, dual-channel type of a Doppler...

Pełny tekst do pobrania w portalu
Architecture and implementation of distributed data storage using Web Services, CORBA i PVM. W: Proceedings. PPAM 2003. Parallel Processing and Applied Mathematics. Fifth International Conference. Częstochowa, 7-10 September 2003. Architektura i implementacja rozproszonego zarządzania danymi używając systemów Web Services, CORBA i PVN.
Publikacja
- P. Czarnul
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2003
Proponujemy architekturę i jej implementację PVMWeb Cluster I/O przeznaczoną do rozproszonego zarządzania danymi. Dane zapisywane są w systemie Web Services z geograficznie odległych klientów lub przez wywołania CORBA z wewnątrz danego klastra co oferuje lepsze osiągi.
Efkleidis Katsaros

Osoby

Efklidis Katsaros received the B.Sc. degree in mathematics from the Aristotle University of Thessaloniki, Greece, in 2016, and the M.Sc. degree (cum laude) in data science: statistical science from Leiden University, The Netherlands, in 2019. He is currently pursuing the Ph.D. degree in deep video multi-task learning with the Department of Biomedical Engineering, Gdańsk University of Technology, Poland. Since 2020, he has been...
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S6
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 39 - COMMANDS C1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - SEQUENCE S1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S6
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: AUDIO PROCESSING OBJECTS

Jerzy Proficz dr hab. inż.