Filters
total: 945
-
Catalog
- Publications 487 available results
- Journals 25 available results
- Conferences 6 available results
- Publishing Houses 1 available results
- People 32 available results
- Inventions 1 available results
- Projects 2 available results
- e-Learning Courses 25 available results
- Events 6 available results
- Open Research Data 360 available results
Search results for: video restoration
-
MODALITY corpus - SPEAKER 27 - COMMANDS C2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 32 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - COMMANDS C4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Tagged images with bees
Open Research DataImages taken from bee hive with tagged bees. The images are prepared for training yolo5 deep neural network (supplied with the data).
-
Measuring and Analyzing Audio Levels in Film, Commercials, and Movie Trailers Using Leq(A) Values and the LUFS Loudness Model . Analiza pomiarów dźwięku w filmie oraz w reklamach filmowych z wykorzystaniem modelu głośności
PublicationThe purpose of this paper is to describe the measurement of loudness levels in movies, movie trailers, and commercials displayed before feature films at movie theaters. In the initial section, the paper discusses the issues related to measurement of loudness levels, provides recommendations regarding permissible loudness levels during movie screenings, and mentions the applied units of measurement. The following section of the...
-
Visual perception of vowels from static and dynamic cues
PublicationThe purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...
-
Nauka w świecie cyfrowym okiem młodego inżyniera - początki techniki wirtualnej rzeczywistości
PublicationIstnieje wiele definicji wirtualnej rzeczywistości (VR – Virtual Reality), które mniej lub bardziej pokrywają się ze sobą w różnych obszarach naukowych. Obecnie, gdy używamy określenia „VR”, odnosi się ono konkretnie do obrazów generowanych komputerowo, które zostały specjalnie zaprojektowane tak, aby dostarczyć jak najbardziej immersyjnych wrażeń. Sporo opracowań mówi również, że VR musi być interaktywna. To odróżniałoby ją od...
-
An new method of audio-visual correlation analysis
PublicationThis paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...
-
Bimodal Emotion Recognition Based on Vocal and Facial Features
PublicationEmotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
-
Modelling and Analysis of the Positioning Accuracy in the Loading Systems of Mobile Cranes
PublicationIn this work, the authors analyse the influence of the order and range of sequential movements of a crane's working members on the accuracy of the final cargo positioning. The analysis was conducted on the basis of a specially developed method in which the authors proposed the introduction of a geometrical indicator of positioning the load in the intermediate positions (after completing each movement sequence) and in the target...
-
QR CODE JAKO NARZĘDZIE KOMUNIKACJI Z KLIENTAMI
PublicationKażdą nową technologię użytkownicy muszą zacząć postrzegać jako użyteczną, aby mogła się ona upowszechnić. W ten sposób działania z nią związane stają się codziennością. Jednak oprócz użyteczności dla konsumentów istotna jest też prostota użytkowania technologii. Obie cechy dotyczą technologii mobilnych i związanych z nimi działań określanych mianem marketingu mobilnego . Zalicza się do nich: wysyłanie SMS i MMS, włączanie bluetooth...
-
English Language Learning Employing Developments in Multimedia IS
PublicationIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Cognition and Decisional Experience to Support Safety Management in Workplaces
PublicationHazards are present in all workplaces and can result in serious injuries, short and long-term illnesses, or death. In this context, management of safety is essential to ensure the occupational health of workers. Aiming to assist the safety management process, especially in industrial environments, a Cognitive Vision Platform for Hazard Control (CVP-HC) is proposed. This platform is a Cyber Physical system, capable of identifying...
-
Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services
PublicationStreaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...
-
Comparison of the effectiveness of automatic EEG signal class separation algorithms
PublicationIn this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and...
-
From creative writing, virtual environments to nature-based solutions: linking research and education to facilitate transition from sustainable to regenerative cities
PublicationChallenges related to the climate crisis and its consequences, such as rising sea levels, urban heat islands or floods, engender pressure on architectural education. Sustainable design often inclines to regenerative one - an emerging trend focused on the restorative power of architecture. The question appears upon the tools and methods that would facilitate both students and academics to address new challenges. This article offers...
-
Oprogramowanie mobilnego komunikatora multimedialnego
PublicationArtykuł przedstawia efekty prac nad stworzeniem oprogramowania dla mobilnego komunikatora multimedialnego. Projektowane urządzenie ma umożliwić użytkownikom swobodną komunikację (tekstową, głosową, wideo) oraz możliwość lokalizowania innych użytkowników dzięki działającej w tle wymianie informacji o pozycji. W referacie zaprezentowano architekturę systemu oraz oprogramowania stworzonego w środowisku Qt realizującego założoną funkcjonalność....
-
ELECTIVE PROJECT II _sem 5_Green Story - Free Time Space
e-Learning CoursesThe topic of the course - Green Story - Free Time Space, joins green architecture and a place to spend free time - inside and outside – to read, to eat, to relax. The idea is to design green – to give back the greenery to the public square – to make a city space more friendly for users and more friendly to the environment. You can design a story, to make a space more attractive. You can design a Green Story, to make people more...
-
Dorota Dominika Kamrowska-Załuska dr hab. inż. arch.
PeopleProf. Dorota Kamrowska-Zaluska, architect and urban planner, Associate Professor and Director of mid-career program on Urban development and management of metropolitan areas, at the Department of Urban Design and Regional Planning at Faculty of Architecture, Gdansk University of Technology; Visiting Scholar and Research Fellow at several research institutions incl. Massachusetts Institute of Technology (2013), Charted Urban Planner...
-
Wykorzystanie narzędzi pracy zdalnej w działaniach Koła Naukowego Konstruktorów Pojazdów
PublicationNiniejszy artykuł stanowi opis działalności Koła Naukowego Konstruktorów Pojazdów, w którego działaniach wykorzystywane są nowoczesne narzędzia pracy zdalnej. Dzięki takiemu podejściu, możliwe staje się wyeliminowanie niedogodnień, z którymi borykano się stosując standardowe, starsze podejście do realizacji zadań projektowych w jednostkach badawczo-rozwojowych. Podane przykłady ilustrują, w jaki sposób powszechny obecnie dostęp...
-
Long-term comparative evaluation of an acoustic climate in selected schools before and after the acoustic treatment
PublicationThe results of long-term continuous noise measurements in two selected schools are presented in the paper. Noise characteristics were measured continuously there for approximately 16 months. Measurements started eight months prior to the acoustic treatment of the school corridors of both schools. An evaluation of the acoustic climates in both schools, before and after the acoustic treatment, was performed based on comparison of...
-
Lossless Compression of Binary Trees with Correlated Vertex Names
PublicationCompression schemes for advanced data structures have become the challenge of today. Information theory has traditionally dealt with conventional data such as text, image, or video. In contrast, most data available today is multitype and context-dependent. To meet this challenge, we have recently initiated a systematic study of advanced data structures such as unlabeled graphs [1]. In this paper, we continue this program by considering...
-
Evaluating the Use of Edge Device Towards Fall Detection in Smart City Environment
PublicationThis paper presents the development and preliminary testing of a fall detection algorithm that leverages OpenPose for real-time human pose estimation from video feeds. The system is designed to function optimally within a range of up to 7 meters from ground-level cameras, focusing exclusively on detected human silhouettes to enhance processing efficiency. The performance of the proposed approach was evaluated using accuracy values...
-
Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera
PublicationThis paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...
-
DevEmo—Software Developers’ Facial Expression Dataset
PublicationThe COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...
-
Estimation of DC motor parameters using a simple CMOS camera
PublicationDifferent components of control systems for mobile robots are based on dynamic models. In low-cost solutions such a robot is wheeled and equipped with DC motors, which have to be included in the model of the robot. The model is fairly simple but determination of its parameters needs not to be easy. For instance, DC motor parameters are typically identified indirectly using suitable measurements, concerning engine voltage, current,...
-
A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors
PublicationIn recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...
-
On the Consumption of Multimedia Content Using Mobile Devices: a Year to Year User Case Study
PublicationIn the early days, consumption of multimedia content related with audio signals was only possible in a stationary manner. The music player was located at home, with a necessary physical drive. An alternative way for an individual was to attend a live performance at a concert hall or host a private concert at home. To sum up, audio-visual effects were only reserved for a narrow group of recipients. Today, thanks to portable players,...
-
Low-Power Receivers for Wireless Capacitive Coupling Transmission in 3-D-Integrated Massively Parallel CMOS Imager
PublicationThe paper presents pixel receivers for massively parallel transmission of video signal between capacitive coupled integrated circuits (ICs). The receivers meet the key requirements for massively parallel transmission, namely low-power consumption below a single μW, small area of less than 205 μm2, high sensitivity better than 160 mV, and good immunity to crosstalk. The receivers were implemented and measured in a 3-D IC (two face-to-face...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Pomiar obrotów i przemieszczenia cząstek T-S w zlokalizowanej strefie deformacji.
PublicationPrzedstawiono nowe stanowisko do badań eksperymentalnych w warunkach dwuosiowego ściskania z materiałem Taylor-Schneebeli (T-S). Unikatowe stanowisko badawcze, w skali badań mechaniki ośrodków rozdrobnionych w ogóle, wykorzystuje oryginalną technikę pomiarów cyfrowych dla określenia obrotów i przemieszczenia cząstek T-S w skali mikro. Cyfrowa technika pomiarów wykorzystuje konwencjonalny sprzęt wideo kamer lub aparatów...
-
Speech recognition system for hearing impaired people.
PublicationPraca przedstawia wyniki badań z zakresu rozpoznawania mowy. Tworzony system wykorzystujący dane wizualne i akustyczne będzie ułatwiał trening poprawnego mówienia dla osób po operacji transplantacji ślimaka i innych osób wykazujących poważne uszkodzenia słuchu. Active Shape models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na...
-
Przedmiot wyrównawczy - Analityk Danych - Matematyka II st.
e-Learning CoursesKurs do przedmiotu wyrównawczego dla studentów kierunku Matematyka st. II sem. I Osoba odpowiedzialna za przedmiot: dr hab. Karol Dziedziul Osoby współprowadzące zajęcia: mgr Wiktor Florek email: wikflore@pg.edu.pl, inne dane do kontaktu wkrótce godz. konsultacji: poniedziałek po zajęciach (zdalnie - czat/rozmowa wideo na Teamsie lub Skypie)
-
Mining Knowledge of Respiratory Rate Quantification and Abnormal Pattern Prediction
PublicationThe described application of granular computing is motivated because cardiovascular disease (CVD) remains a major killer globally. There is increasing evidence that abnormal respiratory patterns might contribute to the development and progression of CVD. Consequently, a method that would support a physician in respiratory pattern evaluation should be developed. Group decision-making, tri-way reasoning, and rough set–based analysis...
-
Rewitalizacja dróg wodnych delty Wisły jako podstawa nowej perspektywy osiedleńczej
PublicationW artykule opisano badania dotyczące rewitalizacji dróg wodnych Delty Wisły. Problematyka badawcza dotyczy rozwoju obszaru na styku lądu i wody. Wpływ lokalizacji nadmorskiej, i związana z nią problematyka wynikająca z położenia poniżej poziomu morza są dostrzegalne nawet do stu kilometrów od wybrzeża. Dodatkowym aspektem, który ujemnie wpływa na rozwój obszaru delty po II wojnie światowej jest utrata tożsamości regionalnej. Dlatego...
-
AAM toolkit: a system for visual object appearance modeling
PublicationAktywne modele wyglądu (AAM) mogą być traktowane jako zaawansowana metoda analizy informacji multimedialnych, pozwalająca na lokalizowanie i rozpoznawanie obiektów w obrazach statycznych i sekwencjach wideo. Pomimo tego że ukazało się wiele publikacji dotyczących AAM, przejście od koncepcji teoretycznych do działającej implementacji stanowi nadal duże wyzwanie. W pracy przedstawiono przygotowany przez autorów pakiet oprogramowania...
-
Multimedialny system nadzoru dla straży granicznej – projekt STRADAR
PublicationSTRADAR jest systemem nadzoru przeznaczonym do wspierania działań operacyjnych morskiej straży granicznej, umożliwiającym zbieranie, przetwarzanie i udostępnianie informacji i danych pochodzących z takich sensorów, jak radary, kamery wideo, AIS, GPS, aparaty fotograficzne oraz z połączeń audio, wiadomości SMS, plików i notatek. Informacje te mogą być udostępniane na bieżąco oraz archiwalnie z synchronizacją zdarzeń lub bez synchronizacji....