Filters
total: 2939
displaying 1000 best results Help
Search results for: AUDIO PROCESSING OBJECTS
-
Special forms of echo visual representation in an ahead looking sonar.
PublicationThe paper discusses ways to organise visual representation in a multi-beam ahead looking sonars whose function is to detect objects on the bottom and in pelagic zones. Forms of visual representation are shown and illustrated on the basic screen (panoramic representation and setting, alarms) and on the auxiliary screen (type A, B and special). Special forms of visual representation are mainly used in detecting objects in difficult...
-
Detection of vehicles stopping in restricted zones in video from surveillance cameras
PublicationAn algorithm for detection of vehicles that stop in restricted areas, e.g. excluded by traffic rules, is proposed. Classic approaches based on object tracking are inefficient in high traffic scenes because of tracking errors caused by frequent object merging and splitting. The proposed algorithm uses the background subtraction results for detection of moving objects, then pixels belonging to moving objects are tested for stability....
-
Research into the Movements of Surface Water Masses in the Basins Adjacent to the Port
PublicationThis paper presents the results of the practical and simulation research into determining the routes of movement of small objects moving together with surface water masses in basins adjacent to the port. The results of this research were referenced against the modelling of routes of small objects in port channel basins. The results of practical research concerning the movement of small objects in basins adjacent to the port were...
-
MINERAL MATTER IN MUNICIPAL SOLID WASTE
PublicationMunicipal solid waste (MSW) contains mineral materials which are seldom considered as a potential resource. Currently, the waste management sector pays attention to recyclable parts, biodegradable material, waste-to-energy fraction, and residues after waste reuse and recycle. In contrast, this study focus as on the mineral matter in MSW. The aim was to analyze and discuss the sources of mineral matter in MSW, the impact which the...
-
Henryk Krawczyk prof. dr hab. inż.
PeopleDyscyplina naukowa: informatyka Sprawował urząd rektora od 2008 do 2016. Urodził się 20 maja 1946 r. w Dybowie. Studia wyższe ukończył w 1969 r. na Wydziale Elektroniki Politechniki Gdańskiej, uzyskując tytuł magistra inżyniera w zakresie informatyki. W latach 1969–1972 pracował w Przemysłowym Instytucie Telekomunikacji. W 1972 r. rozpoczął pracę na Wydziale Elektroniki Politechniki Gdańskiej, gdzie w 1976 r. uzyskał doktorat,...
-
Time for temporariness! Temporary architecture - whim or necessity?
PublicationWhat are the qualities of temporary architectural objects that make them helpful instruments for evolving the image of contemporary urban space? Six hypotheses provide rationales for why current temporary architecture seems to be a remedy for dysfunctional city structures. The research was limited to the temporary architectural objects constructed in open-air city zones, including the architecture of events. The temporary objects'...
-
Sacral sound-engineering
PublicationOrganologic and campanologic acoustical problems due to applications to sacral objects are characterized on ground of numerous reviewed publications and engineering reports. Participations of several involved research centres, mostly Polish, at solving these problems are evaluated. Some desirable future developments are indicated. Appendices bring examples of documentation on selected investigated objects.
-
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...
-
Sparse autoregressive modeling
PublicationIn the paper the comparison of the popular pitch determination (PD) algorithms for thepurpose of elimination of clicks from archive audio signals using sparse autoregressive (SAR)modeling is presented. The SAR signal representation has been widely used in code-excitedlinear prediction (CELP) systems. The appropriate construction of the SAR model is requiredto guarantee model stability. For this reason the signal representation...
-
Distributed Framework for Visual Event Detection in Parking Lot Area
PublicationThe paper presents the framework for automatic detection of various events occurring in a parking lot basing on multiple camera video analysis. The framework is massively distributed, both in the logical and physical sense. It consists of several entities called node stations that use XMPP protocol for internal communication and SRTP protocol with Jingle extension for video streaming. Recognized events include detecting parking...
-
Innovative method of localization airplanes in VCS (VCS-MLAT) distributed system
PublicationThe article presents the concept and the structure of the localization module. The prototype module is the part of the VCS (VCS-MLAT) localization distributed system. The device receives the audio signal transmitted in airplanes band (118 MHz – 136 MHz). Received data with the timestamps are send to the main server. The data from multiple devices estimates the localization of the airplane. The main aim of the project is the analysis...
-
Advances in Neural Information Processing Systems (Advances in Neural Information Processing Systems [NIPS])
Conferences -
The use of mathematical models for diagnosis of activated sludge systems in WWTP
PublicationIn this study diagnosis of activated sludge systems in wastewater treatment plant (WWTP) was investigated. Diagnosis of technical objects can be realized in many ways. One of the divisions of the diagnostic methods include modelling with or without a model of the object. The first of these is the analysis of the symptoms for which, based on the parameter values, the abnormality in the diagnosed objects are sought. Another way is...
-
Historical Wreck Inventory
Open Research DataThe measurement solution results in a point cloud obtained from the Leica P30 laser scanner. Another element is the processing of photos into point clouds with the Zenmuse P1 camera of the unmanned Matrice 300 RTK aircraft. The measurement provides complete geometrical information about the wreck. The measurement took place as part of the Photogrammetry...
-
Literature Review on Conceptualisation of Online Consumer Engagement
PublicationThe purpose of the current study is to develop a literature review on “online consumer engagement” (OCE). Articles from 2006 to 2016 published in the marketing journals and other related journals have been reviewed to summarise the OCE concept. Although there is not an agreed definition and conceptualisation of OCE, this study classified the concept as either behavioural or psychological within the dimensions of cognitive, emotional,...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Subjective and Objective Comparative Study of DAB+ Broadcast System
PublicationBroadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...
-
Cross-domain applications of multimodal human-computer interfaces
PublicationDeveloped multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
-
Determination of Mechanical Energy Loss in Steady Flow by Means of Dissipation Power
PublicationWhen systems of simple geometry like pipes or regular channels are considered, the mechan- ical energy loss of the fluid flow can be expressed by local and longitudinal empirical energy loss coefficients. However, in the case of large spatially distributed objects, there are no simple approaches to this task. In practice, general recommendations addressing different types of objects are used, but they usually provide very coarse...
-
AITP - AI Thermal Pedestrians Dataset
Open Research DataAITP is a pedestrian detection dataset consisting of 9178 annotated thermal images. The training set contains 7801 images on which15448 pedestrians were labeled. The test set has 1377 images on which 2731 objects were marked. All images are in PNG file format (120x160) captured with FLIR Lepton Thermal Camera on the streets of Gdańsk, Poland. All pedestrians...
-
Virtual Engineering Factory: Creating Experience Base for Industry 4.0
PublicationABSTRACT In recent times, traditional manufacturing is upgrading and adopting Industry 4.0, which supports computerization of manufacturing by round-the-clock connection and communica- tion of engineering objects. Consequently, Decisional DNA- based knowledge representation of manufacturing objects, processes, and system is achieved by virtual engineering objects (VEO), virtual engineering processes (VEP), and virtual engineering...
-
Application of Shape From Shading Technique for Side Scan Sonar Images
PublicationSide scan sonar (SSS) is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy in comparison with more powerful sensors like multibeam echosounder or synthetic aperture sonar. Although, the SSS does not provide directly the seafloor bathymetry measurements. Its outputs are usually in a form of grey level acoustic images of seafloor. However, the analysis of such images...
-
Multi-Camera Vehicle Tracking Using Local Image Features and Neural Networks
PublicationA method for tracking moving objects crossing fields of view of multiple cameras is presented. The algorithm utilizes Artificial Neural Networks (ANNs). Each ANN is trained to recognize images of one moving object acquired by a single camera. Local image features calculated in the vicinity of automatically detected interest points are used as object image parameters. Next, ANNs are employed to identify the same objects captured...
-
The Empirical Application of Automotive 3D Radar Sensor for Target Detection for an Autonomous Surface Vehicle’s Navigation
PublicationAvoiding collisions with other objects is one of the most basic safety tasks undertaken in the operation of floating vehicles. Addressing this challenge is essential, especially during unmanned vehicle navigation processes in autonomous missions. This paper provides an empirical analysis of the surface target detection possibilities in a water environment, which can be used for the future development of tracking and anti-collision...
-
Empirical Methods in Natural Language Processing
Conferences -
International Conference on Image Analysis and Processing
Conferences -
Processing Declarative Knowledge, International Workshop
Conferences -
International Conference on Image and Signal Processing
Conferences -
International Conference on Neural Information Processing
Conferences -
Symposium on Frontiers of Massively Parallel Processing
Conferences -
IEEE International Conference on Image Processing
Conferences -
Natural Language Processing and Knowledge Engineering
Conferences -
International Workshop on Multimedia Signal Processing
Conferences -
Conference on Algorithms and Hardware for Parallel Processing
Conferences -
American Federation of Information Processing Societies
Conferences -
Optical and structural properties of polycrystalline CVD diamond films grown on fused silica optical fibres pre-treated by high-power sonication seeding
PublicationIn this paper, the growth of polycrystalline chemical vapour deposition (CVD) diamond thin films on fused silica optical fibres has been investigated. The research results show that the effective substrate seeding process can lower defect nucleation, and it simultaneously increases surface encapsulation. However, the growth process on glass requires high seeding density. The effects of suspension type and ultrasonic power were...
-
Marek Czachor prof. dr hab.
People -
Methodology and technology for the polymodal allophonic speech transcription
PublicationA method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...
-
Methodology and technology for the polymodal allophonic speech transcription
PublicationA method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...
-
Sound engineering as our commitment to its creators in Poland
PublicationSound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...
-
The impact of cooking method on the phenolic composition, total antioxidant activity and starch digestibility of rice (Oryza sativa L.)
PublicationThis study investigated changes in the phenolic composition, total antioxidant activity (TAA) and starch digestibility in white and brown rice due to three different cooking procedures, and subsequent reheating of cooked rice after storage. Among the analyzed samples, brown rice showed the highest TAA and phenolic content (622.5 mg kg-1 DW). All cooking methods resulted in significant decrease of phenolic content and TAA of rice...
-
The Importance of Contextual Topology in the Process of Harmonization of the Spatial Databases on Example BDOT500
PublicationIn this work, we present two detailed problems of topological errors in spatial database. Both issues are inconsistencies in the database, i.e. interior topological relationships layers of buildings and the relationship between the buildings layer and the layer of plots. That inconsistency is related to the residual polygons that arise as a result of overlapping objects, or gaps between objects. The occurrence of this type of error...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Resolving conflicts in object tracking for automatic detection of events in video
PublicationW referacie przedstawiono algorytm rozwiązywania konfliktów w śledzeniu obiektów ruchomych. Proponowana metoda wykorzystuje predykcję stanu obiektu obliczaną przez filtry Kalmana oraz dopasowuje wykryte obiekty do struktur śledzących ich ruch na podstawie deskryptorów koloru i tekstury. Omówiono specyficzne sytuacje powodujące konflikty, takie jak rozdzielanie obiektów. Przedstawiono wyniki testów. Algorytm może być zastosowany...
-
Cultural Heritage in Spatial Planning
PublicationThe cultural heritage objects of each country should have a major impact on the development of space. Unfortunately, most often the investment needs prevail and only the most precious historical objects are protected. Thus often a monument is preserved, but its surroundings (which put it in context) are lost forever. This article addressed the issues of cultural heritage in relation to the spatial planning system in Poland. The...