Search results for: REAL-TIME SPEECH STRETCHING
-
Virtual keyboard controlled by eye gaze employing speech synthesis
PublicationThe article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
-
Virtual Keyboard controlled by eye gaze employing speech synthesis
PublicationThe article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
-
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
PublicationThe broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...
-
Secured wired BPL voice transmission system
PublicationDesigning a secured voice transmission system is not a trivial task. Wired media, thanks to their reliability and resistance to mechanical damage, seem an ideal solution. The BPL (Broadband over Power Line) cable is resistant to electricity stoppage and partial damage of phase conductors, ensuring continuity of transmission in case of an emergency. It seems an appropriate tool for delivering critical data, mostly clear and understandable...
-
Speech Intelligibility Measurements in Auditorium
PublicationSpeech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...
-
Marking the Allophones Boundaries Based on the DTW Algorithm
PublicationThe paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublicationThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
Investigating Feature Spaces for Isolated Word Recognition
PublicationThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
Detection and localization of selected acoustic events in acoustic field for smart surveillance applications
PublicationA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications
PublicationA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
Auditory-visual attention stimulator
PublicationNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
Subjective and Objective Comparative Study of DAB+ Broadcast System
PublicationBroadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...
-
Computer Controlled Systems Lab - 2023/2024
e-Learning CoursesComputer Controlled Systems Lab The course includes 5 individual projects and their laboratory implementation. The topics:- Job analysis and tuning digital servo- Usage of a PC computer and MatLab package for controlling the dynamic object as a model of the tethered helicopter- Use of C language and the PC to control the plant in real time- Use of assembly language, and a microcontroller to control the plant in real time- Use...
-
Computer Controlled Systems Lab - Nowy - Nowy
e-Learning CoursesComputer Controlled Systems Lab The course includes 5 individual projects and their laboratory implementation. The topics:- Job analysis and tuning digital servo- Usage of a PC computer and MatLab package for controlling the dynamic object as a model of the tethered helicopter- Use of C language and the PC to control the plant in real time- Use of assembly language, and a microcontroller to control the plant in real time- Use...
-
An audio-visual corpus for multimodal automatic speech recognition
Publicationreview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Noise profiling for speech enhancement employing machine learning models
PublicationThis paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...
-
Multimodal English corpus for automatic speech recognition
PublicationA multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublicationIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
Distortion of speech signals in the listening area: its mechanism and measurements
PublicationThe paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...
-
Speech Analytics Based on Machine Learning
PublicationIn this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...
-
Experimental data of galvanic electric cells measurements
Open Research DataInternal temperature of an electric cell can be measured and monitored using microsphere-based fiber-optic sensors with thin ALD ZnO coating. Their compact size will allow to integrate them easily and effectively within the electric cells. Utilization of presented sensors allows to detect, in real time, damages to the structure of the sensor head that...
-
Performance Analysis of the OpenCL Environment on Mobile Platforms
PublicationToday’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...
-
Measurement spectrum obtained with the use of ZnO coated (100 nm) microsphere-based fiber-optic sensor - 200 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 100 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated (100 nm) microsphere-based fiber-optic sensor - 100 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 100 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated (100 nm) microsphere-based fiber-optic sensor - 300 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 100 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublicationThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...
-
Antiviral activity of bee bread derived from polish apiaries.
Open Research DataBee bread is a product of fermentation of bee-collected pollen and revealed a high nutritional value. Other bee products, such as honey and propolis, are known for their antiviral activity, but bee bread is still under investigation, thus its antiviral potential is still unspecified. For investigation antiviral activity of bee bread samples, cytotoxicity...
-
A Fortran-95 algorithm to solve the three-dimensional Higgs boson equation in the de Sitter space-time
Open Research DataA numerically efficient finite-difference technique for the solution of a fractional extension of the Higgs boson equation in the de Sitter space-time is designed. The model under investigation is a multidimensional equation with Riesz fractional derivatives of orders in (0,1)U(1,2], which considers a generalized potential and a time-dependent diffusion...
-
Ultrawideband transmission in physical channels: a broadband interference view
PublicationThe superposition of multipath components (MPC) of an emitted wave, formed by reflections from limiting surfaces and obstacles in the propagation area, strongly affects communication signals. In the case of modern wideband systems, the effect should be seen as a broadband counterpart of classical interference which is the cause of fading in narrowband systems. This paper shows that in wideband communications, the time- and frequency-domain...
-
Threshold photoelectron studies of isoxazole over the energy range 9.9-30 eV
PublicationThe threshold photoelectron spectrum of the isoxazole molecule, C3H3NO has been measured over the photon energy range 9.9-30 eV with the use of synchrotron radiation. In the 9.9-10.8 eV range, corresponding to photoionization from the highest occupied molecular orbital 3a"(π3), seven well resolved vibrational series have been observed and their modes are tentatively assigned. A strong adiabatic ionization, with an energy of 11.132...
-
Emilia Miszewska dr inż.
PeopleEmilia Miszewska was born in 1986 in Gdańsk. She graduated from Primary School No. 17 in Gdańsk with sports classes specializing in swimming and Janusz Kusociński Sports Secondary School No. 11 in Gdańsk. In 2005, she started uniform master's studies at the Faculty of Civil and Environmental Engineering, which she completed in 2011, defending her diploma thesis entitled "Analysis and development of fire protection guidelines and...
-
Vident-real: an intra-oral video dataset for multi-task learning
Open Research DataWe introduce Vident-real, a large dataset of 100 video sequences of intra-oral scenes from real conservative dental treatments performed at the Medical University of Gdańsk, Poland. The dataset can be used for multi-task learning methods including:
-
Automated Classifier Development Process for Recognizing Book Pages from Video Frames
PublicationOne of the latest developments made by publishing companies is introducing mixed and augmented reality to their printed media (e.g. to produce augmented books). An important computer vision problem that they are facing is classification of book pages from video frames. The problem is non-trivial, especially considering that typical training data is limited to only one digital original per book page, while the trained classifier...
-
Flock behavior and control
PublicationIn this paper we present the results of the Flock Behaviour and Control workshop cluster during “Shapes of Logic Conference 2015”. During the event, students got familiar with the techniques of both visual and sound real-time data processing. The second topic presented for students was behaviourbased approach of design process, mainly based on the mathematical rules set up by Craig Raynolds on the swarm behaviour. The aim of the...
-
A Novel Approach to the Assessment of Cough Incidence
PublicationIn this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...
-
Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking
PublicationEcho cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...
-
Determining the noise impact on hearing using psychoacoustical noise dosimeter
PublicationThis research study presents the designed noise dosimeter based on psychoacoustical properties of the human hearing system and, at the same time. evaluation of time and frequency characteristics of noise. The designed noise dosimeter enables assessing temporary threshold shift (TTS) in critical hands in real time. In this way it is possible monitoring the hearing threshold shift continuously for people who stay in the harmful noise...
-
Hardware-Software Implementation of Basic Principles Simulator of Nuclear Reactor Processes
PublicationThe paper presents implementation process of basic principle simulators of a nuclear reactor processes. Simulators are based on point-models of processes: kinetics of neutrons, heat generation and exchange, poisoning and burning-up nuclear fuel. Reference simulator was developed in MATLAB/Simulink without taking into account real-time operation. Second simulator was built using the toolbox xPC with hard real-time requirements....
-
Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
Open Research DataWe introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:
-
Neural network based algorithm for hand gesture detection in a low-cost microprocessor applications
PublicationIn this paper the simple architecture of neural network for hand gesture classification was presented. The network classifies the previously calculated parameters of EMG signals. The main goal of this project was to develop simple solution that is not computationally complex and can be implemented on microprocessors in low-cost 3D printed prosthetic arms. As the part of conducted research the data set EMG signals corresponding...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 140 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 160 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 180 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 220 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 200 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-0optic sensor - 250 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 210 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 300 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 270 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...
-
Measurement spectrum obtained with the use of ZnO coated microsphere-based fiber-optic sensor - 190 Celsius degrees
Open Research DataApplication of a microsphere-based fiber-optic sensor with 200 nm zinc oxide (ZnO) coating, deposited by Atomic Layer Deposition (ALD) method, for temperature measurements between 100°C and 300°C, is presented. The main advantage of integrating a fiber-optic microsphere with a sensing device is the possibility of monitoring the integrity of the sensor...