displaying 1000 best results Help
Search results for: rate of speech estimation
-
Examining Feature Vector for Phoneme Recognition
PublicationThe aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
-
Ultrawideband transmission in physical channels: a broadband interference view
PublicationThe superposition of multipath components (MPC) of an emitted wave, formed by reflections from limiting surfaces and obstacles in the propagation area, strongly affects communication signals. In the case of modern wideband systems, the effect should be seen as a broadband counterpart of classical interference which is the cause of fading in narrowband systems. This paper shows that in wideband communications, the time- and frequency-domain...
-
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
PublicationAn allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
-
Emotions in polish speech recordings
Open Research DataThe data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
-
Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications
PublicationA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
Detection and localization of selected acoustic events in acoustic field for smart surveillance applications
PublicationA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
A Novel Approach to the Assessment of Cough Incidence
PublicationIn this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...
-
Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking
PublicationEcho cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...
-
Extracting concepts from the software requirements specification using natural language processing
PublicationExtracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....
-
Measuring Pulse Rate with a Webcam
PublicationIn this paper a simple method of measuring the pulse rate is presented. Elaborated algorithm allows for efficient pulse rate registration directly from face images captured from a webcam. The desired signal is obtained by proper channel selection and principal component analysis. To determine the accuracy of the method an ECG signal is collected together with a video recordings. The effectiveness of the algorithm is considered...
-
The instantaneous frequency rate spectogram
PublicationAn accelerogram of the instantaneous phase of signal components referred to as an instantaneous frequency rate spectrogram (IFRS) is presented as a joint time-frequency distribution. The distribution is directly obtained by processing the short-time Fourier transform (STFT) locally. A novel approach to amplitude demodulation based upon the reassignment method is introduced as a useful by-product. Additionally, an estimator of energy...
-
Identification of Unstable Reference Points and Estimation of Displacements Using Squared Msplit Estimation
PublicationThe article presents a new version of the method for estimating parameters in a split functional model, which enables the determination of displacements of geodetic network points with constrained datum. The main aim of the study is to present theoretical foundations of Msplit CD estimation and its basic properties and possible applications. Particular attention was paid to the efficacy of the method in the context of geodetic...
-
Cross-domain applications of multimodal human-computer interfaces
PublicationDeveloped multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
-
Subjective and Objective Comparative Study of DAB+ Broadcast System
PublicationBroadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...
-
Impact of the glazed roof on acoustics of historic interiors
PublicationThe paper discusses the adverse acoustic phenomena occurring in the semi-open interiors (courtyards, yards) covered with a glass roof. Particularly negative is the rever-beration noise, which leads to the degradation of the utility functions of the resulting spaces. It involves the drastically reducing the intelligibility of speech, loss of natural sounding of music, problems with the sound system, as well as disturbances in the...
-
Analysis of a caustic formed by a spherical reflector: Impact of a caustic on architectural acoustics
PublicationFocusing sound in rooms intended for listening to music or speech is an acoustic defect. Design recommendations provide remedial steps to effectively prevent this. However, there is a category of objects of high historical or architectural value in which the sound focus correction is limited or even abandoned. This also applies to indoor or outdoor concert shells, installations for teaching and acoustic presentations, etc. The...
-
Balkan Stock Exchanges – Consideration of the Length of the Estimation Window in Similar Markets
PublicationPurpose: We study if capital markets in the Balkan are closely and positively related in terms of rate of return, risk, efficiency, and maximum cumulative loss in relation to different lengths of the estimation window. Design/Methodology/Approach: The research was carried out for the period from 01/01/2017 to 31/12/2019 using portfolio analysis. It was divided into an estimation window (01/01/2019 to 31/12/2019) and another with...
-
Lattice filter based autoregressive spectrum estimation with joint model order and estimation bandwidth adaptation
PublicationThe problem of parametric, autoregressive model based estimation of a time-varying spectral density function of a nonstationary process is considered. It is shown that estimation results can be considerably improved if identification of the autoregressive model is carried out using the two-sided doubly exponentially weighted lattice algorithm which combines results yielded by two one-sided lattice algorithms running forward in...
-
Radar Signal Parameters Estimation Using Phase Accelerogram in the Time-Frequency Domain
PublicationRadar signal parameter estimation, in the context of the reconstruction of the received signal in a passive radar utilizing other radars as a source of illumination, is one of the fundamental steps in the signal processing chain in such a device. The task is also a crucial one in electronic reconnaissance systems, e.g. ELINT (Electronic Intelligence) systems. In order to obtain accurate results it is important to measure, estimate...
-
Speech and Drama
Journals -
LANGUAGE AND SPEECH
Journals -
Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
PublicationThe goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...
-
Analyzing the relationship between sound, color, and emotion based on subjective and machine-learning approaches
PublicationThe aim of the research is to analyze the relationship between sound, color, and emotion. For this purpose, a survey application was prepared, enabling the assignment of a color to a given speaker’s/singer’s voice recordings. Subjective tests were then conducted, enabling the respondents to assign colors to voice/singing samples. In addition, a database of voice/singing recordings of people speaking in a natural way and with expressed...
-
Smartphone application supporting independent movement of the blind
PublicationImproving comfort of life of blind people is a problem of great importance. Neither a white canenor a guide dog, although both very useful, can be considered as a tool for achieving fullindependence in everyday movement around the city. On the market there are some navigation toolsinspired by car navigation systems, but they have many flaws, ranging from positioninginaccuracies to high prices. The authors present their own solution...
-
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
PublicationPraca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...
-
Pursuing Analytically the Influence of Hearing Aid Use on Auditory Perception in Various Acoustic Situations
PublicationThe paper presents the development of a method for assessing auditory perception and the effectiveness of applying hearing aids for hard-of-hearing people during short-term (up to 7 days) and longer-term (up to 3 months) use. The method consists of a survey based on the APHAB questionnaire. Additional criteria such as the degree of hearing loss, technological level of hearing aids used, as well as the user experience are taken...
-
Two-Stage Identification of Locally Stationary Autoregressive Processes and its Application to the Parametric Spectrum Estimation
PublicationThe problem of identification of a nonstationary autoregressive process with unknown, and possibly time-varying, rate of parameter changes, is considered and solved using the parallel estimation approach. The proposed two-stage estimation scheme, which combines the local estimation approach with the basis function one, offers both quantitative and qualitative improvements compared with the currently used single-stage methods.
-
Lattice filter based multivariate autoregressive spectral estimation with joint model order and estimation bandwidth adaptation
PublicationThe problem of parametric, autoregressive model based estimation of a time-varying spectral density function of a multivariate nonstationary process is considered. It is shown that estimation results can be considerably improved if identification of the autoregressive model is carried out using the two-sided doubly exponentially weighted lattice algorithm which combines results yielded by two one-sided lattice algorithms running...
-
Ordinal pattern statistics for the assessment of heart rate variability
PublicationThe recognition of all main features of a healthy heart rhythm (the so-called sinus rhythm) is still one of the biggest challenges in contemporary cardiology. Recently the interesting physiological phenomenon of heart rate asymmetry has been observed. This phenomenon is related to unbalanced contributions of heart rate decelerations and accelerations to heart rate variability. In this paper we apply methods based on the concept...
-
Artur Gańcza dr inż.
PeopleI received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
-
Preliminary estimation of groundwater recharge on Brda river outwash plain
PublicationEstimation of groundwater recharge is one of the most challenging subjects in hydrogeology. It is a critical factor influencing the pollution migration, assessment of aquifer vulnerability to contamination, small-scale groundwater budget calculation, modeling of nutrient cycling and detailed flow path calculations. In Poland an infiltration rate method is widely used, which depends on a system of rate coefficients referring to...
-
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
PublicationRadio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...
-
Empirical analyses of robustness of the square Msplit estimation
PublicationThe paper presents Msplit estimation as an alternative to methods in the class of robust M-estimation. The analysis conducted showed that Msplit estimation is highly efficient in the identification of observations encumbered by gross errors, especially those of small or moderate values. The classical methods of robust estimation provide then unsatisfactory results. Msplit estimation also shows high robustness to single gross errors...
-
Analysis of the Accuracy of Pulse Estimation Using Smart Watches
PublicationThe purpose of this paper is to perform an analysis of the accuracy of the pulse estimation by comparing readings from a smartwatch with readings from medical devices. The study required writing applications that allow continuous pulse measurement. As a result, two applications were created for the smartwatch. The first one is dedicated to Android Wear devices, while the other one is compatible with Tizen watches. The next step...
-
Denitrifcation rate in the mainstream deammonification
PublicationThe conventional processes of biological nitrogen removal based on nitrification and denitrification does not fit properly into the concept of the circular economy. As the alternative one should consider the deammonification process, which is a combination of partial nitrification (nitritation) and Anammox processes. It consists of removing ammonium nitrogen from wastewater under anaerobic conditions by a group of autotrophic microorganisms....
-
Discovery of Stylistic Patterns in Business Process Textual Descriptions: IT Ticket Case
PublicationGrowing IT complexity and related problems, which are reflected in IT tickets,create a need for new qualitative approaches. The goal isto automate the extraction of main topics described in tickets in order to provide high quality support for the IT process workers and enablea smooth service delivery to the end user. Present paper proposes a method of knowledge extraction in a form of stylistic patterns in business...
-
Engineering Challenges in the Design of Cochlear Implants
PublicationHearing aids such as cochlear implants have been used by both adults and children for a long time. In addition, cochlear implants are used by patients who have severe hearing loss either by birth or after an accident. This paper aims to investigate the engineering challenges bounding the design of cochlear implants and present its possible solution...
-
A low complexity double-talk detector based on the signal envelope
PublicationA new algorithm for double-talk detection, intended for use in the acoustic echo canceller for voice communication applications, is proposed. The communication system developed by the authors required the use of a double-talk detection algorithm with low complexity and good accuracy. The authors propose an approach to doubletalk detection based on the signal envelopes. For each of three signals: the far-end speech, the microphone...
-
Analysis of allophones based on audio signal recordings and parameterization
PublicationThe aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...
-
Sample Rate Conversion with Fluctuating Resampling Ratio
PublicationIn this paper a sample rate conversion with continuouslychanging resampling ratio has been presented. The proposed implementation is based on variable fractional delay filter implemented using a Farrow structure. It have been demonstrated that using the proposed approach instantaneous resampling ratio can be freely changed. This allows for simulation of audio recored on magnetic tape with nonuniform velocity as well as removal...
-
Sample Rate Conversion with Fluctuating Resampling Ratio
PublicationIn this paper a sample rate conversion with continuously changing resampling ratio has been presented. The proposed implementation is based on variable fractional delay filter implemented using a Farrow structure. It have been demonstrated that using the proposed approach instantaneous resampling ratio can be freely changed. This allows for simulation of audio recored on magnetic tape with nonuniform velocity as well as removal...
-
New results on estimation bandwidth adaptation
PublicationThe problem of identification of a nonstationary autoregressive signal using non-causal estimation schemes is considered. Noncausal estimators can be used in applications that are not time-critical, i.e., do not require real-time processing. A new adaptive estimation bandwidth selection rule based on evaluation of pseudoprediction errors is proposed, allowing one to adjust tracking characteristics of noncausal estimators to unknown...
-
Estimation of electrode contact in capacitive ECG measurement
PublicationIn the paper a method of electrode’s contact estimation in capacitive electrocardiogram (CECG) is presented. Proposed solution allows estimation of contact quality for each individual electrode. This enables construction of multi-electrode CECG systems, where electrode pairs can be selected on the basis of the individual electrode contact quality.
-
Inflation rate (In percentage) in Iran
Open Research DataCurrently, a major concern of the Iranian economy is high inflation. Increasing prices of basic goods, and their increase in wages do not follow. At the same time the unemployment rate has raised. The following data set present the inflation rate between 2010-2020 (forecast).
-
Intelligent processing of stuttered speech.
PublicationW artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
-
Development Of Dynamic Method For Evaluation Of Corrosion Rate On The Example Of Organic Corrosion İnhibitor
PublicationMeasurements of the corrosion rate belong to the most important aspects of materials science. In order to reduce material loss corrosion inhibitors are used. However selection of proper inhibitor should be based on evaluation of its mechanism and effective concentrations. Mechanism of inhibition usually has dynamic character so physicochemical parameters are changing in time. Most of actually used methods...
-
The influence of time of hearing aid use on auditory perception in various acoustic situations
PublicationThe assessment of sound perception in hearing aids, especially in the context of benefits that a prosthesis can bring, is a complex issue. The objective parameters of the hearing aids can easily be determined. These parameters, however, do not always have a direct and decisive influence on the subjective assessment of quality of the patient’s hearing while using a hearing aid. The paper presents the development of a method for...
-
Separability Assessment of Selected Types of Vehicle-Associated Noise
PublicationMusic Information Retrieval (MIR) area as well as development of speech and environmental information recognition techniques brought various tools in-tended for recognizing low-level features of acoustic signals based on a set of calculated parameters. In this study, the MIRtoolbox MATLAB tool, designed for music parameter extraction, is used to obtain a vector of parameters to check whether they are suitable for separation of...
-
Automatic Emotion Recognition in Children with Autism: A Systematic Literature Review
PublicationThe automatic emotion recognition domain brings new methods and technologies that might be used to enhance therapy of children with autism. The paper aims at the exploration of methods and tools used to recognize emotions in children. It presents a literature review study that was performed using a systematic approach and PRISMA methodology for reporting quantitative and qualitative results. Diverse observation channels and modalities...
-
Employing Blended E-Learning to Improve Rate of Assignments Handing-In
PublicationIt has been observed that students hand in homework assignments at a notably low rate in introductory C programming course. A survey has revealed that the real issue was not student learning but instructor work organization. Based on survey results, the physical course has been complemented with an e-learning component to guide the homework process. Assignment handing-in rate significantly improved, as e-learning allowed the homework...