Filters
total: 1948
filtered: 1519
displaying 1000 best results Help
Search results for: SPEECH STRETCHING
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublicationAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
Database of speech and facial expressions recorded with optimized face motion capture settings
PublicationThe broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...
-
Elimination of clicks from archive speech signals using sparse autoregressive modeling
PublicationThis paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...
-
Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement
Publication -
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
PublicationThe broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...
-
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
PublicationThe user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublicationWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublicationA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
PublicationThe aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...
-
Automated detection of pronunciation errors in non-native English speech employing deep learning
PublicationDespite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...
-
Akustyczny obraz słowa na tle mowy etnicznej [The acoustic image of ethnic speech words]
Publication -
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublicationObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
-
Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition
PublicationBrain–computer interfaces (BCIs) aim to decode brain signals and transform them into commands for device operation. The present study aimed to decode the brain activity during imagined speech. The BCI must identify imagined words within a given vocabulary and thus perform the requested action. A possible scenario when using this approach is the gradual addition of new words to the vocabulary using incremental learning methods....
-
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
PublicationThe article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
The development of speech in early childhood in children from twin pregnancies with twin-twin transfusion syndrome (TTTS)
Publication -
Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions
Publication -
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublicationIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
PublicationIn this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...
-
The spindle speed control for high speed milling vibration surveillance
PublicationW pracy przedstawiono zmodyfikowaną metodę nadzorowania drgań za pomocą sterowania optymalno-liniowego prędkością obrotową wrzeciona. Zamieszczono opis dynamiki skrawania, niestacjonarnego modelu obliczeniowego oraz metody nadzorowania drgań. Wykorzystano model modalny wirującego narzędzia, którego parametry wyznaczono zarówno metodami symulacji komputerowych, jak i - za pomocą eksperymentalnej analizy modalnej. Zamieszczono przykład...
-
Cyfrowa analiza mowy etnicznej – ekstrakcja kodu informacji [A digital analysis of ethnic speech – deciphering the information code]
Publication -
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
PublicationIn order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...
-
Цифровой анализ сигналов речи как инструмент сравнительного языкознания [A digital analysis of speech signals as an instrument in comparative linguistics]
Publication -
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych = System of speech signal processing and visualisation of the results
PublicationW artykule przedstawiono sposób przetwarzania i wizualizacji sygnału mowy w formie prostego w obsłudze i relatywnie niedrogiego urządzenia do nagrywania sygnału akustycznego oraz przetwarzania cyfrowego wyselekcjonowanych fragmentów i wizualizacji uzyskanych rezultatów przekształceń. Zastosowano do tego celu komputer z kartą dźwiękową. Przetwarzanie cyfrowe oraz wizualizacja dokonywana była w oparciu o program MATLAB bezpośrednio...
-
The sensitiveness of the speed of pile displacement to speed variations of hammer in beating down process
PublicationIn this paper there is presented dynamical system described speed of pile displacement during beating down process. Its response is determined by using Heaviside operator. There is introduced the convergence with regulator in partially ordered space. There is given an answer to the question, whetdisplacement is sensitive to hammer's speed variations.
-
Speed and load torque observer application in high-speed train electric drive
PublicationW artykule przedstawiono zastosowanie diagnostyczne obserwatorów prędkości obrotowej i momentu obciążenia w układzie napędowym z silnikiem asynchronicznym. System diagnostyczny jest dedykowany do napędu pociągu szybkiego. Celem diagnostycznym jest monitorowanie stanu czujnika prędkości obrotowej wału silnika oraz układu przeniesienia momentu trakcyjnego. Analiza sygnałów obliczanych w obserwatorach stanu pozwala na wykrycie uszkodzeń...
-
Vibration surveillance of high speed ball end milling by the spindle speed control
PublicationPrzedstawiono zmodyfikowaną metodę nadzorowania drgań smukłych wirujących narzedzi skrawających za pomocą sterowania optymalno-liniowego prędkością obrotową wrzeciona. Podano opis dynamiki skrawania, niestacjonarnego modelu obliczeniowego oraz zmodyfikowanej metody nadzorowania drgań. Modyfikacja polega na tym, że do symulacji procesu nadzorowania wykorzystano model modalny freza, zaś parametry tego modelu wyznaczono albo numerycznie,...
-
Application of speed and load torque observers in high-speed train drive for diagnostic purposes
PublicationW artykule przedstawiono zastosowanie diagnostyczne obserwatorów prędkości obrotowej i momentu obciążenia w układzie napędowym z silnikiem asynchronicznym. System diagnostyczny jest dedykowany do napędu pociągu szybkiego. Celem diagnostycznym jest monitorowanie stanu czujnika prędkości obrotowej wału silnika oraz układu przeniesienia momentu trakcyjnego. Analiza sygnałów obliczanych w obserwatorach stanu pozwala na wykrycie uszkodzeń...
-
Simulation and experiments of high speed milling vibration surveillence with a use of changing spindle speed
PublicationPraca poświęcona jest nowemu podejściu do nadzorowania drgań wirujących narzędzi w nowoczesnych frezarkach. Przeprowadzono analizy dynamiczne skrawania smukłym frezem kulistym. Opisano dynamikę sterowanego układu nistacjonarnego. Jako rezultat sterowania optymalno-liniowego otrzymano program chwilowych zmian prędkości obrotowej pozwalający uzyskać redukację drgań. Metodę zweryfikowano eksperymentalnie na frezarce Alcera Gambin...
-
Towards simulations and experiments of high speed milling vibration surveillance by the spindle speed control
PublicationPraca poświęcona jest nowemu podejściu do nadzorowania drgań wirujących narzędzi we współczesnych frezarkach. Przeprowadzono analizę skrawania smukłym frezem kulistym. Jako rezultat sterowania optymlano-liniowego otrzymano program chwilowych zmian prędkości obrotowej pozwalający uzyskać redukację drgań. Metodę zweryfikowano eksperymentalnie.
-
Speed observer of induction machine based on backstepping and sliding mode for low‐speed operation
PublicationThis paper presents a speed observer design based on backstepping and slidingmode approaches. The inputs to the observer are the stator current and thevoltage vector components. This observer structure is extended to the integra-tors. The observer stabilizing functions contain the appropriate sliding surfaceswhich result from the Lyapunov function. The rotor angular speed is obtainedfrom the non‐adaptive formula with a sliding...
-
Acoustic journal bearing – Performance under various load and speed conditions speed conditions
PublicationThe paper presents results of experimental testing aiming at finding out what effect system of piezo-electric actuators (PZTs)attached to an aerodynamic journal bearing has on the magnitude of shaft's motion within the bearing operating at specified speed and load. The results clearly demonstrate effectiveness of PZTs in mitigating the shaft's motion thus contributing to the increased stability of the bearing. This stabilizing...
-
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych [A system of speech signal processing and visualisation for linguistic purposes]
Publication -
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
PublicationThe Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...
-
Determination of optimal rotational speeds of circular saws
PublicationW pracy przedstawiono możliwości diagnostyczne stanowiska HewSaw do badania zachowania się pił w funkcji prędkości obrotowej piły. Wykazano, ze wyznaczone wartości prędkości optymalnych badanych pił leżą poniżej wartości zalecanych przez producenta. Praca z prędkościami wyższymi od optymalnych może prowadzić do zwiększenia strat materiałowych, a także stwarzać zagrożenia dla obsługi.
-
Neurocontrolled Car Speed System
PublicationThe features of the synthesis of neural controllers for the car speed control system are considered in this article. The task of synthesis is to determine the weight coefficients of neural networks that provide the implementation of proportional and proportional-integralderivative control laws. The synthesis of controllers is based on an approach that uses a reversed model of the standard. A model of the car speed control system with...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Intelligent video and audio applications for learning enhancement
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Chatter Vibration Surveillance by the Optimal-linear Control of Spindle Speed and Randomly Varying Spindle Speed
PublicationW artykule opisano dynamikę procesu skrawania smukłym frezem kulistym. Jako metodę nadzorowania drgań chatter zastosowano zmienną prędkość obrotową wrzeciona. Jest ona dobierna z wykorzystaniem energetycznego wskaźnika jakości co prowadzi do optymalno-liniowego sterowania prędkością obrotową. Dodatkowo zastosowano nadzorowanie drgań chatter poprzez losowo zmienną prędkość obrotową.
-
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublicationPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
Critical Review on Robust Speed Control Techniques for Permanent Magnet Synchronous Motor (PMSM) Speed Regulation
PublicationThe permanent magnet synchronous motor (PMSM) is a highly efficient energy saving machine. Due to its simple structural characteristics, good heat radiation capability, and high efficiency, PMSMs are gradually replacing AC induction motors in many industrial applications. The PMSM has a nonlinear system and lies on parameters that differ over time with complex high-class dynamics. To achieve the excessive performance operation...
-
Chatter surveillance with the creation of a map of optimal spindle speeds
PublicationW pracy przedstawiono metodę nadzorowania drgań samowzbudnych typu chatter. Wyznaczono wartości optymalnych prędkości obrotowych wrzeciona dla poszczególnych punktów na powierzchni przedmiotu obrabianego, wykorzystując rożne techniki analizy modalnej. Badania eksperymentalne wykazały, że otrzymana w ten sposób mapa optymalnych prędkości obrotowych wrzeciona jest skutecznym narzędziem do eliminacji drgań chatter w procesie obróbki...
-
Speace frienly for the blind = Przestrzeń przyjazna dla niewidomych
PublicationThe article presents issues connected with accessibility of public space for people with eyesight disabilities. The use of the extravisual spatial stimuli in shaping the urban environment has been analysed. Spaces in which musltisensory spatial reception is feasible become user-friendly, as they come to meet the changing needs of their users. The article introduces a system of textures aiding spatial orientation, navigation and...
-
Variable speed small hydropower plant
Publication -
Optimisation of inland vessels' route speed.
PublicationW pracy przedstawiono rezultaty badań dotyczących metody doboru prędkości i oceny zapotrzebowanej mocy napędu statków śródlądowych, przydatnej zarówno we wstępnych etapach projektowania, jak i w zarządzaniu eksploatacją taboru śródlądowego, np. przy ustalaniu harmonogramu rejsów pasażerskich statków wycieczkowych. Do rozwiązania problemu zastosowano metodę optymalizacji nieliniowej z ograniczeniami, opartą na minimalizacji kosztu...
-
Control design for slow speed positioning
PublicationThe problem under study is a synthesis of position and heading control system for low frequency model of surface vessel described by 3 DOF mathematical model. The recursive vectorial backstepping control design was used to keep fixed position and heading in presence of wave disturbances. The controller has been simulated on computer model of scaled supply vessel. It has been assumed that the actuators produce generalized forces...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Failure analysis of a high-speed induction machine driven by a SiC-inverter and operating on a common shaft with a high-speed generator
PublicationDue to ongoing research work, a prototype test rig for testing high-speed motors/generators has been developed. Its design is quite unique as the two high- speed machines share a single shaft with no support bearings between them. A very high maximum operating speed, up to 80,000 rpm, was required. Because of the need to minimise vibration during operation at very high rotational speeds, rolling bearings were used. To eliminate...
-
Frequency characteristics of induction machine speed observers
PublicationWłaściwości napędu bezczujnikowego z silnikiem indukcyjnym zależą od struktury obserwatora prędkości. System ten wymaga starannej analizy w przypadku uszkodzenia maszyny, której celem jest odpowiedź na pytania: jak układ regulacji pracuje przy niesymetrii maszyny spowodowanej np. uszkodzeniem klatki wirnika oraz jak obserwator odtwarza pulsacje prędkości spowodowane niesymetrią maszyny. W artykule przedstawiono charakterystyki...