Filtry
wszystkich: 1966
wybranych: 1541
-
Katalog
Filtry wybranego katalogu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: SPEECH PARAMETRIZATION
-
Database of speech and facial expressions recorded with optimized face motion capture settings
PublikacjaThe broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublikacjaIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Improving signal quality of a speech codec using hybrid perceptual-parametric algorithm
PublikacjaW artykule zaprezentowano hybrydową architekturę parametryczno-perceptualną kodeka mowy. Jego podstawę stanowi kodek CELP, który wspomagany jest kodekiem perceptualnym. Celem zastosowania proponowanej metody jest uzyskanie poprawy jakości kodowania sygnału mowy. Badaniom poddano dwie architektury, z których w jednej dźwięczne części sygnału rezydualnego kodeka CELP kodowane są perceptualnie. Drugi z proponowanych kodeków dokonuje...
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublikacjaIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Elimination of clicks from archive speech signals using sparse autoregressive modeling
PublikacjaThis paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublikacjaAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
Computer-assisted pronunciation training—Speech synthesis is almost all you need
PublikacjaThe research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...
-
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
PublikacjaAlthough there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...
-
Combining visual and acoustic modalities to ease speech recognition by hearing impaired people
PublikacjaArtykuł prezentuje system, którego celem działania jest ułatwienie procesu treningu poprawnej wymowy dla osób z poważnymi wadami słuchu. W analizie mowy wykorzystane zostały parametry akutyczne i wizualne. Do wyznaczenia parametrów wizualnych na podstawie kształtu i ruchu ust zostały wykorzystane modele Active Shape Models. Parametry akustyczne bazują na współczynnikach melcepstralnych. Do klasyfikacji wypowiadanych głosek została...
-
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
PublikacjaThe broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublikacjaA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement
Publikacja -
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
PublikacjaThe aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...
-
Automated detection of pronunciation errors in non-native English speech employing deep learning
PublikacjaDespite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...
-
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
PublikacjaThe user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublikacjaWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish
PublikacjaThe article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...
-
Akustyczny obraz słowa na tle mowy etnicznej [The acoustic image of ethnic speech words]
Publikacja -
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
PublikacjaThe article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
-
Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition
PublikacjaBrain–computer interfaces (BCIs) aim to decode brain signals and transform them into commands for device operation. The present study aimed to decode the brain activity during imagined speech. The BCI must identify imagined words within a given vocabulary and thus perform the requested action. A possible scenario when using this approach is the gradual addition of new words to the vocabulary using incremental learning methods....
-
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublikacjaObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
-
The development of speech in early childhood in children from twin pregnancies with twin-twin transfusion syndrome (TTTS)
Publikacja -
Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions
Publikacja -
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
PublikacjaIn this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...
-
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublikacjaIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
The spindle speed control for high speed milling vibration surveillance
PublikacjaW pracy przedstawiono zmodyfikowaną metodę nadzorowania drgań za pomocą sterowania optymalno-liniowego prędkością obrotową wrzeciona. Zamieszczono opis dynamiki skrawania, niestacjonarnego modelu obliczeniowego oraz metody nadzorowania drgań. Wykorzystano model modalny wirującego narzędzia, którego parametry wyznaczono zarówno metodami symulacji komputerowych, jak i - za pomocą eksperymentalnej analizy modalnej. Zamieszczono przykład...
-
Simple Physics-Based Analytical Formulas for the Potentials of Mean Force for the Interaction of Amino Acid Side Chains in Water. 3. Calculation and Parameterization of the Potentials of Mean Force of Pairs of Identical Hydrophobic Side Chains
Publikacja -
Cyfrowa analiza mowy etnicznej – ekstrakcja kodu informacji [A digital analysis of ethnic speech – deciphering the information code]
Publikacja -
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
PublikacjaIn order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...
-
Цифровой анализ сигналов речи как инструмент сравнительного языкознания [A digital analysis of speech signals as an instrument in comparative linguistics]
Publikacja -
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych = System of speech signal processing and visualisation of the results
PublikacjaW artykule przedstawiono sposób przetwarzania i wizualizacji sygnału mowy w formie prostego w obsłudze i relatywnie niedrogiego urządzenia do nagrywania sygnału akustycznego oraz przetwarzania cyfrowego wyselekcjonowanych fragmentów i wizualizacji uzyskanych rezultatów przekształceń. Zastosowano do tego celu komputer z kartą dźwiękową. Przetwarzanie cyfrowe oraz wizualizacja dokonywana była w oparciu o program MATLAB bezpośrednio...
-
The sensitiveness of the speed of pile displacement to speed variations of hammer in beating down process
PublikacjaIn this paper there is presented dynamical system described speed of pile displacement during beating down process. Its response is determined by using Heaviside operator. There is introduced the convergence with regulator in partially ordered space. There is given an answer to the question, whetdisplacement is sensitive to hammer's speed variations.
-
Speed and load torque observer application in high-speed train electric drive
PublikacjaW artykule przedstawiono zastosowanie diagnostyczne obserwatorów prędkości obrotowej i momentu obciążenia w układzie napędowym z silnikiem asynchronicznym. System diagnostyczny jest dedykowany do napędu pociągu szybkiego. Celem diagnostycznym jest monitorowanie stanu czujnika prędkości obrotowej wału silnika oraz układu przeniesienia momentu trakcyjnego. Analiza sygnałów obliczanych w obserwatorach stanu pozwala na wykrycie uszkodzeń...
-
Vibration surveillance of high speed ball end milling by the spindle speed control
PublikacjaPrzedstawiono zmodyfikowaną metodę nadzorowania drgań smukłych wirujących narzedzi skrawających za pomocą sterowania optymalno-liniowego prędkością obrotową wrzeciona. Podano opis dynamiki skrawania, niestacjonarnego modelu obliczeniowego oraz zmodyfikowanej metody nadzorowania drgań. Modyfikacja polega na tym, że do symulacji procesu nadzorowania wykorzystano model modalny freza, zaś parametry tego modelu wyznaczono albo numerycznie,...
-
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych [A system of speech signal processing and visualisation for linguistic purposes]
Publikacja -
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
PublikacjaThe Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...
-
Speed observer of induction machine based on backstepping and sliding mode for low‐speed operation
PublikacjaThis paper presents a speed observer design based on backstepping and slidingmode approaches. The inputs to the observer are the stator current and thevoltage vector components. This observer structure is extended to the integra-tors. The observer stabilizing functions contain the appropriate sliding surfaceswhich result from the Lyapunov function. The rotor angular speed is obtainedfrom the non‐adaptive formula with a sliding...
-
Application of speed and load torque observers in high-speed train drive for diagnostic purposes
PublikacjaW artykule przedstawiono zastosowanie diagnostyczne obserwatorów prędkości obrotowej i momentu obciążenia w układzie napędowym z silnikiem asynchronicznym. System diagnostyczny jest dedykowany do napędu pociągu szybkiego. Celem diagnostycznym jest monitorowanie stanu czujnika prędkości obrotowej wału silnika oraz układu przeniesienia momentu trakcyjnego. Analiza sygnałów obliczanych w obserwatorach stanu pozwala na wykrycie uszkodzeń...
-
Simulation and experiments of high speed milling vibration surveillence with a use of changing spindle speed
PublikacjaPraca poświęcona jest nowemu podejściu do nadzorowania drgań wirujących narzędzi w nowoczesnych frezarkach. Przeprowadzono analizy dynamiczne skrawania smukłym frezem kulistym. Opisano dynamikę sterowanego układu nistacjonarnego. Jako rezultat sterowania optymalno-liniowego otrzymano program chwilowych zmian prędkości obrotowej pozwalający uzyskać redukację drgań. Metodę zweryfikowano eksperymentalnie na frezarce Alcera Gambin...
-
Acoustic journal bearing – Performance under various load and speed conditions speed conditions
PublikacjaThe paper presents results of experimental testing aiming at finding out what effect system of piezo-electric actuators (PZTs)attached to an aerodynamic journal bearing has on the magnitude of shaft's motion within the bearing operating at specified speed and load. The results clearly demonstrate effectiveness of PZTs in mitigating the shaft's motion thus contributing to the increased stability of the bearing. This stabilizing...
-
Towards simulations and experiments of high speed milling vibration surveillance by the spindle speed control
PublikacjaPraca poświęcona jest nowemu podejściu do nadzorowania drgań wirujących narzędzi we współczesnych frezarkach. Przeprowadzono analizę skrawania smukłym frezem kulistym. Jako rezultat sterowania optymlano-liniowego otrzymano program chwilowych zmian prędkości obrotowej pozwalający uzyskać redukację drgań. Metodę zweryfikowano eksperymentalnie.
-
Determination of optimal rotational speeds of circular saws
PublikacjaW pracy przedstawiono możliwości diagnostyczne stanowiska HewSaw do badania zachowania się pił w funkcji prędkości obrotowej piły. Wykazano, ze wyznaczone wartości prędkości optymalnych badanych pił leżą poniżej wartości zalecanych przez producenta. Praca z prędkościami wyższymi od optymalnych może prowadzić do zwiększenia strat materiałowych, a także stwarzać zagrożenia dla obsługi.
-
Neurocontrolled Car Speed System
PublikacjaThe features of the synthesis of neural controllers for the car speed control system are considered in this article. The task of synthesis is to determine the weight coefficients of neural networks that provide the implementation of proportional and proportional-integralderivative control laws. The synthesis of controllers is based on an approach that uses a reversed model of the standard. A model of the car speed control system with...
-
Chatter Vibration Surveillance by the Optimal-linear Control of Spindle Speed and Randomly Varying Spindle Speed
PublikacjaW artykule opisano dynamikę procesu skrawania smukłym frezem kulistym. Jako metodę nadzorowania drgań chatter zastosowano zmienną prędkość obrotową wrzeciona. Jest ona dobierna z wykorzystaniem energetycznego wskaźnika jakości co prowadzi do optymalno-liniowego sterowania prędkością obrotową. Dodatkowo zastosowano nadzorowanie drgań chatter poprzez losowo zmienną prędkość obrotową.
-
Automatic Clustering of EEG-Based Data Associated with Brain Activity
PublikacjaThe aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....
-
Critical Review on Robust Speed Control Techniques for Permanent Magnet Synchronous Motor (PMSM) Speed Regulation
PublikacjaThe permanent magnet synchronous motor (PMSM) is a highly efficient energy saving machine. Due to its simple structural characteristics, good heat radiation capability, and high efficiency, PMSMs are gradually replacing AC induction motors in many industrial applications. The PMSM has a nonlinear system and lies on parameters that differ over time with complex high-class dynamics. To achieve the excessive performance operation...
-
Chatter surveillance with the creation of a map of optimal spindle speeds
PublikacjaW pracy przedstawiono metodę nadzorowania drgań samowzbudnych typu chatter. Wyznaczono wartości optymalnych prędkości obrotowych wrzeciona dla poszczególnych punktów na powierzchni przedmiotu obrabianego, wykorzystując rożne techniki analizy modalnej. Badania eksperymentalne wykazały, że otrzymana w ten sposób mapa optymalnych prędkości obrotowych wrzeciona jest skutecznym narzędziem do eliminacji drgań chatter w procesie obróbki...
-
Speace frienly for the blind = Przestrzeń przyjazna dla niewidomych
PublikacjaThe article presents issues connected with accessibility of public space for people with eyesight disabilities. The use of the extravisual spatial stimuli in shaping the urban environment has been analysed. Spaces in which musltisensory spatial reception is feasible become user-friendly, as they come to meet the changing needs of their users. The article introduces a system of textures aiding spatial orientation, navigation and...