Filtry
wszystkich: 1427
wybranych: 994
Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION
-
Site-selective cation–π interaction as a way of selective recognition of the caesium cation using sumanene-functionalized ferrocenes
Publikacja -
Oscillating water-oil-water liquid membrane systems for molecular recognition of substances belonging to diferent taste classes
PublikacjaBadano oscylacje róznicy potencjału elektrochemicznego miedzy fazami wodnymi. Jedna faza wodna zawiera kationowy lub anionowy surfaktant podczas gdy w drugiej fazie wodnej znajduje sie substancja odpowiedzialna za smak. Dwie fazy wodne sa rozdzielone faza olejową. Oscylacje były analizowane poprzez konstrukcje portretów fazowych uzywając metody opoznienia czasowego. Kształt portretów fazowych jest rozny dla oscylatorów z kationowym...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublikacjaIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Ion recognition properties of new pyridine-2,6-dicarboxamide bearing propeller-like pendant residues: multi-spectroscopic approach
PublikacjaThe synthesis and ion binding properties of new amide derived from propeller-like tris(2-pyridyl)amine and 2,6-pyridinedicarboxylic acid chloride were described. Amide binds divalent metal cations: copper(II), nickel(II), zinc(II), and lead(II) in acetonitrile. In acetonitrile:water mixture (9:1 v/v) amide interacts only with copper(II) and nickel(II) cations forming complexes of 1:1 stoichiometry. It was found that the introduction...
-
Quantum and carbon dots conjugated molecularly imprinted polymers as advanced nanomaterials for selective recognition of analytes in environmental, food and biomedical applications
PublikacjaSamples with complex matrix analyzed during explanation of pathogenesis of various diseases and food or environmental monitoring request advanced analytical and instrumental devices. Among the materials used for described purposes, quantum (QDs) or carbon dots (CDs) layered by molecularly imprinted polymer (MIP) shells have gained widespread attention. Unique optical and physicochemical properties of QDs/CDs together with high...
-
Pattern Recognition Methods in Evaluation of the Structure of the Laboratory Data Biominerals, Antioxidant Enzymes, Selected Biochemical Parameters, and Pulmonary Function of Welders
Publikacja -
Chromatographic and Spectroscopic Identification and Recognition of Natural Dyes, Uncommon Dyestuff Components, and Mordants: Case Study of a 16th Century Carpet with Chintamani Motifs
PublikacjaA multi-tool analytical practice was used for the characterisation of a 16th century carpet manufactured in Cairo. A mild extraction method with hydrofluoric acid has been evaluated in order to isolate intact flavonoids and their glycosides, anthraquinones, tannins, and indigoids from fibre samples. High-performance liquid chromatography coupled to spectroscopic and mass spectrometric detectors was used for the identification of...
-
Synthesis of thiol derivatives of azobenzocrown ethers. The preliminary studies on recognition of alkali metal ions by gold nanoparticles functionalized with azobenzocrown and lipoic acid
PublikacjaThe article presents the synthesis of novel 13- and 16-membered azobenzocrown derivatives with peripheral thiol moieties and preliminary studies assessing their possible application in plasmonic sensors based on gold nanoparticles. The effect of the length of the chain connecting the macrocycle with the thiol group and the effect of the presence of the additional functional compound,...
-
Novel Highly Thermostable Endolysin from Thermus scotoductus MAT2119 Bacteriophage Ph2119 with Amino Acid Sequence Similarity to Eukaryotic Peptidoglycan Recognition Proteins
Publikacja -
Anion–π recognition between [M(CN)6]3− complexes and HAT(CN)6: structural matching and electronic charge density modification
Publikacja -
Micro-cracking pattern recognition of hybrid CNTs/GNPs cement pastes under three-point bending loading using acoustic emission technique
PublikacjaThe generation of microcracks has an important influence on the behaviour of concrete structures. In this study, the acoustic emission (AE) technique was used to investigate the fracture phenomena and micro-cracking behavior of hybrid carbon nanotubes (CNTs, the 1-D allotrope of carbon atoms) and graphene nanoplatelets (GNPs, 2D monolayer of sp2-hybridized carbon atoms), cement composites under three-point bending loading. In...
-
Discriminating macromolecular interactions based on an impedimetric fingerprint supported by multivariate data analysis for rapid and label-free Escherichia coli recognition in human urine
PublikacjaThis manuscript presents a novel approach to address the challenges of electrode fouling and highly complex electrode nanoarchitecture, which are primary concerns for biosensors operating in real environments. The proposed approach utilizes multiparametric impedance discriminant analysis (MIDA) to obtain a fingerprint of the macromolecular interactions on flat glassy carbon surfaces, achieved through self-organized, drop-cast,...
-
High quality speech coding using combined parametric and perceptual modules. [Kodowanie sygnału mowy z zachowaniem wysokiej jakości przy wykorzystaniu modułu parametrycznego i perceptualnego]
PublikacjaW komunikacie zaprezentowano nową metodę hybrydowego kodowania sygnału mowy. Techniki kodowania parametrycznego oraz perceptualnego zostały wykorzystane w celu zapewnienia wysokiej jakości kodowania sygnału mowy. Przedstawiono wyniki badań dla dwóch architektur kodeka. Jedna z nich bazuje na algorytmie pozwalajacym wyodrębnić składowe dźwięczne, bezdźwięczne oraz transjenty. Składowe dźwięczne kodowane są metodą perceptualną, bezdźwięczne...
-
Using pattern recognition entropy to select mass chromatograms to prepare total ion current chromatograms from raw liquid chromatography–mass spectrometry data
Publikacja -
Improving signal quality in speech codec using hybrid perceptual-parametric algorithm. [Poprawa jakości sygnału w kodekach mowy przy użyciu hybrydowego, parametryczno-perceptualnego algorytmu kodowania]
PublikacjaPrzedstawiono hybrydową, parametryczno-perceptualną architekturę kodeka. Podstawowa struktura kodeka parametrycznego CELP została wzbogacona o kodowanie perceptualne. Celem hybrydyzacji kodeka jest uzyskanie znaczącej poprawy subiektywnej jakości zdekodowanego sygnału. Zaproponowano dwie hybrydowe struktury. Pierwsza polega na perceptualnym kodowaniu dźwięcznych elementów sygnału rezydualnego kodeka CELP. Druga metoda dzieli sygnał...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublikacjaPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Modelling Of Commercial Websites. A New Perspective On Usability And Customer Relation
PublikacjaFrom an economic point of view, a critical aspect of online services is their ability to retain customers. The aim of presented study was the use of a layered model VIPR (Visual - Interaction - Process - Relation ) for commercial services online. The indicator of trust and establishing lasting relationships were assessment achieved from experienced users of commercial online services (n = 207), obtained by means of Web Credibility...
-
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
PublikacjaPrzedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
-
Dynamiczna aplikacja Internetowa ASP.NET silnika indukcyjnego jako elementu wirtualnego laboratorium maszyn elektrycznych
PublikacjaTematem referatu jest dynamiczna aplikacja internetowa, która umożliwia symulację obwodową silnika indukcyjnego trójfazowego Sg 100 L2 z wykorzystaniem interfejsu przeglądarki WWW. Model matematyczny silnika jest zdefiniowany w tzw. osiach naturalnych i sformułowany na podstawie metody energetycznej Lagrange’a. Do implementacji modelu maszyny w aplikacji internetowej wybrano projekt typu Web Forms, który jest składnikiem środowiska...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublikacjaAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Prototype selection algorithms for distributed learning
Publikacja -
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublikacjaPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublikacjaThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Zastosowanie spowalniania wypowiedzi w celu poprawy rozumienia mowy przez dzieci w szkole
PublikacjaThis paper presents a time-scale modification algorithms that could be used for hearing impairment therapy supported by real-time speech stretching. In this paper the OLA based algorithms and Phase Vocoder were described. In the experimental part usability of those algorithms for real-time speech stretching was discussed
-
Instantaneous complex frequency for pipeline pitch estimation
PublikacjaIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublikacjaThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
Engineering Candida albicans glucosamine-6-phosphate synthase for efficient enzyme purification
PublikacjaRationally designed muteins of Candida albicans glucosamine-6-phosphate synthase, an enzyme known as a promising target for antifungal chemotherapy, were constructed, overexpressed in Escherichia coli and purified to near homogeneity. To facilitate and to optimize the purification of the enzyme, three recombinant versionscontaining internal oligoHis fragments were constructed: (i) by substituting residues 343 - 348...
-
Digital fingerprinting for color images based on the quaternion encryption scheme
PublikacjaIn this paper we present a new quaternion-based encryption technique for color images. In the proposed encryption method, images are written as quaternions and are rotated in a three-dimensional space around another quaternion, which is an encryption key. The encryption process uses the cipher block chaining (CBC) mode. Further, this paper shows that our encryption algorithm enables digital fingerprinting as an additional feature....
-
Bridging challenges of clinical decision support systems with a semantic approach. A case study on breast cancer
PublikacjaThe integration of Clinical Decision Support Systems (CDSS) in nowadays clinical environments has not been fully achieved yet. Although numerous approaches and technologies have been proposed since 1960, there are still open gaps that need to be bridged. In this work we present advances from the established state of the art, overcoming some of the most notorious reported difficulties in: (i) automating CDSS, (ii) clinical workflow...
-
Simultaneous determination of thermodynamic and kinetic parameters of aminopolycarbonate complexes of cobalt(II) and nickel(II) based on isothermal titration calorimetry data
Publikacja -
Zinc(II) complexation by some biologically relevant pH buffers
Publikacja -
Wykorzystanie systemu komputerowego ALEP-PL w planowaniu rozwoju lokalnych systemów energetycznych
PublikacjaZaprezentowano autorski system komputerowy ALEP-PL, który wspomaga proces planowania rozwoju lokalnych systemów energetycznych. Narzędzie zostało przygotowane z uwzględnieniem metodyki planowania zaawansowanego. System składa się z serwisu internetowego, bazy danych i modułów logiki biznesowej. Serwis internetowy został stworzony w technologii ASP.NET z użyciem środowiska Visual Studio 2010 i serwera baz danych MS SQL Server 2008...
-
Creating new voices using normalizing flows
PublikacjaCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
PublikacjaW wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublikacjaThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
Multimodal Attention Stimulator
PublikacjaMultimodal attention stimulator was proposed and tested for improving auditory and visual attention, including pupils with developmental dyslexia. Results of the conducted experiments shown that the designed stimulator can be used in order to improve comprehension during reading tasks. The changes in the visual attention, observed in reading test results, translate into the overall reading performance.
-
Pracujący w czasie rzeczywistym system detekcji gazów wykorzystujący przenośny komputer Raspberry PI oraz matrycę półprzewodnikowych czujników gazu
PublikacjaThe gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and lowcost alternative for other devices, like gas‑analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
-
Joint fingerprinting and decryption method for color images based on quaternion rotation with cipher quaternion chaining
PublikacjaThis paper addresses the problem of unauthorized redistribution of multimedia content by malicious users (pirates). In this method three color channels of the image are considered a 3D space and each component of the image is represented as a point in this 3D space. The distribution side uses a symmetric cipher to encrypt perceptually essential components of the image with the encryption key and then sends the encrypted data via...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
Interactions with recognized patients using smart glasses
PublikacjaRecently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublikacjaReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Investigation of educational processes with affective computing methods
PublikacjaThis paper concerns the monitoring of educational processes with the use of new technologies for the recognition of human emotions. This paper summarizes results from three experiments, aimed at the validation of applying emotion recognition to e-learning. An analysis of the experiments’ executions provides an evaluation of the emotion elicitation methods used to monitor learners. The comparison of affect recognition algorithms...
-
Lighting conditions in Home Office and occupant’s perception: an international study
PublikacjaThe global pandemic and physical distancing restrictions are forcing us to rethink how residential buildings are used regarding the visual environment. This paper describes home office lighting conditions within different countries and continents. The aim is to define the current limitations of home offices in providing a resilient visual environment. The work was developed by a team of international experts working together on...
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Gesture-based computer control system
PublikacjaIn the paper a system for controlling computer applications by hand gestures is presented. First, selected methods used for gesture recognition are described. The system hardware and a way of controlling a computer by gestures are described. The architecture of the software along with hand gesture recognition methods and algorithms used are presented. Examples of basic and complex gestures recognized by the system are given.
-
Automatic Classification of Polish Sign Language Words
PublikacjaIn the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublikacjaIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublikacjaThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...