Filters
total: 1049
filtered: 719
Search results for: AUDIOVISUAL SPEECH RECOGNITION
-
Conditions for increasing the recognition of degradation in thermal-flow diagnostics, taking into account environmental legal aspects
PublicationThe ever-increasing demand for electricity and the need for conventional sources to cooperate with renewable ones generates the need to increase the efficiency and safety of the generation sources. Therefore, it is necessary to find a way to operate existing facilities more efficiently with full detection of emerging faults. These are the requirements of Polish, European and International law, which demands that energy facilities...
-
Tailoring Diffusional Fields in Zwitterion/Dopamine Copolymer Electropolymerized at Carbon Nanowalls for Sensitive Recognition of Neurotransmitters
PublicationThe importance of neurotransmitter sensing in the diagnosis and treatment of many psychological illnesses and neurodegenerative diseases is non-negotiable. For electrochemical sensors to become widespread and accurate, a long journey must be undertaken for each device, from understanding the materials at the molecular level to real applications in biological fluids. We report a modification of diamondized boron-doped carbon nanowalls...
-
On the new possibility of applying oscillating liquid membrane systems for melecular recognition substances responsible for taste.
PublicationSugerowano wczesniej, że układy osylacyjne z membraną ciekłą mogą być uzyte do opracowania sensora smaku. zbadano wpływ substancji odpowiedzialnych za smak należacych do czterech klas samku na charakterystyki osylacyjne oscylatora z mambrana ciekłą i kationowym surfaktantem chlorkiem benzylodimetylotetradecyloamonionowym. Wykazano,że niezaleznie od natury rozpuszczalnika organicznego w membranie ciekłej charkterystyki oscylacyjne...
-
Цифровой анализ сигналов речи как инструмент сравнительного языкознания [A digital analysis of speech signals as an instrument in comparative linguistics]
Publication -
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych = System of speech signal processing and visualisation of the results
PublicationW artykule przedstawiono sposób przetwarzania i wizualizacji sygnału mowy w formie prostego w obsłudze i relatywnie niedrogiego urządzenia do nagrywania sygnału akustycznego oraz przetwarzania cyfrowego wyselekcjonowanych fragmentów i wizualizacji uzyskanych rezultatów przekształceń. Zastosowano do tego celu komputer z kartą dźwiękową. Przetwarzanie cyfrowe oraz wizualizacja dokonywana była w oparciu o program MATLAB bezpośrednio...
-
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
Publication -
Electrochemical Recognition of Aromatic Species with Ferrocenylated 1,3,5‐Triazine‐ or 1,3,5‐Triphenylbenzene‐Containing Highly Organized Molecules
Publication -
Stable nanoconjugates of transferrin with alloyed quaternary nanocrystals Ag–In–Zn–S as a biological entity for tumor recognition
PublicationOne way to limit the negative effects of anti-tumor drugs on healthy cells is targeted therapy employing functionalized drug carriers. Here we present a biocompatible and stable nanoconjugate of transferrin anchored to Ag-In-Zn-S quantum dots modified with 11-mercaptoundecanoic acid (Tf-QD) as a drug carrier versus typical anticancer drug, doxorubicin. Detailed investigations of Tf-QD nanoconjugates without and with doxorubicin...
-
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
PublicationBecause of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...
-
Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study
PublicationThis article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....
-
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych [A system of speech signal processing and visualisation for linguistic purposes]
Publication -
Site-selective cation–π interaction as a way of selective recognition of the caesium cation using sumanene-functionalized ferrocenes
Publication -
A new phosphonium calix[4]arene for selective anion recognition: synthesis and studies in solution and in ion selective electrodes
PublicationZaprezentowana została synteza i charakteryzacja tetra (trifenylofosfoniowego) p-tert-butylkaliks[4]arenu 2. Oddziaływania z anionami były badane z użyciem (1)H oraz (31)P NMR i spektrofotometrii absorpcyjnej UV. Badania wykazały oddziaływania z anionami: CLO4-, I-, oraz SCN-. Selektywność jonoforu 2 badano również w membranowych elektrodach jonoselektywnych (ISE) PCV/o-NPOE. Elektroda zawierająca związek 2 generuje odpowiedź potencjometryczną...
-
Oscillating water-oil-water liquid membrane systems for molecular recognition of substances belonging to diferent taste classes
PublicationBadano oscylacje róznicy potencjału elektrochemicznego miedzy fazami wodnymi. Jedna faza wodna zawiera kationowy lub anionowy surfaktant podczas gdy w drugiej fazie wodnej znajduje sie substancja odpowiedzialna za smak. Dwie fazy wodne sa rozdzielone faza olejową. Oscylacje były analizowane poprzez konstrukcje portretów fazowych uzywając metody opoznienia czasowego. Kształt portretów fazowych jest rozny dla oscylatorów z kationowym...
-
Ion recognition properties of new pyridine-2,6-dicarboxamide bearing propeller-like pendant residues: multi-spectroscopic approach
PublicationThe synthesis and ion binding properties of new amide derived from propeller-like tris(2-pyridyl)amine and 2,6-pyridinedicarboxylic acid chloride were described. Amide binds divalent metal cations: copper(II), nickel(II), zinc(II), and lead(II) in acetonitrile. In acetonitrile:water mixture (9:1 v/v) amide interacts only with copper(II) and nickel(II) cations forming complexes of 1:1 stoichiometry. It was found that the introduction...
-
Pattern Recognition Methods in Evaluation of the Structure of the Laboratory Data Biominerals, Antioxidant Enzymes, Selected Biochemical Parameters, and Pulmonary Function of Welders
Publication -
Quantum and carbon dots conjugated molecularly imprinted polymers as advanced nanomaterials for selective recognition of analytes in environmental, food and biomedical applications
PublicationSamples with complex matrix analyzed during explanation of pathogenesis of various diseases and food or environmental monitoring request advanced analytical and instrumental devices. Among the materials used for described purposes, quantum (QDs) or carbon dots (CDs) layered by molecularly imprinted polymer (MIP) shells have gained widespread attention. Unique optical and physicochemical properties of QDs/CDs together with high...
-
Chromatographic and Spectroscopic Identification and Recognition of Natural Dyes, Uncommon Dyestuff Components, and Mordants: Case Study of a 16th Century Carpet with Chintamani Motifs
PublicationA multi-tool analytical practice was used for the characterisation of a 16th century carpet manufactured in Cairo. A mild extraction method with hydrofluoric acid has been evaluated in order to isolate intact flavonoids and their glycosides, anthraquinones, tannins, and indigoids from fibre samples. High-performance liquid chromatography coupled to spectroscopic and mass spectrometric detectors was used for the identification of...
-
Synthesis of thiol derivatives of azobenzocrown ethers. The preliminary studies on recognition of alkali metal ions by gold nanoparticles functionalized with azobenzocrown and lipoic acid
PublicationThe article presents the synthesis of novel 13- and 16-membered azobenzocrown derivatives with peripheral thiol moieties and preliminary studies assessing their possible application in plasmonic sensors based on gold nanoparticles. The effect of the length of the chain connecting the macrocycle with the thiol group and the effect of the presence of the additional functional compound,...
-
Novel Highly Thermostable Endolysin from Thermus scotoductus MAT2119 Bacteriophage Ph2119 with Amino Acid Sequence Similarity to Eukaryotic Peptidoglycan Recognition Proteins
Publication -
Micro-cracking pattern recognition of hybrid CNTs/GNPs cement pastes under three-point bending loading using acoustic emission technique
PublicationThe generation of microcracks has an important influence on the behaviour of concrete structures. In this study, the acoustic emission (AE) technique was used to investigate the fracture phenomena and micro-cracking behavior of hybrid carbon nanotubes (CNTs, the 1-D allotrope of carbon atoms) and graphene nanoplatelets (GNPs, 2D monolayer of sp2-hybridized carbon atoms), cement composites under three-point bending loading. In...
-
Anion–π recognition between [M(CN)6]3− complexes and HAT(CN)6: structural matching and electronic charge density modification
Publication -
Discriminating macromolecular interactions based on an impedimetric fingerprint supported by multivariate data analysis for rapid and label-free Escherichia coli recognition in human urine
PublicationThis manuscript presents a novel approach to address the challenges of electrode fouling and highly complex electrode nanoarchitecture, which are primary concerns for biosensors operating in real environments. The proposed approach utilizes multiparametric impedance discriminant analysis (MIDA) to obtain a fingerprint of the macromolecular interactions on flat glassy carbon surfaces, achieved through self-organized, drop-cast,...
-
High quality speech coding using combined parametric and perceptual modules. [Kodowanie sygnału mowy z zachowaniem wysokiej jakości przy wykorzystaniu modułu parametrycznego i perceptualnego]
PublicationW komunikacie zaprezentowano nową metodę hybrydowego kodowania sygnału mowy. Techniki kodowania parametrycznego oraz perceptualnego zostały wykorzystane w celu zapewnienia wysokiej jakości kodowania sygnału mowy. Przedstawiono wyniki badań dla dwóch architektur kodeka. Jedna z nich bazuje na algorytmie pozwalajacym wyodrębnić składowe dźwięczne, bezdźwięczne oraz transjenty. Składowe dźwięczne kodowane są metodą perceptualną, bezdźwięczne...
-
Using pattern recognition entropy to select mass chromatograms to prepare total ion current chromatograms from raw liquid chromatography–mass spectrometry data
Publication -
Improving signal quality in speech codec using hybrid perceptual-parametric algorithm. [Poprawa jakości sygnału w kodekach mowy przy użyciu hybrydowego, parametryczno-perceptualnego algorytmu kodowania]
PublicationPrzedstawiono hybrydową, parametryczno-perceptualną architekturę kodeka. Podstawowa struktura kodeka parametrycznego CELP została wzbogacona o kodowanie perceptualne. Celem hybrydyzacji kodeka jest uzyskanie znaczącej poprawy subiektywnej jakości zdekodowanego sygnału. Zaproponowano dwie hybrydowe struktury. Pierwsza polega na perceptualnym kodowaniu dźwięcznych elementów sygnału rezydualnego kodeka CELP. Druga metoda dzieli sygnał...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublicationPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
PublicationPrzedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
-
Evaluation of Six Degrees of Freedom 3D Audio Orchestra Recording and Playback using multi-point Ambisonic interpolation
PublicationThis paper describes a strategy for recording sound and enabling six-degrees-of-freedom playback, making use of multiple simultaneous and synchronized Higher Order Ambisonics (HOA) recordings. Such a strategy enables users to navigate in a simulated 3D space and listen to the six-degrees-of-freedom recordings from different perspectives. For the evaluation of the proposed approach, an Unreal Engine-based navigable 3D audiovisual...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublicationAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Prototype selection algorithms for distributed learning
Publication -
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublicationPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Zastosowanie spowalniania wypowiedzi w celu poprawy rozumienia mowy przez dzieci w szkole
PublicationThis paper presents a time-scale modification algorithms that could be used for hearing impairment therapy supported by real-time speech stretching. In this paper the OLA based algorithms and Phase Vocoder were described. In the experimental part usability of those algorithms for real-time speech stretching was discussed
-
Instantaneous complex frequency for pipeline pitch estimation
PublicationIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
Engineering Candida albicans glucosamine-6-phosphate synthase for efficient enzyme purification
PublicationRationally designed muteins of Candida albicans glucosamine-6-phosphate synthase, an enzyme known as a promising target for antifungal chemotherapy, were constructed, overexpressed in Escherichia coli and purified to near homogeneity. To facilitate and to optimize the purification of the enzyme, three recombinant versionscontaining internal oligoHis fragments were constructed: (i) by substituting residues 343 - 348...
-
Digital fingerprinting for color images based on the quaternion encryption scheme
PublicationIn this paper we present a new quaternion-based encryption technique for color images. In the proposed encryption method, images are written as quaternions and are rotated in a three-dimensional space around another quaternion, which is an encryption key. The encryption process uses the cipher block chaining (CBC) mode. Further, this paper shows that our encryption algorithm enables digital fingerprinting as an additional feature....
-
Bridging challenges of clinical decision support systems with a semantic approach. A case study on breast cancer
PublicationThe integration of Clinical Decision Support Systems (CDSS) in nowadays clinical environments has not been fully achieved yet. Although numerous approaches and technologies have been proposed since 1960, there are still open gaps that need to be bridged. In this work we present advances from the established state of the art, overcoming some of the most notorious reported difficulties in: (i) automating CDSS, (ii) clinical workflow...
-
Simultaneous determination of thermodynamic and kinetic parameters of aminopolycarbonate complexes of cobalt(II) and nickel(II) based on isothermal titration calorimetry data
Publication -
Zinc(II) complexation by some biologically relevant pH buffers
Publication -
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublicationThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
Creating new voices using normalizing flows
PublicationCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
Human voice modification using instantaneous complex frequency
PublicationThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
PublicationW wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...
-
Auditory-visual attention stimulator
PublicationNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublicationThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublicationIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Pracujący w czasie rzeczywistym system detekcji gazów wykorzystujący przenośny komputer Raspberry PI oraz matrycę półprzewodnikowych czujników gazu
PublicationThe gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and lowcost alternative for other devices, like gas‑analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublicationIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
Interactions with recognized patients using smart glasses
PublicationRecently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...