Search results for: VOICE RECOGNITION

System for automatic singing voice recognition

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2008

W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets

Publication

- Year 2008

Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...

Automatic singing voice recognition employing neural networks and rough sets

Publication

- Year 2007

Celem prac opisanych w referacie jest automatyczne rozpoznawanie głosów śpiewaczych. Do tego celu utworzona została baza nagrań próbek śpiewu profesjonalnego i amatorskiego. Próbki poddane zostały parametryzacji parametrami zaproponowanymi przez autorów ściśle do tego celu. Sposób wyznaczenia parametrów i ich interpretacja fizyczna przedstawione są w referacie. Parametry wprowadzane są do systemów decyzyjnych, klasyfikatorów opartych...

Voice command recognition using hybrid genetic algorithm

Publication

- TASK Quarterly - Year 2010

Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

Full text available to download

Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Publication

- Year 2016

The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...

Automatic singing quality recognition employing artificial neural networks

Publication

P. Żwan

- Archives of Acoustics - Year 2008

Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Full text available to download

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

Publication

- Year 2015

The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...

Full text to download in external service

Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

Publication

M. Wang
T. Sirlapu
A. Kwaśniewska
M. Szankin
M. Bartscherer
R. Nicolas

- Year 2018

With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service

A survey of automatic speech recognition deep models performance for Polish medical terms

Publication

- Year 2023

Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Full text to download in external service

The project IDENT: Multimodal biometric system for bank client identity verification

Publication

- Year 2017

Biometric identity verification methods are implemented inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank cli-ent voice recognition and hand vein distribution verification. A secure communication system based on an intra-bank client-server architecture was designed for this purpose. Hitherto achieved progress within the project is reported in this paper with a focus...

Full text to download in external service

Introduction to the special issue on machine learning in acoustics

Publication

Z. Michalopoulou
P. Gerstoft
B. Kostek
M. A. Roch

- Journal of the Acoustical Society of America - Year 2021

When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

Full text available to download

Sensors integration in the smart home environment - a proposal to solve the problem with user identification

Publication

- Year 2019

In this preliminary study we, investigate the possibility of user recognition techniques suitable on smart home devices like chairs, beds, aiming for low–power, high accuracy and quick response time. We propose the two well know technique: voice speaker recognition and accelerometer signal from device mounted on the chair, and the third one optical system basing on IR LED transmitter/receiver circuit. The preliminary results proved...

Full text to download in external service

Validating data acquired with experimental multimodal biometric system installed in bank branches

Publication

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2019

An experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...

Full text available to download

The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

Publication

S. Zaporowski

- Year 2024

The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

Full text available to download

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Publication

- Year 2015

The quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...

Client-server Approach in the Navigation System for the Blind

Publication

- TransNav - The International Journal on Marine Navigation and Safety of Sea Transportation - Year 2013

The article presents the client‐server approach in the navigation system for the blind ‐ “Voice Maps”. The authors were among the main creators of the prototype and currently the commercialization phase is being finished. In the implemented prototype only exemplary, limited spatial data were used, therefore they could be stored and analysed (for path-finding process) in the mobile device’s memory without any difficulties. The...

Full text available to download

WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE

Publication

- Year 2018

W niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...

Performance Analysis of Developed Multimodal Biometric Identity Verification System

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2018

The bank client identity verification system developed in the course of the IDENT project is presented. The total number of five biometric modalities including: dynamic handwritten signature proofing, voice recognition, face image verification, face contour extraction and hand blood vessels distribution comparison have been developed and studied. The experimental data were acquired employing multiple biometric sensors installed...

Full text to download in external service

Pilot Testing of Developed Multimodal Biometric Identity Verification System

Publication

- Year 2017

The bank client identity verification system developed in the course of the IDENT project is presented. The total number of five biometric modalities including: dynamic signature proofing, voice recognition, face image verification, face contour extraction and hand blood vessels distribution comparison have been developed and studied. The experimental data were acquired employing multiple biometric sensors installed at engineered...

The data exchange between smart glasses and healthcare information systems using the HL7 FHIR standard

Publication

J. Rumiński
A. Bujnowski
T. Kocejko
A. Andrushevich
M. Biallas
R. Kistler

- Year 2016

In this study we evaluated system architecture for the use of smart glasses as a viewer of information, as a source of medical data (vital sign measurements: temperature, pulse rate, and respiration rate), and as a filter of healthcare information. All activities were based on patient/device identification procedures using graphical markers or features based on visual appearance. The architecture and particular use cases were implemented...

Full text to download in external service

Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

Publication

- Year 2015

Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Full text to download in external service

Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study

Publication

- Year 2024

This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....

Full text available to download

Filters

Catalog

System for automatic singing voice recognition

Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets

Automatic singing voice recognition employing neural networks and rough sets

Voice command recognition using hybrid genetic algorithm

Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Automatic singing quality recognition employing artificial neural networks

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

A survey of automatic speech recognition deep models performance for Polish medical terms

The project IDENT: Multimodal biometric system for bank client identity verification

Introduction to the special issue on machine learning in acoustics

Sensors integration in the smart home environment - a proposal to solve the problem with user identification

Validating data acquired with experimental multimodal biometric system installed in bank branches

The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Client-server Approach in the Navigation System for the Blind

WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE

Performance Analysis of Developed Multimodal Biometric Identity Verification System

Pilot Testing of Developed Multimodal Biometric Identity Verification System

The data exchange between smart glasses and healthcare information systems using the HL7 FHIR standard

Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study