Search results for: LOMBARD EFFECT, SPEECH DETECTION, NOISE SIGNAL, SELF-SIMILARITY MATRIX, CONVOLUTIONAL NEURAL NETWORK

Neural network training with limited precision and asymmetric exponent

Publication

- Journal of Big Data - Year 2022

Along with an extremely increasing number of mobile devices, sensors and other smart utilities, an unprecedented growth of data can be observed in today’s world. In order to address multiple challenges facing the big data domain, machine learning techniques are often leveraged for data analysis, filtering and classification. Wide usage of artificial intelligence with large amounts of data creates growing demand not only for storage...

Full text available to download

The influence of image masks definition onsegmentation results of histopathological imagesusing convolutional neural network

Publication

- Year 2019

Abstract—In the era of collecting large amounts of tissue materials, assisting the work of histopathologists with various electronic and information IT tools is an undeniable fact. The traditional interaction between a human pathologist and the glass slide is changing to interaction between an AI pathologist with a whole slide images. One of the important tasks is the segmentation of objects (e.g. cells) in such images. In this...

Full text available to download

Signal conditioning for examination of shallow-water acoustic noise correlation properties

Publication

- HYDROACOUSTICS - Year 2016

The article describes the process of signal conditioning for examination of acoustic noise correlation properties in shallow water. Knowledge of these properties is very important for the design processes of passive and active hydroacoustic systems. This paper focuses on the above issue from the point of view of passive sonar. In sonar systems, signal processing algorithms operate on both useful acoustic signals, and accompanying...

Full text available to download

Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

Publication

- Year 2007

In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2008

In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

Bees Detection on Images: Study of Different Color Models for Neural Networks

Publication

- Year 2019

This paper presents an approach to bee detection in video streams using a neural network classifier. We describe the motivation for our research and the methodology of data acquisition. The main contribution to this work is a comparison of different color models used as an input format for a feedforward convolutional architecture applied to bee detection. The detection process has is based on a neural binary classifier that classifies...

Full text available to download

Robustness in Compressed Neural Networks for Object Detection

Publication

- Year 2021

Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...

Full text available to download

Noise effect on parameters of quiet sonar with code modulation

Publication

- Vibrations in Physical Systems - Year 2023

Earlier publications of the paper authors have shown that the use of code keying mixed with the CW FM sound signal allows the significant reduction in the distance measurement error, compared to classic silent CW FM sonar. In addition to the code modulation parameters, the magnitude of this error is influenced by the received input acoustic noise. The article shows the dependence of the input signal-to-noise ratio and the sound...

Full text available to download

Application of TMS320c67xx signal processors for SONIC-self-optimizing narrowband interference canceler

Publication

K. Cisowski

- Year 2010

The paper presents a laboratory system for testing active control algorithms of acoustics noise in ducts. An applied algorithm - self-optimizing narrowband interference canceller (SONIC), allows one to remove narrowband disturbances of constant or slowly time-varying frequencies. Example experimental results of using the laboratory system for supression of sinusoidal disturbance are described. An electronic part of the system was...

Wireless Body Area Network for Preventing Self-Inoculation Transmission of Respiratory Viral Diseases

Publication

Ł. Pawlicki
A. Fotyga
J. Rewieński
M. Groth
Ł. Kulas
G. Fotyga

- SENSORS - Year 2023

This paper proposes an idea of Wireless Body Area Networks (WBANs) based on Bluetooth Low-Energy (BLE) standards to recognize and alarm a gesture of touching the face, and in effect, to prevent self-inoculation of respiratory viral diseases, such as COVID-19 or influenza A, B, or C. The proposed network comprises wireless modules placed in bracelets and a necklace. It relies on the received signal strength indicator (RSSI) measurements...

Full text available to download

Implementation of constant component filter in measurements of random telegraph signal noise

Publication

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2012

Noise is generated in all semiconductor devices. The intensity of these fluctuations depends on used elements, manufacturing process, operating conditions and device type. The result noise is a superposition of different kinds of fluctuations like thermal noise, generation-recombination noise, 1/f noise, shot noise and Random Telegraph Signal (RTS) noise. The last one, RTS noise is observed as nonstationary impulse fluctuations....

Full text available to download

Global Surrogate Modeling by Neural Network-Based Model Uncertainty

Publication

L. Leifsson
J. Nagawkar
L. Barnet
K. Bryden
S. Kozieł
A. Pietrenko-Dąbrowska

- Year 2022

This work proposes a novel adaptive global surrogate modeling algorithm which uses two neural networks, one for prediction and the other for the model uncertainty. Specifically, the algorithm proceeds in cycles and adaptively enhances the neural network-based surrogate model by selecting the next sampling points guided by an auxiliary neural network approximation of the spatial error. The proposed algorithm is tested numerically...

Full text to download in external service

Supply current signal and artificial neural networks in the induction motor bearings diagnostics

Publication

- Year 2013

This paper contains research results of the diagnostics of induction motor bearings based on measurement of the supply current with usage of artificial neural networks. Bearing failure amount is greater than 40% of all engine failures, which makes their damage-free operation crucial. Tests were performed on motors with intentionally made bearings defects. Chapter 2 introduces the concept of artificial neural networks. It presents...

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publication

- Year 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download

APPLICATION OF STATISTICAL FEATURES AND MULTILAYER NEURAL NETWORK TO AUTOMATIC DIAGNOSIS OF ARRHYTHMIA BY ECG SIGNALS

Publication

A. B. Slama
Ł. Lentka
A. Mouelhi
M. F. Diouani
M. Sayadi
J. Smulko

- Metrology and Measurement Systems - Year 2018

Abnormal electrical activity of heart can produce a cardiac arrhythmia. The electrocardiogram (ECG) is a non-invasive technique which is used as a diagnostic tool for cardiac diseases. Non-stationarity and irregu- larity of heartbeat signal imposes many difficulties to clinicians (e.g., in the case of myocardial infarction arrhythmia). Fortunately, signal processing algorithms can expose hidden information within ECG signal contaminated...

Full text available to download

Towards bees detection on images: study of different color models for neural networks

Publication

- Year 2019

This paper presents an approach to bee detection in videostreams using a neural network classifier. We describe the motivationfor our research and the methodology of data acquisition. The maincontribution to this work is a comparison of different color models usedas an input format for a feedforward convolutional architecture appliedto bee detection. The detection process has is based on a neural...

Data Compression in Ultrasonic Network Communication via Sparse Signal Processing

Publication

B. Zima
O. Reyes Márquez
M. Mohammadgholiha
J. Moll
L. Marchi De

- Year 2022

This document presents the approach of using compressed sensing in signal encoding and information transferring within a guided wave sensor network, comprised of specially designed frequency steerable acoustic transducers (FSATs). Wave propagation in a damaged plate was simulated using commercial FEM-based software COMSOL. Guided waves were excited by means of FSATs, characterized by the special shape of its electrodes, and modeled...

Full text available to download

Self-Organizing Wireless Nodes Monitoring Network

Publication

- POLISH JOURNAL OF ENVIRONMENTAL STUDIES - Year 2009

The concept of data monitoring system and self-organizing network of multipurpose data transfer nodes are presented. Two practical applications of this system are also presented. The first of these is the wireless monitoring system for containers, and the second is the mobile monitoring system for gas air pollution measurements.

Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech

Publication

D. Korzekwa
J. Lorenzo-trueba
T. Drugman
S. Calamaro
B. Kostek

- Year 2021

We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Full text available to download

Self-optimizing narrowband interference canceller - can reference signal help?

Publication

- Year 2011

SONIC (Self-Optimizing Narrowband Interference Canceller) is an acronym of the recently proposed active noise control algorithm with interesting adaptivity and robustness properties. SONIC is a purely feedback controller, capable of rejecting nonstationary sinusoidal disturbances (with time-varying amplitudes and/or frequencies) in the presence of plant (secondary path) uncertainties. We show that even though SONIC can work reliably...

Network on Chip implementation using FPGAs resources

Publication

M. Kłosowski

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2006

W artykule przedstawiono implementację sieci typu ''Network on Chip'' w układach FPGA. Sieci typu ''Network on Chip'' stały się bardzo interesującym i obiecującym rozwiązaniem dla systemów typu ''System on Chip'' które charakteryzują się intensywną komunikacją wewnętrzną. Ze względu na inne paradygmaty projektowania nie ma obecnie dostępnych efektywnych platform do budowy prototypów sieci typu ''Network on Chip'' i ich weryfikacji....

Vehicle Detection with Self-Training for Adaptative Video Processing Embedded Platform

Publication

- Applied Sciences-Basel - Year 2020

Traffic monitoring from closed-circuit television (CCTV) cameras on embedded systems is the subject of the performed experiments. Solving this problem encounters difficulties related to the hardware limitations, and possible camera placement in various positions which affects the system performance. To satisfy the hardware requirements, vehicle detection is performed using a lightweight Convolutional Neural Network (CNN), named...

Full text available to download

DEEP CONVOLUTIONAL NEURAL NETWORKS AS A DECISION SUPPORT TOOL IN MEDICAL PROBLEMS – MALIGNANT MELANOMA CASE STUDY

Publication

- Year 2017

The paper presents utilization of one of the latest tool from the group of Machine learning techniques, namely Deep Convolutional Neural Networks (CNN), in process of decision making in selected medical problems. After the survey of the most successful applications of CNN in solving medical problems, the paper focuses on the very difficult problem of automatic analyses of the skin lesions. The authors propose the CNN structure...

Full text to download in external service

Visual Lip Contour Detection for the Purpose of Speech Recognition

Publication

- Year 2014

A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...

Controlling computer by lip gestures employing neural network

Publication

- Year 2010

Results of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....

Full text to download in external service

Evolving neural network as a decision support system — Controller for a game of “2048” case study

Publication

- Year 2016

The paper proposes an approach to designing the neuro-genetic self-learning decision support system. The system is based on neural networks being adaptively learned by evolutionary mechanism, forming an evolved neural network. Presented learning algorithm enables for a selection of the neural network structure by establishing or removing of connections between the neurons, and then for a finding the beast suited values of the network...

Full text to download in external service

Diagnostic potential for a serum miRNA neural network for detection of ovarian cancer

Publication

K. Elias
W. Fendler
K. Stawiski
S. Fiascone
A. Vitonis
R. Berkowitz
G. Frendl
P. Konstantinopoulos
C. Crum
M. Kedzierska... and 2 others

- eLife - Year 2017

Full text to download in external service

TOXIC GASES IDENTIFICATION USING SINGLE ELECTROCATALYTIC SENSOR RESPONSES AND ARTIFICIAL NEURAL NETWORK

Publication

- Year 2013

The need for precise detection of toxic gases drives development of new gas sensors structures and methods of processing the output signals from the sensors. In literature, artificial neural networks are considered as one of the most effective tool for the analysis of gas sensors or sensors arrays responses. In this paper a method of toxic gas components identification using a electrocatalytic gas sensor as a detector and an artificial...

The impact of the AC922 Architecture on Performance of Deep Neural Network Training

Publication

- Year 2020

Practical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...

Full text to download in external service

Automated detection of pronunciation errors in non-native English speech employing deep learning

Publication

D. Korzekwa

- Year 2023

Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

Full text available to download

Longitudinal drug synergy assessment using convolutional neural network image-decoding of glioblastoma single-spheroid cultures

Publication

A. Giczewska
K. Pastuszak
M. Houweling
U. K. Abdul
N. Faaij
L. Wedekind
D. Noske
T. Würdinger
A. Supernat
B. Westerman

- Neuro-Oncology Advances - Year 2023

Abstract Background In recent years, drug combinations have become increasingly popular to improve therapeutic outcomes in various diseases, including difficult to cure cancers such as the brain cancer glioblastoma. Assessing the interaction between drugs over time is critical for predicting drug combination effectiveness and minimizing the risk of therapy resistance. However, as viability readouts of drug combination experiments...

Full text available to download

Computer-assisted pronunciation training—Speech synthesis is almost all you need

Publication

D. Korzekwa
J. Lorenzo-trueba
T. Drugman
B. Kostek

- SPEECH COMMUNICATION - Year 2022

The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...

Full text available to download

A Bayesian regularization-backpropagation neural network model for peeling computations

Publication

S. Gouravaraju
J. Narayan
R. Sauer
S. S. Gautam

- JOURNAL OF ADHESION - Year 2023

A Bayesian regularization-backpropagation neural network (BRBPNN) model is employed to predict some aspects of the gecko spatula peeling, viz. the variation of the maximum normal and tangential pull-off forces and the resultant force angle at detachment with the peeling angle. K-fold cross validation is used to improve the effectiveness of the model. The input data is taken from finite element (FE) peeling results. The neural network...

Full text available to download

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publication

- Journal of the Acoustical Society of America - Year 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Full text to download in external service

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publication

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Year 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Full text available to download

Age Prediction from Low Resolution, Dual-Energy X-ray Images Using Convolutional Neural Networks

Publication

- Applied Sciences-Basel - Year 2022

Age prediction from X-rays is an interesting research topic important for clinical applications such as biological maturity assessment. It is also useful in many other practical applications, including sports or forensic investigations for age verification purposes. Research on these issues is usually carried out using high-resolution X-ray scans of parts of the body, such as images of the hands or images of the chest. In this...

Full text available to download

GPU Power Capping for Energy-Performance Trade-Offs in Training of Deep Convolutional Neural Networks for Image Recognition

Publication

- Year 2022

In the paper we present performance-energy trade-off investigation of training Deep Convolutional Neural Networks for image recognition. Several representative and widely adopted network models, such as Alexnet, VGG-19, Inception V3, Inception V4, Resnet50 and Resnet152 were tested using systems with Nvidia Quadro RTX 6000 as well as Nvidia V100 GPUs. Using GPU power capping we found other than default configurations minimizing...

Full text available to download

Intelligent turbogenerator controller based on artifical neural network

Publication

H. Tiliouine

- Measurement Automation Monitoring - Year 2011

The paper presents a desing of an intelligent controller based on neural network (ICNN). The ICNN ensures at the same time two fundamental functions : the maintaining of generator voltage at the desired value and the damping of the electromechanical oscillations. Its performance is evaluted on a single machine infinite bus power system through computer simulations. The dynamic and transient operation of the proposed controller...

Electromagnetic Modeling of Microstrip Elements Aided with Artificial Neural Network

Publication

Ł. Sorokosz
W. Zieniutycz

- Year 2020

The electromagnetic modeling principle aided withartificial neural network to designing the microwave widebandelements/networks prepared in microstrip technology is proposedin the paper. It is assumed that the complete information is knownfor the prototype design which is prepared on certain substratewith certain thickness and electric permittivity. The longitudinaland transversal dimensions of new design...

Full text available to download

Real-time speech-rate modification experiments

Publication

- Year 2010

An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Full text to download in external service

DIAGNOSIS OF MALIGNANT MELANOMA BY NEURAL NETWORK ENSEMBLE-BASED SYSTEM UTILISING HAND-CRAFTED SKIN LESION FEATURES

Publication

- Metrology and Measurement Systems - Year 2019

Malignant melanomas are the most deadly type of skin cancer but detected early have high chances for successful treatment. In the last twenty years, the interest of automated melanoma recognition detection and classification dynamically increased partially because of public datasets appearing with dermatoscopic images of skin lesions. Automated computer-aided skin cancer detection in dermatoscopic images is a very challenging task...

Full text available to download

Detection of the First Component of the Received LTE Signal in the OTDoA Method

Publication

- WIRELESS COMMUNICATIONS & MOBILE COMPUTING - Year 2019

In a modern world there is a growing demand for localization services of various kinds. Position estimation can be realized via cellular networks, especially in the currently widely deployed LTE (Long Term Evolution) networks. However, it is not an easy task in harsh propagation conditions which often occur in dense urban environments. Recently, time-methods of terminal localization within the network have been the focus of attention,...

Full text available to download

Improved method for real-time speech stretching

Publication

- Year 2012

n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Full text to download in external service

Real‐Time PPG Signal Conditioning with Long Short‐Term Memory (LSTM) Network for Wearable Devices

Publication

M. Wójcikowski

- SENSORS - Year 2022

This paper presents an algorithm for real‐time detection of the heart rate measured on a person’s wrist using a wearable device with a photoplethysmographic (PPG) sensor and accelerometer. The proposed algorithm consists of an appropriately trained LSTM network and the Time‐Domain Heart Rate (TDHR) algorithm for peak detection in the PPG waveform. The Long Short‐Term Memory (LSTM) network uses the signals from the accelerometer...

Full text available to download

Smart Approach for Glioma Segmentation in Magnetic Resonance Imaging using Modified Convolutional Network Architecture (U-NET)

Publication

N. Sohail
S. M. Anwar
F. Majeed
E. Szczerbicki

- CYBERNETICS AND SYSTEMS - Year 2021

Segmentation of a brain tumor from magnetic resonance multimodal images is a challenging task in the field of medical imaging. The vast diversity in potential target regions, appearance and multifarious intensity threshold levels of various tumor types are few of the major factors that affect segmentation results. An accurate diagnosis and its treatment demand strict delineation of the tumor affected tissues. Herein, we focus on...

Full text available to download

Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

Publication

P. Rościszewski

- Procedia Computer Science - Year 2017

In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Full text available to download

Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations

Publication

- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2016

Evaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...

Full text available to download

Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students

Publication

P. Falkowski-Gilski

- Year 2021

The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Full text to download in external service

Search

Filters

Catalog

Search results for: LOMBARD EFFECT, SPEECH DETECTION, NOISE SIGNAL, SELF-SIMILARITY MATRIX, CONVOLUTIONAL NEURAL NETWORK

Bożena Kostek prof. dr hab. inż.