Filters
total: 2574
displaying 1000 best results Help
Search results for: SPECH PROCESSING
-
A Workflow Application for Parallel Processing of Big Data from an Internet Portal
PublicationThe paper presents a workflow application for efficient parallel processing of data downloaded from an Internet portal. The workflow partitions input files into subdirectories which are further split for parallel processing by services installed on distinct computer nodes. This way, analysis of the first ready subdirectories can start fast and is handled by services implemented as parallel multithreaded applications using multiple...
-
Processing of acoustical data in a multimodal bank operating room surveillance system
PublicationAn automatic surveillance system capable of detecting, classifying and localizing acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of...
-
Using Rule-Based System for Monitoring Marine Navigation Data Processing
PublicationProcessing marine navigational data requires sophisticated software solutions. Typically, specialized tools called processors are analyzing raw data from different sensors. It becomes important to create the monitoring software that is able to validate and verify processing components integrated into the final system. Drools®business rule management platform provides a core business rules engine, web authoring and rules management...
-
Processing, mechanical and thermal behavior assessments of polycaprolactone/agricultural wastes biocomposites
PublicationIn this paper, brewer’s spent grain (BSG) was applied as potential lignocellulose biofiller in biocompos-ites based on polycaprolactone (PCL). The PCL/BSG biocomposites filled with varying content of biofillerswere prepared via low-temperature melt-compounding. These conditions allow limiting thermal degra-dation of used biofillers during processing. The influence of biofiller content (ranging from 25 to 200parts by weight on 100...
-
Processing data on sea bottom structure obtained by means of the parametric sounding
PublicationThe aim of the paper is to analyze data obtained during sounding the Gdansk Bay sea bed by means of the parametric echo-sounder. The accuracy of the sea bottom structure investigation needs correct configuration of research equipment and proper calibration of peripheral devices (GPS, heading sensor, MRU-Z motion sensor and navigation instruments which provide necessary data to bathymetrical measurement system, enabling its work...
-
Reliable Document-Centric Processing in Loosely Coupled Email-Based Systems
PublicationEmail is a simple way to exchange digital documents of any kind. The Mobile INteractive Document architecture (MIND) enables self-coordination and self-steering of document agent systems based on commonly available email services. In this paper, a mechanism for providing integrity and reliability of such an email based agent system is proposed to cope with message soft or hard bounces, user interrupts, and other unexpected events....
-
Usage of the Gstreamer framework for generation, analysis, processing and visualization of sonar signal
PublicationIn this paper a novel method of the bearing estimation in a passive sonar system with a towed array is introduced. The classical approach of the bearing estimation based on the spatial spectrum is extended by using the synchrosqeezing method that is a part of the reassignment method introduced by Kodera et al. The usage of this method leads to the precise bearing estimation. The proposed method requires a relatively small amount...
-
Processing and structure–property relationships of natural rubber/wheat bran biocomposites
PublicationIn this work, wheat bran was used as cellulosic filler in biocomposites based on natural rubber. The impact of wheat bran content [ranging from 10 to 50 parts per hundred rubber (phr)] on processing, structure, dynamic mechanical properties, thermal properties, physico-mechanical properties and morphology of resulting biocomposites was investigated. For better characterization of interfacial interactions between natural rubber...
-
Probe signal processing for channel estimation in underwater acoustic communication system
PublicationUnderwater acoustic communication channels are characterized by a large variety of propagation conditions. Designing a reliable communication system requires knowledge of the transmission parameters of the channel, namely multipath delay spread, Doppler spread, coherence time, and coherence bandwidth. However, the possibilities of its estimation in a realtime underwater communication system are limited, mainly due to the computational...
-
Measurement of the Development of a Learning IT Organization Supported by a Model of Knowledge Acquisition and Processing
PublicationThe paper presents a model of knowledge acquisition and processing for the development of learning organizations. The theory of a learning organization provides neither metrics nor tools to measure its development The authors' studies in this field are based on their experience gathered after projects realized in real IT organizations. The authors have described the construction of the model and the methods of its verification...
-
Prediction of Processor Utilization for Real-Time Multimedia Stream Processing Tasks
PublicationUtilization of MPUs in a computing cluster node for multimedia stream processing is considered. Non-linear increase of processor utilization is described and a related class of algorithms for multimedia real-time processing tasks is defined. For such conditions, experiments measuring the processor utilization and output data loss were proposed and their results presented. A new formula for prediction of utilization was proposed...
-
Data Compression in Ultrasonic Network Communication via Sparse Signal Processing
PublicationThis document presents the approach of using compressed sensing in signal encoding and information transferring within a guided wave sensor network, comprised of specially designed frequency steerable acoustic transducers (FSATs). Wave propagation in a damaged plate was simulated using commercial FEM-based software COMSOL. Guided waves were excited by means of FSATs, characterized by the special shape of its electrodes, and modeled...
-
Jan Daciuk dr hab. inż.
PeopleJan Daciuk received his M.Sc. from the Faculty of Electronics of Gdansk University of Technology in 1986, and his Ph.D. from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology in 1999. He has been working at the Faculty from 1988. His research interests include finite state methods in natural language processing and computational linguistics including speech processing. Dr. Daciuk...
-
Objective, observer-independent evaluation of myocardial perfusion and function: the role of SPECT
Publication -
Speech formant frequency and pitch estimation using instantaneous complex frequency
PublicationW pracy opisany został algorytm estymacji częstotliwości podstawowej oraz częstotliwości środkowych i pasm formantów mowy z wykorzystaniem zespolonej pulsacji chwilowej. W artykule przedstawiono również wyniki działania algorytmu dla polskich samogłosek.
-
Time-scale modification of speech signals for supporting hearing impaired schoolchildren
PublicationA study of time scale modification algorithmsapplied to hearing impaired schoolchildren supporting ispresented. Variety of algorithms are considered, namely:overlap and add, two variations of synchronized overlapand add, and the phase vocoder. Their effectiveness as wellas real-time processing capabilities are examined.
-
Corrupted speech intelligibility improvement using adaptive filter based algorithm
PublicationA technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithmemploying adaptive filtration is described and additional possibilities of speech intelligibility improvement arediscussed. Results of the tests are presented.
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication -
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublicationArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
PublicationThe aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...
-
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publicationconvolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
-
High quality speech codec employing sines+noise+transients model
PublicationA method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and...
-
A non-uniform real-time speech time-scale stretching method
PublicationAn algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublicationThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
PublicationThe problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
-
Procesing the complex signal in the acoustic processor of a sonobuoy system
PublicationW artykule przedstawiono metody cyfrowego przetwarzania sygnału kompleksowego w procesorze akustycznym systemu radiohydroboi. Omówiono ogólną postać systemu oraz sygnału kompleksowego. Opisano dwie alternatywne metody przetwarzania sygnału: pierwszą w dziedzinie czasu, drugą w dziedzinie częstotliwości. Zaprezentowano schematy blokowe algorytmów obu sposobów przetwarzania. Omówiono problemy praktycznej realizacji poszczególnych...
-
Processing of polymer‐derived, aerogel‐filled, SiC foams for high‐temperature insulation
Publication -
Processing and thermal characterization of polymer derived SiCN(O) and SiOC reticulated foams
Publication -
Teleportation seen from spacetime: on 2-spinor aspects of quantum information processing
PublicationZastosowanie formalizmu 2-spinowego do kwantowego przetwarzania informacji zilustrowane przykładem teleportacji i relatywistycznej korelacji błędu.
-
Architecture of Request/Response and Publish/Subscribe System Capable of Processing Multimedia Streams
PublicationAnaliza ''w locie'' (ang. on-the-fly) strumieni multimedialnych, zawierających wysokiej jakości dane obrazu i dźwięku, wciąż stanowi wyzwanie dla projektantów oprogramowania. Praca przedstawia architekturę systemu zdolnego do przetwarzania w czasie rzeczywistym strumieni multimedialnych przy użyciu komponentów działających w architekturze Publish/Subscribe oraz Request/Response, korzystających z możliwości Java Multimedia Framework...
-
Architecture of Request/Response and Publish/Subscribe System Capable of Processing Multimedia Streams
PublicationAbstrakt Analiza ''w locie'' (ang. on-the-fly) strumieni multimedialnych, zawierających wysokiej jakości dane obrazu i dźwięku, wciąż stanowi wyzwanie dla projektantów oprogramowania. Praca przedstawia architekturę systemu zdolnego do przetwarzania w czasie rzeczywistym strumieni multimedialnych przy użyciu komponentów działających w architekturze Publish/Subscribe oraz Request/Response, korzystających z możliwości Java Multimedia...
-
EIGHT OLD CULTIVARS OF APPLE TREES – AN EVALUATION OF THEIR POTENTIAL FOR USE BY THE PROCESSING INDUSTRY
Publication -
The Research into the Effect of Conditions of Combined Electric Powered Diamond Processing on Cutting Power
Publication -
Research of Influence Electric Conditions Combined ElectroDiamond Processing by on Specific Consumption of Wheel*
Publication -
Architecture for Aggregation, Processing and Provisioning of Data from Heterogeneous Scientific Information Services
Publication -
Interfacial properties of PET and PET/starch polymers developed by air plasma processing
Publication -
The Use of Effective Microorganisms as a Sustainable Alternative to Improve the Quality of Potatoes in Food Processing
Publication -
Measurements of Two-phase Flows in Pipelines Using Radioisotopes and Statistical Signal Processing
PublicationThis paper presents an application of radiotracers and gamma absorption method in two-phase flow measurements in pipelines. Two different methods were implemented to analysis of acquired signals. Investigated methods are based on the cross-correlation function and the phase of the cross-spectral density distribution. The examples presented in the article illustrate the application of the radioisotopes to evaluation of liquid-gas...
-
Effect of Processing Parameters on Strength and Corrosion Resistance of Friction Stir-Welded AA6082
PublicationThe friction stir welding method is increasingly attracting interest in the railway sector due to its environmental friendliness, low cost, and ease of producing high-quality joints. Using aluminum alloys reduces the weight of structures, increasing their payload and reducing fuel consumption and running costs. The following paper presents studies on the microstructure, strength, and corrosion resistance of AA6082 aluminum alloy...
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublicationIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
PublicationDeveloping signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....
-
Can high hydrostatic pressure processing be the best way to preserve human milk?
PublicationBreastfeeding is one of the most important factors influencing proper child development. When a mother cannot breastfeed, the best alternative, especially for feeding premature infants, is to then use of human milk (HM) which has been collected, preserved and stored in Human Milk Banks (HMB). Scope and approach: In this review, the impact of some stages of the management of HM in HMB on its final biological value and microbiological...
-
Task Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Envirinment
PublicationAn allocation algorithm for stream processing tasks is proposed (Modified best Fit Descendent, MBFD). A comparison with another solution (BFD) is provided. Tests of the algorithms in an HPC environment are descrobed and the results are presented. A proper scalability metric is proposed and used for the evaluation of the allocation algorithm.
-
The influence of roasting and additional processing on the content of bioactive components in special purpose coffees
PublicationCoffee being the beverage consumed worldwide is also a very competitive commodity. Consequently, producers seek ways of attracting consumers by proposing e.g. novel ingredient combinations usually without evaluating their health quality. In this study, variations in health-promoting determinants for five special purpose coffee brews were characterized. The major bioactive components - chlorogenic acids (CAs) - detected by HPLC-DAD-MS...
-
Methods of Assessing Odour Emissions from Biogas Plants Processing Municipal Waste
Publication -
Assessment of image processing methods for the determination of propagation of squat-type defects in rails
PublicationWe demonstrate the idea of squat-type defect measurement in the rail and the concept of tracking of the defect development using the techniques of image acquisition and image processing as well as the methods of metric spaces. We introduce the concepts of a set diameter δ(A) and the metric ρ1, which come from the properties of plane figures, to compare and to observe the development of the defects. We characterize the feasibility...
-
A Solution to Image Processing with Parallel MPI I/O and Distributed NVRAM Cache
PublicationThe paper presents a new approach to parallel image processing using byte addressable, non-volatile memory (NVRAM). We show that our custom built MPI I/O implementation of selected functions that use a distributed cache that incorporates NVRAMs located in cluster nodes can be used for efficient processing of large images. We demonstrate performance benefits of such a solution compared to a traditional implementation without NVRAM...
-
Impact of Shifting Time-Window Post-Processing on the Quality of Face Detection Algorithms
PublicationWe consider binary classification algorithms, which operate on single frames from video sequences. Such a class of algorithms is named OFA (One Frame Analyzed). Two such algorithms for facial detection are compared in terms of their susceptibility to the FSA (Frame Sequence Analysis) method. It introduces a shifting time-window improvement, which includes the temporal context of frames in a post-processing step that improves the...
-
An application of advanced data processing methods to response analysis of electrocatalytic gas sensor
PublicationPrzedstawiono stosowane dotychczas oraz zaproponowano nowe metody analizy odpowiedzi czujników elektrokatalitycznych. Porównano ich właściwości.