Search results for: audio parametrization

coMpliAnce with evideNce-based cliniCal guidelines in the managemenT of acute biliaRy pancreAtitis): The MANCTRA-1 international audit

Publication

S. Di
D. Damaskos
V. Agnoletti
D. Mole
C. Gerardi
F. Virdis
D. Pacella
K. Jayant
M. Sartelli
A. Leppaniemi... and 568 others

- PANCREATOLOGY - Year 2022

Full text to download in external service

How it audit can help your communications branch company increase effectiveness of using it technologies and contributes to implementation of business

Publication

- Year 2008

Text based on the example of Cable Television Ltd. Co. in Koszalin.

Multimodal English corpus for automatic speech recognition

Publication

- Year 2013

A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...

Automatic Clustering of EEG-Based Data Associated with Brain Activity

Publication

- Year 2018

The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....

Full text to download in external service

Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

Publication

- Advances in Intelligent Systems and Computing - Year 2013

The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Full text to download in external service

INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY

Publication

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2018

In recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...

Low-Level Music Feature Vectors Embedded as Watermarks

Publication

- Year 2013

In this paper a method consisting in embedding low-level music feature vectors as watermarks into a musical signal is proposed. First, a review of some recent watermarking techniques and the main goals of development of digital watermarking research are provided. Then, a short overview of parameterization employed in the area of Music Information Retrieval is given. A methodology of non-blind watermarking applied to music-content...

Full text to download in external service

Measuring and Analyzing Audio Levels in Film, Commercials, and Movie Trailers Using Leq(A) Values and the LUFS Loudness Model . Analiza pomiarów dźwięku w filmie oraz w reklamach filmowych z wykorzystaniem modelu głośności

Publication

- Year 2015

The purpose of this paper is to describe the measurement of loudness levels in movies, movie trailers, and commercials displayed before feature films at movie theaters. In the initial section, the paper discusses the issues related to measurement of loudness levels, provides recommendations regarding permissible loudness levels during movie screenings, and mentions the applied units of measurement. The following section of the...

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publication

- Journal of the Acoustical Society of America - Year 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Full text to download in external service

Comparison of the effectiveness of automatic EEG signal class separation algorithms

Publication

- JOURNAL OF INTELLIGENT & FUZZY SYSTEMS - Year 2019

In this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and...

Full text available to download

Speech Analytics Based on Machine Learning

Publication

- Year 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service

Rough Sets Applied to Mood of Music Recognition

Publication

- Year 2016

With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...

Approximate models and parameter analysis of the flow process in transmission pipelines

Publication

- Year 2016

the paper deals with the problem of early leak detection in transmission pipelines. First we present the derivation of state-space equations of the flow process in the pipelines. This description is then aggregated in order to obtain a principal model. Next, the problem of process model parameterization is addressed, taking into account the maximization of a model stability margin. The location of the maximum is determined using...

Full text to download in external service

Assessment of Therapeutic Progress After Acquired Brain Injury Employing Electroencephalography and Autoencoder Neural Networks

Publication

- Year 2018

A method developed for parametrization of EEG signals gathered from participants with acquired brain injuries is shown. Signals were recorded during therapeutic session consisting of a series of computer assisted exercises. Data acquisition was performed in a neurorehabilitation center located in Poland. The presented method may be used for comparing the performance of subjects with acquired brain injuries (ABI) who are involved...

Full text to download in external service

Automatic sound recognition for security purposes

Publication

P. Żwan

- Year 2008

In the paper an automatic sound recognition system is presented. It forms a part of a bigger security system developed in order to monitor outdoor places for non-typical audio-visual events. The analyzed audio signal is being recorded from a microphone mounted in an outdoor place thus a non stationary noise of a significant energy is present in it. In the paper an especially designed algorithm for outdoor noise reduction is presented,...

QoS/QoE in the Heterogeneous Internet of Things (IoT)

Publication

K. Nowicki
T. Uhl

- Year 2017

Applications provided in the Internet of Things can generally be divided into three categories: audio, video and data. This has given rise to the popular term Triple Play Services. The most important audio applications are VoIP and audio streaming. The most notable video applications are VToIP, IPTV, and video streaming, and the service WWW is the most prominent example of data-type services. This chapter elaborates on the most...

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Publication

- Year 2016

Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Full text to download in external service

Testing A Novel Gesture-Based Mixing Interface

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013

With a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...

Full text available to download

Examining Acoustic Emission of Engineered Ultrasound Loudspeakers

Publication

- Year 2014

Measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...

A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics

Publication

- Year 2016

A research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...

Flow Process Models for Pipeline Diagnosis

Publication

- Year 2021

This chapter examines the problem of modeling and parameterization of the transmission pipeline flow process. First, the base model for discrete time is presented, which is a reference for other developed models. Then, the diagonal approximation (AMDA) method is proposed, in which the tridiagonal sub-matrices of the recombination matrix are approximated by their diagonal counterparts, which allows for a simple determination of...

Full text to download in external service

Measurements and Simulations of Engineered Ultrasound Loudspeakers

Publication

- Computational Methods in Science and Technology - Year 2015

Simulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...

Full text to download in external service

Quality Aspects in Digital Broadcasting and Webcasting Systems: Bitrate versus Loudness

Publication

- Journal of Telecommunications and Information Technology - Year 2017

In this paper the quality aspects of bitrate and loudness in digital broadcasting and webcasting systems are examined. The authors discuss a survey concerning user preferences related with processing and managing audio content. The coding efficiency of a popular audio format is analyzed in the context of storing media. An objective study on a representative group of signal samples, as well as a subjective study of the perceived...

Full text available to download

Intelligent multimedia solutions supporting special education needs.

Publication

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2011

The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Bimodal deep learning model for subjectively enhanced emotion classification in films

Publication

D. Weber
B. Kostek

- INFORMATION SCIENCES - Year 2024

This research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....

Full text to download in external service

Online sound restoration system for digital library applications

Publication

- Year 2013

Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Full text to download in external service

Wow defect reduction based on interpolation techniques

Publication

P. Maziewski

- Year 2005

W referacie przedstawiono wyniki badania różnych technik interpolacji wykorzystanych w redukcji kołysania dźwięku. W badaniach użyto: interpolację liniową, dwie techniki interpolacji wielomianowej (Hermite i spline), i technikę sumowania okienkowanych funkcji sink. Jakość rekonstrukcji wykonano wykorzystując sztucznie spreparowany sygnał audio, rekonstruowany wymienionymi metodami interpolacji. Jakość rekonstrukcji oceniono wykorzystując...

Design and implementation principles of FIReWORK ONLINE - the VHDL autogenerator for hardware structures

Publication

R. Smyk

- Year 2013

The paper presents an aspects of remote autogeneration of hardware structures. The solution is an online application, that is running on the server side and allows to design a particular filters and other selected hardware and generate its structure in the form of VHDL, dedicated to FPGA design environments. The paper also addresses the problem of parameterization of algorithms used to generate the hardware structures and current...

Transmitting Alarm Information in DAB+ Broadcasting System

Publication

P. Falkowski-Gilski

- Year 2018

The main goal of digital broadcasting is to deliver high-quality content with the lowest possible bitrate. This paper is focused on transmitting alarm information, such as emergency warning and alerting, in the DAB+ (Digital Audio Broadcasting plus) broadcasting system. These additional services should be available at the lowest possible bitrate, in order to provide a clear and understandable voice message to people. Furthermore, additional...

In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation

Publication

A. Rosner
F. Weninger
B. Schuller
M. Michalak
B. Kostek

- Year 2013

We present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...

Online sound restoration system for digital library applications.

Publication

- Journal of the Acoustical Society of America - Year 2013

Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Reduction of parasitic pitch variations in archival musical recordings

Publication

- SIGNAL PROCESSING - Year 2010

A new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...

Full text available to download

Building Knowledge for the Purpose of Lip Speech Identification

Publication

- Advances in Intelligent Systems and Computing - Year 2017

Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Full text to download in external service

Fitting the mobile device characteristics to the user's hearing preferences

Publication

- Year 2014

A method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...

Full text to download in external service

Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning

Publication

B. Kostek

- Year 2023

In this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....

Full text to download in external service

Comparison of perforator location in dynamic and static thermographic imaging with Doppler ultrasound in breast reconstruction surgery

Publication

S. Kołacz
M. Moderhak
J. Jankau

- Year 2016

This paper co mpares the effectiveness of the dTnorm and t90_10 parametrizations in dynamic thermography for imaging location of perforators in TRAM flaps in the intraoperative period. The results were compared with the location detected in a Doppler ultrasound examination. Cold and heat stimulation was used in dynamic thermography. Additionally, these results were compared with static...

Full text to download in external service

Real and imaginary motion classification based on rough set analysis of EEG signals for multimedia applications

Publication

P. Szczuko

- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2017

Rough set-based approach to the classification of EEG signals of real and imaginary motion is presented. The pre-processing and signal parametrization procedures are described, the rough set theory is briefly introduced, and several classification scenarios and parameters selection methods are proposed. Classification results are provided and discussed with their potential utilization for multimedia applications controlled by the...

Full text available to download

Report of the ISMIS 2011 Contest : Music Information Retrieval

Publication

B. Kostek
A. Kupryjanow
P. Żwan
W. Jiang
Z. W. Raś
M. Wojnarski
J. Świetlicka

- Year 2011

This report presents an overview of the data mining contestorganized in conjunction with the 19th International Symposiumon Methodologies for Intelligent Systems (ISMIS 2011), in days betweenJan 10 and Mar 21, 2011, on TunedIT competition platform. The contestconsisted of two independent tasks, both related to music information retrieval:recognition of music genres and recognition of instruments, for agiven music sample represented...

Postprodukcja nagrania wideo z dzwiekiem dookolnym

Publication

- Year 2009

One of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...

1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

Publication

- Year 2020

A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy

Publication

- Year 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...

Full text to download in external service

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2020

Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Full text available to download

Physics-Based Coarse-Grained Modeling in Bio- and Nanochemistry

Publication

A. Liwo
A. K. Sieradzan
A. S. Karczyńska
E. Lubecka
S. A. Samsonov
C. Czaplewski
P. Krupa
M. Mozolewska

- Year 2021

Coarse-grained approaches, in which groups of atoms are represented by single interaction sites, are very important in biological and materials sciences because they enable us to cover the size- and time-scales by several orders of magnitude larger than those available all-atom simulations, while largely keeping the details of the systems studied. The coarse-grained approaches differ by the scheme of reduction and by the origin...

Full text to download in external service

Characterizing the Performance of <span class="sc">xor</span> Games and the Shannon Capacity of Graphs

Publication

R. Ramanathan
A. Kay
G. Murta
P. Horodecki

- PHYSICAL REVIEW LETTERS - Year 2014

In this Letter we give a set of necessary and sufficient conditions such that quantum players of a two-party xor game cannot perform any better than classical players. With any such game, we associate a graph and examine its zero-error communication capacity. This allows us to specify a broad new class of graphs for which the Shannon capacity can be calculated. The conditions also enable the parametrization of new families of games...

Full text to download in external service

Music genre classification applied to bass enhancement for mobile technology

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...

Full text to download in external service

Machine learning applied to acoustic-based road traffic monitoring

Publication

- Procedia Computer Science - Year 2022

The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Full text available to download

Machine learning applied to acoustic-based road traffic monitoring

Publication

- Year 2022

The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Full text available to download

Impact of maintenance of floodplains of the Vistula River on high water levels on the section from Włocławek to Toruń

Publication

- Acta Energetica - Year 2013

This article describes the methodology of hydraulic calculations to estimate the water levels in open channels for steady gradually varied flow. The presented method has been used to analyse the water level on the Vistula River from Włocławek cross-section to Toruń cross-section. The HEC-RAS modelling system has been used for parameterization of the river channel and floodplains, as well as for flow simulation. The results obtained...

Full text available to download

Theoretical calculation of the physico-chemical properties of 1-butyl-4-methylpyridinium based ionic liquids

Publication

A. Giełdoń
M. Bobrowski
A. Bielicka-giełdoń
C. Czaplewski

- JOURNAL OF MOLECULAR LIQUIDS - Year 2017

ACCEPTED MAIonic liquids (ILs) have attracted much attention for their unique physicochemical properties, which can be designed as needed by altering the ion combinations. Besides experimental work, numerous computational studies have been concerned with prediction of physical properties of ILs. The results of molecular dynamics simulations of ILs depend strongly on the proper force field parameterization. Classical force fields...

Full text available to download

Adaptive Personal Tuning of Sound in Mobile Computers

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2016

An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Full text available to download

Search

Filters

Catalog

Category

Year

Options

Search results for: audio parametrization