Search results for: MPEG 7 LOW-LEVEL AUDIO DESCRIPTORS

MPEG-7-based low level descriptor effectiveness in the automatic musical sound classification.

Publication

- Year 2004

Celem referatu jest określenie, które z parametrów opisowych MPEG-7 są najbardziej przydatne w klasyfikacji dźwięków instrumentów muzycznych. Określana jest wysokość dźwięku a następnie wyznaczane są wartości parametrów zawartych w standardzie MPEG-7. Otrzymany wektor parametrów poddawany jest analizie statystycznej w celu wyeliminowania danych nadmiarowych. Do celów automatycznej klasyfikacji i testów zaprojektowano dwa systemy...

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publication

- Year 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download

Comparison of sound of organ pipes in contemporary and historical instruments

Publication

- Year 2020

The aim of this research is to examine the differences in the timbre of organ pipes’ sound between a historical and a contemporary organ instrument. The historical instrument is the Oliwa organ from Gdansk, Poland, and the contemporary one is from Kartuzy, Poland. Recordings are made of single notes played by an open labial pipe that belongs to the Principal rank. The analyses and comparison of several sound features compatible...

Full text to download in external service

Processing of musical data employing rough sets and artificial neural networks

Publication

- Year 2004

Artykuł opisuje założenia systemu automatycznej identyfikacji muzyki i dźwięków muzycznych. Dokonano przeglądu standardu MPEG-7, ze szczególnym naciskiem na parametry opisowe dźwięku. Przedyskutowano problemy analizy danych audio, związane z zastosowaniami wykorzystującymi MPEG-7. W oparciu o eksperymenty przedstawiono efektywność deskryptorów niskiego poziomu w automatycznym rozpoznawaniu dźwięków instrumentów muzycznych. Przedyskutowano...

Processing of musical data employing rough sets and artificial neural networks

Publication

- Year 2005

Artykuł opisuje założenia systemu automatycznej identyfikacji muzyki i dźwięków muzycznych. Dokonano przeglądu standardu MPEG-7, ze szczególnym naciskiem na parametry opisowe dźwięku. Przedyskutowano problemy analizy danych audio, związane z zastosowaniami wykorzystującymi MPEG-7. W oparciu o eksperymenty przedstawiono efektywność deskryptorów niskiego poziomu w automatycznym rozpoznawaniu dźwięków instrumentów muzycznych. Przedyskutowano...

Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger

Publication

- Journal of Digital Forensic Practice - Year 2010

W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Full text to download in external service

Music genre classification applied to bass enhancement for mobile technology

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...

Full text to download in external service

An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms

Publication

- Year 2014

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...

Full text to download in external service

Improving automatic surveillance by sound analysis

Publication

- Year 2010

An automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands...

Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy

Publication

- Year 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...

Full text to download in external service

MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Publication

M. Piotrowska
G. Korvel
B. Kostek
T. Ciszewski
A. Czyżewski

- International Journal of Applied Mathematics and Computer Science - Year 2019

Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

Full text available to download

Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

Publication

- Year 2014

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...

TRANSPORT POSSIBILITY FOR MPEG-4/AVC- AND MPEG-2-ENCODED VIDEO DATA IN IPTV: A COMPARISON STUDY

Publication

T. Uhl
S. Paulsen
K. Nowicki

- Year 2013

IPTV (Television over IP) is a modern service with a great potential to expand. It uses the IP transport platform, that is already in worldwide operation. At the time of writing, two techniques are used to transport the video and audio data of IPTV: MPEG-2 TS and Native RTP. The two techniques quite definitely have an influence on both quality of service (QoS) and quality of experience (QoE). This paper sets out to demonstrate...

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Publication

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2019

Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Full text available to download

Badanie efektywności kodeków źródłowych w radiofonii cyfrowej DAB+

Publication

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2015

W Polsce radiofonia cyfrowa jest dostępna dla słuchaczy już od 2013 roku. Jednakże brakuje ogólnodostępnych publikacji naukowych lub też raportów badawczych uzasadniających przyjęte przepływności dla strumieni audio. W artykule przedstawiono badania sprawności kodowania oraz subiektywnej oceny jakości kodeka MPEG-4 HE-AAC v2, wykorzystywanego w standardzie DAB+. Testy prze-prowadzono wg. techniki porównawczej MUSHRA na dwóch grupach,...

Full text to download in external service

Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System

Publication

M. Zamłyńska
G. Debita
P. Falkowski-Gilski

- Year 2022

Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

Full text to download in external service

Rough Sets Applied to Mood of Music Recognition

Publication

- Year 2016

With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...

Classification of Music Genres Based on Music Separation into Harmonic and Drum Components . Klasyfikacja gatunków muzycznych wykorzystująca separację instrumentów muzycznych

Publication

A. Rosner
B. Schuller
B. Kostek

- Archives of Acoustics - Year 2014

This article presents a study on music genre classification based on music separation into harmonic and drum components. For this purpose, audio signal separation is executed to extend the overall vector of parameters by new descriptors extracted from harmonic and/or drum music content. The study is performed using the ISMIS database of music files represented by vectors of parameters containing music features. The Support Vector...

Full text available to download

Eksperymentalna weryfikacja przydatności wybranych parametrów standardu MPEG-7 w procesie klasyfikacji dźwięków instrumentów muzycznych

Publication

P. Żwan

- Prace Naukowe Instytutu Telekomunikacji i Akustyki Politechniki Wrocławskiej. Konferencje - Year 2003

Obecnie stosowane metody wyszukiwania informacji muzycznej w internecie bazują na parametrycznym opisie zawartości danych multimedialnych. W standardzie MPEG-7 w części dotyczącej sygnałów fonicznych zawarto opis oparty w dużej mierze o analizę widmową, przy czym dla dźwięków muzycznych parametryzowane jest widmo FFT fragmentu stanu quasi-ustalonego.

MPEG-7 jako format zapisu cyfrowych dokumentów muzycznych

Publication

W. Szwoch

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2006

W pracy zwrócono uwagę na problem przechowywania różnych postaci muzyki w cyfrowych dokumentach muzycznych. Przedstawiono ogólną charakterystykę formatu MPEG7. Pokazano możliwości wynikające z zastosowania MPEG7 do cyfrowej reprezentacji dokumentów muzycznych. Zwrócono uwagę na możliwość rozszerzania standardu w celu dopasowania go do przechowywania dokumentów muzycznych zawierających pełną informację muzyczną.

Speech Analytics Based on Machine Learning

Publication

- Year 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service

Low-Level Music Feature Vectors Embedded as Watermarks

Publication

- Year 2013

In this paper a method consisting in embedding low-level music feature vectors as watermarks into a musical signal is proposed. First, a review of some recent watermarking techniques and the main goals of development of digital watermarking research are provided. Then, a short overview of parameterization employed in the area of Music Information Retrieval is given. A methodology of non-blind watermarking applied to music-content...

Full text to download in external service

Enhanced method of DS-CDMA low level singnals detection

Publication

R. Katulski
K. Bronk
J. Stefański
R. Studański
R. Wąs

- POLISH JOURNAL OF ENVIRONMENTAL STUDIES - Year 2009

The following article comprises three main parts. The first one generally describes two methods of low level signals detection which are in the interest of this study. The signal spectrum averaging technique is shown as well as the method exploiting averaged spectrum of the signal raised to the power of 2n (nN). Additionally, this section briefly presents proposed enhancements and modifications of these two solutions, which allow...

In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation

Publication

A. Rosner
F. Weninger
B. Schuller
M. Michalak
B. Kostek

- Year 2013

We present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...

Airborne Laser Scanning Point Cloud Update by Used of the Terrestrial Laser Scanning and the Low-Level Aerial Photogrammetry

Publication

- Year 2017

Laser scanning technology is a spatial information gathering technique which is commonly used all over the world. Systems where the red-light beam are used, are divided into: terrestrial, mobile and airborne scanning systems. The main differences between those are the accuracy, the data acquisition solution (f. e. in ALS and MLS besides of the laser scanner, the inertial navigation system is required) and the covered area in one...

Full text available to download

Low-Level Aerial Photogrammetry as a Source of Supplementary Data for ALS Measurements

Publication

K. Bobkowska
A. Inglot
M. Przyborski
J. Sieniakowski
P. Tysiac
P. Tysiąc

- Year 2017

Full text to download in external service

Low-Level Aerial Photogrammetry as a Source of Supplementary Data for ALS Measurements

Publication

- Year 2017

The development of laser scanning technology ALS allows to make high-resolution measurements for large areas result-ing in significant reduction of costs. The main stakeholders at heights data received from the airborne laser scanning is mainly state administration. The state institutions appear among projects such as ISOK. Each point is classified in ac-cordance with the standard LAS 1.2, our research focuses on the class 6 -...

Full text available to download

The time spent sitting does not always mean a low level of physical activity

Publication

E. Matusiak-Wieczorek
A. Lipert
E. Kochan
A. Jegier

- BMC PUBLIC HEALTH - Year 2020

Full text to download in external service

The effect of cabbage juice and it's active components on the protein level and expression of CYP19 in MCF-7 breast cancer cells

Publication

H. Szaefer
B. Licznerska
V. Krajka-Kuźniak
A. Bartoszek-Pączkowska
W. Bear-Dubowska

- Acta Biochimica Polonica - Year 2009

Wiadomo, że bioaktywne skladniki kapusty mają chemoprewencyjne działanie w przypadku nowotworów piersi, między innymi ze względu na hamowanie aktywności enzymu aromatazy uczestniczącego w biosyntezie estrogenów. Badania prowadzono przy użyciu komórek raka piersi i stwierdzono, że soki z kapusty, zarówno pochodzącej z uprawy ekologicznej jak i przemysłowej, znacząco hamowały ekspresję genu CYP19 kodującego wspomniany enzym. Sugeruje...

Full text to download in external service

Factors determining accumulation of bisphenol A and alkylphenols at a low trophic level as exemplified by mussels Mytilus trossulus

Publication

M. Staniszewska
B. Graca
A. Sokołowski
I. Nehring
A. Wasik
A. Jendzul

- ENVIRONMENTAL POLLUTION - Year 2017

The aim of the study was to investigate abiotic and biotic factors influencing the accumulation of endocrine disrupting compounds (EDCs) such as bisphenol A (BPA), 4-tert-octylphenol (OP) and 4- nonylphenol (NP) in mussels Mytilus trossulus from the Gulf of Gdansk (Southern Baltic). The key abiotic factor influencing BPA, OP and NP accumulation in mussels is their hydrophilicity/lipophilicity, which affects their main assimilation...

Full text to download in external service

Products of Vitamin D3 or 7-Dehydrocholesterol Metabolism by Cytochrome P450scc Show Anti-Leukemia Effects, Having Low or Absent Calcemic Activity

Publication

A. Slominski
Z. Janjetovic
B. Fuller
M. Zmijewski
R. Tuckey
M. Nguyen
T. Sweatman
W. Li
J. Zjawiony
D. Miller... and 3 others

- PLOS ONE - Year 2010

Full text to download in external service

Protective impact of extract from Aronia melanocarpa berries against low-level exposure to cadmium-induced liver damage: a study in a rat model

Publication

M. Mezynska
M. Tomczyk
J. Rogalska
B. Pilat-Marcinkiewicz
M. Galazyn-Sidoczuk
M. Brzoska

- PLANTA MEDICA - Year 2016

Full text to download in external service

Protective impact of extract from Aronia melanocarpa berries against low-level exposure to cadmium-induced lipid peroxidation in the bone tissue: a study in a rat model

Publication

M. Brzóska
M. Tomczyk
J. Rogalska
M. Galazyn-Sidorczuk
M. Jurczuk
A. Roszczenko

- PLANTA MEDICA - Year 2015

Full text to download in external service

Wykorzystanie bezzałogowych aparatów latających (mini śmigłowców) do wykonywania fotogrametrycznych zdjęć lotniczych z niskich pułapów = The use of unmaned aerial vehicles (mini helicoptes) in photogrammetry from low level

Publication

B. Szczechowski

- Archiwum Fotogrametrii, Kartografii i Teledetekcji - Year 2008

W artykule przedstawione zostały założenia wykorzystania bezzałogowego statku powietrznego do wykonywania zdjęć fotogrametrycznych z niskich pułapów.

Full text available to download

Cyfrowa biblioteka dokumentów muzycznych

Publication

M. Szwoch

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2006

W pracy przedstawiono główny cel projektu badawczego Moniuszko, będacego projektem i realizacją interaktywnej biblioteki cyfrowej dokumentów muzycznych. W pracy przedstawiono również koncepcję cyfrowego dokumentu muzycznego, a także wskazano standard MPEG-7, jako najlepiej spełniający wymogi opisu tych dokumentów. Zaproponowano rozszerzenie tego standardu o możliwość pełnego opisu bibliograficznego dokumentu muzycznego, a także...

System rozpoznawania dźwięków instrumentów muzycznych.

Publication

- Year 2004

Niniejszy referat przedstawia działanie systemu automatycznego rozpoznawania pojedynczych dźwięków instrumentów muzycznych. System składa się z trzech bloków: detekcja częstotliwości podstawowej, parametryzacja dźwięków i klasyfikacja. W algorytmie detekcji wykorzystano zmodyfikowany algorytm Schroedera. Parametryzację przeprowadzono głównie w oparciu o parametry zdefiniowane w standardzie MPEG-7. Na potrzeby systemu zaimplementowano...

Creating a Realible Music Discovery and Recomendation System

Publication

- Year 2014

The aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...

Full text to download in external service

Testing Watermark Robustness against Application of Audio Restoration Algorithms

Publication

- Year 2013

The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

Full text to download in external service

Music Recommendation System

Publication

- Journal of Telecommunications and Information Technology - Year 2014

The paper focuses on optimization vector content feature for the music recommendation system. For the purpose of experiments a database is created consisting of excerpts of music les. They are assigned to 22 classes corresponding to dierent music genres. Various feature vectors based on low-level signal descriptors are tested and then optimized using correlation analysis and Principal Component Analysis (PCA). Results of the experiments...

Full text available to download

Online sound restoration system for digital library applications

Publication

- Year 2013

Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Full text to download in external service

Formaty zapisu cyfrowych dokumentów muzycznych

Publication

- Year 2005

W pracy zwrócono uwagę na problem przechowywania różnych postaci muzyki występujących w cyfrowych dokumentach muzycznych. Przedstawiono ogólną charakterystykę istniejących formatów zapisu danych muzycznych oraz wybrane cyfrowe formaty muzyczne. Przedstawiono również propozycję stworzenia uniwersalnego formatu opisu danych muzycznych w oparciu o istniejący standard MPEG-7.

Online sound restoration system for digital library applications.

Publication

- Journal of the Acoustical Society of America - Year 2013

Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Dangerous sound event recognition using Support Vector Machine classifiers

Publication

- Year 2010

A method of recognizing events connected to danger based on their acoustic representation through Support Vector Machine classification is presented. The method proposed is particularly useful in an automatic surveillance system. The set of 28 parameters used in the classifier consists of dedicated parameters and MPEG-7 features. Methods for parameter calculation are presented, as well as a design of SVM model used for classification....

Skuteczność klasyfikacji gatunków muzycznych za pomocą sieci neuronowej w zależności od typu danych wejściowych

Publication

- Year 2021

Rozpoznawanie gatunku muzycznego jest jednym z podstawowych elementów inteligentnych systemów tworzenia automatycznych list muzyki. Platformy strumieniowe oferujące taką usługę wymagają rozwiązań, które umożliwią jak najdokładniej określić przynależność utworu do gatunku muzycznego. Zgodnie z aktualnym stanem wiedzy – najskuteczniejszym klasyfikatorem są sztuczne sieci neuronowe (w tym w wersji uczenia głębokiego), dla których...

Full text to download in external service

Further Developments of the Online Sound Restoration System for Digital Library Applications

Publication

- Year 2014

New signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...

Full text to download in external service

Exploiting audio-visual correlation by means of gaze tracking

Publication

- International Journal of Computer Science and Applications - Year 2010

This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

Full text available to download

The data exchange between smart glasses and healthcare information systems using the HL7 FHIR standard

Publication

J. Rumiński
A. Bujnowski
T. Kocejko
A. Andrushevich
M. Biallas
R. Kistler

- Year 2016

In this study we evaluated system architecture for the use of smart glasses as a viewer of information, as a source of medical data (vital sign measurements: temperature, pulse rate, and respiration rate), and as a filter of healthcare information. All activities were based on patient/device identification procedures using graphical markers or features based on visual appearance. The architecture and particular use cases were implemented...

Full text to download in external service

Multibeam sonar data processing for seafloor characterisation

Publication

Z. Łubniewski

- Year 2011

The approach to seafloor characterisation was investigated. It relies on calculation of several descriptors (parameters) related to seabed type using three types of multibeam sonar data obtained during seafloor sensing: 1) the grey-level sonar images of seabed, 2) the 3D model of the seabed surface which consist of (x, y, z) points, 3) the set of time domain echo envelopes corresponding to several beams. The proposed method has...

Combined method of multibeam sonar signal processing and image analysis for seafloor classification

Publication

- Year 2011

The combined approach to seafloor characterisation was investigated. It relies on calculation of several descriptors (parameters) related to seabed type using three types of multibeam sonar data obtained during seafloor sensing: 1) the grey-level sonar images (echograms) of seabed, 2) the 3D model of the seabed surface which consists of bathymetric data, 3) the set of time domain bottom echo envelopes received in the consecutive...

Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention

Publication

D. Korzekwa
R. Barra-Chicote
S. Zaporowski
G. Beringer
J. Lorenzo-trueba
A. Serafinowicz
J. Droppo
T. Drugman
B. Kostek

- Year 2021

This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Full text available to download

Search

Filters

Catalog

Category

Year

Options

Search results for: MPEG 7 LOW-LEVEL AUDIO DESCRIPTORS