Wyniki wyszukiwania dla: BIMODAL SPEECH RECOGNITION - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: BIMODAL SPEECH RECOGNITION

Wyniki wyszukiwania dla: BIMODAL SPEECH RECOGNITION

  • Magdalena Szuflita-Żurawska

    Magdalena Szuflita-Żurawska jest kierownikiem Sekcji Informacji Naukowo-Technicznej na Politechnice Gdańskiej oraz Liderem Centrum Kompetencji Otwartej Nauki przy Bibliotece Politechniki Gdańskiej. Jej główne zainteresowania badawcze koncentrują się w obszarze komunikacji naukowej oraz otwartych danych badawczych, a także motywacji i produktywności naukowej. Jest odpowiedzialna między innymi za prowadzenie szkoleń dla pracowników...

  • Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing

    Publikacja

    - IEEE SIGNAL PROCESSING LETTERS - Rok 2017

    This letter introduces new chirp rate and instantaneous frequency estimators designed for frequency-modulated signals. These estimators are first investigated from a deterministic point of view, then compared together in terms of statistical efficiency. They are also used to design new recursive versions of the vertically synchrosqueezed short-time Fourier transform, using a previously published method (D. Fourer, F. Auger, and...

    Pełny tekst do pobrania w portalu

  • Distributed Representations Based on Geometric Algebra: the Continuous Model

    Publikacja

    - Informatica - Rok 2011

    Authors revise the concept of a distributed representation of data as well as two previously developed models: Holographic Reduced Representation (HRR) and Binary Spatter Codes (BSC). A Geometric Analogue (GAc - ''c'' stands for continuous as opposed to its discrete version) of HRR is introduced - it employs role-filler binding based on geometric products. Atomic objects are real-valued vectors in n-dimensional Euclidean space...

    Pełny tekst do pobrania w portalu

  • Validating data acquired with experimental multimodal biometric system installed in bank branches

    An experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...

    Pełny tekst do pobrania w portalu

  • Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations

    Publikacja

    - Information - Rok 2018

    The research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...

    Pełny tekst do pobrania w portalu

  • Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals

    Publikacja

    A method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • AUGMENTATION OF THE CRITICAL HEAT FLUX IN WATER-Al2O3, WATER-TiO2 AND WATER-Cu NANOFLUIDS

    Publikacja

    The main aim of the proposed study is therefore recognition of the phenomena accompanying nucleate boiling crisis of selected nanofluids during boiling on horizontal tubes of various outside diameters. Of particular interest is impact of contact angle and tube diameter on the value of critical heat flux. The results obtained should give more light on the nature of nucleate boiling crisis and will serve as a basis for future theoretical...

  • Activated Sludge Process Development

    Publikacja

    - Rok 2014

    This paper summarizes the most significant steps in the activated sludge process development and recognizes key contributors. Recognition of the roles of oxygen and living organisms was the first step (1882-1914). Ardern and Lockett (1914) named the accumulated olids "activated sludge". The process was rapidly accepted and applied in the period 1914-1930. The most dramatic changes in the activated sludge process understanding and...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Biometryczna kontrola dostępu

    Opisano szczegółowo algorytm detekcji oraz identyfikacji człowieka na podstawie punktów nodalnych twarzy. Zdefiniowano pojęcia: biometria, proces pomiaru biometrycznego, metody biometrycznej identyfikacji oraz kontrola dostępu. Przedstawiono opis opracowanego systemu biometrycznej identyfikacji wykorzystującego sztuczne sieci neuronowe. Podano wyniki badań oraz przeprowadzono ich wnikliwą dyskusję.Biometrics is the study of automated...

  • Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

    Publikacja

    - Rok 2017

    An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

  • Ultrawideband transmission in physical channels: a broadband interference view

    The superposition of multipath components (MPC) of an emitted wave, formed by reflections from limiting surfaces and obstacles in the propagation area, strongly affects communication signals. In the case of modern wideband systems, the effect should be seen as a broadband counterpart of classical interference which is the cause of fading in narrowband systems. This paper shows that in wideband communications, the time- and frequency-domain...

    Pełny tekst do pobrania w portalu

  • Robust Object Detection with Multi-input Multi-output Faster R-CNN

    Publikacja

    Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Robust Object Detection with Multi-input Multi-output Faster R-CNN

    Publikacja

    Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...

    Pełny tekst do pobrania w portalu

  • Towards Contactless, Hand Gestures-Based Control of Devices

    Publikacja

    Gesture-based intuitive interactions with electronic devices can be an important part of smart home systems. In this paper, we adapt the contactless linear gesture sensor for the navigation of smart lighting system. Set of handled gestures allow to propose two methods of active light source selection, continuous dimming, and turning on and off based on discrete gestures. The average gesture recognition accuracy was 97.58% in the...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Using MusicXML to evaluate accuracy of OMR Systems

    Publikacja

    - Rok 2008

    In this paper a methodology for automatic accuracy evaluation in optical music recognition (OMR) applications is proposed. Presented approach assumes using ground truth images together with digital music scores describing their content. The automatic evaluation algorithm measures differences between the tested score and the reference one, both stored in MusicXML format. Some preliminary test results of this approach are presented...

  • Affective reactions to playing digital games

    Publikacja

    The paper presents a study of emotional states during a gameplay. An experiment of two-player Tetris game is reported, followed by the analysis of the results - self-reported emotional states as well as physiological signals measurements interpretation. The study reveals the diversity of emotional reactions and concludes, that a representative player's emotional model is hard to define. Instead, an adaptive approach to emotion...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detection and localization of selected acoustic events in acoustic field for smart surveillance applications

    A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

    Pełny tekst do pobrania w portalu

  • A Novel Approach to the Assessment of Cough Incidence

    Publikacja

    In this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Extracting concepts from the software requirements specification using natural language processing

    Publikacja

    - Rok 2018

    Extracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

    Publikacja

    Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications

    A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Several notes on technical modernization of a historical building

    Publikacja

    - Rok 2013

    This paper describes methods and means used during the renovation process of several elements of a building – the assembly hall, inner courtyards and attics of the main building at Gdansk University of Technology, both in design and construction domains. A tissue of a historic building required special approach to these tasks, with special emphasis on good recognition of problems, careful planning and consistent implementing of...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • TspGWI, a thermophilic class-IIS restriction endonuclease from Thermus sp.,recognizes novel asymmetric sequence 5´-ACGGA(N11/9)-3

    Publikacja
    • A. Żylicz-Stachula
    • R. I. Harasimowicz-Słowińska
    • I. Sobolewski
    • P. Skowron

    - NUCLEIC ACIDS RESEARCH - Rok 2002

    A novel prototype class-IIS restriction endonuclease, TspGWI, was isolated from the thermophilic bacterium Thermus sp. GW. The recognition sequence and cleavage positions have been established: TspGWI recognizes the non-palindromic 5-bp sequence 5′-ACGGA-3′ and cleaves the DNA 11 and 9 nt downstream in the top and bottom strand, respectively. In addition, an accompanying endonuclease, TspGWII, an isoschizomer of Pst I, was found...

    Pełny tekst do pobrania w portalu

  • Identification of acoustic event of selected noise sources in a long-term environmental monitoring systems

    Publikacja
    • M. Kłaczyński
    • W. Cioch
    • T. Wszołek
    • W. Wszołek
    • D. Mleczko
    • P. Pawlik
    • A. Grzeczka

    - Rok 2014

    ABSTRACT Undertaking long-term acoustic measurements on sites located near an airport is related to a problem of large quantities of recorded data, which very often represents information not related to flight operations. In such areas, usually defined as zone of limited use, often other sources of noise exist, such as roads or railway lines treated is such context as acoustic background. Manual verification of such recorded data...

  • Szymon Olewniczak mgr inż.

    Osoby

    Jestem związany z Politechniką Gdańską od 2013 roku, kiedy to rozpocząłem studia inżynierskie na kierunku informatyka na Wydziale Elektroniki, Telekomunikacji i Informatyki. Po uzyskaniu tytułu magistra w 2019 roku podjąłem pracę jako asystent w Katedrze Architektury Systemów Komputerowych. Od 2024 roku pełnię również funkcję zastępcy kierownika katedry. Moje zainteresowania badawcze koncentrują się wokół tematów związanych z przetwarzaniem...

  • Non-Contact Temperature Measurements Dataset

    Publikacja

    - Rok 2022

    The dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...

    Pełny tekst do pobrania w portalu

  • Towards better understanding of context-aware knowledge transformation

    Publikacja

    Considering different aspects of knowledge functioning, context is poorly understood in spite of intuitively identifying this concept with environmental recognition. For dynamic knowledge, context especially seems to be an essential factor of change. Investigation on the impact of context on knowledge dynamics or more generally on the relationship between knowledge and its contextual interpretation is important in order to understand...

    Pełny tekst do pobrania w portalu

  • Design Elements of Affect Aware Video Games

    Publikacja

    - Rok 2015

    In this paper issues of design and development process of affect-aware video games are presented. Several important design aspects of such games are pointed out. A concept of a middleware framework is proposed that separates the development of affect-aware video games from emotion recognition algorithms and support from input sensors. Finally, two prototype affect-aware video games are presented that conform to the presented architecture...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Musical Instrument Separation Applied to Music Genre Classification . Separacja instrumentów muzycznych w zastosowaniu do rozpoznawania gatunków muzycznych

    Publikacja

    - Rok 2015

    This paper outlines first issues related to music genre classification and a short description of algorithms used for musical instrument separation. Also, the paper presents proposed optimization of the feature vectors used for music genre recognition. Then, the ability of decision algorithms to properly recognize music genres is discussed based on two databases. In addition, results are cited for another database with regard to...

  • THE POSSIBILITIES OF ESTIMATING THE RELIABILITY OF SHIP PIPELINES’ ELEMENTS INCLUDING DESTRUCTIVE PHENOMENA ACTING ON THEM

    Publikacja

    In the article an approach to the problem of estimating reliability data based on physical models is proposed. The possibility of reliability assessment for selected elements of ship pipelines, based on the recognition of the destructive physical phenomena taking place in them, is discussed. To do this, an overview of these phenomena has been made. In addition, a preliminary review of existing measures of destruction of materials...

    Pełny tekst do pobrania w portalu

  • Knowledge representation of motor activity of patients with Parkinson’s disease

    An approach to the knowledge representation extraction from biomedical signals analysis concerning motor activity of Parkinson disease patients is proposed in this paper. This is done utilizing accelerometers attached to their body as well as exploiting video image of their hand movements. Experiments are carried out employing artificial neural networks and support vector machine to the recognition of characteristic motor activity...

    Pełny tekst do pobrania w portalu

  • Two-step mechanism of J-domain action in driving Hsp70 function

    Publikacja
    • B. Tomiczek
    • W. Delewski
    • Ł. Nierzwicki
    • M. Stolarska
    • I. Grochowina
    • B. Schilke
    • R. Dutkiewicz
    • M. A. Uzarska
    • S. Ciesielski
    • J. Czub... i 2 innych

    - PLoS Computational Biology - Rok 2020

    J-domain proteins (JDPs), obligatory Hsp70 cochaperones, play critical roles in protein homeostasis. They promote key allosteric transitions that stabilize Hsp70 interaction with substrate polypeptides upon hydrolysis of its bound ATP. Although a recent crystal structure revealed the physical mode of interaction between a J-domain and an Hsp70, the structural and dynamic consequences of J-domain action once bound and how Hsp70s...

    Pełny tekst do pobrania w portalu

  • Subjective and Objective Comparative Study of DAB+ Broadcast System

    Broadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...

    Pełny tekst do pobrania w portalu

  • Analysis of a caustic formed by a spherical reflector: Impact of a caustic on architectural acoustics

    Publikacja

    Focusing sound in rooms intended for listening to music or speech is an acoustic defect. Design recommendations provide remedial steps to effectively prevent this. However, there is a category of objects of high historical or architectural value in which the sound focus correction is limited or even abandoned. This also applies to indoor or outdoor concert shells, installations for teaching and acoustic presentations, etc. The...

    Pełny tekst do pobrania w portalu

  • Cross-domain applications of multimodal human-computer interfaces

    Publikacja

    - Rok 2015

    Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...

  • Impact of the glazed roof on acoustics of historic interiors

    Publikacja

    - Rok 2018

    The paper discusses the adverse acoustic phenomena occurring in the semi-open interiors (courtyards, yards) covered with a glass roof. Particularly negative is the rever-beration noise, which leads to the degradation of the utility functions of the resulting spaces. It involves the drastically reducing the intelligibility of speech, loss of natural sounding of music, problems with the sound system, as well as disturbances in the...

  • Thermal imaging in automatic rodent’s social behaviour analysis

    Publikacja

    - Rok 2016

    Laboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Poly-L-Lysine-modified boron-doped diamond electrodes for the amperometric detection of nucleic acid bases

    Publikacja

    - JOURNAL OF ELECTROANALYTICAL CHEMISTRY - Rok 2015

    Boron-doped diamond (BDD) is a very promising supporting material used in the construction of biosensors for molecular recognition. The direct immobilization of structurally-organized huge molecules, such as poly-L-Lysine (PLL) provides the possibility of determining organic molecules, e.g. nucleic acid bases (e.g. adenine, guanine) or peptides and proteins. This paper describes the direct method for chemical and electrochemical...

    Pełny tekst do pobrania w portalu

  • Report of the ISMIS 2011 Contest : Music Information Retrieval

    Publikacja

    - Rok 2011

    This report presents an overview of the data mining contestorganized in conjunction with the 19th International Symposiumon Methodologies for Intelligent Systems (ISMIS 2011), in days betweenJan 10 and Mar 21, 2011, on TunedIT competition platform. The contestconsisted of two independent tasks, both related to music information retrieval:recognition of music genres and recognition of instruments, for agiven music sample represented...

  • PCR detection of Scopulariopsis brevicaulis

    Scopulariopsis brevicaulis is known as a most common etiological factor of the mould toenail infections. There are also reports indicating that S. brevicaulis could cause organ and disseminated infections. Nowadays microscopic observations from the direct sample and culture are crucial for the appropriate recognition of the infection. In this paper is presented a PCR-based method for S. brevicaulis detection. The specificity of...

    Pełny tekst do pobrania w portalu

  • Hostility bias or sadness bias in excluded individuals: Does anodal transcranial direct current stimulation of right VLPFC vs. left DLPFC have a mitigating effect?

    Publikacja
    • J. Rajchert
    • A. Zajenkowska
    • I. Nowakowska
    • M. Bodecka-Zych
    • A. Abramiuk

    - COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE - Rok 2022

    Exclusion has multiple adverse effects on individual’s well-being. It induces anger and hostile cognitions leading to aggressive behavior. The purpose of this study was to test whether exclusion would affect recognition of anger on ambivalent faces of the excluders. We hypothesized that exclusion would elicit more anger encoding (hostility bias) than inclusion, but this effect would be mitigated by anodal tDCS of right VLPFC...

  • AffecTube — Chrome extension for YouTube video affective annotations

    Publikacja

    The shortage of emotion-annotated video datasets suitable for training and validating machine learning models for facial expression-based emotion recognition stems primarily from the significant effort and cost required for manual annotation. In this paper, we present AffecTube as a comprehensive solution that leverages crowdsourcing to annotate videos directly on the YouTube platform, resulting in ready-to-use emotion-annotated...

    Pełny tekst do pobrania w portalu

  • Techniques of acquiring additional features of the responses of individual gas sensors

    Gas sensors usually exhibit lack of selectivity, require fre quent calibration, exhibit drift of the response and a lot of factors, such as humidity or ambient temperature, influen ce their performance. Different approaches can be used to overcome this shortcomings. Building arrays of different sensors and usage of pattern recognition methods to analyze responses of elements...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Recognizing emotions on the basis of keystroke dynamics

    Publikacja

    - Rok 2015

    The article describes a research on recognizing emotional states on the basis of keystroke dynamics. An overview of various studies and applications of emotion recognition based on data coming from keyboard is presented. Then, the idea of an experiment is presented, i.e. the way of collecting and labeling training data, extracting features and finally training classifiers. Different classification approaches are proposed to be...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Przegląd rodzajów chiralnych faz stacjonarnych oraz możliwości ich zastosowań w chromatografii cieczowej

    Chromatograficzne rozdzielanie związków optycznie czynnych ma ogromne znaczenie nie tylko w przemyśle farmaceutycznym, ale i agrochemicznym, a także w badaniach naukowych różnego rodzaju. W niniejszym opracowaniu scharakteryzowano komercyjnie dostępne chiralne fazy stacjonarne na bazie, cyklodekstryn, polisacharydów, makrocyklicznych antybiotyków, eterów koronowych, a także fazy proteinowe, ligandowymienne, jonowymienne oraz fazy...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Pursuing Analytically the Influence of Hearing Aid Use on Auditory Perception in Various Acoustic Situations

    Publikacja

    - Vibrations in Physical Systems - Rok 2022

    The paper presents the development of a method for assessing auditory perception and the effectiveness of applying hearing aids for hard-of-hearing people during short-term (up to 7 days) and longer-term (up to 3 months) use. The method consists of a survey based on the APHAB questionnaire. Additional criteria such as the degree of hearing loss, technological level of hearing aids used, as well as the user experience are taken...

    Pełny tekst do pobrania w portalu

  • Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network

    Publikacja

    - Journal of the Acoustical Society of America - Rok 2021

    The goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...

    Pełny tekst do pobrania w portalu

  • Smartphone application supporting independent movement of the blind

    Improving comfort of life of blind people is a problem of great importance. Neither a white canenor a guide dog, although both very useful, can be considered as a tool for achieving fullindependence in everyday movement around the city. On the market there are some navigation toolsinspired by car navigation systems, but they have many flaws, ranging from positioninginaccuracies to high prices. The authors present their own solution...

  • ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

    Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

    Pełny tekst do pobrania w portalu

  • ALOFON corpus

    The ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/.  The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...