Wyniki wyszukiwania dla: NEURAL TEXT-TO-SPEECH MULTILINGUAL SYNTHESIS VOICE CONVERSION SYNTHETIC DATA NORMALISING FLOWS

Wyniki wyszukiwania dla: NEURAL TEXT-TO-SPEECH MULTILINGUAL SYNTHESIS VOICE CONVERSION SYNTHETIC DATA NORMALISING FLOWS

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 10878

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Offshore project management based on jack-up rig conversion
Publikacja
- R. Bielski
- M. Bielski
- M. Łukanowska
- Rok 2015
Chapter presents aspects of Offshore project management during conversion drilling rig to accommodation rig in Batam Indonesia
Database of speech and facial expressions recorded with optimized face motion capture settings
Publikacja
- A. Czyżewski
- M. Kawaler
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2019
The broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...

Pełny tekst do pobrania w portalu
Anna Zielińska-Jurek prof. dr hab. inż.

Osoby

Katedra Inżynierii Procesowej i Technologii Chemicznej

Research work on photocatalysis by Prof. Anna Zielinska-Jurek started in 2006 at Gdansk University of Technology (Poland), including 3-month research stay at Hokkaido University, Training School “Environmental Applications of TiO2 Photocatalysis” at the University of Oulu in Finland granted by COST Program and Training School “NanoBiophotonics” at the Beckmann Institute, Urbana-Champaign in the USA granted by University of Illinois....
The Workshop on Multi-Phase Flows
Publikacja
- J. Cieśliński
- D. Butrymowicz
- Rok 2005
Przedstawiono ideę warsztatów ''Modelowanie przepływów wielofazowych w układach termochemicznych'' - organizowanych corocznie - od 2000 roku, przez Podsekcję Przepływów Wielofazowych Komitetu Mechaniki PAN. Omówiono tematykę VI Warsztatów, które były poświęcone głównie metodom eksperymentalnym.
Foundation text of St. Mary's Church in Gdańsk
Dane Badawcze
open access
- E. Starek
- G. Kotłowski
The data set concerns epigraphy. It refers to the medieval foundation preserved on the wall above the sacristy entrance in St. Mary’s Church in Gdańsk, which confirms that the foundation stone of the temple was laid on 28th of March 1343. The data set contains one general photo of the foundation text, transcription of its text in Latin and its Polish...
Nuclear Magnetic Resonance data for the synthesis of esterase cleavable antifungal conjugates containing fatty acids as molecular carriers
Dane Badawcze
open access
- M. Nowak
- S. Milewski
NMR data for novel organic compounds - conjugates composed of C2-18 fatty acid (FA) residues as a molecular carrier and 5-fluorocytosine (5-FC) as an active agent, released upon the action of intracellular esterases on the ester bond between FA and “trimethyl lock” intramolecular linker.
Data Analysis 2023/24
Kursy Online
- K. Flisikowski
Data Analysisdr inż. Karol Flisikowski, prof. PG - winter semester 2023/24
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Rok 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Results of tests on speech intelligibility in reverberant conditions
Dane Badawcze
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
Resource constrained neural network training
Publikacja
- M. Pietrołaj
- M. Blok
- Scientific Reports - Rok 2024
Modern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining...

Pełny tekst do pobrania w portalu
Comparison of noise reduction methods in radiometric correlation measurements of two-phase liquid-gas flows
Publikacja
- M. Zych
- R. Hanus
- B. Wilk
- L. Petryka
- D. Świsulski
- MEASUREMENT - Rok 2018
Two-phase liquid-gas flows occur frequently in the mining, energy, chemical, and petrochemical industries. One of non-contact methods used to analyse these flows is the gamma ray absorption method. However, the signals received from radiation detectors contain a significant stochastic noise, which makes them difficult to analyse. The article describes four methods of noise reduction in cross-correlation measurements of water-air...

Pełny tekst do pobrania w portalu
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Toward Intelligent Recommendations Using the Neural Knowledge DNA
Publikacja
- G. Ning
- C. Wu
- H. Zhang
- E. Szczerbicki
- CYBERNETICS AND SYSTEMS - Rok 2021
In this paper we propose a novel recommendation approach using past news click data and the Neural Knowledge DNA (NK-DNA). The Neural Knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for news recommendation tasks on the MIND benchmark dataset. By taking advantages of NK-DNA, deep...

Pełny tekst do pobrania w portalu
Catalytic Asymmetric Synthesis of Isochroman Derivatives
Publikacja
- K. Joshi
- Rok 2020
The isochroman scaffold constitutes an important structural unit, which is present in various bioactive natural products and synthetic pharmaceutical compounds exhibiting wide arrays of biological properties. Hence the synthesis of this class of heterocyclic compounds in a stereoselective fashion is highly significant and desirable. In the last decade, a substantial advancement has been witnessed ln the catalytic asymmetric...
One-Step Synthesis of b-Lactams with Retro-Amide Side Chain
Publikacja
- SYNTHESIS-STUTTGART - Rok 2011
Abstract: A one pot synthesis for preparation of 1,4-disubstituted-2-oxo-azetidine-3-carboxylic acid amides was developed. 5-(α-N-substituted-amino-α'-hydroxy)methylene Meldrum's acids act as a source of ketenes that react with aldimines in boiling toluene to give b-lactams with retro-amid side chain.

Pełny tekst do pobrania w portalu
Technology and Energy Conversion Machines
Kursy Online
- Z. Kneba
The course covers the basics of mechanical, electrical and thermal energy production in industry and maritime transport. Describes installations supporting high-power engines. Particular attention has been paid to the fuel systems of internal combustion engines. The treatment of engine exhaust gases is described.
Proteolysis of whey protein isolates in nanoemulsion systems: impact of nanoemulsification and additional synthetic emulsifiers
Publikacja
- FOOD CHEMISTRY - Rok 2021
Nanoemulsions are currently of interest in the functional food sector because their small droplet size (100–500 nm) provides a number of potential advantages over conventional emulsions. This study concerned the behavior of nanoemulsions stabilized with whey proteins and two synthetic emulsifiers (Tween 80 and Croduret), and exposed to conditions simulating the human upper gastrointestinal tract. In particular, the effect of synthetic...

Pełny tekst do pobrania w serwisie zewnętrznym
''Voice Maps'' - system supporting navigation of the blind
Publikacja
- HYDROACOUSTICS - Rok 2012
Referat wygłoszony na Konferencji SHA 2012,Gołuń, 22-25.V.2012.

Pełny tekst do pobrania w portalu
MEMS based voice message system for elevators
Publikacja
- M. Kłosowski
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2007
W artykule przedstawiono implementację systemu głosowych komunikatów w windach. Prezentowany system posiada unikalną cechę polegającą na tym, że do działania nie potrzebuje połączenia z systemem sterującym windy. Zasilany z baterii lub akumulatorów może być zamontowany w ścianie windy, wymaga tylko prostej kalibracji. System oparty jest na akcelerometrach MEMS dokonujących pomiaru przeciążeń w kabinie windy. W artykule przedstawiono...
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
Publikacja
- SENSORS - Rok 2022
Objective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...

Pełny tekst do pobrania w portalu
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Rok 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Pełny tekst do pobrania w portalu
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
Publikacja
- K. Goczyła
- Rok 2014
Rodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
Text (new tilte Text and Talk)

Czasopisma

ISSN: 0165-4888
Topological invariants for equivariant flows: Conley index and degree
Publikacja
- M. Styborski
- Rok 2010
About forty years have passed since Charles Conley defined the homotopy index. Thereby, he generalized the ideas that go back to the calculus of variations work of Marston Morse. Within this long time the Conley index has proved to be a valuable tool in nonlinear analysis and dynamical systems. A significant development of applied methods has been observed. Later, the index theory has evolved to cover such areas as discrete dynamical...
On Sample Rate Conversion Based on Variable Fractional Delay Filters
Publikacja
- M. Blok
- International Journal of Computer Science and Application - Rok 2013
The sample rate conversion algorithm based on variable fractional delay filters is often used if the resampling ratio cannot be expressed as the ratio of small integer numbers or if it is not constant. The main advantage of such solution is that it allows for arbitrary resampling ratios which can even be changed during the resampling process. In this paper a discussion on influence of different approaches to fractional filter...

Pełny tekst do pobrania w serwisie zewnętrznym
Data Analytics Meeting

Wydarzenia

17-05-2024 08:30 - 18-05-2024 15:00

Data Analytics Meeting Konferencja studentów i doktorantów
Materials for energy storage and conversion devices 2022/2023
Kursy Online
Electrodes: Metals as electrodes in aqueous and non-aqueous systems, metal nanoparticles. Collectors current. 3D, 2D, 1 D carbons, carbon nanomaterials. Organic semiconductors "Synthetic metals" - p-type, n-type. Inorganic semiconductors: oxides, selenides, sulfides, others. Intercalated electrodes. Mixed conductors (MIEC). Photo-active semiconductor materials. Electrolytes. Water electrolytes in commercial products. Electroactive...
Synthesis Methods of nanomaterials & Experimental nanotechnology
Kursy Online
- M. S. Łapiński
Outlier detection method by using deep neural networks
Publikacja
- O. Aydin
- S. Erpolat Tasabat
- Rok 2017
Detecting outliers in the data set is quite important for building effective predictive models. Consistent prediction can not be made through models created with data sets containing outliers, or robust models can not be created. In such cases, it may be possible to exclude observations that are determined to be outlier from the data set, or to assign less weight to these points of observation than to other points of observation....

Pełny tekst do pobrania w serwisie zewnętrznym
Text classifiers for automatic articles categorization
Publikacja
- Rok 2012
The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.
Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks
Publikacja
- IEEE SENSORS JOURNAL - Rok 2018
In this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....

Pełny tekst do pobrania w portalu
Complexes of silanethiolate ligands: Synthesis, structure, properties and application
Publikacja
- A. Pladzyk
- D. Kowalkowska-Zedler
- A. Ciborska
- A. Schnepf
- A. Dołęga
- COORDINATION CHEMISTRY REVIEWS - Rok 2021
The purposeful syntheses of silanethiolate complexes started approximately in the mid-eighties of the 20th century but no summary of the synthetic efforts has been reported till now. The synthetic methods and the resulting complexes have some common features, which are emphasized throughout the review. Thereby specific difficulties during synthesis are outlined and the structures, properties and possible applications of the resulting...

Pełny tekst do pobrania w portalu
Third Text

Czasopisma

ISSN: 0952-8822 , eISSN: 1475-5297
Social Text

Czasopisma

ISSN: 0164-2472 , eISSN: 1527-1951
Word and Text

Czasopisma

ISSN: 2069-9271
Neural Development

Czasopisma

ISSN: 1749-8104
NEURAL NETWORKS

Czasopisma

ISSN: 0893-6080 , eISSN: 1879-2782
Neural Computation

Czasopisma

ISSN: 0899-7667 , eISSN: 1530-888X
Text & Talk

Czasopisma

ISSN: 1860-7330 , eISSN: 1860-7349
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publikacja
- Rok 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publikacja
- Rok 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
Publikacja
- P. Rościszewski
- J. Kaliski
- Rok 2017
In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modiﬁcation of the training program which minimizes the...

Pełny tekst do pobrania w serwisie zewnętrznym
Robustness in Compressed Neural Networks for Object Detection
Publikacja
- S. Cygert
- A. Czyżewski
- Rok 2021
Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...

Pełny tekst do pobrania w portalu
Neural network agents trained by declarative programming tutors
Publikacja
- J. Dobrosolski
- J. Szymański
- H. Mora
- K. Draszawka
- Rok 2024
This paper presents an experimental study on the development of a neural network-based agent, trained using data generated using declarative programming. The focus of the study is the application of various agents to solve the classic logic task – The Wumpus World. The paper evaluates the effectiveness of neural-based agents across different map configurations, offering a comparative analysis to underline the strengths and limitations...

Pełny tekst do pobrania w serwisie zewnętrznym
Design and Analysis of Artificial Neural Network (ANN) Models for Achieving Self-Sustainability in Sanitation
Publikacja
- M. Ganesapillai
- A. Sinha
- R. Mehta
- A. Tiwari
- V. Chellappa
- J. Drewnowski
- Applied Sciences-Basel - Rok 2022
The present study investigates the potential of using fecal ash as an adsorbent and demonstrates a self-sustaining, optimized approach for urea recovery from wastewater streams. Fecal ash was prepared by heating synthetic feces to 500 °C and then processing it as an adsorbent for urea adsorption from synthetic urine. Since this adsorption approach based on fecal ash is a promising alternative for wastewater treatment, it increases...

Pełny tekst do pobrania w portalu
ATOMIC DATA AND NUCLEAR DATA TABLES

Czasopisma

ISSN: 0092-640X , eISSN: 1090-2090
Elimination of clicks from archive speech signals using sparse autoregressive modeling
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2012
This paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...

Pełny tekst do pobrania w serwisie zewnętrznym
Toward Intelligent Vehicle Intrusion Detection Using the Neural Knowledge DNA
Publikacja
- F. Li
- H. Zhang
- J. Wang
- C. Sanin
- E. Szczerbicki
- CYBERNETICS AND SYSTEMS - Rok 2018
In this paper, we propose a novel intrusion detection approach using past driving experience and the neural knowledge DNA for in-vehicle information system security. The neural knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for classifying malicious vehicle control commands...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: NEURAL TEXT-TO-SPEECH MULTILINGUAL SYNTHESIS VOICE CONVERSION SYNTHETIC DATA NORMALISING FLOWS

Anna Zielińska-Jurek prof. dr hab. inż.