Filtry
wszystkich: 10878
-
Katalog
- Publikacje 7053 wyników po odfiltrowaniu
- Czasopisma 203 wyników po odfiltrowaniu
- Konferencje 107 wyników po odfiltrowaniu
- Osoby 164 wyników po odfiltrowaniu
- Wynalazki 1 wyników po odfiltrowaniu
- Projekty 8 wyników po odfiltrowaniu
- Laboratoria 1 wyników po odfiltrowaniu
- Kursy Online 447 wyników po odfiltrowaniu
- Wydarzenia 29 wyników po odfiltrowaniu
- Dane Badawcze 2865 wyników po odfiltrowaniu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: NEURAL TEXT-TO-SPEECH MULTILINGUAL SYNTHESIS VOICE CONVERSION SYNTHETIC DATA NORMALISING FLOWS
-
Offshore project management based on jack-up rig conversion
PublikacjaChapter presents aspects of Offshore project management during conversion drilling rig to accommodation rig in Batam Indonesia
-
Database of speech and facial expressions recorded with optimized face motion capture settings
PublikacjaThe broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...
-
Anna Zielińska-Jurek prof. dr hab. inż.
OsobyResearch work on photocatalysis by Prof. Anna Zielinska-Jurek started in 2006 at Gdansk University of Technology (Poland), including 3-month research stay at Hokkaido University, Training School “Environmental Applications of TiO2 Photocatalysis” at the University of Oulu in Finland granted by COST Program and Training School “NanoBiophotonics” at the Beckmann Institute, Urbana-Champaign in the USA granted by University of Illinois....
-
The Workshop on Multi-Phase Flows
PublikacjaPrzedstawiono ideę warsztatów ''Modelowanie przepływów wielofazowych w układach termochemicznych'' - organizowanych corocznie - od 2000 roku, przez Podsekcję Przepływów Wielofazowych Komitetu Mechaniki PAN. Omówiono tematykę VI Warsztatów, które były poświęcone głównie metodom eksperymentalnym.
-
Foundation text of St. Mary's Church in Gdańsk
Dane BadawczeThe data set concerns epigraphy. It refers to the medieval foundation preserved on the wall above the sacristy entrance in St. Mary’s Church in Gdańsk, which confirms that the foundation stone of the temple was laid on 28th of March 1343. The data set contains one general photo of the foundation text, transcription of its text in Latin and its Polish...
-
Nuclear Magnetic Resonance data for the synthesis of esterase cleavable antifungal conjugates containing fatty acids as molecular carriers
Dane BadawczeNMR data for novel organic compounds - conjugates composed of C2-18 fatty acid (FA) residues as a molecular carrier and 5-fluorocytosine (5-FC) as an active agent, released upon the action of intracellular esterases on the ester bond between FA and “trimethyl lock” intramolecular linker.
-
Data Analysis 2023/24
Kursy OnlineData Analysisdr inż. Karol Flisikowski, prof. PG - winter semester 2023/24
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublikacjaIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublikacjaIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Results of tests on speech intelligibility in reverberant conditions
Dane BadawczeThe dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
-
Resource constrained neural network training
PublikacjaModern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining...
-
Comparison of noise reduction methods in radiometric correlation measurements of two-phase liquid-gas flows
PublikacjaTwo-phase liquid-gas flows occur frequently in the mining, energy, chemical, and petrochemical industries. One of non-contact methods used to analyse these flows is the gamma ray absorption method. However, the signals received from radiation detectors contain a significant stochastic noise, which makes them difficult to analyse. The article describes four methods of noise reduction in cross-correlation measurements of water-air...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
Toward Intelligent Recommendations Using the Neural Knowledge DNA
PublikacjaIn this paper we propose a novel recommendation approach using past news click data and the Neural Knowledge DNA (NK-DNA). The Neural Knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for news recommendation tasks on the MIND benchmark dataset. By taking advantages of NK-DNA, deep...
-
Catalytic Asymmetric Synthesis of Isochroman Derivatives
PublikacjaThe isochroman scaffold constitutes an important structural unit, which is present in various bioactive natural products and synthetic pharmaceutical compounds exhibiting wide arrays of biological properties. Hence the synthesis of this class of heterocyclic compounds in a stereoselective fashion is highly significant and desirable. In the last decade, a substantial advancement has been witnessed ln the catalytic asymmetric...
-
One-Step Synthesis of b-Lactams with Retro-Amide Side Chain
PublikacjaAbstract: A one pot synthesis for preparation of 1,4-disubstituted-2-oxo-azetidine-3-carboxylic acid amides was developed. 5-(α-N-substituted-amino-α'-hydroxy)methylene Meldrum's acids act as a source of ketenes that react with aldimines in boiling toluene to give b-lactams with retro-amid side chain.
-
Technology and Energy Conversion Machines
Kursy OnlineThe course covers the basics of mechanical, electrical and thermal energy production in industry and maritime transport. Describes installations supporting high-power engines. Particular attention has been paid to the fuel systems of internal combustion engines. The treatment of engine exhaust gases is described.
-
Proteolysis of whey protein isolates in nanoemulsion systems: impact of nanoemulsification and additional synthetic emulsifiers
PublikacjaNanoemulsions are currently of interest in the functional food sector because their small droplet size (100–500 nm) provides a number of potential advantages over conventional emulsions. This study concerned the behavior of nanoemulsions stabilized with whey proteins and two synthetic emulsifiers (Tween 80 and Croduret), and exposed to conditions simulating the human upper gastrointestinal tract. In particular, the effect of synthetic...
-
''Voice Maps'' - system supporting navigation of the blind
PublikacjaReferat wygłoszony na Konferencji SHA 2012,Gołuń, 22-25.V.2012.
-
MEMS based voice message system for elevators
PublikacjaW artykule przedstawiono implementację systemu głosowych komunikatów w windach. Prezentowany system posiada unikalną cechę polegającą na tym, że do działania nie potrzebuje połączenia z systemem sterującym windy. Zasilany z baterii lub akumulatorów może być zamontowany w ścianie windy, wymaga tylko prostej kalibracji. System oparty jest na akcelerometrach MEMS dokonujących pomiaru przeciążeń w kabinie windy. W artykule przedstawiono...
-
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublikacjaObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublikacjaWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
PublikacjaRodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
-
Text (new tilte Text and Talk)
Czasopisma -
Topological invariants for equivariant flows: Conley index and degree
PublikacjaAbout forty years have passed since Charles Conley defined the homotopy index. Thereby, he generalized the ideas that go back to the calculus of variations work of Marston Morse. Within this long time the Conley index has proved to be a valuable tool in nonlinear analysis and dynamical systems. A significant development of applied methods has been observed. Later, the index theory has evolved to cover such areas as discrete dynamical...
-
On Sample Rate Conversion Based on Variable Fractional Delay Filters
PublikacjaThe sample rate conversion algorithm based on variable fractional delay filters is often used if the resampling ratio cannot be expressed as the ratio of small integer numbers or if it is not constant. The main advantage of such solution is that it allows for arbitrary resampling ratios which can even be changed during the resampling process. In this paper a discussion on influence of different approaches to fractional filter...
-
Data Analytics Meeting
WydarzeniaData Analytics Meeting Konferencja studentów i doktorantów
-
Materials for energy storage and conversion devices 2022/2023
Kursy OnlineElectrodes: Metals as electrodes in aqueous and non-aqueous systems, metal nanoparticles. Collectors current. 3D, 2D, 1 D carbons, carbon nanomaterials. Organic semiconductors "Synthetic metals" - p-type, n-type. Inorganic semiconductors: oxides, selenides, sulfides, others. Intercalated electrodes. Mixed conductors (MIEC). Photo-active semiconductor materials. Electrolytes. Water electrolytes in commercial products. Electroactive...
-
Synthesis Methods of nanomaterials & Experimental nanotechnology
Kursy Online -
Outlier detection method by using deep neural networks
PublikacjaDetecting outliers in the data set is quite important for building effective predictive models. Consistent prediction can not be made through models created with data sets containing outliers, or robust models can not be created. In such cases, it may be possible to exclude observations that are determined to be outlier from the data set, or to assign less weight to these points of observation than to other points of observation....
-
Text classifiers for automatic articles categorization
PublikacjaThe article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.
-
Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks
PublikacjaIn this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....
-
Complexes of silanethiolate ligands: Synthesis, structure, properties and application
PublikacjaThe purposeful syntheses of silanethiolate complexes started approximately in the mid-eighties of the 20th century but no summary of the synthetic efforts has been reported till now. The synthetic methods and the resulting complexes have some common features, which are emphasized throughout the review. Thereby specific difficulties during synthesis are outlined and the structures, properties and possible applications of the resulting...
-
Third Text
Czasopisma -
Social Text
Czasopisma -
Word and Text
Czasopisma -
Neural Development
Czasopisma -
NEURAL NETWORKS
Czasopisma -
Neural Computation
Czasopisma -
Text & Talk
Czasopisma -
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublikacjaA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
-
Objectivization of phonological evaluation of speech elements by means of audio parametrization
PublikacjaThis study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
-
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
PublikacjaIn the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training program which minimizes the...
-
Robustness in Compressed Neural Networks for Object Detection
PublikacjaModel compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...
-
Neural network agents trained by declarative programming tutors
PublikacjaThis paper presents an experimental study on the development of a neural network-based agent, trained using data generated using declarative programming. The focus of the study is the application of various agents to solve the classic logic task – The Wumpus World. The paper evaluates the effectiveness of neural-based agents across different map configurations, offering a comparative analysis to underline the strengths and limitations...
-
Design and Analysis of Artificial Neural Network (ANN) Models for Achieving Self-Sustainability in Sanitation
PublikacjaThe present study investigates the potential of using fecal ash as an adsorbent and demonstrates a self-sustaining, optimized approach for urea recovery from wastewater streams. Fecal ash was prepared by heating synthetic feces to 500 °C and then processing it as an adsorbent for urea adsorption from synthetic urine. Since this adsorption approach based on fecal ash is a promising alternative for wastewater treatment, it increases...
-
ATOMIC DATA AND NUCLEAR DATA TABLES
Czasopisma -
Elimination of clicks from archive speech signals using sparse autoregressive modeling
PublikacjaThis paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...
-
Toward Intelligent Vehicle Intrusion Detection Using the Neural Knowledge DNA
PublikacjaIn this paper, we propose a novel intrusion detection approach using past driving experience and the neural knowledge DNA for in-vehicle information system security. The neural knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for classifying malicious vehicle control commands...