Filtry
wszystkich: 1365
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: text-to-speech transcription
-
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublikacjaPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE
PublikacjaW niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...
-
Regulated assembly of lipopolysaccharide and sensing of its alterations in Escherichia coli
PublikacjaThis thesis describes mechanism of the regulation of the transcription of the rpoE gene encoding an essential RNA polymerase subunit in Escherichia coli. The RpoE regulates extracytoplasmic stress response regulon and is required to initiate transcription of genes, whose products are involvedin the folding of periplasmic proteins and synthesis and transport of outer membrane components. The transcriptional regulation of the rpoE...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaMuch attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...
-
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
PublikacjaSpatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...
-
Zastosowanie spowalniania wypowiedzi w celu poprawy rozumienia mowy przez dzieci w szkole
PublikacjaThis paper presents a time-scale modification algorithms that could be used for hearing impairment therapy supported by real-time speech stretching. In this paper the OLA based algorithms and Phase Vocoder were described. In the experimental part usability of those algorithms for real-time speech stretching was discussed
-
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
PublikacjaW referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
-
Piotr Krajewski dr
OsobyPiotr Krajewski pracuje jako starszy bibliotekarz w Bibliotece Politechniki Gdańskiej. Jako pracownik Sekcji Informacji Naukowo-Technicznej skupia się przede wszystkim na zagadnieniach związanych z ruchem Open Access oraz rolą repozytoriów instytucjonalnych w jego rozwoju. Jest także autorem artykułów poruszających kwestie standaryzacji statystyk wykorzystania zasobów elektronicznych jak również problematykę „drapieżnych wydawców”....
-
Analyzing and Visualizing Uncertain Knowledge: The Use of TEI Annotations in the PROVIDEDH Open Science Platform
Publikacja -
The database of localization and expression of aquaporin 3 (AQP3), aquaporin 7 (AQP7) and aquaporin 9 (AQP9) in the male reproductive system in cattle. Morphometric studies. Localization of zinc finger transcription factor GATA-4.
Dane BadawczeThe data present research results being part of the OPUS-22 project entitled “In search of new markers of male fertility in cattle. Aquaporins expression in the reproductive organs and sperm of the bulls (Bos taurus)” obtained from the National Science Center in Poland (grant no. 2021/43/B/NZ9/00204). The aim of this part of the project was (i) to determine...
-
Interaction of the conserved region 4.2 of sigma(E) with the RseA anti-sigma factor
PublikacjaEo-E RNA polymerase transcribes a regulon of folding factors for the bacterial envelope and is induced by physical and chemical stresses. The RseA anti-sigma factor inhibits the activity of Esigma(E) RNA, polymerase. It is shown here that the N-terminal portion of sigma(E), residues 1-153, binds core RNA polymerase. RseA interacts with residues 154-191 of sigma(E), a site that is homologous to region 4, the sigma factor binding...
-
Data on solutions of Hes1 system
Dane BadawczeHes1 protein (hairy and enhancer of split 1) belongs to the helix-loop-helix (bHLH) family of transcription proteins, i.e. DNA-binding proteins in the promoter region or in another region where regulation of transcription processes occurs. The database collects data on solutions of the Hes1 systems with multiple binding sites and the dimer formation...
-
Instantaneous complex frequency for pipeline pitch estimation
PublikacjaIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublikacjaThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublikacjaThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Wyspiański Pavilion
PublikacjaText on Wyspiański Pavilion in Cracow.
-
Gdańsk Shakespeare Theatre
PublikacjaText on Gdańsk Shakespeare Theatre.
-
Review on Wikification methods
PublikacjaThe paper reviews methods on automatic annotation of texts with Wikipedia entries. The process, called Wikification aims at building references between concepts identified in the text and Wikipedia articles. Wikification finds many applications, especially in text representation, where it enables one to capture the semantic similarity of the documents. Also, it can be considered as automatic tagging of the text. We describe typical...
-
Rozliczalność władzy politycznej jako element wzmocnienia demokracji i podwyższenia jej jakości: przykład Polski
PublikacjaCelem artykułu jest wskazanie odpowiednich,...
-
European Solidarity Centre
PublikacjaText on European Solidarity Centre in Gdansk.
-
National Music Forum
PublikacjaText on National Music Forum in Wroclaw.
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
PublikacjaW wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...
-
Elektrownia Masovia Centre for Contemporary Art
PublikacjaText on Elektrownia Masovia Centre for Contemporary Art in Radom.
-
Gracjana Klein-Raina dr hab.
Osoby -
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublikacjaThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
How it audit can help your communications branch company increase effectiveness of using it technologies and contributes to implementation of business
PublikacjaText based on the example of Cable Television Ltd. Co. in Koszalin.
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublikacjaIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Regulatory approach to competition law in the practice of the Polish Competition Authority – critical assessment
PublikacjaThe text provides a critical analysis of the Polish practice of using competition law provisions as regulation provisions
-
Cabbage Juices and Indoles Modulate the Expression Profile of AhR, ERα, and Nrf2 in Human Breast Cell Lines
PublikacjaOur previous studies showed the diversified effect of cabbage juices and indoles on the estrogen metabolism key enzymes (CYP1A1, CYP1A2, CYP1B1) in breast epithelial cells differing in ER status, i.e., in tumorigenic—MCF7, MDA-MB-231 and non-tumorigenic—MCF10A cells. The aim of the present study was to further investigate the mechanism of chemopreventive action of cabbage juice and its active components by evaluating their effect...
-
Auditory Brainstem Responses recorded employing Audio ABR device
Dane BadawczeThe dataset consists of ABR measurements employing click, burst and speech stimuli. Parameters of the particular stimuli were as follows:
-
Radar with rotary head
PublikacjaNowadays usage of radars is no longer reserved only for the military purpose. It finds many applications in various areas of science and industry. It may be used in order to obtain extended information about the state of critical infrastructure, like shipyards or petrochemical plants. Furthermore, it may be applied in vision denied environments. The aim of this project...
-
Analysis of a gene expression model
PublikacjaWe study a mathematical model of gene transcription and protein synthesis with negative feedback. We consider a system of equations taking into account the number of active binding sites, the way in which dimers bind to DNA and time delay in translation process. For a simplified model that consist of three ordinary differential equations with time delay we derive conditions for stability of the positive steady state and for the...
-
Embedded system using Bluetooth Low Energy sensors for smart farming applications
PublikacjaThe main goal of this Bachelor of Engineering project titled Embedded system using Bluetooth Low Energy sensors for smart farming applications is to create a prototype of a system consistent with Agriculture 4.0 concept using Bluetooth Low Energy (BLE) technology. Developed solution shall be easy in implementation and its main functionality shall be periodic gathering of data from environmental sensors...
-
Exterior Plasterwork in Gdynia`s Modernist Architecture and Its Preservation
PublikacjaText presents the historical plasterworks of facades of modernist monuments built before WWII as the important expression in architecture and ist conservation
-
MEMORYSCAPES OF EASTERN POLAND
PublikacjaThe text investigates new phenomena emerging in the field of social memory and commemoration in contemporary Poland. On the basis of field analyses, case studies and theoretical, transdisciplinary approaches, the paper discusses the issue of contemporary memoryscapes in eastern Poland (Bialystok and Lublin). These emerging forms of remembrance are the result of the sophisticated interplay between different actors involved in the...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
Mathematical analysis of a generalised p53-Mdm2 protein gene expression model
PublikacjaWe propose the generalisation of the p53-Mdm2 protein gene expression model introduced by Monk (2003). We investigate the stability of a unique positive steady state and formulate conditions which guarantee the occurrence of the Hopf bifurcation. We show that oscillatory behaviour can be caused not only by time lag in protein transcription process, but also can be present in the model without time delay. Moreover, we investigate...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublikacjaReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublikacjaA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Molecular and structural basis of inner core lipopolysaccharide alterations in Escherichia coli: incorporation of glucuronic acid and phosphoethanolamine in the heptose region.
PublikacjaIt is well established that lipopolysaccharide (LPS) often carries nonstoichiometric substitutions in lipid A and in the inner core. In this work, the molecular basis of inner core alterations and their physiological significance are addressed. A new inner core modification of LPS is described, which arises due to the addition of glucuronic acid on the third heptose with a concomitant loss of phosphate on the second heptose. This...
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Neotenic phenomenon in gene expression in the skin of Foxn1- deficient (nude) mice - a projection for regenerative skin wound healing
PublikacjaMouse fetuses up to 16 day of embryonic development and nude (Foxn1- deficient) mice are examples of animals that undergo regenerative (scar-free) skin healing. The expression of transcription factor Foxn1 in the epidermis of mouse fetuses begins at embryonic day 16.5 which coincides with the transition point from scar-free to scar-forming skin wound healing. In the present study, we tested the hypothesis that Foxn1 expression...
-
The awareness of the profession and the self-reflection of the primary, secondary and upper secondary school teachers on their own practice in the light of empirical studies
PublikacjaThe article presents the issue of awareness of the profession and the self-reflection of the primary, secondary and upper secondary school teachers’ on their own practice. The text refers to data based on empirical studies.
-
A comparative study of English viseme recognition methods and algorithms
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
A comparative study of English viseme recognition methods and algorithm
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublikacjaIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublikacjaThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...