Wyniki wyszukiwania dla: SPEAKER AUTHENTICATION
-
MODALITY corpus - SPEAKER 33 - COMMANDS C1
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - COMMANDS C1
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C2
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S5
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
The influence of selected parameters of idd on vibrations of contact in ac hybrid breaker
PublikacjaWzrost mocy systemów zasilania na statkach wymaga poszukiwania nowych rozwiązań wyłączników dc oraz ac. Od kilku lat trwają prace nad modelowaniem ultra szybkiego hybrydowego wyłącznika prądu. Czas wyłączenia prądu wyłącznika zależy od wielu parametrów. W pracy przedstawiono wyniki badań eksperymentalnych i symulacyjnych wyłącznika ze szczególnym uwzględnieniem rezystancji zestyku.
-
Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers
PublikacjaThe purpose of this study was to apply Self-Organizing Maps to differentiate between the correct and the incorrect allophone pronunciations and to compare the results with subjective evaluation. Recordings of a list of target words, containing selected allophones of English plosive consonants, the velar nasal and the lateral consonant, were made twice. First, the target words were read from the list by 9 non-native speakers and...
-
Cobalt(II) tri-tert-butoxysilanethiolates with bidentate spacer ligands
PublikacjaOtrzymano szereg nowych tri-tert-butoksysilanotiolanów Co(II) z dwudonorowymi N,N'-ligandami takimi jak pirazyna, chinoksalina i 4,4'-bipy. W przypadku pirazyny i chinoksaliny otrzymane kompleksy są bimetaliczne, w których atomy metali połączone są odpowiednią aminą. Użyty do syntezy 4,4'-bipy pozwolił na otrzymanie dwóch odmian polimorficznych kompleksu {[Co{SSi(tBuO)3}2]2(μ-4,4'bipy)}, a także polimeru koordynacyjnego o wzorze...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Current commutation process in a ultra-fast fuse-IGBT hybrid circuit breaker.
PublikacjaUkład hybrydowy bezpiecznika topikowego i tranzystora IGBT umożliwia bardzo szybkie włączenie do obwodu zwarciowego impedancji ograniczającej wartość prądu. Po eksplozji krótkiego topika bezpiecznika wywołanej narastającym prądem zwarciowym wywołuje w czasie poniżej 2 ms wzrost napięcia na topiku do wartości ok.20 - 25V, po czym prąd jest komutowany do gałęzi z przyrządem półprzewodnikowym typu GTO lub IGBT. Ostateczne wyłączenie...
-
Influence of ci rcuit parameters on the procces of current transfer in thehybrid circuit breaker.
PublikacjaW artykule przedstawiono działanie hybrydowego bezstykowego ogranicznika prądów zwarciowych w oparciu o symulację komputerową oraz porównanie eksperymentu z symulacją komputerową przy pomocy programu MATLAB i PSPICE. Przedstawiono również wady i zalety obliczeń numerycznych przeprowadzonych z zastosowaniem komercyjnych programów na analizę działania łącznika hybrydowego.W analizie uwzględniono model hybrydowego ogranicznika...
-
Influence of circuit breaker operation upon transformeros residual flux and inrush current
PublikacjaPrzedstawiono model i wyniki obliczeń prądu i strumienia magnetycznego w układzie zawierającym transformator trójfazowy i wyłącznik. Opracowano obwodowy model łącznika z łukiem elektrycznym. Model łuku zbudowano w oparciu o jego statyczną charakterystykę prądowo-napięciową z uwzględnieniem uproszczonej charakterystyki dynamicznej. Przeprowadzone symulacje wykazały, że wartość maksymalna prądu włączania transformatora zależy zarówno...
-
Optymalizacja procedur dyskryminacyjnych w procesie weryfikacji mówców - metodyka doboru wag parametrów = Optimization of discriminative procedures in speaker verification process - a method for selecting parameter weights
PublikacjaPoddano testowaniu system weryfikacji mówców, działający w sposób zależny od tekstu, oparty na parametrach cepstralnych. Wstępnie przyjęto wagi wyrównane przypisane do zdefiniowanego w ten sposób wektora wag, właściwego dla obranego systemu parametryzacyjnego. Uzyskane wyniki przedstawiono w postaci macierzy pomyłek (''confusion matrix''). Dobór wartości wektora wag odbywał się w oparciu o część treningową bazy danych przy użyciu...
-
Ribosomal intergenic spacer analysis as a tool for monitoring methanogenic archaea changes in an anaerobic digester
PublikacjaThe applicability of a newly-designed PCR primer pair in examination of methanogenic Archaea in a digester treating plant biomass was evaluated by Ribosmal Intergenic Spacer Analysis (RISA). To find a suitable approach, three variants of RISA were tested: (1) standard, polyacrylamide gel-based, (2) automated, utilized capillary electrophoresis (GA-ARISA), and (3) automated microfluidics-based (MF-ARISA). All three techniques yielded...
-
Emotions in polish speech recordings
Dane BadawczeThe data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
-
Calculations of notch stress factor of a thin-walled spreader bracket fillet weld with the use of a local stress approach
PublikacjaPresence of geometric notches in welded joints causes concentration of strains and stresses, therefore reducing fatigue strength of such joints. This article presents an analysis of stress concentrations in a fillet weld of a spreader mounting bracket on a small sailing yacht. The aim of this article is to direct the attention of designers, manufacturers and regulatory bodies to issues of fatigue cracks that form in brackets fastening...
-
Creating new voices using normalizing flows
PublikacjaCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
PublikacjaPrzedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
-
Andrzej Czyżewski prof. dr hab. inż.
OsobyProf. zw. dr hab. inż. Andrzej Czyżewski jest absolwentem Wydziału Elektroniki PG (studia magisterskie ukończył w 1982 r.). Pracę doktorską na temat związany z dźwiękiem cyfrowym obronił z wyróżnieniem na Wydziale Elektroniki PG w roku 1987. W 1992 r. przedstawił rozprawę habilitacyjną pt.: „Cyfrowe operacje na sygnałach fonicznych”. Jego kolokwium habilitacyjne zostało przyjęte jednomyślnie w czerwcu 1992 r. w Akademii Górniczo-Hutniczej...
-
Webinarium o Programie SPINAKER
WydarzeniaCelem szkolenia jest zaprezentowanie możliwości złożenia wniosku projektowego w konkursie SPINAKER – intensywne międzynarodowe programy kształcenia NAWA
-
Actions Speak Louder Than Words: Health Behaviours and Literacy of Future Healthcare Professionals
PublikacjaOur everyday behaviours in life can positively and negatively impact our health, thus cumulatively shaping our lifestyles as more or less healthy. These behaviours are often determined by our knowledge, literacy, motivations and socioeconomic backgrounds. The authors aimed to assess health behaviours and explore variables that may affect persons studying to become future healthcare professionals in Poland. This study was conducted...
-
Actions Speak Louder Than Words: Health Behaviours and the Literacy of Future Healthcare Professionals
Publikacja -
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublikacjaIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
The cement-bone bond is weaker than cement-cement bond in cement-in-cement revision arthroplasty. A comparative biomechanical study
PublikacjaThis study compares the strength of the native bone-cement bond and the old-new cement bond under cyclic loading, using third generation cementing technique, rasping and contamination of the surface of the old cement with biological tissue. The possible advantages of additional drilling of the cement surface is also taken into account. Femoral heads from 21 patients who underwent a total hip arthroplasty performed for hip arthritis...
-
Playback detection using machine learning with spectrogram features approach
PublikacjaThis paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...
-
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
PublikacjaAn allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublikacjaA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
Objectivization of phonological evaluation of speech elements by means of audio parametrization
PublikacjaThis study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
-
Evidence for widespread occurrence of copper in Late Neolithic Poland? A deposit of Funnel Beaker Culture bone products at site 2 in Osłonki (Kuyavia, central Poland)
Publikacja -
Development of an orbital shaker-assisted fatty acid-based switchable solvent microextraction procedure for rapid and green extraction of amoxicillin from complex matrices: Central composite design
PublikacjaIn this study, a cheap, fast and simple orbital shaker-assisted fatty acid-based switchable solvent microextraction (OS-FASS-ME) procedure was developed for the extraction of amoxicillin (AMOX) in dairy products, pharmaceutical samples and wastewater prior to its spectrophotometric analysis. Fatty acid-based switchable solvents were investigated for extracting AMOX. The key factors of the OS-FASS-ME procedure were optimized using...
-
Technika wyłączania, zmniejszająca skutki zwarć łukowych. - Tł. z Plant Engineering, March 2007. - Tyt. oryg.: Circuit breaker technology reduces arc flash risk / Patricia E. Chandler
PublikacjaW artykule podano sposoby zmniejszania skutków zwarć łukowych oraz opisano zalety nowoczesnych wyłączników ograniczających prądy zwarciowe. Podano możliwości układów elektronicznych w monitorowaniu parametrów elektrycznych instalacji, stanu pracy wyłączników i wykrywaniu zwarć.
-
Analysis of allophones based on audio signal recordings and parameterization
PublikacjaThe aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...
-
Magnetic hydrophobic deep eutectic solvents for orbital shaker-assisted dispersive liquid-liquid microextraction (MAGDES-OS-DLLME) - determination of nickel and copper in food and water samples by FAAS
PublikacjaIn this work, a cheap and widely applicable dispersive liquid-liquid microextraction (DLLME) method was developed for the extraction of Ni(II) and Cu(II) from water and food samples and analysis using flame atomic absorption spectrometry. DLLME was assisted by orbital shaker, while ferrofluid as an extractant was based on deep eutectic solvent (DES). This ferrofluid was made of hydrophobic DES (hDES), composed of lauric acid and...
-
Wrzeszcz z wyższej rzędnej – parki publiczne w historii Gdańska" w cyklu Dojrzały Smak Przygody (Centrum Informacji i Edukacji Ekologicznej Pomorskich Parków Krajobrazowych)-spacer edukacyjny
PublikacjaSpacer zaczęto w parku miejskim przy ul. Grunwaldzkiej, skąd ruszono do lasu do Parku Jaśkowej Doliny. Zwiedzono najstarszy park publiczny w Europie, podążając XIX wiecznymi ścieżkami, żeby zrobić piknik w teatrze! Następnie wędrówka przecięła Jaśkową Dolinę w kierunku Łąki Festynowej gdzie odwiedzono wzniesienie zwane Ślimakiem (Góra Sobótki 90 m npm) i odwiedzono punkt widokowy ‘Spojrzenie na Gdańsk’ w połowie wyprawy. Następnie...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublikacjaA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
Edge-Computing based Secure E-learning Platforms
PublikacjaImplementation of Information and Communication Technologies (ICT) in E-Learning environments have brought up dramatic changes in the current educational sector. Distance learning, online learning, and networked learning are few examples that promote educational interaction between students, lecturers and learning communities. Although being an efficient form of real learning resource, online electronic resources are subject to...
-
Spotkanie i spacer po kampusie z autorami albumu
WydarzeniaSpotkanie i spacer po kampusie z autorami albumu KAMPUS POLITECHNIKI GDAŃSKIEJ. Szczegóły: https://pg.edu.pl/pg-otwarta/
-
Improving the quality of speech in the conditions of noise and interference
PublikacjaThe aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...
-
Visual perception of vowels from static and dynamic cues
PublikacjaThe purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...
-
Audio Feature Analysis for Precise Vocalic Segments Classification in English
PublikacjaAn approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...
-
Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
PublikacjaThe problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...
-
Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"
PublikacjaThe purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and...
-
Piotr Marek Smolnicki dr inż. arch.
OsobyAutor, prelegent, architekt-urbanista, doktor nauk inżynieryjno-technicznych w temacie nowoczesnej zautomatyzowanej i współdzielonej mobilności metropolitalnej, ekspert doradzający miastom nt. prowadzenia procesów partycypacji społecznej przy planowaniu przestrzennym, projektował tzw. uchwały krajobrazowe.
-
The Transmission Protocol of Sensor Ad Hoc Networks
PublikacjaThis paper presents a secure protocol for a radio Ad Hoc sensor network. This network uses the TDMA multiple access method. The transmission rate on the radio channel is 57.6 kbps. The paper presents the construction of frames, types of packets and procedures for the authentication, assignment of time slots available to the node, releasing assigned slots and slots assignment conflict detection.
-
The secure transmission protocol of sensor Ad Hoc network
PublikacjaThe paper presents a secure protocol of radio Ad Hoc sensor network. This network operates based on TDMA multiple access method. Transmission rate on the radio channel is 57.6 kbps. The paper presents the construction of frames, types of packets and procedures for the authentication, assignment of time slots available to the node, releasing assigned slots and slots assignment conflict detection.
-
Biometryczna kontrola dostępu
PublikacjaOpisano szczegółowo algorytm detekcji oraz identyfikacji człowieka na podstawie punktów nodalnych twarzy. Zdefiniowano pojęcia: biometria, proces pomiaru biometrycznego, metody biometrycznej identyfikacji oraz kontrola dostępu. Przedstawiono opis opracowanego systemu biometrycznej identyfikacji wykorzystującego sztuczne sieci neuronowe. Podano wyniki badań oraz przeprowadzono ich wnikliwą dyskusję.Biometrics is the study of automated...
-
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublikacjaA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
-
Uwierzytelnienie i autoryzacja w systemie STRADAR
PublikacjaPrzedstawiono rozwiązanie serwera uwierzytelnienia i autoryzacji (AA) w rozproszonym systemie STRADAR, udostępniającym funkcjonalności dla prowadzenia działań operacyjnych Morskiego Oddziału Straży Granicznej. System umożliwia prezentację na stanowisku wizualizacji zdarzeń (SWZ) bieżącej i archiwalnej sytuacji na mapie (AIS, radary), obrazu z kamer, zdjęć, notatek, rozmów telefonicznych oraz plików i wiadomości tekstowych (SMS)...
-
Evaluation of aspiration problems in L2 English pronunciation employing machine learning
PublikacjaThe approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...
-
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
PublikacjaAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...
-
Facial data registration facility for biometric protection of electronic documents
PublikacjaIn modern world, information is crucial, and its leakage may lead to serious losses. Documents as the main medium of information must be therefore highly protected. Nowadays, the most common way of protecting data is using passwords, however it seems inconvenient to type complex passwords, when it is needed many times a day. For that reason a significant research has been conducted on biometric authentication...