Filtry
wszystkich: 678
wybranych: 427
-
Katalog
Filtry wybranego katalogu
Wyniki wyszukiwania dla: Query by Sketch
-
Instantaneous complex frequency for pipeline pitch estimation
PublikacjaIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublikacjaThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
PublikacjaW referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
-
Creating new voices using normalizing flows
PublikacjaCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublikacjaThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
New Technology or Adaptation at the Frontier? Butchery as a Signifier of Cultural Transitions in the Medieval Eastern Baltic
Publikacja -
Nowe wyzwania, nowe rozwiązania. Jak przedsiębiorstwo branży IT odnajduje się w erze VUCA?
PublikacjaZmienność, niepewność, złożoność i niejednoznaczność, określane akronimem VUCA (volatility, uncertainty, complexity, ambiguity) towarzyszą funkcjonowaniu każdego przedsiębiorstwa. Jak wykorzystać te nieodłączne cechy otoczenia przedsiębiorstw jako szanse dla ich rozwoju to wy-zwania stojące przed nimi, a zarazem pytanie badawcze artykułu. Celem artykułu jest wykazanie, na przykładzie przedsiębiorstwa branży IT, że analiza zmienności,...
-
Początek Gdyńskiego Systemu Wodociągowego Wodociąg wiejski w gminie Oksywie w latach 1911 – 1929. Część I.
PublikacjaPrzedmiotem badań był wodociąg wiejski na Oksywiu z początku XX wieku, jako najstarszy na ziemiach polskich pod zaborami. Po przeprowadzeniu żmudnej kwerendy odtworzono przebieg procesu decyzyjnego jego budowy i eksploatacji. Szczególnie ważkie informacje odkryto w protokołach z posiedzeń Rady Gminnej Oksywia napisanych w języku staroniemieckim w latach 1911 – 1920. W rezultacie ustalono parametry techniczne sieci i urządzeń wodociągowych...
-
Applying ground penetrating radar to tracking of ancient architectural transformations: the case of the monastery St. Peter on the Island of Rab (Croatia)
PublikacjaThe ground-penetrating radar (GPR) method has been used for many years in archaeological research. However, thismethod is still not widely used in studies of past architecture. The biggest problem with the implementation of the GPRmethod at such sites is usually connected with extensive debris layers, plant cover and standing relics of walls and otherfeatures that restrict the available measurement area. Despite of these, properly...
-
Flexible Knowledge–Vision–Integration Platform for Personal Protective Equipment Detection and Classification Using Hierarchical Convolutional Neural Networks and Active Leaning
PublikacjaThis work is part of an effort to develop of a Knowledge-Vision Integration Platform for Hazard Control (KVIP-HC) in industrial workplaces, adaptable to a wide range of industrial environments. The paper focuses on hazards resulted from the non-use of personal protective equipment (PPE). The objective is to test the capability of the platform to adapt to different industrial environments by simulating the process of randomly selecting...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
THREE-LEVEL F-TYPE INVERTER
PublikacjaGiven the recent available IGBT switch modules up to 6.5 kV, 1200 A rating, the prospect of the diode-free variant topology of the three-level neutral-point-clamped (3-level, T-type) inverter in certain medium voltage applications is bright; due to its small part count and low conduction losses compared to the diode-clamped NPC inverter. However, within this voltage range, the input dc voltage rating of 50% of the switches per...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
Quantum security and theory of decoherence
PublikacjaWe sketch a relation between two crucial, yet independent, fields in quantum information research, viz. quantum decoherence and quantum cryptography. We investigate here how the standard cryptographic assumption of shielded laboratory, stating that data generated by a secure quantum device remain private unless explicitly published, is disturbed by the einselection mechanism of quantum Darwinism explaining the measurement process...
-
Auditory-visual attention stimulator
PublikacjaNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
Asking Data in a Controlled Way with Ask Data Anything NQL
PublikacjaWhile to collect data, it is necessary to store it, to understand its structure it is necessary to do data-mining. Business Intelligence (BI) enables us to make intelligent, data-driven decisions by the mean of a set of tools that allows the creation of a potentially unlimited number of machine-generated, data-driven reports, which are calculated by a machine as a response to queries specified by humans. Natural Query Languages...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublikacjaThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublikacjaIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Accurate modeling of quasi-resonant inverter fed IM drive
PublikacjaIn this paper wide-band modeling methodology of a parallel quasi-resonant dc link inverter (PQRDCLI) fed induction machine (IM) is presented. The modeling objective is early-design stage prediction of conductive electromagnetic interference (EMI) emissions of the considered converter fed IM drive system. Operation principles of the selected topology of PQRDCLI feeding IM drive are given. Modeling of the converter drive system is...
-
Jacek Krenz : City in my eyes
PublikacjaSketch City: Tips and Inspiration for Drawing on Location is the companion for any creative traveler, urban explorer, or budding artist. 50 artists scattered around the globe have been brought together to share tips about their favorite urban settings, including locations in South America, Europe, Canada, Asia, and beyond. Citified sketchers are guiding through techniques for capturing frenetic street scenes as well as the rare...
-
Effective configuration of a double triad planar parallel manipulator for precise positioning of heavy details during their assembling process
PublikacjaIn the paper, dynamics analysis of a parallel manipulator is presented. It is an atypical manipulator, devoted to help in assembling of heavy industrial constructions. Few atypical properties are required: small workspace; slow velocities; high loads. Initially, a short discussion about definition of the parallel manipulators is presented, as well as the sketch of the proposed structure. In parallel, some definitions, assumptions...
-
SiMiSnoRNA: Collection of siRNA, miRNA, and snoRNA database for RNA Interference
PublikacjaObjective:The discovery of sequence specific gene silencing which occurs due to the presence of double-stranded RNAs has considerable impact on biology, revealing an unknown level of regulation of gene expression. This process is known as RNA interference (RNAi) or RNA silencing in which RNA molecules inhibit gene expression, typically by causing the destruction of specific mRNA molecule. Two types of small RNA molecules - small...
-
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
PublikacjaW wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...
-
Problemy napędów maszyn do konfekcjonowania folii opakowaniowych.
PublikacjaPrzedstawiono konstrukcje maszyn do konfekcjonowania folii stretch, samoprzylepnej i wstępnego rozciągania. Problemem jest napęd wałów rozciągających folię. Pokazano złożony napęd w maszynie do konfekcjonowania folii samoprzylepnej.
-
RDF dataset profiling - a survey of features, methods, vocabularies and applications
PublikacjaThe Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...
-
Testing Question Order Effects of Self-perception of Risk Propensity on Simple Lottery Choices as Measures of the Actual Risk Propensity
PublikacjaUncertainty together with the necessity of making choices inevitably results in risky decisions. For many years now, scientists have been studying notions connected with risk such as risk management, risk perception or risk propensity. While many sophisticated methods regarding measurement of risk propensity have been developed so far, it seems that little attention has been paid to checking whether they are not inherently flawed....
-
Graph Representation Integrating Signals for Emotion Recognition and Analysis
PublikacjaData reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...
-
Approximation Strategies for Generalized Binary Search in Weighted Trees
PublikacjaWe consider the following generalization of the binary search problem. A search strategy is required to locate an unknown target node t in a given tree T. Upon querying a node v of the tree, the strategy receives as a reply an indication of the connected component of T\{v} containing the target t. The cost of querying each node is given by a known non-negative weight function, and the considered objective is to minimize the total...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublikacjaReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublikacjaA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublikacjaThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...
-
A comparative study of English viseme recognition methods and algorithm
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublikacjaIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
A comparative study of English viseme recognition methods and algorithms
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
Ocena kondycji ekonomiczno-finansowej spółek teleinformatycznych notowanych na Giełdzie Papierów Wartościowych w Warszawie
PublikacjaW rozdziale oceniono kondycję ekonomiczno-finansową spółek segmentu SiTech notowanych na Giełdzie Papierów Wartościowych w Warszawie. Dokonano wyboru zmiennych diagnostycznych oraz zaproponowano wskaźnik syntetyczny. Przedstawiona metodologia i analizy mogą stanowić cenne źródło dla inwestorów giełdowych.
-
Broadband/Dual-band Metal-Mountable UHF RFID Tag Antennas: A Systematic Review. Taxonomy Analysis, Standards of Seamless RFID System Operation, Supporting IoT implementations, Recommendations and Future Directions
PublikacjaThe employment of broadband/dual-band ultra-high frequency (UHF) radio frequency identification (RFID) tag antennas contributes to the growth of RFID technology, with many potential implications, such as the increase of international trade, and reducing costs thereof. This study presents all reported articles on RFID tags for metal objects that can work seamlessly across different countries. Moreover, it addresses all available...
-
Research project BRIK: development of an innovative method for determining the precise trajectory of a railway vehicle
PublikacjaIn the paper the essential assumptions regarding a research project implemented by a consortium of Gdansk University of Technology and Gdynia Maritime University are presented. The project has been commissioned by National Center of Research and Development with cooperation with Polish Railways (PKP Polskie Linie Kolejowe S.A.). The project is focused in implementation of modern measurement techniques using Global Navigation Positioning...
-
Can Web Search Queries Predict Prices Change on the Real Estate Market?
PublikacjaThis study aims to explore whether the intensity of internet searches, according to the Google Trends search volume index (SVI), is a predictor of changes in real estate prices. The motivation of this study is the possibility to extend the understanding of the extra predictive power of Google search engine query volume of future housing price change (shift direction) by (i) the introduction of a research approach that combines...
-
Cable-stayed bridges. Basic static schemes
PublikacjaThe paper presents an overview of shaping of cable-stayed bridges. Historical background, basic static sketches and overview of selected bridges are included. Selected natural solutions and interesting unrealized projects were presented. Basic ideas and most important principals are discussed. The examples and sketches were given an author's comment. Static diagrams of two pylon structures with three variants of the arrangement...
-
Smart Knowledge Engineering for Cognitive Systems: A Brief Overview
PublikacjaCognition in computer sciences refers to the ability of a system to learn at scale, reason with purpose, and naturally interact with humans and other smart systems, such as humans do. To enhance intelligence, as well as to introduce cognitive functions into machines, recent studies have brought humans into the loop, turning the system into a human–AI hybrid. To effectively integrate and manipulate hybrid knowledge, suitable technologies...
-
Playback detection using machine learning with spectrogram features approach
PublikacjaThis paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...
-
Dual polarization antennas for UHF RFID readers
PublikacjaThis paper presents various concepts of switching polarization in patch antenna dedicated for UHF RFID readers. Proposed designs allow for switching between linear and circular polarization. The first design does not require electronic switching as the polarization can be changed by choosing one of two available feeding terminals. Two remaining designs use PIN diode or FET SPDT switch.
-
Photoelectron spectroscopy of a series of acetate and propionate esters
PublikacjaThe electronic state and photoionization spectroscopy of a series of acetate esters: methyl acetate, isopropyl acetate, butyl acetate and pentyl acetate as well as two propionates: methyl propionate and ethyl propionate, have been determined using vacuum-ultraviolet photoelectron spectroscopy. These experimental investigations are complemented by ab initio calculations. The measured first adiabatic and vertical ionization energies...
-
THE INFLUENCE OF PET MECHANICAL PROPERTIES ON SBM PROCESS PARAMETERS – LITERATURE REVIEW
PublikacjaIn the paper it is said about the influence of PET (polyethylene terephthalate) mechanical properties on SBM (stretch blow molding) process output parameters changes. The below paper mentions also about the influence of PET orientation and crystallization processes on mechanical and thermal properties of PET material during SBM process. All mechanical data of PET material and SBM process output parameters changes are from collected...
-
Evaluation Criteria for Affect-Annotated Databases
PublikacjaIn this paper a set of comprehensive evaluation criteria for affect-annotated databases is proposed. These criteria can be used for evaluation of the quality of a database on the stage of its creation as well as for evaluation and comparison of existing databases. The usefulness of these criteria is demonstrated on several databases selected from affect computing domain. The databases contain different kind of data: video or still...
-
Intelligent video and audio applications for learning enhancement
PublikacjaThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Intelligent multimedia solutions supporting special education needs.
PublikacjaThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....