Filtry
wszystkich: 1043
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: AUDIOVISUAL SPEECH RECOGNITION
-
Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing
PublikacjaThis letter introduces new chirp rate and instantaneous frequency estimators designed for frequency-modulated signals. These estimators are first investigated from a deterministic point of view, then compared together in terms of statistical efficiency. They are also used to design new recursive versions of the vertically synchrosqueezed short-time Fourier transform, using a previously published method (D. Fourer, F. Auger, and...
-
Julita Wasilczuk dr hab.
OsobyUrodzona 5 kwietnia 1965 roku w Gdańsku. W latach 1987–1991 odbyła studia na Wydziale Ekonomiki Transportu Uniwersytetu Gdańskiego (obecnie Wydział Ekonomii). Od 1993 roku zatrudniona na nowo utworzonym Wydziale Zarządzania i Ekonomii, Politechniki Gdańskiej, na stanowisku asystenta. W 1997 roku uzyskała stopień doktora nauk ekonomicznych na WZiE, a w 2006 doktora habilitowanego nauk ekonomicznych w dyscyplinie nauki o zarządzaniu,...
-
Distributed Representations Based on Geometric Algebra: the Continuous Model
PublikacjaAuthors revise the concept of a distributed representation of data as well as two previously developed models: Holographic Reduced Representation (HRR) and Binary Spatter Codes (BSC). A Geometric Analogue (GAc - ''c'' stands for continuous as opposed to its discrete version) of HRR is introduced - it employs role-filler binding based on geometric products. Atomic objects are real-valued vectors in n-dimensional Euclidean space...
-
Validating data acquired with experimental multimodal biometric system installed in bank branches
PublikacjaAn experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...
-
Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations
PublikacjaThe research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...
-
Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals
PublikacjaA method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the...
-
Activated Sludge Process Development
PublikacjaThis paper summarizes the most significant steps in the activated sludge process development and recognizes key contributors. Recognition of the roles of oxygen and living organisms was the first step (1882-1914). Ardern and Lockett (1914) named the accumulated olids "activated sludge". The process was rapidly accepted and applied in the period 1914-1930. The most dramatic changes in the activated sludge process understanding and...
-
AUGMENTATION OF THE CRITICAL HEAT FLUX IN WATER-Al2O3, WATER-TiO2 AND WATER-Cu NANOFLUIDS
PublikacjaThe main aim of the proposed study is therefore recognition of the phenomena accompanying nucleate boiling crisis of selected nanofluids during boiling on horizontal tubes of various outside diameters. Of particular interest is impact of contact angle and tube diameter on the value of critical heat flux. The results obtained should give more light on the nature of nucleate boiling crisis and will serve as a basis for future theoretical...
-
Ultrawideband transmission in physical channels: a broadband interference view
PublikacjaThe superposition of multipath components (MPC) of an emitted wave, formed by reflections from limiting surfaces and obstacles in the propagation area, strongly affects communication signals. In the case of modern wideband systems, the effect should be seen as a broadband counterpart of classical interference which is the cause of fading in narrowband systems. This paper shows that in wideband communications, the time- and frequency-domain...
-
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
PublikacjaAn allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
-
Biometryczna kontrola dostępu
PublikacjaOpisano szczegółowo algorytm detekcji oraz identyfikacji człowieka na podstawie punktów nodalnych twarzy. Zdefiniowano pojęcia: biometria, proces pomiaru biometrycznego, metody biometrycznej identyfikacji oraz kontrola dostępu. Przedstawiono opis opracowanego systemu biometrycznej identyfikacji wykorzystującego sztuczne sieci neuronowe. Podano wyniki badań oraz przeprowadzono ich wnikliwą dyskusję.Biometrics is the study of automated...
-
Towards Contactless, Hand Gestures-Based Control of Devices
PublikacjaGesture-based intuitive interactions with electronic devices can be an important part of smart home systems. In this paper, we adapt the contactless linear gesture sensor for the navigation of smart lighting system. Set of handled gestures allow to propose two methods of active light source selection, continuous dimming, and turning on and off based on discrete gestures. The average gesture recognition accuracy was 97.58% in the...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublikacjaRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublikacjaRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
Using MusicXML to evaluate accuracy of OMR Systems
PublikacjaIn this paper a methodology for automatic accuracy evaluation in optical music recognition (OMR) applications is proposed. Presented approach assumes using ground truth images together with digital music scores describing their content. The automatic evaluation algorithm measures differences between the tested score and the reference one, both stored in MusicXML format. Some preliminary test results of this approach are presented...
-
Affective reactions to playing digital games
PublikacjaThe paper presents a study of emotional states during a gameplay. An experiment of two-player Tetris game is reported, followed by the analysis of the results - self-reported emotional states as well as physiological signals measurements interpretation. The study reveals the diversity of emotional reactions and concludes, that a representative player's emotional model is hard to define. Instead, an adaptive approach to emotion...
-
Detection and localization of selected acoustic events in acoustic field for smart surveillance applications
PublikacjaA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking
PublikacjaEcho cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...
-
Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications
PublikacjaA method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...
-
A Novel Approach to the Assessment of Cough Incidence
PublikacjaIn this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...
-
Extracting concepts from the software requirements specification using natural language processing
PublikacjaExtracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....
-
Several notes on technical modernization of a historical building
PublikacjaThis paper describes methods and means used during the renovation process of several elements of a building – the assembly hall, inner courtyards and attics of the main building at Gdansk University of Technology, both in design and construction domains. A tissue of a historic building required special approach to these tasks, with special emphasis on good recognition of problems, careful planning and consistent implementing of...
-
TspGWI, a thermophilic class-IIS restriction endonuclease from Thermus sp.,recognizes novel asymmetric sequence 5´-ACGGA(N11/9)-3
PublikacjaA novel prototype class-IIS restriction endonuclease, TspGWI, was isolated from the thermophilic bacterium Thermus sp. GW. The recognition sequence and cleavage positions have been established: TspGWI recognizes the non-palindromic 5-bp sequence 5′-ACGGA-3′ and cleaves the DNA 11 and 9 nt downstream in the top and bottom strand, respectively. In addition, an accompanying endonuclease, TspGWII, an isoschizomer of Pst I, was found...
-
Identification of acoustic event of selected noise sources in a long-term environmental monitoring systems
PublikacjaABSTRACT Undertaking long-term acoustic measurements on sites located near an airport is related to a problem of large quantities of recorded data, which very often represents information not related to flight operations. In such areas, usually defined as zone of limited use, often other sources of noise exist, such as roads or railway lines treated is such context as acoustic background. Manual verification of such recorded data...
-
THE POSSIBILITIES OF ESTIMATING THE RELIABILITY OF SHIP PIPELINES’ ELEMENTS INCLUDING DESTRUCTIVE PHENOMENA ACTING ON THEM
PublikacjaIn the article an approach to the problem of estimating reliability data based on physical models is proposed. The possibility of reliability assessment for selected elements of ship pipelines, based on the recognition of the destructive physical phenomena taking place in them, is discussed. To do this, an overview of these phenomena has been made. In addition, a preliminary review of existing measures of destruction of materials...
-
Design Elements of Affect Aware Video Games
PublikacjaIn this paper issues of design and development process of affect-aware video games are presented. Several important design aspects of such games are pointed out. A concept of a middleware framework is proposed that separates the development of affect-aware video games from emotion recognition algorithms and support from input sensors. Finally, two prototype affect-aware video games are presented that conform to the presented architecture...
-
Knowledge representation of motor activity of patients with Parkinson’s disease
PublikacjaAn approach to the knowledge representation extraction from biomedical signals analysis concerning motor activity of Parkinson disease patients is proposed in this paper. This is done utilizing accelerometers attached to their body as well as exploiting video image of their hand movements. Experiments are carried out employing artificial neural networks and support vector machine to the recognition of characteristic motor activity...
-
Musical Instrument Separation Applied to Music Genre Classification . Separacja instrumentów muzycznych w zastosowaniu do rozpoznawania gatunków muzycznych
PublikacjaThis paper outlines first issues related to music genre classification and a short description of algorithms used for musical instrument separation. Also, the paper presents proposed optimization of the feature vectors used for music genre recognition. Then, the ability of decision algorithms to properly recognize music genres is discussed based on two databases. In addition, results are cited for another database with regard to...
-
Towards better understanding of context-aware knowledge transformation
PublikacjaConsidering different aspects of knowledge functioning, context is poorly understood in spite of intuitively identifying this concept with environmental recognition. For dynamic knowledge, context especially seems to be an essential factor of change. Investigation on the impact of context on knowledge dynamics or more generally on the relationship between knowledge and its contextual interpretation is important in order to understand...
-
Non-Contact Temperature Measurements Dataset
PublikacjaThe dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...
-
Two-step mechanism of J-domain action in driving Hsp70 function
PublikacjaJ-domain proteins (JDPs), obligatory Hsp70 cochaperones, play critical roles in protein homeostasis. They promote key allosteric transitions that stabilize Hsp70 interaction with substrate polypeptides upon hydrolysis of its bound ATP. Although a recent crystal structure revealed the physical mode of interaction between a J-domain and an Hsp70, the structural and dynamic consequences of J-domain action once bound and how Hsp70s...
-
Quantifying inconsistencies in the Hamburg Sign Language Notation System
PublikacjaThe advent of machine learning (ML) has significantly advanced the recognition and translation of sign languages, bridging communication gaps for hearing-impaired communities. At the heart of these technologies is data labeling, crucial for training ML algorithms on a huge amount of consistently labeled data to achieve models that generalize well. The adoption of language-agnostic annotations is essential to connect different sign...
-
Szymon Olewniczak mgr inż.
OsobyJestem związany z Politechniką Gdańską od 2013 roku, kiedy to rozpocząłem studia inżynierskie na kierunku informatyka na Wydziale Elektroniki, Telekomunikacji i Informatyki. Po uzyskaniu tytułu magistra w 2019 roku podjąłem pracę jako asystent w Katedrze Architektury Systemów Komputerowych. Od 2024 roku pełnię również funkcję zastępcy kierownika katedry. Moje zainteresowania badawcze koncentrują się wokół tematów związanych z przetwarzaniem...
-
Impact of the glazed roof on acoustics of historic interiors
PublikacjaThe paper discusses the adverse acoustic phenomena occurring in the semi-open interiors (courtyards, yards) covered with a glass roof. Particularly negative is the rever-beration noise, which leads to the degradation of the utility functions of the resulting spaces. It involves the drastically reducing the intelligibility of speech, loss of natural sounding of music, problems with the sound system, as well as disturbances in the...
-
Cross-domain applications of multimodal human-computer interfaces
PublikacjaDeveloped multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
-
Subjective and Objective Comparative Study of DAB+ Broadcast System
PublikacjaBroadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...
-
Analysis of a caustic formed by a spherical reflector: Impact of a caustic on architectural acoustics
PublikacjaFocusing sound in rooms intended for listening to music or speech is an acoustic defect. Design recommendations provide remedial steps to effectively prevent this. However, there is a category of objects of high historical or architectural value in which the sound focus correction is limited or even abandoned. This also applies to indoor or outdoor concert shells, installations for teaching and acoustic presentations, etc. The...
-
Report of the ISMIS 2011 Contest : Music Information Retrieval
PublikacjaThis report presents an overview of the data mining contestorganized in conjunction with the 19th International Symposiumon Methodologies for Intelligent Systems (ISMIS 2011), in days betweenJan 10 and Mar 21, 2011, on TunedIT competition platform. The contestconsisted of two independent tasks, both related to music information retrieval:recognition of music genres and recognition of instruments, for agiven music sample represented...
-
PCR detection of Scopulariopsis brevicaulis
PublikacjaScopulariopsis brevicaulis is known as a most common etiological factor of the mould toenail infections. There are also reports indicating that S. brevicaulis could cause organ and disseminated infections. Nowadays microscopic observations from the direct sample and culture are crucial for the appropriate recognition of the infection. In this paper is presented a PCR-based method for S. brevicaulis detection. The specificity of...
-
Poly-L-Lysine-modified boron-doped diamond electrodes for the amperometric detection of nucleic acid bases
PublikacjaBoron-doped diamond (BDD) is a very promising supporting material used in the construction of biosensors for molecular recognition. The direct immobilization of structurally-organized huge molecules, such as poly-L-Lysine (PLL) provides the possibility of determining organic molecules, e.g. nucleic acid bases (e.g. adenine, guanine) or peptides and proteins. This paper describes the direct method for chemical and electrochemical...
-
Thermal imaging in automatic rodent’s social behaviour analysis
PublikacjaLaboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...
-
Hostility bias or sadness bias in excluded individuals: Does anodal transcranial direct current stimulation of right VLPFC vs. left DLPFC have a mitigating effect?
PublikacjaExclusion has multiple adverse effects on individual’s well-being. It induces anger and hostile cognitions leading to aggressive behavior. The purpose of this study was to test whether exclusion would affect recognition of anger on ambivalent faces of the excluders. We hypothesized that exclusion would elicit more anger encoding (hostility bias) than inclusion, but this effect would be mitigated by anodal tDCS of right VLPFC...
-
Recognizing emotions on the basis of keystroke dynamics
PublikacjaThe article describes a research on recognizing emotional states on the basis of keystroke dynamics. An overview of various studies and applications of emotion recognition based on data coming from keyboard is presented. Then, the idea of an experiment is presented, i.e. the way of collecting and labeling training data, extracting features and finally training classifiers. Different classification approaches are proposed to be...
-
Techniques of acquiring additional features of the responses of individual gas sensors
PublikacjaGas sensors usually exhibit lack of selectivity, require fre quent calibration, exhibit drift of the response and a lot of factors, such as humidity or ambient temperature, influen ce their performance. Different approaches can be used to overcome this shortcomings. Building arrays of different sensors and usage of pattern recognition methods to analyze responses of elements...
-
AffecTube — Chrome extension for YouTube video affective annotations
PublikacjaThe shortage of emotion-annotated video datasets suitable for training and validating machine learning models for facial expression-based emotion recognition stems primarily from the significant effort and cost required for manual annotation. In this paper, we present AffecTube as a comprehensive solution that leverages crowdsourcing to annotate videos directly on the YouTube platform, resulting in ready-to-use emotion-annotated...
-
Przegląd rodzajów chiralnych faz stacjonarnych oraz możliwości ich zastosowań w chromatografii cieczowej
PublikacjaChromatograficzne rozdzielanie związków optycznie czynnych ma ogromne znaczenie nie tylko w przemyśle farmaceutycznym, ale i agrochemicznym, a także w badaniach naukowych różnego rodzaju. W niniejszym opracowaniu scharakteryzowano komercyjnie dostępne chiralne fazy stacjonarne na bazie, cyklodekstryn, polisacharydów, makrocyklicznych antybiotyków, eterów koronowych, a także fazy proteinowe, ligandowymienne, jonowymienne oraz fazy...
-
Smartphone application supporting independent movement of the blind
PublikacjaImproving comfort of life of blind people is a problem of great importance. Neither a white canenor a guide dog, although both very useful, can be considered as a tool for achieving fullindependence in everyday movement around the city. On the market there are some navigation toolsinspired by car navigation systems, but they have many flaws, ranging from positioninginaccuracies to high prices. The authors present their own solution...
-
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
PublikacjaPraca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...
-
Pursuing Analytically the Influence of Hearing Aid Use on Auditory Perception in Various Acoustic Situations
PublikacjaThe paper presents the development of a method for assessing auditory perception and the effectiveness of applying hearing aids for hard-of-hearing people during short-term (up to 7 days) and longer-term (up to 3 months) use. The method consists of a survey based on the APHAB questionnaire. Additional criteria such as the degree of hearing loss, technological level of hearing aids used, as well as the user experience are taken...
-
Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
PublikacjaThe goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...