Search results for: audio-visual speech recognition system - Bridge of Knowledge

Search

Search results for: audio-visual speech recognition system

Search results for: audio-visual speech recognition system

  • Further Developments of the Online Sound Restoration System for Digital Library Applications

    Publication

    New signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...

    Full text to download in external service

  • Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced

    Publication

    - Year 2018

    This study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

    Full text to download in external service

  • Music Genre Recognition in the Rough Set-Based Environment

    Publication

    - Year 2015

    The aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...

    Full text to download in external service

  • Stradar - Multimedia Dispatcher and Teleinformation System for the Border Guard

    Security of national borders requires utilization of multimedia surveillance systems automatically gathering, processing and sharing various data. The paper presents such a system developed for the Maritime Division of the Polish Border Guard within the STRADAR project. The system, apart from providing communication means, gathers data, such as map data from AIS, GPS and radar receivers, videos and photos from camera or audio from...

    Full text available to download

  • Communication Platform for Evaluation of Transmitted Speech Quality

    A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

    Full text available to download

  • An electronic nose for quantitative determination of gas concentrations

    Publication

    The practical application of human nose for fragrance recognition is severely limited by the fact that our sense of smell is subjective and gets tired easily. Consequen tly, there is considerable need for an instrument that can be a substitution of the human sense of smell. Electronic nose devices from the mid 1980s are used in growing number of applications. They comprise an array of several electrochemical gas sensors...

    Full text to download in external service

  • Agnieszka Landowska dr hab. inż.

    Agnieszka Landowska works for Gdansk University of Technology, FETI, Department of Software Engineering.  Her research concentrates on usability, accessibility and technology adoption, as well as affective computing methods. She initiated Emotions in HCI Research Group and conducts resarch on User eXperiene evaluation of applications and other technologies.

  • Dependable Integration of Medical Image Recognition Components

    Computer driven medical image recognition may support medical doctors in the diagnosis process, but requires high dependability considering potential consequences of incorrect results. The paper presentsa system that improves dependability of medical image recognition by integration of results from redundant components. The components implement alternative recognition algorithms of diseases in thefield of gastrointestinal endoscopy....

  • Anita Maria Dąbrowicz-Tlałka dr

    Anita Dąbrowicz-Tlałka graduated from the Faculty of Mathematics and Physics at the University of Gdańsk with an outstanding grade, having written her thesis in the field of geometric topology. She concurrently obtained a diploma in Postgraduate Studies in the Basics of Computer Science at the University of Gdańsk. In 2001 she received a Ph.D. degree in mathematical studies at the Poznań University of Technology after defending...

  • Separability Assessment of Selected Types of Vehicle-Associated Noise

    Music Information Retrieval (MIR) area as well as development of speech and environmental information recognition techniques brought various tools in-tended for recognizing low-level features of acoustic signals based on a set of calculated parameters. In this study, the MIRtoolbox MATLAB tool, designed for music parameter extraction, is used to obtain a vector of parameters to check whether they are suitable for separation of...

    Full text to download in external service

  • Performance Analysis of the OpenCL Environment on Mobile Platforms

    Publication

    Today’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...

    Full text to download in external service

  • Wykorzystanie sztucznych sieci neuronowych do wykrywania i rozpoznawania tablic rejestracyjnych na zdjęciach pojazdów

    W artykule przedstawiono koncepcję algorytmu wykrywania i rozpoznawania tablic rejestracyjnych (AWiRTR) na obrazach cyfrowych pojazdów. Detekcja i lokalizacja tablic rejestracyjnych oraz wyodrębnienie z obrazu tablicy rejestracyjnej poszczególnych znaków odbywa się z wykorzystaniem podstawowych technik przetwarzania obrazu (przekształcenia morfologiczne, wykrywanie krawędzi) jak i podstawowych danych statystycznych obiektów wykrytych...

    Full text available to download

  • Interactions with recognized objects

    Publication

    - Year 2014

    Implicit interaction combined with object recognition techniques opens a new possibility for gathering data and analyzing user behavior for activity and context recognition. The electronic eyewear platform, eGlasses, is being developed, as an integrated and autonomous system to provide interactions with smart environment. In this paper we present a method for the interactions with the recognized objects that can be used for electronic...

    Full text to download in external service

  • Creating a Remote Choir Performance Recording Based on an Ambisonic Approach

    Publication

    The aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...

    Full text available to download

  • Evaluation of Face Detection Algorithms for the Bank Client Identity Verification

    Results of investigation of face detection algorithms efficiency in the banking client visual verification system are presented. The video recordings were made in real conditions met in three bank operating outlets employing a miniature industrial USB camera. The aim of the experiments was to check the practical usability of the face detection method in the biometric bank client verification system. The main assumption was to provide...

    Full text available to download

  • A Visual Method of Measuring Railway-Track Weed Infestation Level

    Publication

    This paper concerns the assessment of railway track surface conditions in relation to the degree of weed infestation. The paper conceptually describes the proposed method using a visual system to analyse weed infestation level. The use of image analysis software for weed detection is also proposed. This new measurement method allows for a mobile assessment of the track’s weed infestation status. Validation of the assessment method...

    Full text available to download

  • Szymon Olewniczak mgr inż.

    People

    I've been a part of the Gdansk University of Technology since 2013, when I started my bachelor's degree in computer science at the Faculty of Electronics, Telecommunications and Informatics. After receiving my master's degree in 2019, I've been working as an assistant at the Department of Computer Architecture. Since 2024, I am also the deputy head of my department. My research interests revolve around various NLP related topics,...

  • Nowa metoda diagnostyki stanu technicznego nakładek stykowych

    Publication

    The current collection system, which consists of the overhead contact line and a current collector, is particularly important in electric rail vehicles, where their reliability is concerned. Faultless current collection is conditioned not only by suitable construction of these elements but also by their proper maintenance. Retaining permanent electrical contact is essential in DC systems, where current demand is relatively high. In...

  • An electronic nose based on the semiconducting and electrochemical gas sensors

    The practical application of human nose for fragrance recognition is severely limited by the fact that our sense of smell is subjective and gets tired easily. Consequently, there is a significant need for an instrument that can be a substitution of the human sense of smell. Development of an electronic nose devices is an active area of research starting from pioneering research of Dodd and Persuad in the mid-1980s. Such systems...

    Full text to download in external service

  • Moving object detection and tracking for the purpose of multimodal surveillance system in urban areas

    Publication

    - Year 2008

    Background subtraction method based on mixture of Gaussians was employed to detect all regions in a video frame denoting moving objects. Kalman filters were used for establishing relations between the regions and real moving objects in a scene and for tracking them continuously. The objects were represented by rectangles. The objects coupling with adequate regions including the relation of many-to-many was studied experimentally...

  • From Knowledge based Vision Systems to Cognitive Vision Systems: A Review

    Publication

    - Year 2018

    Computer vision research and applications have their origins in 1960s. Limitations in computational resources inherent of that time, among other reasons, caused research to move away from artificial intelligence and generic recognition goals to accomplish simple tasks for constrained scenarios. In the past decades, the development in machine learning techniques has contributed to noteworthy progress in vision systems. However,...

    Full text available to download

  • Diagnostic testing of marine propulsion systems with internal combustion engines by means of vibration measurement and results analysis

    Publication

    In this paper selected issues concerning vibration diagnosis of the mechanical system within marine propulsion units have been presented, carried out on the basis of experimental examinations of a real object in which an exceedance of the allowable vibration’s level had been observed. Used diagnosing system has been characterised. A procedure of longitudinal and transverse vibrations shaft lines of the mechanical system within...

    Full text available to download

  • DAB vs DAB+ Radio Broadcasting: a Subjective Comparative Study

    Publication

    In the age of digital media, delivering high quality content to consumers is one of the most demanding tasks. There exist numerous broadcasting standards, with different pros and cons, and the DAB/DAB (Digital Audio Broadcasting) system is one of the most popular among them. From an engineer’s perspective, efficient resource management under limited bandwidth conditions has always been a challenge. In this paper a subjective quality...

    Full text available to download

  • AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED

    A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

  • Towards More Realistic Probabilistic Models for Data Structures: The External Path Length in Tries under the Markov Model

    Publication

    - Year 2013

    Tries are among the most versatile and widely used data structures on words. They are pertinent to the (internal) structure of (stored) words and several splitting procedures used in diverse contexts ranging from document taxonomy to IP addresses lookup, from data compression (i.e., Lempel- Ziv'77 scheme) to dynamic hashing, from partial-match queries to speech recognition, from leader election algorithms to distributed hashing...

  • In-service measurement of the small wind turbine test stand for structural health monitoring

    Publication

    - Year 2013

    This paper presents the research activity performed on a Small Wind Turbine (SWT) test stand. Commercially available turbine was modified towards incorporation of the sensors system for condition monitoring. Installed sensors measure angular shaft position, torque applied from the wind loads, vibration accelerations and last but not least rotational speed. All gathered data are then transferred and processed in Test.Lab by means...

    Full text to download in external service

  • Smartphone application supporting independent movement of the blind

    Improving comfort of life of blind people is a problem of great importance. Neither a white canenor a guide dog, although both very useful, can be considered as a tool for achieving fullindependence in everyday movement around the city. On the market there are some navigation toolsinspired by car navigation systems, but they have many flaws, ranging from positioninginaccuracies to high prices. The authors present their own solution...

  • Performance of Watermarking-based DTD Algorithm Under Time-varying Echo Path Conditions

    Publication

    A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The...

  • Robustness analysis of watermarking-based dtd algorithm under time-variable echo conditions

    Publication

    A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The...

  • Influence of the Delay in Monitor System on the Motor Coordination of Musicians while Performing

    This paper provides a description and results of measurements of the maximum acceptable value of delay tolerated by a musician, while playing an instrument, that does not cause de-synchronization and discomfort. First, methodology of measurements comprising audio recording and a fast camera is described. Then, themeasurement procedure for acquiring the maximum value of delay conditioning...

    Full text to download in external service

  • TRANSMISJA GŁOSOWYCH KOMUNIKATÓW DROGOWYCH W RADIOFONII CYFROWEJ DAB+

    Publication

    Proces cyfryzacji radia jest nowym rozdziałem w historii radiofonii. Wiele rekomendacji i badań naukowych wskazuje na standard DAB+ (Digital Audio Broadcasting plus), który w niedalekiej przyszłości ma zastąpić analogową radiofonię FM. Ten system cyfrowy wprowadza wiele zmian, oferując przy tym lepszą jakość dźwięku oraz szereg usług dodatkowych. W pracy postanowiono zbadać minimalną wymaganą przepływność bitową potrzebną do transmisji...

    Full text to download in external service

  • Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE Compliance

    Publication

    - Year 2020

    Cognitive Vision Systems have gained significant interest from academia and industry during the past few decade, and one of the main reasons behind this is the potential of such technologies to revolutionize human life as they intend to work under complex visual scenes, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination of these properties aims to mimic the human capabilities...

    Full text available to download

  • On practical application of Shannon theory to character recognition and more

    Publication

    - Year 2014

    Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...

  • A Device for Measuring Auditory Brainstem Responses to Audio

    Publication

    Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

    Full text available to download

  • Consciousness Study of Subjects with Unresponsive Wakefulness Syndrome Employing Multimodal Interfaces

    Publication

    - Year 2014

    The paper presents a novel multimodal-based methodology for consciousness study of individuals with unresponsive wakefulness syndrome. Two interfaces were employed in the experiments: eye gaze tracking system – CyberEye developed at the Multimedia Systems Department, and EEG device with electrode placement in the international 10-20 standard. It was a pilot study for checking if it is possible to determine objective methods based...

    Full text to download in external service

  • Design and Evaluation of the Platform for Weight-Shifting Exercises with Compensatory Forces Monitoring

    Details of a platform for the rehabilitation of people with severe balance impairment are discussed in the paper. Based upon a commercially available static parapodium, modified to fit force sensors, this device is designed to give a new, safe tool to physiotherapists. It is designed for the patients who cannot maintain equilibrium during a bipedal stance and need to hold to or lean on something during the rehabilitation. Visual,...

    Full text available to download

  • Karolina Zielińska-Dąbkowska dr inż. arch.

    Karolina M. Zielinska-Dabkowska, Ph.D., Eng. Arch., M. Arch., is an Assistant Professor at the Faculty of Architecture of Gdańsk University of Technology (GUT). In 2002, she completed her studies of Architecture and Urban Planning at Gdańsk University of Technology (Gdańsk Tech) and in 2004, Architectural Engineering at the University of Applied Sciences and Arts (HAWK) in Hildesheim, Germany. After graduation, she worked for several...

  • Impact of the glazed roof on acoustics of historic interiors

    Publication

    - Year 2018

    The paper discusses the adverse acoustic phenomena occurring in the semi-open interiors (courtyards, yards) covered with a glass roof. Particularly negative is the rever-beration noise, which leads to the degradation of the utility functions of the resulting spaces. It involves the drastically reducing the intelligibility of speech, loss of natural sounding of music, problems with the sound system, as well as disturbances in the...

  • Developing a Framework for the Implementation of Landscape and Greenspace Indicators in Sustainable Urban Planning. Waterfront Landscape Management: Case Studies in Gdańsk, Poznań and Bristol

    Publication

    - Sustainability - Year 2019

    Urban landscape (UL) management and urban greenspace (UG) delivery require effective planning tools. The aim of the study is to develop a conceptual framework for the implementation of ecological, structural and visual landscape and greenspace indicators (LGI) in spatial development of urban areas. The UL and UG management provisions in Poland are identified at various levels of urban planning (local, municipal and regional). Furthermore,...

    Full text available to download

  • Uwierzytelnienie i autoryzacja w systemie STRADAR

    Przedstawiono rozwiązanie serwera uwierzytelnienia i autoryzacji (AA) w rozproszonym systemie STRADAR, udostępniającym funkcjonalności dla prowadzenia działań operacyjnych Morskiego Oddziału Straży Granicznej. System umożliwia prezentację na stanowisku wizualizacji zdarzeń (SWZ) bieżącej i archiwalnej sytuacji na mapie (AIS, radary), obrazu z kamer, zdjęć, notatek, rozmów telefonicznych oraz plików i wiadomości tekstowych (SMS)...

    Full text to download in external service

  • Emotion Recognition and Its Applications

    The paper proposes a set of research scenarios to be applied in four domains: software engineering, website customization, education and gaming. The goal of applying the scenarios is to assess the possibility of using emotion recognition methods in these areas. It also points out the problems of defining sets of emotions to be recognized in different applications, representing the defined emotional states, gathering the data and...

    Full text to download in external service

  • Waveguide model of the hearing aid earmold system

    Publication

    Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

    Full text available to download

  • Waveguide model of the hearing aid earmold system

    Publication

    - Year 2006

    Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

    Full text to download in external service

  • Virtual Whiteboard: A gesture-controlled pen-free tool emulating school whiteboard

    Publication

    In the paper the so-called Virtual Whiteboard is presented which may be an alternative solution for modern electronic whiteboards based on electronic pens and sensors. The presented tool enables the user to write, draw and handle whiteboard contents using his/her hands only. An additional equipment such as infrared diodes, infrared cameras or cyber gloves is not needed. The user's interaction with the Virtual Whiteboard computer...

  • Potential and Use of the Googlenet Ann for the Purposes of Inland Water Ships Classification

    Publication

    - Polish Maritime Research - Year 2020

    This article presents an analysis of the possibilities of using the pre-degraded GoogLeNet artificial neural network to classify inland vessels. Inland water authorities monitor the intensity of the vessels via CCTV. Such classification seems to be an improvement in their statutory tasks. The automatic classification of the inland vessels from video recording is a one of the main objectives of the Automatic Ship Recognition and...

    Full text available to download

  • Methodology of Affective Intervention Design for Intelligent Systems

    This paper concerns how intelligent systems should be designed to make adequate, valuable and natural affective interventions. The article proposes a process for choosing an affective intervention model for an intelligent system. The process consists of 10 activities that allow for step-by-step design of an affective feedback loop and takes into account the following factors: expected and desired emotional states, characteristics...

    Full text to download in external service

  • Emotion monitoring system for drivers

    This article describes a new approach to the issue of building a driver monitoring system. Actual systems focus, for example, on tracking eyelid and eyebrow movements that result from fatigue. We propose a different approach based on monitoring the state of emotions. Such a system assumes that by using the emotion model based on our own concept, referred to as the reverse Plutchik’s paraboloid of emotions, the recognition of emotions...

    Full text available to download

  • Zastosowanie metod eksploracji danych do analizy odpowiedzi czujników gazu

    Publication

    - Year 2018

    Zagadnienia poruszane w niniejszej rozprawie dotyczą zastosowania metod eksploracji danych do analizy odpowiedzi czujników gazu, umożliwiających poprawną identyfikację składu mieszaniny gazowej w elektronicznych systemach rozpoznawania gazu. Elektroniczne systemy rozpoznawania gazu to urządzenia wykorzystujące czujniki gazu oraz odpowiednio dobrane metody analizy danych pomiarowych, zdolne do określenia składu mierzonej mieszaniny...

    Full text available to download

  • Multi-Criteria Approach in Multifunctional Building Design Process

    Publication

    - Year 2017

    The paper presents new approach in multifunctional building design process. Publication defines problems related to the design of complex multifunctional buildings. Currently, contemporary urban areas are characterized by very intensive use of space. Today, buildings are being built bigger and contain more diverse functions to meet the needs of a large number of users in one capacity. The trends show the need for recognition of...

    Full text to download in external service

  • Visual Content Representation for Cognitive Systems: Towards Augmented Intelligence

    Publication

    - Year 2020

    Cognitive Vision Systems have gained significant attention from academia and industry during the past few decades. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes (which environmental conditions may vary), adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination...

    Full text to download in external service