Filters
total: 30446
filtered: 8625
-
Catalog
Chosen catalog filters
displaying 1000 best results Help
Search results for: IMAGE CLASSIFICATION , TRAFFIC LIGHT RECOGNITION , VIDEO CLASSIFICATION , FSA METHOD , SHIFTING WINDOW
-
Bimodal Emotion Recognition Based on Vocal and Facial Features
PublicationEmotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
-
AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS
PublicationSymptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....
-
Video Analytics-Based Algorithm for Monitoring Egress from Buildings
PublicationA concept and practical implementation of the algorithm for detecting of potentially dangerous situations of crowding in passages is presented. An example of such situation is a crush which may be caused by obstructed pedestrian pathway. Surveillance video camera signal analysis performed on line is employed in order to detect hold-ups near bottlenecks like doorways or staircases. The details of implemented algorithm which uses...
-
Uncertainty in emotion recognition
PublicationPurpose–The purpose of this paper is to explore uncertainty inherent in emotion recognition technologiesand the consequences resulting from that phenomenon.Design/methodology/approach–The paper is a general overview of the concept; however, it is basedon a meta-analysis of multiple experimental and observational studies performed over the past couple of years.Findings–The mainfinding of the paper might be summarized as follows:...
-
A Method and Device for 3D Recognition of Cutting Edge Micro Geometry
Publication -
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
PublicationThe main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...
-
Comparison of Deep Neural Network Learning Algorithms for Mars Terrain Image Segmentation
PublicationThis paper is dedicated to the topic of terrain recognition on Mars using advanced techniques based on the convolutional neural networks (CNN). The work on the project was conducted based on the set of 18K images collected by the Curiosity, Opportunity and Spirit rovers. The data were later processed by the model operating in a Python environment, utilizing Keras and Tensorflow repositories. The model benefits from the pretrained...
-
Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks
PublicationIn this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....
-
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
PublicationWith the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...
-
The Impact of Weather on Traffic Speed in Urban Area
PublicationThe issue of the impact of weather conditions on trip speed of vehicles has been studied for a long time and it is still the subject of many scientific researches. The impact of atmospheric conditions on the speed with which drivers drive their vehicles seems to be obvious. Good weather conditions, sunny weather with good visibility surely provokes higher speed while rainfall, wind...
-
Image simulation and annotation for color blinded
PublicationIn this paper methods for image simulation as seen by a color blinded and a method for constructing images of perceived color difference are presented. The work is also focused on the interactive color description of an image contents. As a result, the individuals having problems with color discrimination can identify colors in an image.W artykule prezentowane są metody symulacji kolorów w obrazach postrzeganych przez osoby ze...
-
Medical Image Segmentation Using Deep Semantic-based Methods: A Review of Techniques, Applications and Emerging Trends
PublicationSemantic-based segmentation (Semseg) methods play an essential part in medical imaging analysis to improve the diagnostic process. In Semseg technique, every pixel of an image is classified into an instance, where each class is corresponded by an instance. In particular, the semantic segmentation can be used by many medical experts in the domain of radiology, ophthalmologists, dermatologist, and image-guided radiotherapy. The authors...
-
Comparison of edge detection algorithms for electric wire recognition
PublicationEdge detection is the preliminary step in image processing for object detection and recognition procedure. It allows to remove useless information and reduce amount of data before further analysis. The paper contains the comparison of edge detection algorithms optimized for detection of horizontal edges. For comparison purposes the algorithms were implemented in the developed application dedicated to detection of electric line...
-
Recognition and sensing of anions
PublicationMolecular ion recognition is one of the most intensively studied areas of supramolecular technology. The reason for this is the essential role that ions play in many biological as well as industrial processes. On the other hand, however, it has been proved that ions can have a negative impact on human health and the environment. For these reasons, it is extremly important to develop rapid and simple methods allowing the determination...
-
Virtual touchpad - video-based multimodal interface
PublicationA new computer interface named Virtual-Touchpad (VTP) is presented. The Virtual-Touchpad provides a multimodal interface which enables controlling computer applications by hand gestures captured with a typical webcam. The video stream is processed in the software layer of the interface. Hitherto existing video-based interfaces analyzing frames of hand gestures are presented. Then, the hardware configuration and software features...
-
Impact of Intelligent Transport Systems Services on the Level of Safety and Improvement of Traffic Conditions
PublicationThe positive effects of the services of Intelligent Transport Systems (ITS) on the level of transport systems operation was confirmed by long-term studies conducted, inter alia, in the USA, Japan and Europe. Benefits resulting from the application of ITS services can be presented through performance indicators. The indicators represent in a numerical or qualitative manner to what extent ITS services can contribute to improving...
-
Markowitz’s portfolio theory – optimal length of estimation window for gold and the biggests companies on the Warsaw Stock Exchange
PublicationThe following article is dedicated to the construction of an investment portfolio consisting of 3 investments from the Polish capital market found in the WIG20 index and from investment in gold. The purpose of the study was to determine the optimal length of the estimation window for building a portfolio with minimal risk and maximum efficiency. The length of the estimation window was also assessed in terms of the rate of return...
-
Hand gesture recognition supported by fuzzy rules and Kalman filters
PublicationThe paper presents a system based on camera and multimediaprojector enabling a user to control computer applications by dynamic hand gestures. Gesture recognition methodology based on representing hand movement trajectory by motion vectors analysed using fuzzy rule-based inference is first given. For effective hand position tracking Kalman filters are employed. The system engineered is developed using J2SE and C++/OpenCV technology....
-
Language Models in Speech Recognition
PublicationThis chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.
-
Use of Daylight and Aesthetic Image of Glass Facades in Contemporary Buildings
PublicationThe paper deals with the architecture of contemporary buildings in respect to their aesthetic image created by the use of natural light. Sustainability is regarded as a principle of contemporary architecture, where daylighting is an important factor as it affects energy consumption and environmental quality of the space inside a building. Environmental awareness of architecture, however, involves a much wider and more holistic...
-
Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions
PublicationThe paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...
-
Integration in Multichannel Emotion Recognition
PublicationThe paper concerns integration of results provided by automatic emotion recognition algorithms. It presents both the challenges and the approaches to solve them. Paper shows experimental results of integration. The paper might be of interest to researchers and practitioners who deal with automatic emotion recognition and use more than one solution or multichannel observation.
-
Detection of moving objects in images combined from video and thermal cameras
PublicationAn algorithm for detection of moving objects in video streams from the monitoring cameras is presented. A system composed of a standard video camera and a thermal camera, mounted in close proximity to each other, is used for object detection. First, a background subtraction is performed in both video streams separately, using the popular Gaussian Mixture Models method. For the next processing stage, the authors propose an algorithm...
-
Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth
PublicationAs healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized...
-
Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition
PublicationThe multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...
-
Robust and Efficient Machine Learning Algorithms for Visual Recognition
PublicationIn visual recognition, the task is to identify and localize all objects of interest in the input image. With the ubiquitous presence of visual data in modern days, the role of object recognition algorithms is becoming more significant than ever and ranges from autonomous driving to computer-aided diagnosis in medicine. Current models for visual recognition are dominated by models based on Convolutional Neural Networks (CNNs), which...
-
An audio-visual corpus for multimodal automatic speech recognition
Publicationreview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Molecular profiles of thyroid cancer subtypes: Classification based on features of tissue revealed by mass spectrometry imaging
Publication -
Klasyfikacja osadów ściekowych na podstawie analizy elementarnej = sewage sludge classification based on elementary analysis
PublicationPrzedmiotem pracy jest analiza osadów ściekowych z oczyszczalni ścieków ze Swarzewa oraz Dębogórza. Analizy elementarne osadów na rożnym etapie przeróbki pozwoliły na znalezienie ich obrazu w systematyce Kempy, Van Krevelena oraz Jurkiewicza. Osady ściekowe charakteryzowane są prze szereg parametrów fizycznych, chemicznych i biologicznych. E. Kempa opracował podstawy systematyki osadów w oparciu o systematykę Van Krevelena oraz...
-
Rotor broken bar diagnostics in induction motor drive using Wavelet packet transform and ANFIS classification
Publication -
Challenges of Comparing Marine Microbiome Community Composition Data Provided by Different Commercial Laboratories and Classification Databases
Publication -
ColorNephroNet: Kidney tumor malignancy prediction using medical image colorization
PublicationRenal tumor malignancy classification is one of the crucial tasks in urology, being a primary factor included in the decision of whether to perform kidney removal surgery (nephrectomy) or not. Currently, tumor malignancy prediction is determined by the radiological diagnosis based on computed tomography (CT) images. However, it is estimated that up to 16% of nephrectomies could have been avoided because the tumor that had been...
-
Pedestrian Safety in Road Traffic in Poland
PublicationEvery third road accident in Poland involves a pedestrian as a participant or, most of the time, a casualty. Pedestrian accidents are usually the result of complex situations and the outcome of a number of factors related to driver and pedestrian behaviour and road infrastructure. Safety depends largely on how well the traffic condition is perceived and on visibility in traffic. The paper presents the results of analyses of methodologies...
-
Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification
PublicationThe recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals...
-
Human emotion recognition with biosignals
PublicationThis chapter presents issues in the field of affective computing. Basic preliminary information for the recognition of emotions is given and models of emotions, various ways of evoking emotions, as well as their theoretical foundations are discussed. The particular attention is given to the use of physiological signals in recognizing emotions. This subject is outlined further below by presenting selected biosignals, their relationship...
-
Application of Particle Image Velocimetry method for monitoring the volume changes during silo flow on the basis of X-radiographs
PublicationW artykule przedstawiono wyniki badań nad zastosowaniem techniki pomiarowej PIV (Particle Image Velocimetry) do analizy zmian objętościowych zachodzących w materiale sypkim w czasie opróżniania silosu prostokątnego. Jako mateirły do analizy wykorzystano cyfrowe radiografy uzyskane z kontynualnej rejestracji z użyciem systemu tomografii promieni X. Szcególny nacisk położono na analizę zmian objętościowych zachodzącyh w kanale przepływu.
-
Radio system for monitoring and acquisition of data from traffic enforcement cameras - features and assumptions of the system
PublicationThe study presents the architecture and selected functional assumptions of Radio System for Monitoring and Acquisition of Data from Traffic Enforcement Cameras (RSMAD). Ultimately, the system will be used for transmission and archiving image data of traffic offenses, but can also perform other duties related to traffic safety. Implementation of the RSMAD system will facilitate, inter alia, issuing the fine process and supervision...
-
Image Segmentation of MRI image for Brain Tumor Detection
Publicationthis research work presents a new technique for brain tumor detection by the combination of Watershed algorithm with Fuzzy K-means and Fuzzy C-means (KIFCM) clustering. The MATLAB based proposed simulation model is used to improve the computational simplicity, noise sensitivities, and accuracy rate of segmentation, detection and extraction from MR...
-
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
PublicationIn this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....
-
Evolutionary approach to ship's trajectory planning within Traffic Separation Schemes
PublicationThe paper presents the continuation of the author's research on evolutionary approach to ship trajectory planning. While the general problem of the evolutionary trajectory planning has already been solved, no one has yet touched one of its specific aspects: evolutionary trajectory planning within Traffic Separation Schemes. Traffic Separation Scheme (TSS) is a traffic-management route-system complying with rules of the International...
-
Image normalization method for face identification under difficult lighting conditions
PublicationW pracy przedstawiono nową metodę normalizacji obrazu, opartą o proste techniki, takie jak binaryzacja czy wyrównywanie histogramu, która pozwala na skuteczne wyeliminowanie cieni oraz uzyskanie niezmienników znacznie poprawiających dokładność procesu rozpoznawania twarzy. Podczas wykonanych eksperymentów zaproponowana metoda uzyskała wyniki lepsze od referencyjnego algorytmu wykorzystującego anizotropowe wygładzanie.
-
Video of LEGO Bricks on Conveyor Belt Dataset Series
PublicationThe dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.
-
Mining inconsistent emotion recognition results with the multidimensional model
PublicationThe paper deals with the challenge of inconsistency in multichannel emotion recognition. The focus of the paper is to explore factors that might influence the inconsistency. The paper reports an experiment that used multi-camera facial expression analysis with multiple recognition systems. The data were analyzed using a multidimensional approach and data mining techniques. The study allowed us to explore camera location, occlusions...
-
Noise Scattering Patterns Method for Recognition of RTS Noise in Semiconductor Components
PublicationOpisano nową metodę identyfikacji i wizualizacji szumów RTS. Metoda ta oparta na graficznym przedstawieniu przebiegu szumowego jest szczególnie użyteczna do szybkiej selekcji elektronicznych elementów półprzewodnikowych. Przedstawiono także rezultaty filtacji medianowej szumu zawierającego składową RTS. Filtrację medianową zastosowano do poprawienia obrazu szumu uzyskanego w wyniku zastosowania metody NSP.
-
Real-Time Gastrointestinal Tract Video Analysis on a Cluster Supercomputer
PublicationThe article presents a novel approach to medical video data analysis and recognition. Emphasis has been put on adapting existing algorithms detecting le- sions and bleedings for real time usage in a medical doctor's office during an en- doscopic examination. A system for diagnosis recommendation and disease detec- tion has been designed taking into account the limited mobility of the endoscope and the doctor's requirements. The...
-
Benchmark of the traffic congestion in electrical transport by means of multi criteria decision analysis
PublicationCongestion of the road traffic is an important aspect related to the issues of energy consumption in public transport. Due to the multiattribute nature, the expression of traffic congestion in a quantitative valueis difficult to achieve. The article presents a method of estimation of traffic congestion by means of a multiattribute decision analysis.
-
Traffic Modeling in IMS-based NGN Networks
PublicationIn the modern world the need for accurate and quickly delivered information is becoming more and more essential. In order to fulfill these requirements, next generation telecommunication networks should be fast introduced and correctly dimensioned. For this reason proper traffic models must be identified, which is the subject of this paper. In the paper standardization of IMS (IP Multimedia Subsystem) concept and IMS-based NGN...
-
Traffic Type Influence on Performance of OSPF QoS Routing
PublicationFeasibility studies with QoS routing proved that the network traffic type has influence on routing performance. In this work influence of self-similar traffic for network with DiffServ architecture and OSPF QoS routing has been verified. Analysis has been done for three traffic classes. Multiplexed On-Off model was used for self-similar traffic generation. Comparison of simulation results was presented using both relative and non-relative...
-
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
PublicationThe influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...
-
Concurrent Video Denoising and Deblurring for Dynamic Scenes
PublicationDynamic scene video deblurring is a challenging task due to the spatially variant blur inflicted by independently moving objects and camera shakes. Recent deep learning works bypass the ill-posedness of explicitly deriving the blur kernel by learning pixel-to-pixel mappings, which is commonly enhanced by larger region awareness. This is a difficult yet simplified scenario because noise is neglected when it is omnipresent in a wide...