Wyniki wyszukiwania dla: SEGMENTATION METHODS
-
A Survey on the Datasets and Algorithms for Satellite Data Applications
PublikacjaThis survey compiles insights and describes datasets and algorithms for applications based on remote sensing. The goal of this review is twofold: datasets review for particular groups of tasks and high-level steps of data flow between satellite instruments and end applications from an implementation and development perspective. The article outlines the generalized data processing pipelines, taking into account the variations in...
-
Melanoma skin cancer detection using mask-RCNN with modified GRU model
PublikacjaIntroduction: Melanoma Skin Cancer (MSC) is a type of cancer in the human body; therefore, early disease diagnosis is essential for reducing the mortality rate. However, dermoscopic image analysis poses challenges due to factors such as color illumination, light reflections, and the varying sizes and shapes of lesions. To overcome these challenges, an automated framework is proposed in this manuscript. Methods: Initially, dermoscopic...
-
The influence of image masks definition onsegmentation results of histopathological imagesusing convolutional neural network
PublikacjaAbstract—In the era of collecting large amounts of tissue materials, assisting the work of histopathologists with various electronic and information IT tools is an undeniable fact. The traditional interaction between a human pathologist and the glass slide is changing to interaction between an AI pathologist with a whole slide images. One of the important tasks is the segmentation of objects (e.g. cells) in such images. In this...
-
CMGNet: Context-aware middle-layer guidance network for salient object detection
PublikacjaSalient object detection (SOD) is a critical task in computer vision that involves accurately identifying and segmenting visually significant objects in an image. To address the challenges of gridding issues and feature...
-
Vident-real: an intra-oral video dataset for multi-task learning
Dane BadawczeWe introduce Vident-real, a large dataset of 100 video sequences of intra-oral scenes from real conservative dental treatments performed at the Medical University of Gdańsk, Poland. The dataset can be used for multi-task learning methods including:
-
Autonomous pick-and-place system based on multiple 3Dsensors and deep learning
PublikacjaGrasping objects and manipulating them is the main way the robot interacts with its environment. However, for robots to operate in a dynamic environment, a system for determining the gripping position for objects in the scene is also required. For this purpose, neural networks segmenting the point cloud are usually applied. However, training such networks is very complex and their results are unsatisfactory. Therefore, we propose...
-
Autonomous Perception and Grasp Generation Based on Multiple 3D Sensors and Deep Learning
PublikacjaGrasping objects and manipulating them is the main way the robot interacts with its environment. However, for robots to operate in a dynamic environment, a system for determining the gripping position for objects in the scene is also required. For this purpose, neural networks segmenting the point cloud are usually applied. However, training such networks is very complex and their results are unsatisfactory. Therefore, we propose...
-
Smart Karyotyping Image Selection Based on Commonsense Knowledge Reasoning
PublikacjaKaryotyping requires chromosome instances to be segmented and classified from the metaphase images. One of the difficulties in chromosome segmentation is that the chromosomes are randomly positioned in the image, and there is a great chance for chromosomes to be touched or overlap with others. It is always much easier for operators and automatic programs to tackle images without overlapping chromosomes than ones with largely overlapped...
-
Shape-Based Pose Estimation of Robotic Surgical Instruments
PublikacjaWe describe a detector of robotic instrument parts in image-guided surgery. The detector consists of a huge ensemble of scale-variant and pose-dedicated, rigid appearance templates. The templates, which are equipped with pose-related keypoints and segmentation masks, allow for explicit pose estimation and segmentation of multiple end-effectors as well as fine-grained non-maximum suppression. We train the templates by grouping examples...
-
Automated Parking Management for Urban Efficiency: A Comprehensive Approach
PublikacjaEffective parking management is essential for ad-dressing the challenges of traffic congestion, city logistics, and air pollution in densely populated urban areas. This paper presents an algorithm designed to optimize parking management within city environments. The proposed system leverages deep learning models to accurately detect and classify street elements and events. Various algorithms, including automatic segmentation of...
-
Detection of the Oocyte Orientation for the ICSI Method Automation
PublikacjaAutomation or even computer assistance of the popular infertility treatment method: ICSI (Intracytoplasmic Sperm Injection) would speed up the whole process and improve the control of the results. This paper introduces a preliminary research for automatic spermatozoon injection into the oocyte cytoplasm. Here, the method for detection a correct orientation of the polar body of the oocyte is presented. Proposed method uses deep...
-
Expedited Multi-Objective Design Optimization of Miniaturized Microwave Structures Using Physics-Based Surrogates
PublikacjaIn this paper, a methodology for fast multi-objective design optimization of compact microwave circuits is presented. Our approach exploits an equivalent circuit model of the structure under consideration, corrected through implicit and frequency space mapping, then optimized by a multi-objective evolutionary algorithm. The correction/optimization of the surrogate is iterated by design space confinement and segmentation based on...
-
Training of Deep Learning Models Using Synthetic Datasets
PublikacjaIn order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...
-
Pipelined division of signed numbers with the use of residue arithmetic for small number range with the programmable gate array
PublikacjaIn this work an architecture of the pipelined signed residue divider for the small number range is presented. Its operation is based on reciprocal calculation and multiplication by the dividend. The divisor in the signed binary form is used to compute the approximated reciprocal in the residue form by the table look-up. In order to limit the look-up table address an algorithm based on segmentation of the divisor into two segments...
-
Identification of Emotions Based on Human Facial Expressions Using a Color-Space Approach
PublikacjaHCI technology improves human-computer interaction. Such communication can be carried out with the use of emotions that are visible on the human face since birth. In this paper the Emotion system for detecting and recognizing facial expressions, developed in the MSc work, is presented. The system recognizes emotion from webcam video in real time. It is based on color segmentation and morphological operations. The system uses a...
-
Pipelined division of signed numbers with the use of residue arithmetic in FPGA
PublikacjaAn architecture of a pipelined signed residue divider for small number ranges is presented. The divider makes use of the multiplicative division algorithm where initially the reciprocal of the divisor is calculated and subsequently multiplied by the dividend. The divisor represented in the signed binary form is used to compute the approximated reciprocal in the residue form by the table look-up. In order to reduce the needed length...
-
Investigations of speech signal parameters with regard to articulation influences
PublikacjaW pracy zostało podjęte zagadnienie parametryzacji sygnału mowy w kontekście ekstrakcji cech biometrycznych. Analizowane parametry to parametry cepstralne (cepstrum liniowe i mel-cepstrum, czyli MFCC), parametry liniowej predykcji (LPC) oraz momenty widmowe i parametr F0. Zastosowano analize w krótkich stałych segmentach sygnału z zastosowaniem dużego zakładkowania, tzw. ''implicite segmentation''. Umożliwiło to zaobserwowanie...
-
Rapid multi-objective design optimisation of compact microwave couplers by means of physics-based surrogates
PublikacjaThe authors introduce a methodology for fast multi-objective design optimisation of miniaturised microwave couplers. The approach exploits the surrogate-based optimisation paradigm with an underlying low-fidelity model constructed from an equivalent circuit of the structure under consideration, corrected through implicit and frequency space mapping. A fast prediction tool obtained this way is subsequently optimised by a multi-objective...
-
Controlling computer by lip gestures employing neural network
PublikacjaResults of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....
-
Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition
PublikacjaThe multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...
-
Hazard Control in Industrial Environments: A Knowledge-Vision-Based Approach
PublikacjaThis paper proposes the integration of image processing techniques (such as image segmentation, feature extraction and selection) and a knowledge representation approach in a framework for the development of an automatic system able to identify, in real time, unsafe activities in industrial environments. In this framework, the visual information (feature extraction) acquired from video-camera images and other context based gathered...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublikacjaRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublikacjaRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
AI-powered Customer Relationship Management – GenerativeAI-based CRM – Einstein GPT, Sugar CRM, and MS Dynamics 365
PublikacjaGenerative artificial intelligence (GenAI) and its implementation in successive business management support systems is a rapidly growing area of theoretical consideration, ongoing research, discourse and application in practice. Recently, the implementation of of GenAI in customer relationship management (CRM) systems has been observed. Accordingly, the aim of this article is to identify areas where GenAI can enhance CRM systems,...
-
Workforce mobility against the background of labour market duality theory – the example of selected OECD countries
PublikacjaThe paper aims to present an empirical study of labour market segmentation (LMS) hypothesis. According to the dual labour market theory jobs can be divided into two groups: primary and secondary jobs, with enter barriers into the first one. The primary jobs are usually described with relative high wages, whereas secondary jobs provide lower level of wages. In this paper we first examine the main sectors (according to the ISIC rev....
-
Video content analysis in the urban area telemonitoring system
PublikacjaThe task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...
-
Automatic Threat Detection for Historic Buildings in Dark Places Based on the Modified OptD Method
PublikacjaHistoric buildings, due to their architectural, cultural, and historical value, are the subject of preservation and conservatory works. Such operations are preceded by an inventory of the object. One of the tools that can be applied for such purposes is Light Detection and Ranging (LiDAR). This technology provides information about the position, reflection, and intensity values of individual points; thus, it allows for the creation...
-
Hardware-Software Implementation of a Sensor Network for CityTraffic Monitoring Using the FPGA- and ASIC-Based Sensor Nodes
PublikacjaArtykuł opisuje prototypową sieć sensorową do monitorowania ruchu pojazdów w mieście. Węzły sieci sensorowej, wyposażone w kamerę o niskiej rozdzielczości, obserwują ulice i wykrywają poruszające się obiekty. Detekcja obiektów jest realizowana w oparciu o własny algorytm segmentacji obrazów, wykorzystujący podwójne odejmowanie tła, wykrywanie krawędzi i cieni, działający na dedykowanym systemie mikroelektronicznym typu ''System...