Filters
total: 520
Search results for: image segmentation, computer vision, deep learning
-
Analogue CMOS ASICs in Image Processing Systems
PublicationIn this paper a survey of analog application specific integrated circuits (ASICs) for low-level image processing, called vision chips, is presented. Due to the specific requirements, the vision chips are designed using different architectures best suited to their functions. The main types of the vision chip architectures and their properties are presented and characterized on selected examples of prototype integrated circuits (ICs)...
-
Comparison of Selected Neural Network Models Used for Automatic Liver Tumor Segmentation
PublicationAutomatic and accurate segmentation of liver tumors is crucial for the diagnosis and treatment of hepatocellular carcinoma or metastases. However, the task remains challenging due to imprecise boundaries and significant variations in the shape, size, and location of tumors. The present study focuses on tumor segmentation as a more critical aspect from a medical perspective, compared to liver parenchyma segmentation, which is the...
-
Federated Learning in Healthcare Industry: Mammography Case Study
PublicationThe paper focuses on the role of federated learning in a healthcare environment. The experimental setup involved different healthcare providers, each with their datasets. A comparison was made between training a deep learning model using traditional methods, where all the data is stored in one place, and using federated learning, where the data is distributed among the workers. The experiment aimed to identify possible challenges...
-
Classification of Sea Going Vessels Properties Using SAR Satellite Images
PublicationThe aim of the project was to analyze the possibility of using machine learning and computer vision to identify (indicate the location) of all sea-going vessels located in the selected area of the open sea and to classify the main attributes of the vessel. The key elements of the project were to download data from the Sentinel-1 satellite [1], download data on the sea vessels [2], then automatically tag data and develop a detection...
-
Zdzisław Kowalczuk prof. dr hab. inż.
PeopleZdzislaw Kowalczuk received his M.Sc. degree in 1978 and Ph.D. degree in 1986, both in Automatic Control from Technical University of Gdańsk (TUG), Gdańsk, Poland. In 1993 he received his D.Sc. degree (Dr Habilitus) in Automatic Control from Silesian Technical University, Gliwice, Poland, and the title of Professor from the President of Poland in 2003. Since 1978 he has been with Faculty of Electronics, Telecommunications and Informatics...
-
Blended Learning in Teaching Safety of Electrical Installations
PublicationBlended learning becomes more commonly used in teaching information technology or other subjects, which involve practice in computer laboratories. In case of subjects with no access to computer rooms blended learning supports lecturing and teaching classes e.g. interactive lessons. The article presents the use of blended learning forms in Gdansk University of Technology in teaching the subject of Safety of Electrical Installations....
-
Multiplicative Long Short-Term Memory with Improved Mayfly Optimization for LULC Classification
PublicationLand Use and Land Cover (LULC) monitoring is crucial for global transformation, sustainable land control, urban planning, urban growth prediction, and the establishment of climate regulations for long-term development. Remote sensing images have become increasingly important in many environmental planning and land use surveys in recent times. LULC is evaluated in this research using the Sat 4, Sat 6, and Eurosat datasets. Various...
-
Video Semantic Analysis Framework based on Run-time Production Rules - Towards Cognitive Vision
PublicationThis paper proposes a service-oriented architecture for video analysis which separates object detection from event recognition. Our aim is to introduce new tools to be considered in the pathway towards Cognitive Vision as a support for classical Computer Vision techniques that have been broadly used by the scientific community. In the article, we particularly focus in solving some of the reported scalability issues found in current...
-
Multimodal human-computer interfaces based on advanced video and audio analysis
PublicationMultimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...
-
CMOS realisation of analogue processor for early vision processing
PublicationThe architecture concept of a high-speed low-power analogue vision chip, which performs low-level real-time image algorithms ispresented. The proof-of-concept prototype vision chip containing 32 × 32 photosensor array and 32 analogue processors is fabricated usinga 0.35 μm CMOS technology. The prototype can be configured to register and process images with very high speed, reaching 2000 framesper second, or achieve very low power...
-
Medical Image Computing and Computer-Assisted Intervention
Conferences -
Computational Methods for Liver Vessel Segmentation in Medical Imaging: A Review
PublicationThe segmentation of liver blood vessels is of major importance as it is essential for formulating diagnoses, planning and delivering treatments, as well as evaluating the results of clinical procedures. Different imaging techniques are available for application in clinical practice, so the segmentation methods should take into account the characteristics of the imaging technique. Based on the literature, this review paper presents...
-
Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
PublicationIn the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...
-
Self-Supervised Learning to Increase the Performance of Skin Lesion Classification
PublicationTo successfully train a deep neural network, a large amount of human-labeled data is required. Unfortunately, in many areas, collecting and labeling data is a difficult and tedious task. Several ways have been developed to mitigate the problem associated with the shortage of data, the most common of which is transfer learning. However, in many cases, the use of transfer learning as the only remedy is insufficient. In this study,...
-
Comparison of the exponential thermal transient parameterization methods with the SMTP method in the unipedicled DIEP flap computer modelling and simulation
PublicationThe aim of this paper is to compare the spatial contrast of the image descriptors obtained via three different thermal transient parameterization methods in Active Dynamic Thermography. The thermal constants and amplitude values of the one- and two- exponential parametrization are compared to the Simplified Magnitude-Temporal Parametrization method (SMTP). The comparison is performed using the data obtained by simulating the cold...
-
IEEE Conference on Computer Vision and Pattern Recognition
Conferences -
Soft Computing in Computer Graphics, Imaging, and Vision
Conferences -
Deep Features Class Activation Map for Thermal Face Detection and Tracking
PublicationRecently, capabilities of many computer vision tasks have significantly improved due to advances in Convolutional Neural Networks. In our research, we demonstrate that it can be also used for face detection from low resolution thermal images, acquired with a portable camera. The physical size of the camera used in our research allows for embedding it in a wearable device or indoor remote monitoring solution for elderly and disabled...
-
Examining Quality of Hand Segmentation Based on Gaussian Mixture Models
PublicationResults of examination of various implementations of Gaussian mix-ture models are presented in the paper. Two of the implementations belonged to the Intel’s OpenCV 2.4.3 library and utilized Background Subtractor MOG and Background Subtractor MOG2 classes. The third implementation presented in the paper was created by the authors and extended Background Subtractor MOG2 with the possibility of operating on the scaled version of...
-
An Analysis of Uncertainty and Robustness of Waterjet Machine Positioning Vision System
PublicationThe paper presents a new Automatic Waterjet Positioning Vision System (AWPVS) and investigates components of workpiece positioning accuracy. The main purpose of AWPVS is to precisely identify the position and rotation of a workpiece placed on a waterjet machine table. Two webcams form a basis for the system, and constitute its characteristics. The proposed algorithm comprises various image processing techniques to assure a required...
-
Joint Australia and New Zealand Biennial Conference on Digital Image and Vision Computing
Conferences -
Smart Approach for Glioma Segmentation in Magnetic Resonance Imaging using Modified Convolutional Network Architecture (U-NET)
PublicationSegmentation of a brain tumor from magnetic resonance multimodal images is a challenging task in the field of medical imaging. The vast diversity in potential target regions, appearance and multifarious intensity threshold levels of various tumor types are few of the major factors that affect segmentation results. An accurate diagnosis and its treatment demand strict delineation of the tumor affected tissues. Herein, we focus on...
-
International Conferences in Central Europe on Computer Graphics, Visualization and Computer Vision
Conferences -
Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Conferences -
A novel approach exploiting properties of convolutional neural networks for vessel movement anomaly detection and classification
PublicationThe article concerns the automation of vessel movement anomaly detection for maritime and coastal traffic safety services. Deep Learning techniques, specifically Convolutional Neural Networks (CNNs), were used to solve this problem. Three variants of the datasets, containing samples of vessel traffic routes in relation to the prohibited area in the form of a grayscale image, were generated. 1458 convolutional neural networks with...
-
Marta Kuc-Czarnecka dr
PeopleMarta Kuc-Czarnecka is the deputy head of the Department of Statistics and Economics at the Faculty of Management and Economics of the Gdańsk University of Technology. She also serves as the Dean's proxy for AMBA accreditation. She is a co-founder of Rethinking Economics Gdańsk and a member of the Foundation Edward Lipiński for the promotion of pluralism in economic sciences. In 2018-2022, she was Eurofound’s quality of life and...
-
Fusion-based Representation Learning Model for Multimode User-generated Social Network Content
PublicationAs mobile networks and APPs are developed, user-generated content (UGC), which includes multi-source heterogeneous data like user reviews, tags, scores, images, and videos, has become an essential basis for improving the quality of personalized services. Due to the multi-source heterogeneous nature of the data, big data fusion offers both promise and drawbacks. With the rise of mobile networks and applications, UGC, which includes...
-
Experience-Oriented Knowledge Management for Internet of Things
PublicationIn this paper, we propose a novel approach for knowledge management in Internet of Things. By utilizing Decisional DNA and deep learning technologies, our approach enables Internet of Things of experiential knowledge discovery, representation, reuse, and sharing among each other. Rather than using traditional machine learning and knowledge discovery methods, this approach focuses on capturing domain’s decisional events via Decisional...
-
Paweł Rościszewski dr inż.
PeoplePaweł Rościszewski received his PhD in Computer Science at Gdańsk University of Technology in 2018 based on PhD thesis entitled: "Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption". Currently, he is an Assistant Professor at the Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland....
-
An Overview of Image Analysis Techniques in Endoscopic Bleeding Detection
PublicationAuthors review the existing bleeding detection methods focusing their attention on the image processing techniques utilised in the algorithms. In the article, 18 methods were analysed and their functional components were identified. The authors proposed six different groups, to which algorithms’ components were assigned: colour techniques, reflecting features of pixels as individual values, texture techniques, considering spatial...
-
CMOS implementation of an analogue median filter for image processing in real time
PublicationAn analogue median filter, realised in a 0.35 μm CMOS technology, is presented in this paper. The key advantages of the filter are: high speed of image processing (50 frames per second), low-power operation (below 1.25 mW under 3.3 V supply) and relatively high accuracy of signal processing. The presented filter is a part of an integrated circuit for image processing (a vision chip), containing: a photo-sensor matrix, a set of...
-
Shape-Based Pose Estimation of Robotic Surgical Instruments
PublicationWe describe a detector of robotic instrument parts in image-guided surgery. The detector consists of a huge ensemble of scale-variant and pose-dedicated, rigid appearance templates. The templates, which are equipped with pose-related keypoints and segmentation masks, allow for explicit pose estimation and segmentation of multiple end-effectors as well as fine-grained non-maximum suppression. We train the templates by grouping examples...
-
Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks
PublicationEstimation of vital signs using image processing techniques have already been proved to have a potential for supporting remote medical diagnostics and replacing traditional measurements that usually require special hardware and electrodes placed on a body. In this paper, we further extend studies on contactless Respiratory Rate (RR) estimation from extremely low resolution thermal imagery by enhancing acquired sequences using Deep...
-
Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.
PublicationThe exponential growth of computer processing power, cloud data storage, and crowdsourcing model of gathering data bring new possibilities to music information retrieval (mir) field. Mir is no longer music content retrieval only; the area also comprises the discovery of expressing feelings and emotions contained in music, incorporating other than hearing modalities for helping this issue, users’ profiling, merging music with social...
-
Explainable machine learning for diffraction patterns
PublicationSerial crystallography experiments at X-ray free-electron laser facilities produce massive amounts of data but only a fraction of these data are useful for downstream analysis. Thus, it is essential to differentiate between acceptable and unacceptable data, generally known as ‘hit’ and ‘miss’, respectively. Image classification methods from artificial intelligence, or more specifically convolutional neural networks (CNNs), classify...
-
Paweł Burdziakowski dr inż.
PeoplePaweł Burdziakowski, PhD, is a professional in low-altitude aerial photogrammetry and remote sensing, marine and aerial navigation. He is also a licensed flight instructor and software developer. His main areas of interest are digital photogrammetry, navigation of unmanned platforms and unmanned systems, including aerial, surface, underwater. He conducts research in algorithms and methods to improve the quality of spatial measurements...
-
KEMR-Net: A Knowledge-Enhanced Mask Refinement Network for Chromosome Instance Segmentation
PublicationThis article proposes a mask refinement method for chromosome instance segmentation. The proposed method exploits the knowledge representation capability of Neural Knowledge DNA (NK-DNA) to capture the semantics of the chromosome’s shape, texture, and key points, and then it uses the captured knowledge to improve the accuracy and smoothness of the masks. We validate the method’s effectiveness on our latest high-resolution chromosome...
-
Artificial intelligence for software development — the present and the challenges for the future
PublicationSince the time when first CASE (Computer-Aided Software Engineering) methods and tools were developed, little has been done in the area of automated creation of code. CASE tools support a software engineer in creation the system structure, in defining interfaces and relationships between software modules and, after the code has been written, in performing testing tasks on different levels of detail. Writing code is still the task...
-
Urban scene semantic segmentation using the U-Net model
PublicationVision-based semantic segmentation of complex urban street scenes is a very important function during autonomous driving (AD), which will become an important technology in industrialized countries in the near future. Today, advanced driver assistance systems (ADAS) improve traffic safety thanks to the application of solutions that enable detecting objects, recognising road signs, segmenting the road, etc. The basis for these functionalities...
-
Assessment of student language skills in an e-learning environment
PublicationThis article presents the role of various assessment structures that can be used in a VLE. e-Learning language courses offer tutors a wide range of traditional and computer-generated formative and summative assessment procedures and tools. They help to evaluate each student’s progress, monitor their activities and provide varied support, which comes from the tutor, the course structure and materials as well as other participants....
-
An Analog Sub-Miliwatt CMOS Image Sensor With Pixel-Level Convolution Processing
PublicationA new approach to an analog ultra-low power medium-resolution vision chip design is presented. The prototype chip performs low-level image processing algorithms in real time. Only a photo-diode, MOS switches and two capacitors are used to create an analog processing element (APE) that is able to realize any convolution algorithm based on a full 3x3 kernel. The proof-of-concept circuit is implemented in 0.35 µm CMOS technology,...
-
Mask Detection and Classification in Thermal Face Images
PublicationFace masks are recommended to reduce the transmission of many viruses, especially SARS-CoV-2. Therefore, the automatic detection of whether there is a mask on the face, what type of mask is worn, and how it is worn is an important research topic. In this work, the use of thermal imaging was considered to analyze the possibility of detecting (localizing) a mask on the face, as well as to check whether it is possible to classify...
-
Identification of Emotional States Using Phantom Miro M310 Camera
PublicationThe purpose of this paper is to present the possibilities associated with the use of remote sensing methods in identifying human emotional states, and to present the results of the research conducted by the authors in this field. The studies presented involved the use of advanced image analysis to identify areas on the human face that change their activity along with emotional expression. Most of the research carried out in laboratories...
-
Realization, programming and controlling of the Stewart-Gough platform
PublicationThis paper presents realizaon, programming, and controlling of a low cost Stewart-Gough plaorm (SGP) with rotary actuators. The realized SGP is applied in a ball & plate control system. Developed dedicated software consists of embedded and applicaon soware for both the SGP posioning system and the ball & plate control. system. A ball posion is being obtained using computer vision. The paper contains tests results for both an SGP...
-
Three-dimensional modeling and automatic analysis of the human nasal cavity and paranasal sinuses using the computational fluid dynamics method
PublicationPurpose The goal of this study was to develop a complete workflow allowing for conducting computational fluid dynam- ics (CFD) simulation of airflow through the upper airways based on computed tomography (CT) and cone-beam computed tomography (CBCT) studies of individual adult patients. Methods This study is based on CT images of 16 patients. Image processing and model generation of the human nasal cavity and paranasal sinuses...
-
Pedestrian detection in low-resolution thermal images
PublicationOver one million people die in car accidents worldwide each year. A solution that will be able to reduce situations in which pedestrian safety is at risk has been sought for a long time. One of the techniques for detecting pedestrians on the road is the use of artificial intelligence in connection with thermal imaging. The purpose of this work was to design a system to assist the safety of people and car intelligence with the use...
-
Multimodal human-computer interfaces based on advanced video and audio analysis
PublicationMultimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...
-
An Intelligent Approach to Short-Term Wind Power Prediction Using Deep Neural Networks
PublicationIn this paper, an intelligent approach to the Short-Term Wind Power Prediction (STWPP) problem is considered, with the use of various types of Deep Neural Networks (DNNs). The impact of the prediction time horizon length on accuracy, and the influence of temperature on prediction effectiveness have been analyzed. Three types of DNNs have been implemented and tested, including: CNN (Convolutional Neural Networks), GRU (Gated Recurrent...
-
Stereo image visualization for a VISROBOT system
PublicationThe article describes a novel approach to robotic vision in mobile robot systems. The system implements a Visrobot system which implements a generic idea of using mobile robots for exploring an indoor environment. The task of such a robot is to visualize a stereo image properly for an operator. The system uses different stereo baseline values. Variable baseline can result in increasing depth resolution for distant objects. We assume...
-
PPAM 2022
EventsThe PPAM 2022 conference, will cover topics in parallel and distributed computing, including theory and applications, as well as applied mathematics.