Search results for: IMAGE SEGMENTATION, COMPUTER VISION, DEEP LEARNING - Bridge of Knowledge

Search

Search results for: IMAGE SEGMENTATION, COMPUTER VISION, DEEP LEARNING

Search results for: IMAGE SEGMENTATION, COMPUTER VISION, DEEP LEARNING

  • Adaptive Hounsfield Scale Windowing in Computed Tomography Liver Segmentation

    Publication

    In computed tomography (CT) imaging, the Hounsfield Unit (HU) scale quantifies radiodensity, but its nonlinear nature across organs and lesions complicates machine learning analysis. This paper introduces an automated method for adaptive HU scale windowing in deep learning-based CT liver segmentation. We propose a new neural network layer that optimizes HU scale window parameters during training. Experiments on the Liver Tumor...

    Full text to download in external service

  • Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy

    Publication

    - Year 2018

    The diabetic retinopathy is a disease caused by long-standing diabetes. Lack of effective treatment can lead to vision impairment and even irreversible blindness. The disease can be diagnosed by examining digital color fundus photographs of retina. In this paper we propose deep learning approach to automated diabetic retinopathy screening. Deep convolutional neural networks (CNN) - the most popular kind of deep learning algorithms...

    Full text to download in external service

  • Image and Vision Computing Conference

    Conferences

  • Karol Zdzisław Zalewski mgr inż.

    People

  • Leveraging spatio-temporal features for joint deblurring and segmentation of instruments in dental video microscopy

    Publication

    - Year 2021

    In dentistry, microscopes have become indispensable optical devices for high-quality treatment and micro-invasive surgery, especially in the field of endodontics. Recent machine vision advances enable more advanced, real-time applications including but not limited to dental video deblurring and workflow analysis through relevant metadata obtained by instrument motion trajectories. To this end, the proposed work addresses dental...

    Full text to download in external service

  • Patryk Ziółkowski dr inż.

    Assistant Professor at Gdansk Tech. He participated in international projects, including projects for the Ministry of Transportation of the State of Alabama (2015), he is also the winner of a grant from the Kosciuszko Foundation for conducting research in the USA, which he completed in 2018. An expert in the field of artificial intelligence. His main area of research interest is the application of artificial intelligence in Civil...

  • Podstawy uczenia głębokiego 2022

    e-Learning Courses
    • K. Draszawka
    • S. Olewniczak
    • J. Szymański

    {mlang pl}Kurs podstaw uczenia głębokiego przeznaczony dla studentów kierunku Informatyka.{mlang} {mlang en}This is a course about deep learning basics dedicated for Computer Science students.{mlang}

  • PROPRIETARY SOFTWARE IN TECHNICAL HIGHER EDUCATION

    The authors present a relatively easy way to extend the quality of education in professional studies (engineering) on major “Geodesy and Cartography”. They indicate the possibility to deepen students’ knowledge by using in the educational process proprietary software enriching education. The authors use their own experiences, results of the cooperation with employers, as well as the effects of scientific research to introduce into...

    Full text to download in external service

  • PROPRIETARY SOFTWARE IN TECHNICAL HIGHER EDUCATION

    The authors present a relatively easy way to extend the quality of education in professional studies (engineering) on major “Geodesy and Cartography”. They indicate the possibility to deepen students’ knowledge by using in the educational process proprietary software enriching education. The authors use their own experiences, results of the cooperation with employers, as well as the effects of scientific research to introduce...

    Full text to download in external service

  • Review of Segmentation Methods for Coastline Detection in SAR Images

    Synthetic aperture radar (SAR) images acquired by airborne sensors or remote sensing satellites contain the necessary information that can be used to investigate various objects of interest on the surface of the Earth, including coastlines. The coastal zone is of great economic importance and is also very densely populated. The intensive and increasing use of coasts and changes of coastlines motivate researchers to try to assess...

    Full text available to download

  • Segmentation-Based BI-RADS ensemble classification of breast tumours in ultrasound images

    Publication

    - INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS - Year 2024

    Background: The development of computer-aided diagnosis systems in breast cancer imaging is exponential. Since 2016, 81 papers have described the automated segmentation of breast lesions in ultrasound images using arti- ficial intelligence. However, only two papers have dealt with complex BI-RADS classifications. Purpose: This study addresses the automatic classification of breast lesions into binary classes (benign vs. ma- lignant)...

    Full text available to download

  • Detection of Alzheimer's disease using Otsu thresholding with tunicate swarm algorithm and deep belief network

    Publication

    - Frontiers in Physiology - Year 2024

    Introduction: Alzheimer’s Disease (AD) is a degenerative brain disorder characterized by cognitive and memory dysfunctions. The early detection of AD is necessary to reduce the mortality rate through slowing down its progression. The prevention and detection of AD is the emerging research topic for many researchers. The structural Magnetic Resonance Imaging (sMRI) is an extensively used imaging technique in detection of AD, because...

    Full text available to download

  • Efkleidis Katsaros

    People

    Efklidis Katsaros received the B.Sc. degree in mathematics from the Aristotle University of Thessaloniki, Greece, in 2016, and the M.Sc. degree (cum laude) in data science: statistical science from Leiden University, The Netherlands, in 2019. He is currently pursuing the Ph.D. degree in deep video multi-task learning with the Department of Biomedical Engineering, Gdańsk University of Technology, Poland. Since 2020, he has been...

  • Human Feedback and Knowledge Discovery: Towards Cognitive Systems Optimization

    Publication

    - Procedia Computer Science - Year 2020

    Current computer vision systems, especially those using machine learning techniques are data-hungry and frequently only perform well when dealing with patterns they have seen before. As an alternative, cognitive systems have become a focus of attention for applications that involve complex visual scenes, and in which conditions may vary. In theory, cognitive applications uses current machine learning algorithms, such as deep learning,...

    Full text available to download

  • Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition

    The multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...

    Full text to download in external service

  • Grzegorz Szwoch dr hab. inż.

    Grzegorz Szwoch was born in 1972 in Gdansk. In 1991-1996 he studied at the Technical University of Gdansk. In 1996 he graduated as a student from the Sound Engineering Department. His thesis was related to physical modeling of musical instruments. Since that time he has been a member of the research staff at the Multimedia Systems Department as a PhD student (1996-2001), Assistant (2001-2004), Assistant professor (2004-2020) and...

  • MEAN SHIFT BASED SEGMENTATION FOR BLEEDING REGIONS IN ENDOSCOPIC VIDEOS

    Publication

    With a set of 38 manually marked bleeding regions form endoscopic videos, the authors attempted to find an optimal image segmentation method for reproducing doctor’s markup. Mean shift segmentation combined with HSV histogram segmentation were used as a segmentation method, which was then optimized by tuning the parameters of the method using global optimization algorithm. A target function for measuring the quality of segmentation was...

  • Reinforcement Learning Algorithm and FDTD-based Simulation Applied to Schroeder Diffuser Design Optimization

    Publication

    The aim of this paper is to propose a novel approach to the algorithmic design of Schroeder acoustic diffusers employing a deep learning optimization algorithm and a fitness function based on a computer simulation of the propagation of acoustic waves. The deep learning method employed for the research is a deep policy gradient algorithm. It is used as a tool for carrying out a sequential optimization process the goal of which is...

    Full text available to download

  • Deep neural networks for data analysis

    e-Learning Courses
    • K. Draszawka

    The aim of the course is to familiarize students with the methods of deep learning for advanced data analysis. Typical areas of application of these types of methods include: image classification, speech recognition and natural language understanding. Celem przedmiotu jest zapoznanie studentów z metodami głębokiego uczenia maszynowego na potrzeby zaawansowanej analizy danych. Do typowych obszarów zastosowań tego typu metod należą:...

  • Designing acoustic scattering elements using machine learning methods

    Publication

    - Year 2021

    In the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...

    Full text available to download

  • Vident-lab: a dataset for multi-task video processing of phantom dental scenes

    We introduce a new, asymmetrically annotated dataset of natural teeth in phantom scenes for multi-task video processing: restoration, teeth segmentation, and inter-frame homography estimation. Pairs of frames were acquired with a beam splitter. The dataset constitutes a low-quality frame, its high-quality counterpart, a teeth segmentation mask, and...

  • WEB-CAM AS A MEANS OF INFORMATION ABOUT EMOTIONAL ATTEMPT OF STUDENTS IN THE PROCESS OF DISTANT LEARNING

    Publication

    - Year 2014

    New methods in education become more popular nowadays. Distant learning is a good example when teacher and student meet in virtual environment. Because interaction in this virtual world might be complicated it seems necessary to assure as much methods of conforming that student is still engaged in the process of learning as it is possible. We would like to present assumption that by means of web-cam we will be able to track facial...

  • Ensembling noisy segmentation masks of blurred sperm images

    Background: Sperm tail morphology and motility have been demonstrated to be important factors in determining sperm quality for in vitro fertilization. However, many existing computer-aided sperm analysis systems leave the sperm tail out of the analysis, as detecting a few tail pixels is challenging. Moreover, some publicly available datasets for classifying morphological defects contain images limited only to the sperm head. This...

    Full text available to download

  • International Machine Vision and Image Processing Conference

    Conferences

  • Assessment of particular abdominal aorta section extraction from contrast-enhanced computed tomography angiography

    Publication

    The aim of this work is to improve the accuracy of extraction of a particular abdominal aorta section and to reduce the distortion in three-dimensional Computed Tomography Angiography (CTA) images. Imaging modality and quality plays crucial role in the medical diagnostic process, thus ensuring high quality of images is essential at every stage of acquisition and processing.Noise is defined as a disturbance of the image quality...

    Full text to download in external service

  • THE ROLE OF INFERENCE IN MOBILE MEDICAL APPLICATION DESIGN

    Publication

    - Year 2021

    In the early 21st century, artificial intelligence began to be used to process medical information. However, before this happened, predictive models used in healthcare could only consider a limited number of variables, and only in properly structured and organised medical data. Today, advanced tools based on machine learning techniques - which, using artificial neural networks, can explore extremely complex relationships - and...

  • The influence of image masks definition onsegmentation results of histopathological imagesusing convolutional neural network

    Publication

    Abstract—In the era of collecting large amounts of tissue materials, assisting the work of histopathologists with various electronic and information IT tools is an undeniable fact. The traditional interaction between a human pathologist and the glass slide is changing to interaction between an AI pathologist with a whole slide images. One of the important tasks is the segmentation of objects (e.g. cells) in such images. In this...

    Full text available to download

  • DevEmo—Software Developers’ Facial Expression Dataset

    The COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...

    Full text available to download

  • Semantic segmentation training using imperfect annotations and loss masking

    One of the most significant factors affecting supervised neural network training is the precision of the annotations. Also, in a case of expert group, the problem of inconsistent data annotations is an integral part of real-world supervised learning processes, well-known to researchers. One practical example is a weak ground truth delineation for medical image segmentation. In this paper, we have developed a new method of accurate...

    Full text to download in external service

  • Smart Karyotyping Image Selection Based on Commonsense Knowledge Reasoning

    Publication

    - CYBERNETICS AND SYSTEMS - Year 2024

    Karyotyping requires chromosome instances to be segmented and classified from the metaphase images. One of the difficulties in chromosome segmentation is that the chromosomes are randomly positioned in the image, and there is a great chance for chromosomes to be touched or overlap with others. It is always much easier for operators and automatic programs to tackle images without overlapping chromosomes than ones with largely overlapped...

    Full text available to download

  • Impact of Visual Image Quality on Lymphocyte Detection Using YOLOv5 and RetinaNet Algorithms

    Lymphocytes, a type of leukocytes, play a vital role in the immune system. The precise quantification, spatial arrangement and phenotypic characterization of lymphocytes within haematological or histopathological images can serve as a diagnostic indicator of a particular lesion. Artificial neural networks, employed for the detection of lymphocytes, not only can provide support to the work of histopathologists but also enable better...

    Full text to download in external service

  • Controlling computer by lip gestures employing neural network

    Publication

    - Year 2010

    Results of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....

    Full text to download in external service

  • Assessing the attractiveness of human face based on machine learning

    Publication

    The attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...

    Full text available to download

  • Instance segmentation of stack composed of unknown objects

    The article reviews neural network architectures designed for the segmentation task. It focuses mainly on instance segmentation of stacked objects. The main assumption is that segmentation is based on a color image with an additional depth layer. The paper also introduces the Stacked Bricks Dataset based on three cameras: RealSense L515, ZED2, and a synthetic one. Selected architectures: DeepLab, Mask RCNN, DEtection TRansformer,...

    Full text available to download

  • Automated Parking Management for Urban Efficiency: A Comprehensive Approach

    Publication

    - Year 2024

    Effective parking management is essential for ad-dressing the challenges of traffic congestion, city logistics, and air pollution in densely populated urban areas. This paper presents an algorithm designed to optimize parking management within city environments. The proposed system leverages deep learning models to accurately detect and classify street elements and events. Various algorithms, including automatic segmentation of...

    Full text to download in external service

  • Hazard Control in Industrial Environments: A Knowledge-Vision-Based Approach

    Publication

    This paper proposes the integration of image processing techniques (such as image segmentation, feature extraction and selection) and a knowledge representation approach in a framework for the development of an automatic system able to identify, in real time, unsafe activities in industrial environments. In this framework, the visual information (feature extraction) acquired from video-camera images and other context based gathered...

    Full text to download in external service

  • Olgun Aydin Dr

    People

    Olgun Aydin finished his PhD by publishing a thesis about Deep Neural Networks. He works as a Senior Data Scientist in PwC Poland, gives lectures in Gdansk University of Technology in Poland and member of WhyR? Foundation. Olgun is a very big fan of R and author of the book called “R Web Scraping Quick Start Guide” , two video courses are called “Deep Dive into Statistical Modelling using R” and “Applied Machine Learning and Deep...

  • Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

    Publication

    - Year 2018

    With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

    Full text to download in external service

  • Predicting emotion from color present in images and video excerpts by machine learning

    Publication

    This work aims at predicting emotion based on the colors present in images and video excerpts using a machine-learning approach. The purpose of this paper is threefold: (a) to develop a machine-learning algorithm that classifies emotions based on the color present in an image, (b) to select the best-performing algorithm from the first phase and apply it to film excerpt emotion analysis based on colors, (c) to design an online survey...

    Full text available to download

  • European Conference on Computer Vision

    Conferences

  • Asian Conference on Computer Vision

    Conferences

  • Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning

    Publication

    - Year 2022

    My doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.

    Full text to download in external service

  • Vident-synth: a synthetic intra-oral video dataset for optical flow estimation

    Open Research Data

    We introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:

  • Smart Knowledge Engineering for Cognitive Systems: A Brief Overview

    Publication

    - CYBERNETICS AND SYSTEMS - Year 2022

    Cognition in computer sciences refers to the ability of a system to learn at scale, reason with purpose, and naturally interact with humans and other smart systems, such as humans do. To enhance intelligence, as well as to introduce cognitive functions into machines, recent studies have brought humans into the loop, turning the system into a human–AI hybrid. To effectively integrate and manipulate hybrid knowledge, suitable technologies...

    Full text available to download

  • Optymalizacja zasobów chmury obliczeniowej z wykorzystaniem inteligentnych agentów w zdalnym nauczaniu

    Publication

    - Year 2023

    Rozprawa dotyczy optymalizacji zasobów chmury obliczeniowej, w której zastosowano inteligentne agenty w zdalnym nauczaniu. Zagadnienie jest istotne w edukacji, gdzie wykorzystuje się nowoczesne technologie, takie jak Internet Rzeczy, rozszerzoną i wirtualną rzeczywistość oraz deep learning w środowisku chmury obliczeniowej. Zagadnienie jest istotne również w sytuacji, gdy pandemia wymusza stosowanie zdalnego nauczania na dużą skalę...

    Full text available to download

  • imPlatelet classifier: image‐converted RNA biomarker profiles enable blood‐based cancer diagnostics

    Publication
    • K. Pastuszak
    • A. Supernat
    • M. G. Best
    • S. In ‘t Veld
    • S. Łapińska‐Szumczyk
    • A. Łojkowska
    • R. Różański
    • A. Żaczek
    • J. Jassem
    • T. Würdinger
    • T. Stokowy

    - Molecular Oncology - Year 2021

    Liquid biopsies offer a minimally invasive sample collection, outperforming traditional biopsies employed for cancer evaluation. The widely used material is blood, which is the source of tumor-educated platelets. Here, we developed the imPlatelet classifier, which converts RNA-sequenced platelet data into images in which each pixel corresponds to the expression level of a certain gene. Biological knowledge from the Kyoto Encyclopedia...

    Full text available to download

  • Computer Supported Collaborative Learning

    Conferences

  • Vehicle detector training with minimal supervision

    Publication

    - Year 2019

    Recently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not...

  • Monitoring wizyjny w systemach zabezpieczenia transportu wodnego. Koncepcja implementacyjna

    Publication

    W artykule autorzy przedstawiają koncepcję zastosowania własnych badań nad pomiarem prędkości przepływu cieczy do zastosowań praktycznych w pomiarach przepływu wody w kanałach otwartych i rzekach. Jako narzędzie pomiarowe wykorzystują zestaw aparatów synchronicznych, które rejestrują indykatory przepływu znajdujące się na powierzchni analizowanej cieczy. Aparat matematyczny przedstawiony w rozwiązaniu sprowadza się do stosowania...

    Full text available to download

  • Book Review

    Acting over the last three decades as an Editor and Associate Editor for a number of international journals in the general area of cybernetics and AI, as well as a Chair and Co-Chair of numerous conferences in this field, I have had the exciting opportunity to closely witness and to be actively engaged in the stimulating research area of machine learning and its important augmentation with deep learning techniques and technologies. From...

    Full text to download in external service