Search results for: reinforcement learning

Search results for: reinforcement learning

results on page:
embed this view on your website

Filters

total: 125

clear all filters disabled

Structure and Randomness in Planning and Reinforcement Learning
Publication
- K. Czechowski
- P. Januszewski
- P. Kozakowski
- Ł. Kuciński
- P. Miłoś
- Year 2021
Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random...

Full text to download in external service
Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning
Publication
- P. Januszewski
- Year 2022
My doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.

Full text to download in external service
The Role of Dopaminergic Genes in Probabilistic Reinforcement Learning in Schizophrenia Spectrum Disorders
Publication
- D. Frydecka
- B. Misiak
- P. Piotrowski
- T. Bielawski
- E. Pawlak
- E. Kłosińska
- M. Krefft
- K. Al
- J. Rymaszewska
- A. Moustafa
- J. Drapała
- Brain Sciences - Year 2021
Full text to download in external service
Confirmation Bias in the Course of Instructed Reinforcement Learning in Schizophrenia-Spectrum Disorders
Publication
- D. Frydecka
- P. Piotrowski
- T. Bielawski
- E. Pawlak
- E. Kłosińska
- M. Krefft
- K. Al
- J. Rymaszewska
- A. Moustafa
- J. Drapała
- B. Misiak
- Brain Sciences - Year 2022
Full text to download in external service
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Reinforcement Learning Algorithm and FDTD-based Simulation Applied to Schroeder Diffuser Design Optimization
Publication
- A. Kurowski
- B. Kostek
- IEEE Access - Year 2021
The aim of this paper is to propose a novel approach to the algorithmic design of Schroeder acoustic diffusers employing a deep learning optimization algorithm and a fitness function based on a computer simulation of the propagation of acoustic waves. The deep learning method employed for the research is a deep policy gradient algorithm. It is used as a tool for carrying out a sequential optimization process the goal of which is...

Full text available to download
Autonomous port management based AGV path planning and optimization via an ensemble reinforcement learning framework
Publication
- X. Chen
- S. Liu
- J. Zhao
- H. Wu
- J. Xian
- J. Montewka
- OCEAN & COASTAL MANAGEMENT - Year 2024
The rapid development of shipping trade pushes automated container terminals toward the direction of intelligence, safety and efficiency. In particular, the formulation of AGV scheduling tasks and the safety and stability of transportation path is an important part of port operation and management, and it is one of the basic tasks to build an intelligent port. Existing research mainly focuses on collaborative operation between...

Full text to download in external service
IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Conferences
Designing acoustic scattering elements using machine learning methods
Publication
- A. Kurowski
- Year 2021
In the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...

Full text available to download
Hossein Nejatbakhsh Esfahani Dr.

People

My research interests lie primarily in the area of Learning-based Safety-Critical Control Systems, for which I leverage the following concepts and tools:-Robust/Optimal Control-Reinforcement Learning-Model Predictive Control-Data-Driven Control-Control Barrier Function-Risk-Averse Controland with applications to:-Aerial and Marine robotics (fixed-wing UAVs, autonomous ships and underwater vehicles)-Multi-Robot and Networked Control...
JamesBot - an intelligent agent playing StarCraft II
Publication
- Year 2019
The most popular method for optimizing a certain strategy based on a reward is Reinforcement Learning (RL). Lately, a big challenge for this technique are computer games such as StarCraft II which is a real-time strategy game, created by Blizzard. The main idea of this game is to fight between agents and control objects on the battlefield in order to defeat the enemy. This work concerns creating an autonomous bot using reinforced...

Full text available to download
Chained machine learning model for predicting load capacity and ductility of steel fiber–reinforced concrete beams
Publication
- T. Shafighfard
- F. Kazemi
- F. Bagherzadeh
- M. Mieloszyk
- D. Yoo
- COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING - Year 2024
One of the main issues associated with steel fiber–reinforced concrete (SFRC) beams is the ability to anticipate their flexural response. With a comprehensive grid search, several stacked models (i.e., chained, parallel) consisting of various machine learning (ML) algorithms and artificial neural networks (ANNs) were developed to predict the flexural response of SFRC beams. The flexural performance of SFRC beams under bending was...

Full text available to download
Koncepcja systemu wspomagania decyzji nawigatora statku opartego na ewolucyjnym planowaniu manewrów antykolizyjnych
Publication
- R. Szłapczyński
- Logistyka - Year 2014
Artykuł przedstawia koncepcję systemu wspomagania decyzji nawigatora statku opartego na wątkach badań prowadzonych wcześniej przez autora. System będzie rozszerzał funkcjonalność systemów dotychczasowych o możliwość szczegółowego planowania bezpiecznej trajektorii statku na wodach zamkniętych, z dużą liczbą statków obcych i ograniczeniami toru wodnego. Artykuł zawiera dyskusję możliwych podejść do planowania manewrów, optymalizacji...
Reinforced Secure Gossiping Against DoS Attacks in Post-Disaster Scenarios
Publication
- C. Esposito
- Z. Zhao
- J. Rak
- IEEE Access - Year 2020
During and after a disaster, the perceived quality of communication networks often becomes remarkably degraded with an increased ratio of packet losses due to physical damages of the networking equipment, disturbance to the radio frequency signals, continuous reconfiguration of the routing tables, or sudden spikes of the network traffic, e.g., caused by the increased user activity in a post-disaster period. Several techniques have...

Full text available to download
Integracja bezprzewodowych heterogenicznych sieci IP dla poprawy efektywności transmisji danych na morzu
Publication
- M. Hoeft
- Year 2023
Wraz ze wzrostem istotności środowiska morskiego w naszym codziennym życiu np. w postaci zwiększonego wolumenu transportu realizowanego drogą morską. czy zintensyfikowanych prac dotyczących obserwacji i monitoringu środowiska morskiego, wzrasta również potrzeba opracowania efektywnych systemów komunikacyjnych dedykowanych dla tego środowiska. Heterogeniczne systemy łączności bezprzewodowej integrowane na poziomie warstwy sieciowej...

Full text available to download
Load effect impact on the exploitation of concrete machine foundations used in the gas and oil industry
Publication
- P. Ziółkowski
- M. Niedostatkiewicz
- S. Demczyński
- Year 2019
Machine foundations is a critical topic in the gas and oil industry, which design and exploitation require extensive technical knowledge. Machine foundations are the constructions which are intended for mounting on it a specific type of machine. The foundation has to transfer dynamic and static load from machine to the ground. The primary difference between machine foundations and building foundations is that the machine foundations...
Investigations on fracture in reinforced concrete beams in 3-point bending using continuous micro-CT scanning
Publication
- Ł. Skarżyński
- J. Tejchman
- CONSTRUCTION AND BUILDING MATERIALS - Year 2021
This study explores a fracture process in rectangular reinforced concrete (RC) beams subjected to quasi-static three-point bending. RC beams were short and long with included longitudinal reinforcement in the form of a steel or basalt bar. The ratio of the shear span to the effective depth was 1.5 and 0.75. The focus was on the load–deflection diagram and crack formation. Three-dimensional (3D) analyses of the size and distribution...

Full text available to download
Hybrid all-cellulose reinforcement in polypropylene matrix biocomposites for injection moulding - influence of particle geometry and volume fraction on hybrid effect
Publication
- P. Franciszczak
- J. Smoliński
- COMPOSITE STRUCTURES - Year 2023
The presented study is focused on evaluation of influence of reinforcement volume fraction and geometry on the occurrence of positive hybrid effect by the hybridisation of man-made cellulose fibres (rayon viscose) with cellulose microparticle fillers applied in polypropylene matrix. Four volume fractions of reinforcement were used at 1:1 combination of short man-made cellulose fibres with cellulose microfillers of different aspect...

Full text to download in external service
Influence of effective width of flange on calculation and reinforcement dimensioning of beam of reinforced concrete frame
Publication
- M. T. Solarczyk
- Budownictwo i Architektura - Year 2021
The paper analyses the influence of modelling the cross-section of a beam in two-storey reinforced concrete frame of industrial warehouse with dimensions: 18.0 m × 32.0 m using bar elements on the results of bending moments, the value of elastic deflection and the dimensioning of reinforcement due to bending. Six options were considered: a beam as a rectangular section and five T-beam variants with different definitions of effective...

Full text available to download
Modeling and Designing Acoustical Conditions of the Interior – Case Study
Publication
- Archives of Acoustics - Year 2016
The primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...

Full text available to download

Search

Filters

Catalog

Search results for: reinforcement learning

Hossein Nejatbakhsh Esfahani Dr.