Filters
total: 125
Search results for: reinforcement learning
-
Structure and Randomness in Planning and Reinforcement Learning
PublicationPlanning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random...
-
Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning
PublicationMy doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.
-
The Role of Dopaminergic Genes in Probabilistic Reinforcement Learning in Schizophrenia Spectrum Disorders
Publication -
Confirmation Bias in the Course of Instructed Reinforcement Learning in Schizophrenia-Spectrum Disorders
Publication -
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
PublicationText-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...
-
Reinforcement Learning Algorithm and FDTD-based Simulation Applied to Schroeder Diffuser Design Optimization
PublicationThe aim of this paper is to propose a novel approach to the algorithmic design of Schroeder acoustic diffusers employing a deep learning optimization algorithm and a fitness function based on a computer simulation of the propagation of acoustic waves. The deep learning method employed for the research is a deep policy gradient algorithm. It is used as a tool for carrying out a sequential optimization process the goal of which is...
-
Autonomous port management based AGV path planning and optimization via an ensemble reinforcement learning framework
PublicationThe rapid development of shipping trade pushes automated container terminals toward the direction of intelligence, safety and efficiency. In particular, the formulation of AGV scheduling tasks and the safety and stability of transportation path is an important part of port operation and management, and it is one of the basic tasks to build an intelligent port. Existing research mainly focuses on collaborative operation between...
-
IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning
Conferences -
Designing acoustic scattering elements using machine learning methods
PublicationIn the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...
-
Hossein Nejatbakhsh Esfahani Dr.
PeopleMy research interests lie primarily in the area of Learning-based Safety-Critical Control Systems, for which I leverage the following concepts and tools:-Robust/Optimal Control-Reinforcement Learning-Model Predictive Control-Data-Driven Control-Control Barrier Function-Risk-Averse Controland with applications to:-Aerial and Marine robotics (fixed-wing UAVs, autonomous ships and underwater vehicles)-Multi-Robot and Networked Control...
-
JamesBot - an intelligent agent playing StarCraft II
PublicationThe most popular method for optimizing a certain strategy based on a reward is Reinforcement Learning (RL). Lately, a big challenge for this technique are computer games such as StarCraft II which is a real-time strategy game, created by Blizzard. The main idea of this game is to fight between agents and control objects on the battlefield in order to defeat the enemy. This work concerns creating an autonomous bot using reinforced...
-
Chained machine learning model for predicting load capacity and ductility of steel fiber–reinforced concrete beams
PublicationOne of the main issues associated with steel fiber–reinforced concrete (SFRC) beams is the ability to anticipate their flexural response. With a comprehensive grid search, several stacked models (i.e., chained, parallel) consisting of various machine learning (ML) algorithms and artificial neural networks (ANNs) were developed to predict the flexural response of SFRC beams. The flexural performance of SFRC beams under bending was...
-
Koncepcja systemu wspomagania decyzji nawigatora statku opartego na ewolucyjnym planowaniu manewrów antykolizyjnych
PublicationArtykuł przedstawia koncepcję systemu wspomagania decyzji nawigatora statku opartego na wątkach badań prowadzonych wcześniej przez autora. System będzie rozszerzał funkcjonalność systemów dotychczasowych o możliwość szczegółowego planowania bezpiecznej trajektorii statku na wodach zamkniętych, z dużą liczbą statków obcych i ograniczeniami toru wodnego. Artykuł zawiera dyskusję możliwych podejść do planowania manewrów, optymalizacji...
-
Reinforced Secure Gossiping Against DoS Attacks in Post-Disaster Scenarios
PublicationDuring and after a disaster, the perceived quality of communication networks often becomes remarkably degraded with an increased ratio of packet losses due to physical damages of the networking equipment, disturbance to the radio frequency signals, continuous reconfiguration of the routing tables, or sudden spikes of the network traffic, e.g., caused by the increased user activity in a post-disaster period. Several techniques have...
-
Integracja bezprzewodowych heterogenicznych sieci IP dla poprawy efektywności transmisji danych na morzu
PublicationWraz ze wzrostem istotności środowiska morskiego w naszym codziennym życiu np. w postaci zwiększonego wolumenu transportu realizowanego drogą morską. czy zintensyfikowanych prac dotyczących obserwacji i monitoringu środowiska morskiego, wzrasta również potrzeba opracowania efektywnych systemów komunikacyjnych dedykowanych dla tego środowiska. Heterogeniczne systemy łączności bezprzewodowej integrowane na poziomie warstwy sieciowej...
-
Load effect impact on the exploitation of concrete machine foundations used in the gas and oil industry
PublicationMachine foundations is a critical topic in the gas and oil industry, which design and exploitation require extensive technical knowledge. Machine foundations are the constructions which are intended for mounting on it a specific type of machine. The foundation has to transfer dynamic and static load from machine to the ground. The primary difference between machine foundations and building foundations is that the machine foundations...
-
Investigations on fracture in reinforced concrete beams in 3-point bending using continuous micro-CT scanning
PublicationThis study explores a fracture process in rectangular reinforced concrete (RC) beams subjected to quasi-static three-point bending. RC beams were short and long with included longitudinal reinforcement in the form of a steel or basalt bar. The ratio of the shear span to the effective depth was 1.5 and 0.75. The focus was on the load–deflection diagram and crack formation. Three-dimensional (3D) analyses of the size and distribution...
-
Hybrid all-cellulose reinforcement in polypropylene matrix biocomposites for injection moulding - influence of particle geometry and volume fraction on hybrid effect
PublicationThe presented study is focused on evaluation of influence of reinforcement volume fraction and geometry on the occurrence of positive hybrid effect by the hybridisation of man-made cellulose fibres (rayon viscose) with cellulose microparticle fillers applied in polypropylene matrix. Four volume fractions of reinforcement were used at 1:1 combination of short man-made cellulose fibres with cellulose microfillers of different aspect...
-
Influence of effective width of flange on calculation and reinforcement dimensioning of beam of reinforced concrete frame
PublicationThe paper analyses the influence of modelling the cross-section of a beam in two-storey reinforced concrete frame of industrial warehouse with dimensions: 18.0 m × 32.0 m using bar elements on the results of bending moments, the value of elastic deflection and the dimensioning of reinforcement due to bending. Six options were considered: a beam as a rectangular section and five T-beam variants with different definitions of effective...
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublicationThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...