Search results for: reinforcement learning - Bridge of Knowledge

Search

Search results for: reinforcement learning

Search results for: reinforcement learning

  • Structure and Randomness in Planning and Reinforcement Learning

    Publication

    - Year 2021

    Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random...

    Full text to download in external service

  • Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning

    Publication

    - Year 2022

    My doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.

    Full text to download in external service

  • Reinforcement Learning Algorithm and FDTD-based Simulation Applied to Schroeder Diffuser Design Optimization

    Publication

    The aim of this paper is to propose a novel approach to the algorithmic design of Schroeder acoustic diffusers employing a deep learning optimization algorithm and a fitness function based on a computer simulation of the propagation of acoustic waves. The deep learning method employed for the research is a deep policy gradient algorithm. It is used as a tool for carrying out a sequential optimization process the goal of which is...

    Full text available to download

  • IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning

    Conferences

  • Designing acoustic scattering elements using machine learning methods

    Publication

    - Year 2021

    In the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...

    Full text available to download

  • JamesBot - an intelligent agent playing StarCraft II

    Publication

    The most popular method for optimizing a certain strategy based on a reward is Reinforcement Learning (RL). Lately, a big challenge for this technique are computer games such as StarCraft II which is a real-time strategy game, created by Blizzard. The main idea of this game is to fight between agents and control objects on the battlefield in order to defeat the enemy. This work concerns creating an autonomous bot using reinforced...

    Full text available to download

  • Hossein Nejatbakhsh Esfahani PhD

    People

    Since 2012 when I graduated in master of mechatronics engineering I've been dealing with kinds of control theory problems in both theoretical and practical perspective. I have five years of work experience in industrial automation area in Iran where I was swamped with some industrial-based control algorithms such as PID and MPC algorithms which were adopted to control some processes including steam turbine, gas turbine, casting...

  • Koncepcja systemu wspomagania decyzji nawigatora statku opartego na ewolucyjnym planowaniu manewrów antykolizyjnych

    Publication

    Artykuł przedstawia koncepcję systemu wspomagania decyzji nawigatora statku opartego na wątkach badań prowadzonych wcześniej przez autora. System będzie rozszerzał funkcjonalność systemów dotychczasowych o możliwość szczegółowego planowania bezpiecznej trajektorii statku na wodach zamkniętych, z dużą liczbą statków obcych i ograniczeniami toru wodnego. Artykuł zawiera dyskusję możliwych podejść do planowania manewrów, optymalizacji...

  • Reinforced Secure Gossiping Against DoS Attacks in Post-Disaster Scenarios

    Publication

    - IEEE Access - Year 2020

    During and after a disaster, the perceived quality of communication networks often becomes remarkably degraded with an increased ratio of packet losses due to physical damages of the networking equipment, disturbance to the radio frequency signals, continuous reconfiguration of the routing tables, or sudden spikes of the network traffic, e.g., caused by the increased user activity in a post-disaster period. Several techniques have...

    Full text available to download

  • Load effect impact on the exploitation of concrete machine foundations used in the gas and oil industry

    Publication

    - Year 2019

    Machine foundations is a critical topic in the gas and oil industry, which design and exploitation require extensive technical knowledge. Machine foundations are the constructions which are intended for mounting on it a specific type of machine. The foundation has to transfer dynamic and static load from machine to the ground. The primary difference between machine foundations and building foundations is that the machine foundations...

  • Towards neural knowledge DNA

    Publication

    - JOURNAL OF INTELLIGENT & FUZZY SYSTEMS - Year 2017

    In this paper, we propose the Neural Knowledge DNA, a framework that tailors the ideas underlying the success of neural networks to the scope of knowledge representation. Knowledge representation is a fundamental field that dedicates to representing information about the world in a form that computer systems can utilize to solve complex tasks. The proposed Neural Knowledge DNA is designed to support discovering, storing, reusing,...

    Full text to download in external service