Q Learning Algorithm - Search News

Wind turbine control systems: From PID to reinforcement learning

In an RL-based control system, the turbine (or wind farm) controller is realized as an agent that observes the state of the ...

GitHub

n-step-deep-q-learning

SnakeRL is a Python project implementing the Snake game with Deep Q-Learning. The agent learns to navigate, collect food, and avoid collisions using a neural network, dynamic rewards, and customizable ...

Frontiers

Using reinforcement learning in genome assembly: in-depth analysis of a Q-learning assembler

Genome assembly remains an unsolved problem, and de novo strategies (i.e., those run without a reference) are relevant but computationally complex tasks in genomics. Although de novo assemblers have ...

IEEE

Optimizing Successive Over-relaxation Q-learning with Deterministic Perturbation Gradient Search

Abstract: Successive Over-Relaxation Q-learning (SOR-QL) has been proposed recently as an alternative to the widely popular Q-learning algorithm as it is seen to provide better performance where ...

Semiconductor Engineering

SpiNNaker2 Neuromorphic Platform: HW-Aware Fine-Tuning of Spiking Q-Networks (TU Dresden Et Al.)

A new technical paper titled “Hardware-Aware Fine-Tuning of Spiking Q-Networks on the SpiNNaker2 Neuromorphic Platform” was published by researchers at TU Dresden, ScaDS.AI and Centre for Tactile ...

Frontiers

Optimization method of electric vehicle energy system based on machine learning

College of Mechanical and Electronic Engineering, Shanghai Jianqiao University, Shanghai, China Introduction: To enhance energy management in electric vehicles (EVs), this study proposes an ...

IEEE

Q-Learning-Based Control for Discrete-Time Switched Affine Systems and Its Application to DC–DC Converter

Abstract: In this paper, a new data-based Q-learning algorithm is proposed to address the optimal control issue for a class of discrete-time switched affine systems (SASs). The algorithm shifts the ...

Microsoft

Stochastic Approximation and Reinforcement Learning: Hidden Theory and New Super-Fast Algorithms

Stochastic approximation algorithms are used to approximate solutions to fixed point equations that involve expectations of functions with respect to possibly unknown distributions. Among many ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results