DBpedia 2014 |

DBpedia 2014

Matches in DBpedia 2014 for { <http://dbpedia.org/resource/Q-learning> ?p ?o. }

Showing items 1 to 27 of 27 with 100 items per page.

Q-learning abstract "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. When such an action-value function is learned, the optimal policy can be constructed by simply selecting the action with the highest value in each state. One of the strengths of Q-learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Additionally, Q-learning can handle problems with stochastic transitions and rewards, without requiring any adaptations. It has been proven that for any finite MDP, Q-learning eventually finds an optimal policy.".
Q-learning wikiPageExternalLink Reinforcement%20Learning%20Maze.
Q-learning wikiPageExternalLink 352693.html.
Q-learning wikiPageExternalLink master.
Q-learning wikiPageExternalLink citation.cfm?id=1143955.
Q-learning wikiPageExternalLink piqle.
Q-learning wikiPageExternalLink thesis.html.
Q-learning wikiPageExternalLink the-book.html.
Q-learning wikiPageExternalLink node65.html.
Q-learning wikiPageExternalLink node4.html.
Q-learning wikiPageID "1281850".
Q-learning wikiPageRevisionID "605560131".
Q-learning hasPhotoCollection Q-learning.
Q-learning subject Category:Machine_learning_algorithms.
Q-learning comment "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter.".
Q-learning label "Q-learning".
Q-learning label "Q-learning".
Q-learning label "Q-learning".
Q-learning label "Q-обучение".
Q-learning label "Q学習".
Q-learning sameAs Q-learning.
Q-learning sameAs Q学習.
Q-learning sameAs m.04pvn7.
Q-learning sameAs Q2664563.
Q-learning sameAs Q2664563.
Q-learning wasDerivedFrom Q-learning?oldid=605560131.
Q-learning isPrimaryTopicOf Q-learning.