Matches in DBpedia 2014 for { <http://dbpedia.org/resource/Q-learning> ?p ?o. }
Showing items 1 to 27 of
27
with 100 items per page.
- Q-learning abstract "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. When such an action-value function is learned, the optimal policy can be constructed by simply selecting the action with the highest value in each state. One of the strengths of Q-learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Additionally, Q-learning can handle problems with stochastic transitions and rewards, without requiring any adaptations. It has been proven that for any finite MDP, Q-learning eventually finds an optimal policy.".
- Q-learning wikiPageExternalLink Reinforcement%20Learning%20Maze.
- Q-learning wikiPageExternalLink 352693.html.
- Q-learning wikiPageExternalLink master.
- Q-learning wikiPageExternalLink citation.cfm?id=1143955.
- Q-learning wikiPageExternalLink piqle.
- Q-learning wikiPageExternalLink thesis.html.
- Q-learning wikiPageExternalLink the-book.html.
- Q-learning wikiPageExternalLink node65.html.
- Q-learning wikiPageExternalLink node4.html.
- Q-learning wikiPageID "1281850".
- Q-learning wikiPageRevisionID "605560131".
- Q-learning hasPhotoCollection Q-learning.
- Q-learning subject Category:Machine_learning_algorithms.
- Q-learning comment "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter.".
- Q-learning label "Q-learning".
- Q-learning label "Q-learning".
- Q-learning label "Q-learning".
- Q-learning label "Q-обучение".
- Q-learning label "Q学習".
- Q-learning sameAs Q-learning.
- Q-learning sameAs Q学習.
- Q-learning sameAs m.04pvn7.
- Q-learning sameAs Q2664563.
- Q-learning sameAs Q2664563.
- Q-learning wasDerivedFrom Q-learning?oldid=605560131.
- Q-learning isPrimaryTopicOf Q-learning.