question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Questions about MCTS

See original GitHub issue

Hi, in update_branch function, you update the parent node value with the same total_reward with the current node. Why? I think they have different values, because action from parent node to current node will get a reward which should be added in parent total_reward.

    def update_branch(self, total_reward):
        self.update(total_reward)
        if self.parent:
            self.parent.update_branch(total_reward)

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:13 (9 by maintainers)

github_iconTop GitHub Comments

1reaction
sriteecommented, Jan 3, 2021

I think the two formulations are equivalent (ie when choosing the next action to take, comparing past + future1 with past + future 2 is the same as just comparing future1 with future2. But I agree that your proposal is more intuitive, maybe I should change it. I don’t really remember why I implemented it this way in the first place

That makes sense, I suppose the shift invariance of the UCB makes it so that the magnitude of the value function doesn’t quite matter, but I suppose for other non-argmax or other policies it might make a difference.

0reactions
eleurentcommented, Jan 3, 2021

should the weight for the first term be 1 - regularization?

Yes it should! thank you for catching this, I’ll fix it.

but I suppose for other non-argmax or other policies it might make a difference

Indeed, it would matter for a Boltzmann policy for instance. In which case, it is probably best to ignore the past.

Read more comments on GitHub >

github_iconTop Results From Across the Web

5 questions with answers in MCTS | Science topic
Review and cite MCTS protocol, troubleshooting and other methodology information | Contact experts in MCTS to get answers.
Read more >
Free MCTS 70-432 Practice Test Questions - Accelerated Ideas
Welcome to our free 70-432 practice test which covers questions for the Microsoft MCTS certification for Database implementation and maintenance.
Read more >
Questions or Comments? - MCTS | Real-Time
MCTS welcomes customer comments regarding Real-Time. Please email bustime@cleverdevices.com or contact MCTS Customer Service at 414-344-6711.
Read more >
What questions did they ask during your interview at MCTS?
Find answers to 'What questions did they ask during your interview at MCTS?' from MCTS employees. Get answers to your biggest company ...
Read more >
Beginner questions about MCTS : r/deeplearning - Reddit
- Looking ahead a bit, is there any good articles or discussion different approaches of combining MCTS with a neural net that can...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found