Dev Observability
Product
Pricing
Docs
Resources
Blog
Company
Debug Wordle

question-mark

Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Questions about MCTS

See original GitHub issue

Hi, in update_branch function, you update the parent node value with the same total_reward with the current node. Why? I think they have different values, because action from parent node to current node will get a reward which should be added in parent total_reward.

    def update_branch(self, total_reward):
        self.update(total_reward)
        if self.parent:
            self.parent.update_branch(total_reward)

Issue Analytics

State:
Created 3 years ago
Comments:13 (9 by maintainers)

Top GitHub Comments

1reaction

sriteecommented, Jan 3, 2021

I think the two formulations are equivalent (ie when choosing the next action to take, comparing past + future1 with past + future 2 is the same as just comparing future1 with future2. But I agree that your proposal is more intuitive, maybe I should change it. I don’t really remember why I implemented it this way in the first place

That makes sense, I suppose the shift invariance of the UCB makes it so that the magnitude of the value function doesn’t quite matter, but I suppose for other non-argmax or other policies it might make a difference.

0reactions

eleurentcommented, Jan 3, 2021

should the weight for the first term be 1 - regularization?

Yes it should! thank you for catching this, I’ll fix it.

but I suppose for other non-argmax or other policies it might make a difference

Indeed, it would matter for a Boltzmann policy for instance. In which case, it is probably best to ignore the past.

Read more comments on GitHub >

Top Results From Across the Web

5 questions with answers in MCTS | Science topic

Review and cite MCTS protocol, troubleshooting and other methodology information | Contact experts in MCTS to get answers.

Free MCTS 70-432 Practice Test Questions - Accelerated Ideas

Welcome to our free 70-432 practice test which covers questions for the Microsoft MCTS certification for Database implementation and maintenance.

Questions or Comments? - MCTS | Real-Time

MCTS welcomes customer comments regarding Real-Time. Please email bustime@cleverdevices.com or contact MCTS Customer Service at 414-344-6711.

What questions did they ask during your interview at MCTS?

Find answers to 'What questions did they ask during your interview at MCTS?' from MCTS employees. Get answers to your biggest company ...

Beginner questions about MCTS : r/deeplearning - Reddit

- Looking ahead a bit, is there any good articles or discussion different approaches of combining MCTS with a neural net that can...

Top Related Medium Post

No results found

Top Related StackOverflow Question

No results found

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Top Related Reddit Thread

No results found

Top Related Hackernoon Post

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Top Related Hashnode Post

No results found

Errors while setting up

How to analyze the data after being successfully trained.