Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Docs]

See original GitHub issue

📚 Documentation/Examples

Hello, I’m trying to implement a Variational Multioutput model by following the example at https://docs.gpytorch.ai/en/v1.1.1/examples/04_Variational_and_Approximate_GPs/SVGP_Multitask_GP_Regression.html

In my case, the number of tasks is defined by nTasks. The input dimension is n x d and the output dimension is n x nTasks. The motivation was mentioned in my question from yesterday–the outputs need to be constrained to [0,1] range. I was advised to use BetaLikelihood.

My question is what the shape of induce points should be for the multi-output case. Is it nTasks x nLatentSpace x d ? The example documentation is not very clear. Also, is there a multitask version of Betalikelihood somewhere. Thank you in advance from a newbie.

	class MultitaskVarGPModel(gpytorch.models.ApproximateGP):
	    def __init__(self, inducing_points):
		nTasks = N*N
		variational_distribution = gpytorch.variational.CholeskyVariationalDistribution(
		    inducing_points.size(-2), batch_shape=torch.Size([nTasks])
		)
		variational_strategy = gpytorch.variational.MultitaskVariationalStrategy(
		    gpytorch.variational.VariationalStrategy(
			self, inducing_points, variational_distribution, learn_inducing_locations=True
		    ), num_tasks=nTasks
		)
		super().__init__(variational_strategy)
		self.mean_module = gpytorch.means.ConstantMean(batch_shape=torch.Size([nTasks]))
		self.covar_module = gpytorch.kernels.ScaleKernel(
		    gpytorch.kernels.RBFKernel(batch_shape=torch.Size([nTasks])),
		    batch_shape=torch.Size([nTasks])
		)

	    def forward(self, x):
		# The forward function should be written as if we were dealing with each output
		# dimension in batch
		mean_x = self.mean_module(x)
		covar_x = self.covar_module(x)
		return gpytorch.distributions.MultivariateNormal(mean_x, covar_x)

** Is there documentation missing? **

** Is documentation wrong? **

** Is there a feature that needs some example code? **

** Think you know how to fix the docs? ** (If so, we’d love a pull request from you!)

Link to GPyTorch documentation
Link to GPyTorch examples

Issue Analytics

State:
Created 3 years ago
Comments:11 (5 by maintainers)

Top GitHub Comments

5reactions

gpleisscommented, Oct 5, 2020

@ZhiliangWu LMCVariationalStrategy and IndependentMultitaskVariationalStrategy both convert a batch of GPs into a multitask GP. For example, the IndependentMultitaskVariationalStrategy converts the output of d Gaussian processes into a n x d MultitaskMultivariateNormal.

Both IndependentMultitaskVariationalStrategy and LMCVariationalStrategy must operate on a batch of Gaussian processes, since multiple GPs are needed for multiple output dimensions. (See section 4.2 of this paper for more information).

However, it’s possible that you might want to have a batch of c multi-output GPs, each with d dimensions. In this case, the IndependentMultitaskVariationalStrategy would have a batch shape of [c, d], where d would correspond to the multiple outputs, and c would correspond to the actual batch of multi-output GPs. In this case, the task_dim argument specifies that the d dimension corresponds to the GP outputs.

1reaction

gpleisscommented, Jun 9, 2021

I am trying to model a vector of values using multi-output/multi-task GP. The dimensions are e.g., 4 for bounding boxes predictions. The inputs are e.g., 10 dimensional feature vectors for each sample. As far as I understood, the task can be done by using GPyTorch with IndependentMultitaskVariationalStrategy or LMCVariationalStrategy.

In IndependentMultitaskVariationalStrategy the dependency of the outputs are not considered, so we have a separate GP corresponding for each output. The batch_shape=torch.Size([4]) is used for CholeskyVariationalDistribution(). (A question here would be, is it basically the same as training a separate GP model for each output? In the example, it would be 4 separate GPs for 4 output values. )

Meanwhile, LMCVariationalStrategy considers the possible dependencies between outputs. If we have batch_shape=torch.Size([3]) inCholeskyVariationalDistribution(), it means each output would modeled by all of these three latent functions from corresponding GPs.

In both previous cases, we set task_dim=-1 as in the default setting.

Your understanding is correct. If you want 4 outputs, then use IndependentMultitaskVariationalStrategy or LMCVariationalStrategy as you have described.

However, from your previous answers, you claimed that one can actually set the batch_shape=(c, d), where c is the number of involved GPs and d is the number of actual outputs.

Just ignore this. What I’m saying is that you can train c independent vector-valued GP models, each with d output dimensions. It sounds like, from the way you describe the issue, you understand how to use IndependentMultitaskVariationalStrategy and LMCVariationalStrategy for your purposes.

This thread is quite long and difficult for me to follow/context-switch to, so I am going to close it for now. If you still have questions, please open up a new discussion.

Top Results From Across the Web

Google Docs: Online Document Editor | Google Workspace

Use Google Docs to create, and collaborate on online documents. Edit together with secure sharing in real-time and from any device.

Google Docs

Forgot email? CAPTCHA image of text used to distinguish humans from robots.

Google Docs - Apps on Google Play

Create, edit, and collaborate with others on documents from your Android phone or tablet with the Google Docs app. With Docs you can:...

Google Docs: Sign-in

Access Google Docs with a personal Google account or Google Workspace account (for business use).

Google Docs: Sync, Edit, Share on the App Store

Google Docs is very, very useful as a whole, providing services that allow seamless collaboration and that solve many past horrors of word-processors...