Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Issues: Dynamic Batching

See original GitHub issue

(Posting a couple issues to get features upstreamed from OpenNMT-py, cc @da03)

For the transformer model, we need some improvements to dynamic batching. In particular the batch_size_fn interface has some issues. Let me give an example.

We need batches of 4096 tokens (including padding).

We can’t really do this with with batch_size_fn because while it lets us count the total number of tokens, it doesn’t let us account for padding (max size in the batch). One bad example either causes tons of padding, or a huge batch and an OOM.

Our current terrible hack:

        global max_src_in_batch, max_tgt_in_batch

        def batch_size_fn(new, count, sofar):
            global max_src_in_batch, max_tgt_in_batch
            if count == 1:
                max_src_in_batch = 0
                max_tgt_in_batch = 0
            max_src_in_batch = max(max_src_in_batch,  len(new.src) + 2)
            max_tgt_in_batch = max(max_tgt_in_batch,  len(new.tgt) + 1)
            src_elements = count * max_src_in_batch
            tgt_elements = count * max_tgt_in_batch
            return max(src_elements, tgt_elements)

Iterator uses this line to buffer data for batching:

for p in batch(data, batch_size * 100, batch_size_fn): https://github.com/pytorch/text/blob/master/torchtext/data/iterator.py#L271

Unfortunately even though there is a 100 here, it doesn’t help because if we are counting padding in batch_size_fn then on long example will make every other sentence take a ton of space. Think we need control.

Our current hack (don’t use batch_size_fn for buffering):

for p in torchtext.data.batch(data, self.batch_size * 100):

Minor: Batching use sort for two different purposes. One to find the batches themselves, and the other for the order in which the batch is created. I would like to be able to have a batch_construction_sort to find sentences of the same length and then an batch_sort for each in batch. For example: in MT I would like to sort by a weighted src x tgt len in batch_construction (to minimize padding), but then have the batch itself sorted by src len to make cudnn work.

Issue Analytics

State:
Created 6 years ago
Comments:12 (4 by maintainers)

Top GitHub Comments

1reaction

eclycommented, May 27, 2021

@patelrajnath I’ll instead suggest that you have a look at huggingface who has a similar implementation (which I just found out recently), which you can use to compare or get inspiration: https://github.com/huggingface/transformers/blob/c2cd02ac625cd0ab64cf42124ad71ce9158fb67c/src/transformers/trainer_pt_utils.py#L501

1reaction

eclycommented, May 17, 2021

@patelrajnath for my usecase I’ve implemented a subclass of torch.utils.data.Sampler that simple generates lists of indices, corresponding to dynamically generated batches, based on the amount of tokens within the batch.

For each call of __iter__ I do the following:

Shuffle a copy of a List containing Tuple[int, Tuple[int, int]] which maps Dataset indices to the src_len, tgt_len tuples.
Split this list for entire dataset into sublists, or “pools” (since we will sort afterwards, this is necessary if you want shuffling between epochs).
Sort each pool by src_len then tgt_len
For each pool generate batches (which can be of any size) that are efficiently packed (due to sorting), by simply filling them up until the batch_size * max(src_len) + batch * max(tgt_len) is add your desired max batch tokens.

Top Results From Across the Web

Dynamic batching - Unity - Manual

Dynamic batching for meshes works by transforming all vertices into world space. on the CPU, rather than on the GPU. This means dynamic...

Issues with Dynamic Batching : r/Unity3D - Reddit

Issues with Dynamic Batching. Hi everyone, I'm prototyping crop fields. Basically the player defines a rectangle on the terrain where the field will...

dynamic batching not working properly while requests waiting ...

I tried to replicate this issue with a simplest model possible. I used the following script to create a torchscript model. import torch...

Why does batching not work? - Unity Forum

Dynamic batching is definitely enabled and the objects don't even overlap so z-fighting shouldn't be an issue either.

Using Dynamic Batching - OpenVINO™ Documentation

Dynamic Batching feature allows you+ to dynamically change batch size for inference calls within preset batch size limit. This feature might be useful...