Implementation of a random sampling node -> minibatch layout troubles · Microsoft/CNTK#1371

Repository metrics

Stars: (16,085 stars)
PR merge metrics: (No merged PRs in 30d)

Description

Hi CNTK Team!

I'm currently trying to implement a Variational Autoencoder using CNTK. I'd need functions for random sampling (like random_normal() in Tensorflow) for this. CNTKs backend already has random distributions in it's matrix lib so I started implementing a ComputationNode that would use these to fill the value matrix with random numbers during the forward pass. So far, not too difficult...

However, I'm struggling to get the minibatch layout consistent. Since my random sampling node has no inputs (parameters of the random distribution are static), I have found no proper way to infer the minibatch layout for the output automatically.

I tried specifying the minibatch layout via static parameters, but this is not very elegant and leads to other nodes in the computation graph complaining about an inconsistent layout with regard to sequences.

I could also use a node in the computation graph that has the correct minibatch layout as additional input to my random sampling node and then use this only to get the minibatch layout from. However, this would mean introducing an input variable to my node and then not use it's actual values. Also not very elegent...

Do you have any tips on how to do this correctly (or maybe I'm just missing something obvious)?

If this will result in a usable implementation, I'd be happy to contribute my code.

Enrico

Contributor guide

Research direction: Examine how existing computation nodes handle minibatch layout (e.g., input nodes). Consider using a dummy input to propagate layout or inspect the layout inference mechanism in the codebase. Look at the Matrix library for random distribution functions.
Tech stack: None
Domain: machine learning
Issue type: Feature
Difficulty: 3
Estimated time: 1-2 days
Activity status: Active
Clarity: Clear
Prerequisites: C++CNTK internals
Newbie friendliness: 50

Repository metrics

Description

Contributor guide

Get fresh easy issues in your inbox.