- Discriminator Network: Should be nothing but flatten MLP
- Update actor critinc in SAC.
- Write the discriminator loss
- Update the reward
- Write a function for merge and extract.
- Append z's size for everything.
- Implement GMM
- Run SAC today.
- Run Tensorflow wala DIAYN on half cheetah, humanoid and ant